scrapy 无法抓取页面中的所有链接 [英] scrapy can't crawl all links in a page

查看：61 发布时间：2021/7/16 21:56:27 python shell xpath scrapy

本文介绍了scrapy 无法抓取页面中的所有链接的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试 scrapy 抓取 ajax 网站 http://play.google.com/store/apps/category/GAME/collection/top sell_new_free

I am trying scrapy to crawl a ajax website http://play.google.com/store/apps/category/GAME/collection/topselling_new_free

我想获得指向每个游戏的所有链接.

I want to get all the links directing to each game.

我检查页面的元素.它看起来像这样:页面的样子所以我想提取模式/store/apps/details?id=

I inspect the element of the page. And it looks like this: how the page looks like so I want to extract all links with the pattern /store/apps/details?id=

但是当我在 shell 中运行命令时，它什么都不返回:shell 命令

but when I ran commands in the shell, it returns nothing: shell command

我也试过//a/@href.也没有解决，但不知道发生了什么问题....

I've also tried //a/@href. didn't work out either but Don't know what is wrong going on....

现在我可以抓取前 120 个链接，并按照有人告诉我的那样修改了 starturl 并添加了formdata"，但之后就没有更多链接了.

有人可以帮我吗?

scrapy 无法抓取页面中的所有链接 [英] scrapy can't crawl all links in a page

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

scrapy 无法抓取页面中的所有链接 [英] scrapy can&#39;t crawl all links in a page

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

scrapy 无法抓取页面中的所有链接 [英] scrapy can't crawl all links in a page

登录关闭