递归使用 Scrapy 从网站上抓取网页 [英] Recursive use of Scrapy to scrape webpages from a website

查看：42 发布时间：2021/7/16 22:20:00 python web-scraping scrapy

本文介绍了递归使用 Scrapy 从网站上抓取网页的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我最近开始使用 Scrapy.我试图从一个大列表中收集一些信息，该列表分为几页(大约 50 页).我可以轻松地从第一页中提取我想要的内容，包括 start_urls 列表中的第一页.但是，我不想将这 50 个页面的所有链接添加到此列表中.我需要一种更动态的方式.有谁知道我如何迭代抓取网页?有没有人有这方面的例子?

I have recently started to work with Scrapy. I am trying to gather some info from a large list which is divided into several pages(about 50). I can easily extract what I want from the first page including the first page in the start_urls list. However I don't want to add all the links to these 50 pages to this list. I need a more dynamic way. Does anyone know how I can iteratively scrape web pages? Does anyone have any examples of this?

谢谢！

递归使用 Scrapy 从网站上抓取网页 [英] Recursive use of Scrapy to scrape webpages from a website

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

递归使用 Scrapy 从网站上抓取网页 [英] Recursive use of Scrapy to scrape webpages from a website

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭