用于测试的 Scrapy 限制请求 [英] Scrapy Limit Requests For Testing

查看：34 发布时间：2021/6/26 18:50:35 python python-2.7 web-scraping scrapy scrapy-spider

本文介绍了用于测试的 Scrapy 限制请求的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我一直在搜索 Scrapy 文档，寻找一种方法来限制我的蜘蛛允许发出的请求数量.在开发过程中，我不想坐在这里等待我的蜘蛛完成整个爬行，即使爬行非常专注，它们仍然需要很长时间.

I've been searching the scrapy documentation for a way to limit the number of requests my spiders are allowed to make. During development I don't want to sit here and wait for my spiders to finish an entire crawl, even though the crawls are pretty focused they can still take quite awhile.

我希望能够说，在向网站发送 x 个请求后，我正在抓取停止生成新请求."

I want the ability to say, "After x requests to the site I'm scraping stop generating new requests."

在我尝试提出自己的解决方案之前，我想知道是否有我可能错过的设置或使用框架的其他方法.

I was wondering if there is a setting for this I may have missed or some other way to do it using the framework before I try to come up with my own solution.

我正在考虑实现一个下载器中间件，它可以跟踪正在处理的请求数量，并在达到限制后停止将它们传递给下载器.但就像我说的那样，如果可能的话，我宁愿使用框架中已有的机制.

I was considering implementing a downloader middleware that would keep track of the number of requests being processed and stop passing them to the downloader once a limit has been reached. But like I said I'd rather use a mechanism already in the framework if possible.

有什么想法吗?谢谢.

用于测试的 Scrapy 限制请求 [英] Scrapy Limit Requests For Testing

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

用于测试的 Scrapy 限制请求 [英] Scrapy Limit Requests For Testing

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭