Scrapy爬行速度很慢(60页/分钟) [英] Scrapy Crawling Speed is Slow (60 pages / min)

查看：541 发布时间：2020/11/24 23:15:10 python http scrapy web-crawler

本文介绍了Scrapy爬行速度很慢(60页/分钟)的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我的抓取速度很慢(大约1页/秒). 我正在从aws服务器爬网一个主要的网站，所以我认为这不是网络问题. Cpu利用率远没有达到100，如果我启动多个抓取进程，爬网速度会更快.

I am experiencing slow crawl speeds with scrapy (around 1 page / sec). I'm crawling a major website from aws servers so I don't think its a network issue. Cpu utilization is nowhere near 100 and if I start multiple scrapy processes crawl speed is much faster.

Scrapy似乎会爬行一堆页面，然后挂几秒钟，然后重复.

Scrapy seems to crawl a bunch of pages, then hangs for several seconds, and then repeats.

我试着玩: CONCURRENT_REQUESTS = CONCURRENT_REQUESTS_PER_DOMAIN = 500

I've tried playing with: CONCURRENT_REQUESTS = CONCURRENT_REQUESTS_PER_DOMAIN = 500

但是这似乎并不能使针头经过20左右.

but this doesn't really seem to move the needle past about 20.

Scrapy爬行速度很慢(60页/分钟) [英] Scrapy Crawling Speed is Slow (60 pages / min)

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

Scrapy爬行速度很慢(60页/分钟) [英] Scrapy Crawling Speed is Slow (60 pages / min)

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭