Python 2.6:使用urllib2进行并行解析 [英] Python 2.6: parallel parsing with urllib2

查看：82 发布时间：2020/5/24 21:13:18 python parsing parallel-processing urllib2

本文介绍了Python 2.6:使用urllib2进行并行解析的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我目前正在使用urllib2从网站检索和解析页面.但是，它们很多(超过1000个)，并且顺序地处理它们很慢.

I'm currently retrieving and parsing pages from a website using urllib2. However, there are many of them (more than 1000), and processing them sequentially is painfully slow.

我希望有一种以并行方式检索和解析页面的方法.如果那是个好主意，那有可能吗，我该怎么做?

I was hoping there was a way to retrieve and parse pages in a parallel fashion. If that's a good idea, is it possible, and how do I do it?

此外，并行处理的页面数的合理"值是什么(我不想因为对服务器使用过多的连接而对服务器造成太大的压力或被禁止)?

Also, what are "reasonable" values for the number of pages to process in parallel (I wouldn't want to put too much strain on the server or get banned because I'm using too many connections)?

谢谢！

Python 2.6:使用urllib2进行并行解析 [英] Python 2.6: parallel parsing with urllib2

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

Python 2.6:使用urllib2进行并行解析 [英] Python 2.6: parallel parsing with urllib2

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭