如何在python中与beautifulsoup并行地刮除多个html页面? [英] How to scrap multiple html page in parallel with beautifulsoup in python?

查看：53 发布时间：2020/5/14 0:09:28 python django multithreading beautifulsoup python-multithreading

本文介绍了如何在python中与beautifulsoup并行地刮除多个html页面?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用Django网络框架在Python中制作一个网络抓取应用程序.我需要使用beautifulsoup库取消多个查询.这是我编写的代码的快照:

I'm making a webscraping app in Python with Django web framework. I need to scrap multiple queries using beautifulsoup library. Here is snapshot of code that I have written:

for url in websites:
    r = requests.get(url)
    soup = BeautifulSoup(r.content)
    links = soup.find_all("a", {"class":"dev-link"})

实际上，这里的网页抓取是按顺序进行的，我想以并行方式运行它.我对使用Python线程并不太了解. 有人可以告诉我，如何并行进行报废?任何帮助，将不胜感激.

Actually here the scraping of webpage is going sequentially, I want to run it in parallel manner. I don't have much idea about threading in Python. can someone tell me, How can I do scrap in parallel manner? Any help would be appreciated.

如何在python中与beautifulsoup并行地刮除多个html页面? [英] How to scrap multiple html page in parallel with beautifulsoup in python?

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

如何在python中与beautifulsoup并行地刮除多个html页面? [英] How to scrap multiple html page in parallel with beautifulsoup in python?

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭