“download_slot"如何在scrapy中工作 [英] How "download_slot" works within scrapy

查看：30 发布时间：2021/7/16 22:01:18 python python-3.x web-scraping scrapy

本文介绍了“download_slot"如何在scrapy中工作的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我在scrapy中创建了一个脚本来解析来自其登陆页面的不同帖子的作者姓名，然后使用meta将其传递给parse_page方法 关键字，以便同时打印 post content 和 作者姓名.

I'v created a script in scrapy to parse the author name of different posts from it's landing page and then pass it to the parse_page method using meta keyword in order to print the post content along with the author name at the same time.

我在 meta 关键字中使用了 download_slot，据称这会掩盖脚本运行得更快.虽然没有必要遵守我在这里尝试应用的逻辑，但我想坚持它只是为了了解 download_slot 在任何脚本中的工作原理以及原因.我搜索了很多关于 download_slot 的信息，但我最终找到了一些链接，例如这个.

I've used download_slot within meta keyword which allegedly maskes the script run faster. Although it is not necessary to comply with the logic I tried to apply here, I would like to stick to it only to understand how download_slot works within any script and why. I searched a lot to know more about download_slot but I end up some links like this one.

download_slot 的示例用法(不过我不太确定):

An example usage of download_slot (I'm not quite sure about it though):

from scrapy.crawler import CrawlerProcess
from scrapy import Request
import scrapy

class ConventionSpider(scrapy.Spider):
    name = 'stackoverflow'
    start_urls = ['https://stackoverflow.com/questions/tagged/web-scraping']

    def parse(self,response):
        for link in response.css('.summary'):
            name = link.css('.user-details a::text').extract_first()
            url = link.css('.question-hyperlink::attr(href)').extract_first()
            nurl = response.urljoin(url)
            yield Request(nurl,callback=self.parse_page,meta={'item':name,"download_slot":name})

    def parse_page(self,response):
        elem = response.meta.get("item")
        post = ' '.join([item for item in response.css("#question .post-text p::text").extract()])
        yield {'Name':elem,'Main_Content':post}

if __name__ == "__main__":
    process = CrawlerProcess({
        'USER_AGENT': 'Mozilla/5.0',
    })
    process.crawl(ConventionSpider)
    process.start()

上述脚本完美运行.

我的问题:download_slot 如何在 scrapy 中工作?

My question: how download_slot works within scrapy?

“download_slot"如何在scrapy中工作 [英] How "download_slot" works within scrapy

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

“download_slot"如何在scrapy中工作 [英] How &quot;download_slot&quot; works within scrapy

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

“download_slot"如何在scrapy中工作 [英] How "download_slot" works within scrapy

登录关闭