使用scrapy从多个网站中查找特定文本 [英] Using scrapy to find specific text from multiple websites

查看：151 发布时间：2020/4/26 9:36:41 web web-crawler scrapy keyword extraction

本文介绍了使用scrapy从多个网站中查找特定文本的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我想抓取/检查多个网站(在同一域中)的特定关键字.我已经找到了该脚本，但是找不到如何添加要搜索的特定关键字.脚本需要做的是找到关键字，并给出在其中找到它的链接的结果.谁能指出我在哪里可以阅读更多有关此的信息? 我一直在阅读 scrapy的文档，但似乎找不到.

I would like to crawl/check multiple websites(on same domain) for a specific keyword. I have found this script, but I can't find how to add the specific keyword to be search for. What the script needs to do is find the keyword, and give the result in which link it was found. Could anyone point me to where i could read more about this ? I have been reading scrapy's documentation, but I can't seem to find this.

谢谢.

class FinalSpider(scrapy.Spider):
name = "final"
allowed_domains = ['example.com']
start_urls = [URL % starting_number]
def __init__(self):
    self.page_number = starting_number

def start_requests(self):
    # generate page IDs from 1000 down to 501
    for i in range (self.page_number, number_of_pages, -1):
        yield Request(url = URL % i, callback=self.parse)

def parse(self, response):
    **parsing data from the webpage**

使用scrapy从多个网站中查找特定文本 [英] Using scrapy to find specific text from multiple websites

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

使用scrapy从多个网站中查找特定文本 [英] Using scrapy to find specific text from multiple websites

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭