Python Scrapy:allowed_domains 从数据库添加新域 [英] Python Scrapy : allowed_domains adding new domains from database

查看：43 发布时间：2021/7/16 22:15:59 screen-scraping web-scraping scrapy

本文介绍了Python Scrapy:allowed_domains 从数据库添加新域的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我需要向 allowed_domains 添加更多域，所以我没有收到过滤的异地请求".

I need to add more domains to allowed_domains , so I dnt get the " Filtered offsite request to ".

我的应用需要从数据库中获取 url，所以我无法手动添加它们.

My app gets urls to fetch from a database, so I cant add them manually.

我试图覆盖蜘蛛 init

喜欢这个

 def __init__(self):
        super( CrawlSpider, self ).__init__()
        self.start_urls = []
        for destination in Phpbb.objects.filter(disable=False):
                self.start_urls.append(destination.forum_link)

            self.allowed_domains.append(destination.link)

start_urls 很好，这是我要解决的第一个问题.但 allow_domains 没有影响.

start_urls was fine, this was my first issue to solve. but the allow_domains makes no affect.

我需要更改一些配置以禁用域检查?我不想要这个，因为我只想要数据库中的那些，但它现在可以帮助我禁用域检查.

I need to change some configuration in order to disable domain checking? I dont want this since I only want the ones from the database, but It could help me for now to disable domain check.

谢谢！！

Python Scrapy:allowed_domains 从数据库添加新域 [英] Python Scrapy : allowed_domains adding new domains from database

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Python Scrapy:allowed_domains 从数据库添加新域 [英] Python Scrapy : allowed_domains adding new domains from database

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭