从被封锁防止自定义Web爬虫 [英] Prevent Custom Web Crawler from being blocked

查看：190 发布时间：2016/9/23 23:43:46 c# web-crawler google-crawlers

本文介绍了从被封锁防止自定义Web爬虫的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我创建一个新的网络爬虫是 C＃抓取一些特定的网站。每一件事情去罚款。但问题是，一些网站被挡住了我的履带式IP地址某些请求后。我试图用我的检索请求之间的时间戳。但没有奏效。

I am creating a new web crawler using C# to crawl some specific websites. every thing goes fine. but the problem is that some websites are blocking my crawler IP address after some requests. I tried using timestamps between my crawl requests. but did not worked.

有没有什么办法，以防止网站挡住了我的履带式？
这样一些解决方案，将有助于（但我需要知道如何应用它们）：

is there any way to prevent websites from blocking my crawler ? some solutions like this would help (but I need to know how to apply them):

模拟谷歌机器人或雅虎啜食

使用多个IP地址（事件伪造IP地址）作为履带式客户端IP

simulating Google bot or yahoo slurp
using multiple IP addresses (event fake IP addresses) as crawler client IP

任何解决方案会有所帮助。

any solution would help.

从被封锁防止自定义Web爬虫 [英] Prevent Custom Web Crawler from being blocked

问题描述

推荐答案

相关文章

C#/.NET最新文章

热门教程

热门工具

登录关闭

从被封锁防止自定义Web爬虫 [英] Prevent Custom Web Crawler from being blocked

问题描述

推荐答案

相关文章

C#/.NET最新文章

热门教程

热门工具

登录 关闭

登录关闭