我怎样才能prevent被屏幕刮我的asp.net网站? [英] How can I prevent my asp.net site from being screen scraped?

查看:114
本文介绍了我怎样才能prevent被屏幕刮我的asp.net网站?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

如何prevent我的asp.net 3.5的网站被屏幕刮我的竞争对手? 理想情况下,我想,以确保没有webbots或screenscrapers可以从我的网站中提取数据。

How can I prevent my asp.net 3.5 website from being screen scraped by my competitor? Ideally, I want to ensure that no webbots or screenscrapers can extract data from my website.

有没有一种方法来检测存在的WebBot或屏幕刮跑?

Is there a way to detect that there is a webbot or screen scraper running ?

推荐答案

这是可能的尝试检测画面撷取:

It is possible to try to detect screen scrapers:

使用cookies和时间,这将使它更难对于那些开箱屏幕刮刀。同时检查JavaScript的支持,最刮不用它。检查元数据浏览器以验证它确实是一个Web浏览器。

Use cookies and timing, this will make it harder for those out of the box screen scrapers. Also check for javascript support, most scrapers do not have it. Check Meta browser data to verify it is really a web browser.

您也可以为您在一分钟的请求,用户开着浏览器只能使一小部分每分钟的请求,这样的逻辑在服务器上检测到每分钟的请求过多可能presume该屏幕拼抢发生和prevent访问从有问题的IP地址为一段时间。如果这开始影响到爬虫,日志被阻止用户的IP地址,并根据需要开始使自己的IP地址。

You can also check for requests in a minute, a user driving a browser can only make a small number of requests per minute, so logic on the server that detects too many requests per minute could presume that screen scraping is taking place and prevent access from the offending IP address for some period of time. If this starts to affect crawlers, log the users ip that is blocked, and start allowing their IPs as needed.

您可以使用 http://www.copyscape.com/ 也proect您的内容,这至少会告诉你谁被重用你的数据。

You can use http://www.copyscape.com/ to proect your content also, this will at least tell you who is reusing your data.

请参阅这个问题也:

<一个href="http://stackoverflow.com/questions/396817/protection-from-screen-scraping">http://stackoverflow.com/questions/396817/protection-from-screen-scraping

另外看看

http://blockscraping.com/

关于屏幕抓取尼斯DOC:

Nice doc about screen scraping:

<一个href="http://www.realtor.org/wps/wcm/connect/5f81390048be35a9b1bbff0c8bc1f2ed/scraping_sum_jun_04.pdf?MOD=AJPERES&CACHEID=5f81390048be35a9b1bbff0c8bc1f2ed">http://www.realtor.org/wps/wcm/connect/5f81390048be35a9b1bbff0c8bc1f2ed/scraping_sum_jun_04.pdf?MOD=AJPERES&CACHEID=5f81390048be35a9b1bbff0c8bc1f2ed

如何prevent屏幕抓取:

How to prevent screen scraping:

<一个href="http://mvark.blogspot.com/2007/02/how-to-$p$pvent-screen-scraping.html">http://mvark.blogspot.com/2007/02/how-to-$p$pvent-screen-scraping.html

这篇关于我怎样才能prevent被屏幕刮我的asp.net网站?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆