如何抓取实现“我不是机器人”的网站? noCAPTCHA? [英] How do I scrape a website that implements the "I'm not a robot" noCAPTCHA?

查看:147
本文介绍了如何抓取实现“我不是机器人”的网站? noCAPTCHA?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述





我正试图抓住一个实现JavaScriptnoCAPTCHA ReCAPTCHA的网站,我不想聘请一些印度人来简单回答谜题...我想要实际上要么回答CAPTCHA,要么回过头来这样我就可以得到我想要的数据..



有问题的CAPTCHA是基于JAVAScript并且来自Google API ...它让你选中一个方框,上面写着我不是机器人,然后它会显示九个图像,你必须单击正确的图像才能释放CAPTCHA。 />


有没有人想出如何打破这些事情?



B

Hi,

I am trying to scrape a website that implements a JavaScript "noCAPTCHA ReCAPTCHA" and I don't want to hire some India people to simply answer the puzzles...I want to actually either answer the CAPTCHA or some how get past it so I can to the data I want..

The CAPTCHA in question is JAVAScript based and comes from the Google API...it has you check a box saying "I'm not a robot," and then it shows you nine images, and you must click the correct images for the CAPTCHA to release.

Has anybody figured out how to break these sorts of things?

B

推荐答案

你想要破坏反机器人保护以获得你的机器人程序做一些网站作者不希望你做的事情。

这是黑客活动,所以你在错误的地方。



否则,你主要有2个解决方案

- 创建一个能够阅读图片并从阅读中得到答案的应用程序。

- 创建一个应用程序记住所有图片及其答案,以便自动回答。

这两种解决方案都需要付出巨大努力,而不是比雇用一些来自印度的人来回答问题要便宜。



noCAPTCHA ReCAPTCHA专门用来打败你想做的事。
You want to defeat an anti-bot protection in order to have your bot program to do something that the author of the site don't want you to do.
that is hacking activity, so you are at the wrong place.

Otherwise, you have mainly 2 solutions
- Create an app that is able to read the pictures and get the answer from the reading.
- Create an app that memorise all the pictures and their answers, in order to answer automatically.
Both solutions require a huge effort that may not be cheaper than hiring some people from India to answer the questions.

The "noCAPTCHA ReCAPTCHA" is made especially to defeat what you want to do.


这篇关于如何抓取实现“我不是机器人”的网站? noCAPTCHA?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆