谷歌播放评论刮刮变化 [英] Google play review scraping changes
问题描述
最近Google Play界面的更新改变了HTML结构,但似乎也实施了一些抵制刮蹭的保护措施。现在有一个令牌参数,它可能会改变某种类型的会话ID,并且我无法生成,因为我不确定它是什么种子。此外,我发现它似乎阻止请求客户端发出多个不符合接口的呼叫,因为在未成功拨打电话后,我甚至无法在任何浏览器中加载Google Play界面。过了一段时间,这似乎超时了。不确定这一点,但这是我从我看到的结论。
谢谢
$ b感谢您购买此类似的问题,并找到解决方法吗? $ b
试试这个: www.scrape4me.com
确实显示错误,但输出内容:
http://scrape4me.com/api ?url = https%3A%2F%2Fplay.google.com%2Fstore%2Fapps%2Fdetails%3Fid%3Dcom.com2us.golfstarworldtour.normal.freefull.google.global.android.common& elm =& ch = ch
Over the past year or so I have created a number of scripts to scrape Android app reviews from Google Play. In the past this was working fine by mimicking the Google Play interface to call https://play.google.com/store/getreviews with the necessary parameters and parse the HTML results.
The recent updates to the Google Play interface changed the HTML structure, but also seems to implement some kind of protection against scraping. There is now a "token" parameter which changes, presumably some kind of session ID, and which I have not been able to generate as I'm not sure of what seeds it. Also I've found that it seems to block requesting clients that make multiple calls that don't conform to the interface, as after an unsuccessful call I can't even load the Google Play interface in any browser. After a while this seems to time out. Not certain of this, but it's what I've concluded from what I'm seeing.
Anyone found this similar problem, and found a way around it?
Thanks
Give this a try: www.scrape4me.com
It does show an error but it outpouts content:
http://scrape4me.com/api?url=https%3A%2F%2Fplay.google.com%2Fstore%2Fapps%2Fdetails%3Fid%3Dcom.com2us.golfstarworldtour.normal.freefull.google.global.android.common&elm=&ch=ch
这篇关于谷歌播放评论刮刮变化的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!