谷歌播放评论刮刮变化 [英] Google play review scraping changes

查看:147
本文介绍了谷歌播放评论刮刮变化的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

在过去一年左右,我创建了大量脚本来从Google Play中刮取Android应用评价。过去,通过模仿Google Play界面可以很好地工作,以便 https://play.google.com/存储/ getreviews 和必要的参数并解析HTML结果。

最近Google Play界面的更新改变了HTML结构,但似乎也实施了一些抵制刮蹭的保护措施。现在有一个令牌参数,它可能会改变某种类型的会话ID,并且我无法生成,因为我不确定它是什么种子。此外,我发现它似乎阻止请求客户端发出多个不符合接口的呼叫,因为在未成功拨打电话后,我甚至无法在任何浏览器中加载Google Play界面。过了一段时间,这似乎超时了。不确定这一点,但这是我从我看到的结论。



谢谢

$ b

感谢您购买此类似的问题,并找到解决方法吗? $ b

解决方案

试试这个: www.scrape4me.com



确实显示错误,但输出内容:

  http://scrape4me.com/api ?url = https%3A%2F%2Fplay.google.com%2Fstore%2Fapps%2Fdetails%3Fid%3Dcom.com2us.golfstarworldtour.normal.freefull.google.global.android.common& elm =& ch = ch 


Over the past year or so I have created a number of scripts to scrape Android app reviews from Google Play. In the past this was working fine by mimicking the Google Play interface to call https://play.google.com/store/getreviews with the necessary parameters and parse the HTML results.

The recent updates to the Google Play interface changed the HTML structure, but also seems to implement some kind of protection against scraping. There is now a "token" parameter which changes, presumably some kind of session ID, and which I have not been able to generate as I'm not sure of what seeds it. Also I've found that it seems to block requesting clients that make multiple calls that don't conform to the interface, as after an unsuccessful call I can't even load the Google Play interface in any browser. After a while this seems to time out. Not certain of this, but it's what I've concluded from what I'm seeing.

Anyone found this similar problem, and found a way around it?

Thanks

解决方案

Give this a try: www.scrape4me.com

It does show an error but it outpouts content:

http://scrape4me.com/api?url=https%3A%2F%2Fplay.google.com%2Fstore%2Fapps%2Fdetails%3Fid%3Dcom.com2us.golfstarworldtour.normal.freefull.google.global.android.common&elm=&ch=ch

这篇关于谷歌播放评论刮刮变化的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆