获取所有ios应用程序的全局列表 [英] Getting global list of all ios apps

查看:279
本文介绍了获取所有ios应用程序的全局列表的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想在apple app store上做一些全局统计。一个瓶颈是至少获得所有当前活动应用程序的ID(这9位数字)。有没有人知道如何获得iOS应用商店中当前活动应用的所有ID的完整列表。甚至更好的特定类别的所有ID(比如教育)。



我尝试了几种方法,但没有一种可靠。




  • 抓取这些站点地图文件。它们引用了更多的站点地图文件,引用了更多的站点地图文件等等。一些站点地图采用XML格式,一些采用gzip格式,一些采用JSON格式。排除名称中包含 itunes_music 的任何站点地图,因为这些站点地图不包含应用。



    大约有 12,500个站点地图。每秒4个站点地图,您可以在一小时内下载站点地图。


  • 站点地图包含 loc 元素和 xhtml:link [href] [rel = alternate] 元素。解析XML,并获取这些值。


  • 解析语言环境/国家/地区代码和ID的每个URL。


  • 我以这种方式发现了719,344个ID。


    i want to do some global statistics on the apple app store. One bottleneck is getting getting at least the ids (these 9 digit numbers) of all currently active apps. Does anyone know a way to get a complete list of all ids of currently active apps on the iOS app store. Even better all ids of a specific category (say "education").

    I tried several ways but none is reliable.

    There is a list of apps at http://iphoneapplicationlist.com/apps/ipad/ but the list stops at November 2012. Also apple provides a search list at https://itunes.apple.com/us/genre/ios-education/id6017?mt=8 but this list turns out to be incomplete. (If I compare it with the other list there are more than 8000 apps missing, In particular my own one :-) ).

    So my qeustion is: provided one can oneself do some php and javascript programming is there a good way of getting all app ids of all apps.

    解决方案

    I cannot find a way to get all mobile apps, but the iTunes sitemaps files seem to contain a list of apps that are recently updated. (I'm not sure exactly which apps are contained in the sitemaps files -- I will update this answer once I work it out.)

    To get this list:

    1. Crawl

    2. Crawl those sitemap files. They reference more sitemap files, which reference more sitemap files, etc, etc. Some sitemaps are in XML format, some are in gzipped XML format and some are in JSON format. Exclude any sitemap that contains itunes_music in the name, as those do not contain apps.

      There are about 12,500 sitemaps of interest. At 4 sitemaps per second, you can download the sitemaps in under an hour.

    3. The sitemaps contain loc elements and xhtml:link[href][rel=alternate] elements. Parse the XML, and get these values.

    4. Parse each URL for the locale/country code and IDs.

    I discovered 719,344 IDs in this way.

    这篇关于获取所有ios应用程序的全局列表的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

    查看全文
    登录 关闭
    扫码关注1秒登录
    发送“验证码”获取 | 15天全站免登陆