如何提取Twitter的API历史的鸣叫? [英] How to extract the historical tweets from twitter API?

查看:367
本文介绍了如何提取Twitter的API历史的鸣叫?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我们需要一些电影历史的鸣叫。现在,我们试图从Twitter流API和搜索API。流API无法给我们一个参数来选择我们需要和搜索API只能给我们数据的一个或提前两周时间范围。有没有办法让我们提取2014-05-01历史鸣叫2014年7月1日(例如)?我发现下面的方式,是可以做到的是:

We need the historical tweets for some movies. Right now, we tried the streaming API and search API from twitter. The streaming API could not give us a parameter to choose the time range we need and the search API could only give us data one or two weeks in advance. Is there a way for us to extract the historical tweets from 2014-05-01 to 2014-07-01 (For example)? I found the following ways that is possible to do that:

1:Twitter的高级搜索
https://twitter.com/search-advanced?lang=en
它可以找到搜索结果,我需要。但是,我怎么能下载的搜索结果?反正是有写一个code和保存搜索结果?

1: Twitter advanced search https://twitter.com/search-advanced?lang=en It could find the search result I need. But how could I download the search result? Is there anyway to write a code and save the search result?

2:使用Twitter的网站分析颠倒一样。但它也有保存结果的难度。

2:Using the twitter analytic website like topsy. But it also has the difficulty to save the result.

3:看来Twitter4J这类一些软件包可以与帮助:
http://twitter4j.org/en/$c$c-examples.html
是否有任何的Python或R包能帮助我们做到这一点?

3: It seems that some packages like twitter4J could help with that: http://twitter4j.org/en/code-examples.html Is there any python or R package could help us to do that?

4:我们需要这个数据做研究。这是不是一个好的选择花很长的时间数据的提取。反正有一些专业的网站购买这些数据?

4: We need this data to do a research. It is not a good choice to spend a long time for the extraction of the data. Is there anyway to buy this data from some professional website?

推荐答案

您可以使用GNIP的历史Powertrack产品要做到这一点,但是这是针对企业而不是研究人员的商业产品。

You can use Gnip's Historical Powertrack product to do this, however this is a commercial product aimed at enterprises rather than researchers.

刮Twitter网站是违反服务条款和开发政策。

Scraping the Twitter website is against the Terms of Service and Developer Policy.

公共搜索API仅支持数据的7-9天,因此,即使使用Twitter或tweepy(R和Python期权)将不会使您能够检索您试图访问期间的数据。

The public search API only supports 7-9 days of data, so even using twitteR or tweepy (R and Python options) would not enable you to retrieve data from the period you are trying to access.

这篇关于如何提取Twitter的API历史的鸣叫?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆