获取来自Twitter的历史数据 [英] Getting historical data from Twitter

查看:2013
本文介绍了获取来自Twitter的历史数据的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

有关的研究项目,我想获得最后3个月的价值Twitter消息。技术挑战不谈,这可能吗?通过使用某种慢轮询机制,以保持在海湾的速度限制器?

For a research project I would like to get the last 3 months worth of Twitter messages. Technical challenges aside, is this possible? by using some sort of slow polling mechanism to keep the rate limiter at bay?

Twitter的API状态客户可以通过页面请求高达3,200状态和计数时间表REST API参数这些是每小时?每天?或者......永远?

The Twitter API states "Clients may request up to 3,200 statuses via the page and count parameters for timeline REST API" Are these per hour? Per day? or...ever?

有什么建议?难道甚至是理论上的可能?难道有人在​​做之前,类似的事情?

Any suggestions? Would it even be theoretically possible? Did some one do something similar before?

谢谢!
马尔科

Thanks! Marco

推荐答案

微博出了名的不作超过三周旧的可用的tweet。在某些情况下,你只能得到一个星期。你就要去存储鸣叫未来三个月更好。很多正确地怀疑他们甚至Twitter的坚持。

Twitter notoriously does not make "available" tweets older than three weeks. In some cases you can only get one week. You're better off storing tweets for the next three months. Many rightly doubt if they're even persisted by Twitter.

您是否在寻找随便什么鸣叫?如果是这样,检查出的流API的状态/样品方法。流API使用持久性HTTP插口,可以是一个痛苦的程序,但是当你得到它的工作这是相当优美。我建议设立一个小脚本转储从状态/样品鸣叫到一个数据库。之后几天,你应该有数据吨。

Are you looking for just any tweets? If so, check out the Streaming API's status/sample method. The streaming API uses persistent HTTP sockets that can be a pain to program, but it's quite graceful when you get it working. I'd recommend setting up a little script to dump tweets from status/sample into a DB. You should have a TON of data after just a few days.

这篇关于获取来自Twitter的历史数据的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆