什么是存储时间序列数据的最佳开源解决方案? [英] What is the best open source solution for storing time series data?

查看:236
本文介绍了什么是存储时间序列数据的最佳开源解决方案?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有兴趣监控一些对象。我希望每15分钟得到大约10000个数据点。 (也许不是一开始,但这是'一般ballpark')。我也想获得每日,每周,每月和每年的统计数据。将数据保持在最高分辨率(15分钟)超过两个月并不重要。

I am interested in monitoring some objects. I expect to get about 10000 data points every 15 minutes. (Maybe not at first, but this is the 'general ballpark'). I would also like to be able to get daily, weekly, monthly and yearly statistics. It is not critical to keep the data in the highest resolution (15 minutes) for more than two months.

我正在考虑各种方法来存储此数据,查看经典关系数据库,或在无模式数据库(如SimpleDB)。

I am considering various ways to store this data, and have been looking at a classic relational database, or at a schemaless database (such as SimpleDB).

我的问题是,最好的办法是什么?我非常喜欢开源(和免费)解决方案,专有的昂贵的。

My question is, what is the best way to go along doing this? I would very much prefer an open-source (and free) solution to a proprietary costly one.

小注:我用Python编写此应用程序。

Small note: I am writing this application in Python.

推荐答案

HDF5 ,可以通过 h5py PyTables ,用于处理非常大的数据集。两个接口工作良好。例如,h5py和PyTables都有自动压缩,并支持Numpy。

HDF5, which can be accessed through h5py or PyTables, is designed for dealing with very large data sets. Both interfaces work well. For example, both h5py and PyTables have automatic compression and supports Numpy.

这篇关于什么是存储时间序列数据的最佳开源解决方案?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆