将 20[TB] 数据存储为向量/数组的 NoSql 解决方案? [英] NoSql solution to store 20[TB] of data, as vector/array?

查看：55 发布时间：2021/6/8 19:03:49 python nosql

本文介绍了将 20[TB] 数据存储为向量/数组的 NoSql 解决方案?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我需要建立一个系统来有效地存储 &维护大量 (20 [TB]) 数据(并能够以向量"形式访问它).这是我的尺寸:

I need to build a system to efficiently store & maintain a huge amount (20 [TB]) of data (and be able to access it in 'vector' form). Here are my dimensions:

(1) 时间(以 YYYYMMDDHHMMSS 形式的整数给出)

(2) 字段(任意给定长度的字符串，代表医院名称)

(3) instrumentID(一个整数，表示工具的唯一ID)

我需要一种能够单独存储数据的方法，例如:

I will need a way to be able to store data individually, meaning, something like:

STORE 23789.46 作为instrumentID = 5 on field = 'Nhsdg' on time = 20040713113500

但是，我需要以下查询来运行FAST:给我时间戳Y"上字段X"的所有工具.

Yet, I would need the following query to to run FAST: give me all instruments for field 'X' on timestamp 'Y'.

为了构建这些系统，我得到了 60 台双核机器(每台机器有 1GB 内存，1.5TB 磁盘)

In order to build these systems, I am given 60 duo-core machines (each with 1GB of RAM, 1.5TB disk)

对合适的 NoSQL 解决方案有什么建议(最好与 Python 一起使用)?

Any recommendation on a suitable NoSQL soltuion (that would ideally work with python)?

注意:系统将首先存储历史数据(大约 20[TB]).每天我最多只会添加大约 200[MB].我只需要一个可以扩展的解决方案.我的用例只是一个简单的查询:在时间戳Y"上给我字段X"的所有工具

NOTE: the system will first store historical data (which is roughly 20[TB]). Every day I will add just about 200[MB] at most. I just need a solution that would scale and scale. my use case would be just a simple query: give me all instruments for field 'X' on timestamp 'Y'

将 20[TB] 数据存储为向量/数组的 NoSql 解决方案? [英] NoSql solution to store 20[TB] of data, as vector/array?

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

将 20[TB] 数据存储为向量/数组的 NoSql 解决方案? [英] NoSql solution to store 20[TB] of data, as vector/array?

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭