Cassandra和非结构化数据 [英] Cassandra and unstructured data

查看:579
本文介绍了Cassandra和非结构化数据的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我是NoSQL的新手,但我已经做了一些关于关系数据库的工作。

I'm relatively new to NoSQL, but I've done a fair bit of toying with relational databases.

我们正在评估Cassandra在环境中使用我们的数据模型可能需要相当积极地发展。我看到它写在多个地方,Cassandra可以存储结构化,半结构化和非结构化数据。

We are evaluating Cassandra for use in an environment where our data model might need to evolve fairly aggressively. I've seen it written multiple places that Cassandra can store "structured, semi-structured and unstructured" data.

我理解结构化的声明。很明显:一个表格已经定义了列。

I understand the structured claim. It's obvious: a table has defined columns.

我想我理解半结构化的声明。一行不需要填充所有列。

I think I understand the semi-structured claim. A row does not need to populate all columns.

但我不清楚非结构化声明。当然,你可以将一切都存储为一个键值blob,但你没有办法通过价值(高效)搜索。

But I'm not clear on the unstructured claim. Certainly you could store everything as a key-value blob but you'd have no means of searching by value (efficiently).

我在网络上找不到任何使用Cassandra使用非结构化数据的最佳实践的资源。理想情况下,对于我们的应用程序,半结构化数据就足够了;但我想了解非结构化声明,如果它可以为我们增加价值。

I've failed to find any resource on the net that describes best practices using unstructured data with Cassandra. Ideally, for our application semi-structured data would be sufficient; but I want to understand the unstructured claim in the event that it can add value for us.

感谢。

推荐答案

Cassandra最多可以搜索半结构化数据。也通过使用集群键和二级索引。集群键肯定是搜索半结构化数据的有效方法。

Cassandra can at best be searchable for semi-structured data. That too via use of clustering keys and secondary indexes. Clustering keys is definitely an efficient way for searching semi-structured data.

搜索次级索引数据而不指定分区键是不高效的。有几个解决方案有助于在这里,即DSE搜索(Solr与Cassandr)和Stargate。如果其中一列是非结构化文本,这两种解决方案也可以帮助。

Searching secondary indexed data without specifying the partition key is not efficient. There a few solutions which help help here namely DSE Search(Solr with Cassandr) and Stargate. Both of these solutions may also help in case one of the columns is unstructured text.

否则,使用Cassandra执行非结构化数据不是一个好主意,因为如果没有键,可能无法搜索。

Otherwise it isn't a great idea to do unstructured data with Cassandra as it may not be searchable without a key.

这篇关于Cassandra和非结构化数据的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆