Cassandra性能为长行 [英] Cassandra performance for long rows

查看：136 发布时间：2016/11/13 14:24:20 cassandra

本文介绍了Cassandra性能为长行的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在考虑在Cassandra中实现一个CF，它有很长的行（每行数十万到数百万列）。

I'm looking at implementing a CF in Cassandra that has very long rows (hundreds of thousands to millions of columns per row).

使用完全伪数据，我已经插入了200万列到单行（均匀间隔）。如果我做一个切片操作来获得20列，那么我注意到一个巨大的性能下降，因为你在切片操作进一步下行。

Using entirely dummy data, I've inserted 2 million columns into a single row (evenly spaced). If I do a slice operation to get 20 columns, then I'm noticing a massive performance degradation as you do your slice operation further down the row.

列，我似乎能够提供10-40毫秒的切片结果，但是当你走到行的末尾，性能击中墙，响应时间逐渐从3800000年的43ms增加到214ms在1,900,000和435ms 1,999,900！（所有切片宽度相等）。

With most of the columns, I seem to be able to serve up slice results in 10-40ms, but as you get towards the end of the row, performance hits the wall, with response times gradually increasing from 43ms at the 1,800,000 mark to 214ms at 1,900,000 and 435ms at 1,999,900! (All slices are of equal width).

我有点失落，解释为什么在到达行尾时，性能会大大降低。有人可以提供一些指导，Cassandra在内部做什么这样的延迟？行缓存已关闭，几乎一切都是默认的Cassandra 1.0安装。

I'm at a loss to explain why there is this massive degradation in performance as you get to the end of the row. Can someone please provide some guidance as to what Cassandra's doing internally to make such a delay? Row caching is turned off and pretty much everything is a default Cassandra 1.0 installation.

它应该能够支持每行20亿列，提高性能意味着在实际情况下不能用于很长的行。

It's supposed to be able to support up to 2 billion columns per row, but at this rate of increase performance will mean that it can't be used for very long rows in a practical situation.

非常感谢。

Caveat，我在一个时间点击这个与10个请求，这就是为什么他们比我预期的慢一点，但它是一个公平的测试，所有的请求，甚至只是做他们所有在连续在1,800,000和1,900,000条记录之间有这种奇怪的降级。

Caveat, I'm hitting this with 10 requests in parallel at a time which is why they are a bit slower than I'd expect anyway, but it's a fair test across all requests and even just doing them all in serial there is this strange degradation between the 1,800,000th and 1,900,000th record.

我也注意到，当只对一个物品做反向切片时，只有200,000每行的列数：
query.setRange（end，start，false，1）;

I've also noticed EXTREMELY bad performance when doing reverse slices for just a single item when having just 200,000 columns per row: query.setRange(end, start, false, 1);

Cassandra性能为长行 [英] Cassandra performance for long rows

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Cassandra性能为长行 [英] Cassandra performance for long rows

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭