一次在多台服务器上清理Cassandra [英] Cassandra cleanup on several servers at once

查看:312
本文介绍了一次在多台服务器上清理Cassandra的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我们有一个18台服务器的大型Cassandra集群(在一台服务器上接近5T数据)

We have a big Cassandra cluster 18 Servers (on one server near 5T data )

http://docs.datastax.com/en/cassandra/2.0/cassandra/operations/ops_add_node_to_cluster_t.html -我们在此文档之后添加了一个新节点.

http://docs.datastax.com/en/cassandra/2.0/cassandra/operations/ops_add_node_to_cluster_t.html - We have added a new nodes following this documentation .

添加新服务器后,我们开始清理数据(nodetool清理)

After we have added new servers, we began the process of cleaning data (nodetool cleanup)

在文档建议中:在所有新节点运行之后,在每个先前存在的节点上运行nodetool cleanup以删除不再属于那些节点的密钥.等待清理在一个节点上完成,然后再进行下一个操作

In the documentation advise: After all new nodes are running, run nodetool cleanup on each of the previously existing nodes to remove the keys no longer belonging to those nodes. Wait for cleanup to complete on one node before doing the next)

但是在我们的情况下,清理一台服务器大约需要2至3天. 我的问题是我可以一次在2台或3台...

But cleanup for one server takes near 2 - 3 days in our case. My question is can I start cleaning at once on multiple servers, 2 or 3 ...

还是可能导致数据丢失?

Or it may lead to data loss ?

更多信息.

我们将cassandra 2.0.13与vnodes一起使用.另外,我们将文件保存在cassandra中.

We use cassandra 2.0.13 with vnodes . Also We keep files in blons in cassandra .

复制因子= 3

推荐答案

Cleanup不涉及任何其他节点,因此可以安全地并行运行.但是,由于清理可能会使用大量磁盘I/O,因此您可能希望一次运行一次以减少对性能的影响.

Cleanup doesn't involve any other nodes so it is safe to run in parallel. However, you may want to run on one at once to reduce the performance impact since cleanup may use lots of disk I/O.

这篇关于一次在多台服务器上清理Cassandra的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆