solr 多核 vs 分片 vs 1 个大集合 [英] solr multicore vs sharding vs 1 big collection

查看：32 发布时间：2021/11/17 2:49:50 solr architecture solrcloud

本文介绍了solr 多核 vs 分片 vs 1 个大集合的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我目前有一个包含 4000 万个文档和 25 GB 索引大小的集合.集合每 n 分钟更新一次，因此删除的文档数量不断增加.集合中的数据是 1000 多个客户记录的合并.每个客户的文档数量平均约为 100,000 条记录.

I currently have a single collection with 40 million documents and index size of 25 GB. The collections gets updated every n minutes and as a result the number of deleted documents is constantly growing. The data in the collection is an amalgamation of more than 1000+ customer records. The number of documents per each customer is around 100,000 records on average.

话虽如此，我正在尝试处理不断增长的已删除文档大小.由于索引大小不断增加，磁盘空间和内存都被用光了.并希望将其减小到可管理的大小.

Now that being said, I 'm trying to get an handle on the growing deleted document size. Because of the growing index size both the disk space and memory is being used up. And would like to reduce it to a manageable size.

我一直在考虑将数据拆分为多个核心，每个客户 1 个.这将使我能够轻松管理较小的集合，并且还可以快速创建/更新集合.我担心的是收藏的数量可能会成为一个问题.有关如何解决此问题的任何建议.

I have been thinking of splitting the data into multiple core, 1 for each customer. This would allow me manage the smaller collection easily and can create/update the collection also fast. My concern is that number of collections might become an issue. Any suggestions on how to address this problem.

Solr: 4.9
Index size:25 GB
Max doc: 40 million
Doc count:29 million

谢谢

solr 多核 vs 分片 vs 1 个大集合 [英] solr multicore vs sharding vs 1 big collection

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

solr 多核 vs 分片 vs 1 个大集合 [英] solr multicore vs sharding vs 1 big collection

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭