Carrot2工作台无法处理大数据 [英] Carrot2 workbench not able to process large data

查看：86 发布时间：2020/10/3 2:22:22 xml cluster-analysis carrot2

本文介绍了Carrot2工作台无法处理大数据的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

我想使用胡萝卜2工作台对数据集进行聚类。我有一个包含 65536 文档的xml输入文件。我正在使用Lingo聚类算法。

I wanted to cluster my data-set using carrot2 workbench. I have an input xml file with 65536 documents. I am using Lingo clustering algorithm.

但是，当我开始该过程时，工作台将在几秒钟内将所有文档归入其他主题集群，并返回结果。

But when I start the process, the workbench returns the result within few seconds having all the documents in the "other topics" cluster.

我检查了具有较小数据集的聚类，并且得到了结果。

I have checked the clustering with smaller data-sets and I am getting the results.