如何设置Spark Kmeans初始中心 [英] how to set Spark Kmeans initial centers

查看：329 发布时间：2020/4/26 10:24:45 apache-spark machine-learning cluster-analysis k-means apache-spark-mllib

本文介绍了如何设置Spark Kmeans初始中心的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用Spark ML运行Kmeans.我有大量数据和三个现有中心，例如，三个中心是:[1.0,1.0,1.0],[5.0,5.0,5.0],[9.0,9.0,9.0]. 因此，我如何指示Kmeans中心是上述三个向量. 我看到Kmean对象具有种子参数，但是种子参数是长类型而不是数组.因此，如何告诉Spark Kmeans仅使用现有的中心进行聚类.

I'm using Spark ML for run Kmeans. I have bunch of data and three existing centers, for example the three centers are:[1.0,1.0,1.0],[5.0,5.0,5.0],[9.0,9.0,9.0]. So how can I indicate the Kmeans centers are the above three vectors. I saw Kmean object has seed parameter, but the seed parameter is an long type not an array. So how can I tell Spark Kmeans to only use the existing centers for clustering.

或者说，我不明白种子在Spark Kmeans中的含义，我认为种子应该是一组向量，它们代表在进行聚类之前指定的中心.

Or say, I didn't understand what does seed mean in Spark Kmeans, I suppose the seeds should be an array of vectors which represents the specified centers before running clustering.

如何设置Spark Kmeans初始中心 [英] how to set Spark Kmeans initial centers

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

如何设置Spark Kmeans初始中心 [英] how to set Spark Kmeans initial centers

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭