如何设置 Spark Kmeans 初始中心 [英] how to set Spark Kmeans initial centers

查看：38 发布时间：2021/11/14 21:02:53 apache-spark machine-learning cluster-analysis k-means apache-spark-mllib

本文介绍了如何设置 Spark Kmeans 初始中心的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我使用 Spark ML 来运行 Kmeans.我有一堆数据和三个现有的中心，例如三个中心是:[1.0,1.0,1.0],[5.0,5.0,5.0],[9.0,9.0,9.0].那么我如何表示 Kmeans 中心是上述三个向量.我看到 Kmean 对象有种子参数，但种子参数是一个长类型而不是数组.那么我如何告诉 Spark Kmeans 只使用现有的中心进行聚类.

I'm using Spark ML for run Kmeans. I have bunch of data and three existing centers, for example the three centers are:[1.0,1.0,1.0],[5.0,5.0,5.0],[9.0,9.0,9.0]. So how can I indicate the Kmeans centers are the above three vectors. I saw Kmean object has seed parameter, but the seed parameter is an long type not an array. So how can I tell Spark Kmeans to only use the existing centers for clustering.

或者说，我不明白 Spark Kmeans 中的种子是什么意思，我想种子应该是一个向量数组，在运行聚类之前代表指定的中心.

Or say, I didn't understand what does seed mean in Spark Kmeans, I suppose the seeds should be an array of vectors which represents the specified centers before running clustering.

如何设置 Spark Kmeans 初始中心 [英] how to set Spark Kmeans initial centers

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

如何设置 Spark Kmeans 初始中心 [英] how to set Spark Kmeans initial centers

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭