Spark MLLib 中 Kmeans++ 中的初始化步骤参数究竟是什么? [英] What exactly is the initializationSteps parameter in Kmeans++ in Spark MLLib?

查看：20 发布时间：2021/11/14 23:26:24 apache-spark pyspark apache-spark-sql apache-spark-mllib

本文介绍了Spark MLLib 中 Kmeans++ 中的初始化步骤参数究竟是什么?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

我知道 k-means 是什么，我也知道 k-means++ 算法是什么.我相信唯一的变化是找到初始 K 中心的方式.

I know what k-means is and I also understand what k-means++ algorithm is. I believe the only change is the way the initial K centers are found.

在 ++ 版本中，我们最初选择一个中心，然后使用概率分布选择剩余的 k-1 个中心.

In the ++ version we initially choose a center and using a probability distribution we choose the remaining k-1 centers.

在 k-means 的 MLLib 算法中，initializationSteps 参数是什么?

In the MLLib algorithm for k-means what is the initializationSteps parameter?