MiniBatchKMeans在后续迭代后给出不同的质心 [英] MiniBatchKMeans gives different centroids after subsequent iterations

查看：82 发布时间：2020/4/26 10:23:43 python statistics scikit-learn cluster-analysis k-means

本文介绍了MiniBatchKMeans在后续迭代后给出不同的质心的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用anaconda中sklearn.cluster模块中的MiniBatchKMeans模型.我正在对一个包含大约75,000点的数据集进行聚类.看起来像这样:

I am using the MiniBatchKMeans model from the sklearn.cluster module in anaconda. I am clustering a data-set that contains approximately 75,000 points. It looks something like this:

data = np.array([8,3,1,17,5,21,1,7,1,26,323,16,2334,4,2,67,30,2936,2,16,12,28,1,4,190...])

我使用以下过程拟合数据.

I fit the data using the process below.

from sklearn.cluster import MiniBatchKMeans kmeans = MiniBatchKMeans(batch_size=100) kmeans.fit(data.reshape(-1,1)

这一切都很好，我继续查找数据的质心:

This is all well and okay, and I proceed to find the centroids of the data:

centroids = kmeans.cluster_centers_ print centroids

哪个给了我以下输出:

array([[ 13.09716569], [ 2908.30379747], [ 46.05089228], [ 725.83453237], [ 95.39868475], [ 1508.38356164], [ 175.48099948], [ 350.76287263]])

但是，当我再次使用相同的数据运行该过程时，会得到不同的质心值，例如:

But, when I run the process again, using the same data, I get different values for the centroids, such as this:

array([[ 29.63143489], [ 1766.7244898 ], [ 171.04417206], [ 2873.70454545], [ 70.05295277], [ 1074.50387597], [ 501.36134454], [ 8.30600975]])

任何人都可以解释这是为什么吗?

Can anyone explain why this is?

MiniBatchKMeans在后续迭代后给出不同的质心 [英] MiniBatchKMeans gives different centroids after subsequent iterations

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

MiniBatchKMeans在后续迭代后给出不同的质心 [英] MiniBatchKMeans gives different centroids after subsequent iterations

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭