sklearn 中可重现的 kmeans

Question

I am using the document clustering code available here .我正在使用此处提供的文档聚类代码。 I know that k-means is solving a non-convex problem and hence the results of optimization will differ every time I run it, but is there a way to make the clustering reproducible (maybe by fixing some random seed)?我知道 k-means 正在解决一个非凸问题，因此每次运行时优化的结果都会有所不同，但是有没有办法使聚类可重现（也许通过修复一些随机种子）？

Answer 1

You can fix the random_state parameter of K-means .您可以修复K-means的random_state参数。 In the following code I use 42:在下面的代码中，我使用了 42：

km = KMeans(n_clusters=true_k, init='k-means++', max_iter=100, n_init=1, 
                               verbose=opts.verbose,
                               random_state = 42)

You can check the documentation here .您可以在此处查看文档。

sklearn 中可重现的 kmeans

问题描述

1 个解决方案

解决方案1
2 已采纳 2016-04-04 17:33:10

sklearn 中可重现的 kmeans

问题描述

1 个解决方案

解决方案1 2 已采纳 2016-04-04 17:33:10

解决方案1
2 已采纳 2016-04-04 17:33:10