简体繁体中英

How does sklearn.cluster.KMeans handle an init ndarray parameter with missing centroids (available centroids less than n_clusters)?

原文 2015-05-11 13:46:29 9 1 python/ scikit-learn/ k-means

In Python sklearn KMeans ( see documentation ), I was wondering what happens internally when passing an ndarray of shape (n, n_features) to the init parameter, When n<n_clusters

Does it drop the given centroids and just starts a kmeans++ initialization which is the default choice for the init parameter ? ( PDF paper kmeans++ ) ( How does Kmeans++ work )
Does it consider the given centroids and fill accordingly the remaining centroids using kmeans++ ?
Does it consider the given centroids and fill the remaining centroids using random values ?

I didn't expect that this method returns no warning in this case. That's why I need to know how it manages this.

1 answers

If you give it a mismatching init it will adjust the number of clusters, as you can see from the source . This is not documented and I would consider it a bug. I'll propose to fix it.

how can I pass some nodes as init in sklearn.kmeans(n_clusters,init,....) in python

Number of distinct clusters in KMeans is less than n_clusters?

How to view cluster centroids for each iteration of n_init using skleans' KMeans

Define k-1 cluster centroids -- SKlearn KMeans

ImportError: cannot import name '_init_centroids' from 'sklearn.cluster._kmeans

sklearn.cluster.KMeans got "TypeError: __init__() got an unexpected keyword argument 'n_jobs'"

how to graph centroids with KMeans

How to extract and map cluster indices from sklearn.cluster.KMeans?

Could not link labels with centroids in Kmeans,Sklearn

How to plot centroids on image after kmeans clustering?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question how can I pass some nodes as init in sklearn.kmeans(n_clusters,init,....) in python Number of distinct clusters in KMeans is less than n_clusters? How to view cluster centroids for each iteration of n_init using skleans' KMeans Define k-1 cluster centroids -- SKlearn KMeans ImportError: cannot import name '_init_centroids' from 'sklearn.cluster._kmeans sklearn.cluster.KMeans got "TypeError: __init__() got an unexpected keyword argument 'n_jobs'" how to graph centroids with KMeans How to extract and map cluster indices from sklearn.cluster.KMeans? Could not link labels with centroids in Kmeans,Sklearn How to plot centroids on image after kmeans clustering?

Related Tags

How does sklearn.cluster.KMeans handle an init ndarray parameter with missing centroids (available centroids less than n_clusters)?

Question

1 answers

solution1 1 ACCPTED 2015-05-11 22:09:57

solution1
1 ACCPTED 2015-05-11 22:09:57