简体繁体中英

How reliable is the Elbow curve in finding K in K-Means?

原文 2018-09-26 09:41:01 3 1 python/ r/ cluster-analysis/ k-means/ word2vec

So I was trying to use the Elbow curve to find the value of optimum 'K' (number of clusters) in K-Means clustering.

The clustering was done for the average vectors (using Word2Vec) of a text column in my dataset (1467 rows). But looking at my text data, I can clearly find more than 3 groups the data can be grouped into.

I read the reasoning is to have a small value of k while keeping the Sum of Squared Errors (SSE) low. Can somebody tell me how reliable the Elbow Curve is? Also if there's something I'm missing.

Attaching the Elbow curve for reference. I also tried plotting it up to 70 clusters, exploratory. .

1 answers

The "Elbow" is not even well defined. So how can it be reliable?

You can "normalize" the values by the expected dropoff from splitting the data into k clusters and it will become a bit more readable. Unfortunately, I forgot the exact name of that.Calinski and Harabasz (1974) variance ratio criterion? If I recall the name correctly, that is essentially a rescaled version that makes much more sense.

Calculating optimal K value in K-means clustering with elbow curve

Scikit Learn - K-Means - Elbow - criterion

K-Means not resulting in elbow shape

K-Means for topic modelling - Elbow method

Elbow Method for K-Means in python

Finding Accuracy for this K-Means model

Finding mean of centers in k-means

K-means performance

Centroids in K-Means

Sequential k-means

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Calculating optimal K value in K-means clustering with elbow curve Scikit Learn - K-Means - Elbow - criterion K-Means not resulting in elbow shape K-Means for topic modelling - Elbow method Elbow Method for K-Means in python Finding Accuracy for this K-Means model Finding mean of centers in k-means K-means performance Centroids in K-Means Sequential k-means

Related Tags

How reliable is the Elbow curve in finding K in K-Means?

Question

1 answers

solution1 1 ACCPTED 2018-09-27 06:03:46

solution1
1 ACCPTED 2018-09-27 06:03:46