简体繁体中英

How do I choose a linkage method for Hierarchical Agglomerative Clustering?

原文 2015-06-13 23:38:27 0 1 machine-learning/ cluster-analysis/ hierarchical-clustering

I understand that HAC has several options in terms of linkage functions. You have:

Single linkage which produces "straggly" clusters
Complete linkage which produces tight, spherical clusters
Average linkage which is sort of a compromise between the two
Ward's method, which is based more off the variance than actual distance

What I'm trying to figure out is, how do I know which one of these I want to use? Are there certain datasets where "straggly" clusters are preferable to spherical ones? Or is it more a function of what I intend to do with the clustering data?

1 answers

It depends on your data.

Single-linkage works reasonably well on clean data.

If you have dirty data, the other linkages may be better.

Ward is similar to k-means. It may be a good choice if you want to talk about centroids and data partitioned completely into disjoint subsets.

The other problem is that only SLINK (for single-linkabe) is fast. All the others usually work in O(n^3) so they are not usable on large data sets. Compare this to eg DBSCAN which runs in O(n log n) if done well, or kmeans in O(n)...

Agglomerative hierarchical clustering technique

OpenCV machine learning library for agglomerative hierarchical clustering

When to stop agglomerative hierarchical clustering - stopping criteria

How do I plug distance data into scipy's agglomerative clustering methods?

choose cluster in hierarchical clustering

How to do Hierarchical Clustering for large similarity matrix

How to print the data of each cluster in agglomerative clustering algorithm in python

single- linkage hierarchical cluster method cutting the tree

Hierarchical Clustering

Choosing the number of clusters in heirarchical agglomerative clustering with scikit

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Agglomerative hierarchical clustering technique OpenCV machine learning library for agglomerative hierarchical clustering When to stop agglomerative hierarchical clustering - stopping criteria How do I plug distance data into scipy's agglomerative clustering methods? choose cluster in hierarchical clustering How to do Hierarchical Clustering for large similarity matrix How to print the data of each cluster in agglomerative clustering algorithm in python single- linkage hierarchical cluster method cutting the tree Hierarchical Clustering Choosing the number of clusters in heirarchical agglomerative clustering with scikit

Related Tags

How do I choose a linkage method for Hierarchical Agglomerative Clustering?

Question

1 answers

solution1 1 2015-06-14 09:08:53

solution1
1 2015-06-14 09:08:53