Solved – Automating determination of number of clusters from a kmeans cluster analysis

clusteringdata miningk-means

I use kmeans for clustering a set of data. However, I have to specify the number of clusters. The problem is that sometimes I need 2 and other times I need 3 clusters.

  • Is there a clustering algorithm that could incorporate that feature in it?

Best Answer

this is a great paper to start with:

Estimating the number of clusters in a data set via the gap statistics

It's really easy to implement something similary in any language.

Related Question