Clustering – How to Find Good Papers Comparing Different Philosophical Views of Cluster Analysis

clusteringlatent-variablemodel-based-clustering

Lots of people use cluster analysis. I've heard very few explicitly say why. I imagine this is because within a given field, most researchers seem to understand why clustering is used for the problems typical to that area – but uses vary between fields, and I haven't seen those differences addressed anywhere.

I am particularly interested in the contrast between latent-variable interpretations of model-based methods (mixture models), and clustering applications in machine learning that don't seem as worried about interpretation of clusters, just that they are useful in some way. Data reduction is an even more agnostic application that is very common.

There are loads of papers comparing different methods for clustering – but I can't find any that compare philisophical/theoretical approaches. If you know of any, could you please list them here?

Best Answer

I like the article entitled Observations on the Use of Growth Mixture Models in Psychological Research. Perhaps not as theoretical as you would like but it is very enlightening. It is written in the context of longitudinal research I think, and within the psychological realm, but a lot can be learned from it.

Edit: Actually, upon second reading there is quite a bit of theoretical/philosophical discussion in that paper! Seems relevant.

Edit: I would also like to add another paper to this answer, entitled What's a taxon? Meehl argues that there are true clusters in nature and provides a salient example there there are gophers, and there are chipmunks, but there are no gophmunks. This does a good job at defining a taxonic group, and highlights that such taxons may also be common in humans. A great deal of research has sought to answer such questions using latent cluster analyses and such.

Best Answer

Related Solutions

Solved – LSA vs. PCA (document clustering)

Solved – Selecting the number of mixtures / hidden states / latent variables

Related Question