Solved – Supervised or Unsupervised Clustering

clusteringsupervised learning

I have a set of N samples belong to K classes. I am using k-means clustering with Euclidean distance in order to cluster these samples into K clusters. To help the k-means algorithm to group samples of a specific class into one cluster, I initialized the k-means algorithm so that the mean value of each cluster is the mean value of the samples of a specific class.

My question is not about the usefulness of what I mentioned above or the purpose of it, however, my question is:
In machine learning, is this called "supervised clustering", "semi-supervised clustering" or just "normal clustering initialized by means of samples of the real class"? I want to know the correct terminology when we use k-means initialized by real-class mean values.

Best Answer

K-means is ''unsupervised'' by definition: it does not take the labels into account.

You however performed a ''supervised initialization''.

So I'd call this an unsupervised algorithm that has been initialized in a supervised manner.

And no, I don't think it makes a lot of sense to do it this way.

Related Question