Solved – k-means clustering on percentages

k-meansmachine learning

Can we do k-means clustering on percentage data (like 56%, 44%, 22%, 13%, etc.)?
There is a data set, and data in various parts are measured in percentages.

Best Answer

I don't see any reason not to. The percentage values are just classical numbers all divided by another one.

If other part of the data are not in percentage you might have to scale the data appropriately (or turn them also in percentage) or to choose carefully the distance you use.

Related Question