I have got two databases $X_{S}$ and $X_{T}$ which are having different number of samples but the same size of feature-space. I am wandering which is the most efficient way to calculate the distance of those databases. How can I calculate the distance between their distributions?
Solved – Calculate the distribution distance between two datasets
distancedistributions
Related Question
- Solved – Distance measure between two multivariate normal distributions (with differing mean and covariances)
- Solved – Maximum Mean Discrepancy (distance distribution)
- Solved – Kullback-Leibler distance for comparing two distribution from sample points
- Solved – How to compute the correlation between two distance matrices
- Solved – Unit of the mahalalanobis distance between two individuals
Best Answer
If you do know their distributions, you can use the KL-divergence to calculate the distance between their distributions. The problem is when you do not have any knowledge about their pdf s. You can perhaps have a look at this paper. Good luck