Solved – P Wasserstein distance in Python

machine learningmathematical-statisticspythonscipy

I know the earth mover's distance is implemented here :

https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.wasserstein_distance.html

I need to compute the p-Wasserstein distance between two 1d distributions ( or samples from these distributions). The p-WD is given as follows

Anybody familiar with a Python implementation of the p-Wasserstein distance? anyhelp is appreciated!

Best Answer

This is implemented in the POT: Python Optimal Transport package, for samples (or, generally, discrete measures): use ot.wasserstein_1d. If you want to do it for weighted samples (or general discrete distributions with finite support), you can provide the a and b arguments.

Related Solutions

Solved – How to estimate the leafsize of the kd-tree

With this setting of 10, you should never have a leaf with a single point, unless your data set consists of exactly one point.

Because the splits are balanced in size, the previous level must have at more than 10 points. So the minimum size is 5, if you set the maximum to 10 (except if there are less than 5 data points total).

Solved – Determining shape parameter for Generalized Pareto Distribution Scipy

In Mathematica this works:

GPD = ParetoPickandsDistribution[2, 3, .07];
data = RandomVariate[GPD, 10^4];
FindDistributionParameters[data, ParetoPickandsDistribution[mu, sigma, eta]] ->
{mu -> 2.00036, sigma -> 2.96883, eta -> 0.07022}

where mu is the location parameter, sigma the scale parameter, and eta the shape parameter.

FindDistributionParameters can use 5 different methods (see the documentation), but I believe the default is maximum likelihood estimation (MLE). Mathematica has all the tools (Likelihood, LogLikelihood, FindMaximium, Maximize, and ParetoPickandsDistribution for the PDF) to do MLE from scratch, if that's your wont. There is a good explanation of MLE in Wikipedia.

Best Answer

Related Solutions

Solved – How to estimate the leafsize of the kd-tree

Solved – Determining shape parameter for Generalized Pareto Distribution Scipy

Related Question