Solved – get probabilities from kernel density estimation pdf

approximate-inferencedensity functiondensity-estimationkernel-smoothingprobability

I have data points located at $\mathbf{x}_i$ and I would like to a find quick and dirty way to calculate their probability of occurring (not the pdf) using kernel density estimation. Formally speaking, I know I must take the definite integral within the neighborhood of the points to get an actual probability. But that is not feasible, due to the large number of points and high dimensionality. Does anyone have an informal way to do this?

Best Answer

The problem here is that your question is contradictory. You are using a KDE with a continuous kernel, which means that you are estimating using a continuous distribution. For a continuous distribution, the probability of any outcome is zero (see e.g., here and here), so we usually measure by the probability density instead. However, you say that you want the probability of the point, not its density. You also make it clear that you want the probability of the individual point, not the probability of a neighbourhood containing that point.

Under these requirements, the estimated probability of the outcome is zero. This is not helpful, which is why we measure outcomes in a continuous distribution by their probability density instead of their probability.

Related Question