Solved – Is calculating a percentile the same as evaluating a cumulative density function

distributionsquantiles

I'm trying to make the jump from the idea of a percentile, say, over the real number line (where the nth percentile is simply the position in which n% of data points are below it, and 100-n% are above it), to the idea of the area under a probability density function.

If I want to know the 50% percentile from a set of numbers, I'll find the point in which half the numbers are below, half the numbers are above. That's the 50% percentile, and I'm done.

If I want to know the 50% percentile from a distribution, say, a Z-score, I'll evaluate the cdf from 0 – 50, and I'm done. Am I saying this correct?

This feel right intuitively, but I need some discussion to hammer it home. Or, I could be completely off…

Best Answer

You are close but not exactly right. Remember that the area under a probability distribution has to sum to 1. The cumulative density function (CDF) is a function with values in [0,1] since CDF is defined as $$ F(a) = \int_{-\infty}^{a} f(x) dx $$ where f(x) is the probability density function. Then 50th percentile is the total probability of 50% of the samples which means the point where CDF reaches 0.5. Or in more general terms, the p'th percentile is the point where the CDF reaches p/100.

Related Question