Solved – Importance of normal distribution

distributionsprobability

Why did the normal distribution become such a popular (important) distribution? I know one reason is because of CLT. Can you please give more reasons?

Best Answer

The main reason that the normal distribution is so popular is because it works (is at least good enough in many situations). The reason that it works is really because of the Central Limit Theorem. Rather than trying to look beyond the CLT, I think you (and others) should better appreciate the CLT (I have a cross-stitch of the CLT hanging on my wall as I type).

We usually teach and think about the CLT in terms of a sample mean (and that is a powerful use of the CLT), but it extends much further than that. The CLT also means that any variable that we measure that is the result of combining many effects (many relative to the degree of relationship between the different pieces) will be approximately normal.

For example: a person's height is determined by many small effects including genetics (there will be several genes that contribute to height), nutrition (not just good/bad, but what was actually eaten each day that the person was growing), environmental polutions (again each day contributed a small effect), and other things. So heights (within sex/race combinations) are approximately normal.

Annual rainfall for a specific area is the summation of the daily rainfall for the year and while the daily rainfall is probably very far from normal (zero inflated) when you add all those days together you get something much more normal.

Binomial distributions are just sums of Bernoullis and a Poisson distribution can be the sum of smaller Poissons, it should not be a surprise that either can be approximated by a normal (if enough pieces are added together).

Most exceptions come when common values are close to a natural boundary (rainfall in the desert, test scores where many students get 100% or close to it, etc.) or when there is a single (or small number) of very strong contributors (height including both sexes or with a spread of ages when kids are still growing). Otherwise there are many things that can be approximated using the normal distribution (and things become even more normal when you average them together from a sample).

So why do we need any more justification than the CLT (not to take away from the other great answers).

dismount soapbox

addition

Since it appears that at least 2 people want to see the cross-stitch (based on comments below) here is a picture:

enter image description here

I also have cross-stitches of Bayes theorem and the mean value theorem of integration, but they are off topic for this question.

Related Question