Arithmetic Mean – Why It Works

arithmeticmeansstatistics

I've been using the formula for the arithmetic mean all my life, but I'm not sure why it works.

My current intuition is this one:

The arithmetic mean is a number that when multiplied by the number of elements, gives you the sum of all the elements. Because of this fact, it can't be more than the maximum nor less than the minimum, and it should be located somewhat around the center.

But I was wondering if there are other intuitions out there? Why does this formula work? If in passing you could talk about the weighted average as well, that would be nice too.

Thanks

Best Answer

The simplest way to explain arithmetic mean is in terms of "equal sharing":

Abe has 12 cookies, Brianna has 8 cookies, and Chuck has 7 cookies. If they were to redistribute them so that they all have the same amount, how many would each get?

Obviously the way you answer this question is to find the total amount of cookies ($12 + 8 + 7 = 27$) and then divide the cookies among the three people ($27/3 = 9$). That's precisely what the computation of arithmetic mean does.

Is that what you're looking for?


Edited to add:

Here's another viewpoint that might help. We would like to find some number $N$ that is in the "middle" of the set $ \{12, 8, 7 \}$ (using the same numbers from the example above). What does "in the middle" mean? Well, one way to interpret this vague phrase is to imagine that we already had such an $N$ in hand, and we compute the three quantities $12-N, 8-N,7-N$. These three quantities tell us how far $N$ is from each of three pieces of information -- call these the "deviations".

What if we made a bad choice of $N$? For example, if each of the three deviations were positive, then that would mean that $N$ is smaller than each of the three original numbers, which we don't want. If each of the three deviations were negative, then that would mean that $N$ is larger than each of the three original numbers -- again bad. For $N$ to be in the middle, we would want some of the deviations to be positive and some of them to be negative. In fact, if we could choose $N$ so that the positive deviations exactly cancel out the negative deviations, then we will feel like we've really found the "middle".

Let's translate that now into a computation. We want to find $N$ such that $$(12-N) + (8-N) + (7-N) = 0$$ If you now think about what it would take to solve this equation, you will quickly realize that you end up adding the three numbers in your dataset together and then dividing by 3.