Solved – If the mean and median underestimate the true central tendency, why use them

descriptive statisticsmeanmedianmode

In skewed distributions, both the mean and the median can easily underestimate or overestimate the true central tendency. For example, have a look at this violin plot:

The median is shown here in red. It seems to overestimate the true central tendency. This is the same for the mean. The best central tendency description would be the mode of that distribution (which would get the peak).

This is a basic question: but why use mean and median for skewed distributions? Do people even use them? Wouldn't it make sense to get the maximum point of the Kernel density estimate?

Best Answer

In skewed distributions, both the mean and the median can easily underestimate or overestimate the true central tendency. [...] Wouldn't it make sense to get the maximum point of the Kernel density estimate?

No it wouldn't, at least not always.

Take as an example the exponential distribution: it is parametrized by $\lambda$, its expected value is $\lambda^{-1}$, the same as its mean, its median is $\lambda^{-1} \ln(2)$ and its mode is always $0$, regardless of parametrization. So all the values from this distribution are greater than or equal to the mode -- what does that tell us? Not much...

It is the opposite with the mean which marks the probability mass center; the median also provides us with similar information. Yes, $0$ is the most likely value, but we are more interested here in the tail of the distribution (since it's a tail-only distribution).

Finally, the mode would be a useless summary statistic if you would like to compare different exponential variables.

Check out also: If mean is so sensitive, why use it in the first place?

Mean:

1. Balance Point: Mean as the fulcrum

The best way to understand the concept of mean it to think of it as the balance point on a uniform rod. Imagine a series of data points, such as {1,1,1,3,3,6,7,10}. If each of these points are marked on a uniform rod and equal weights are placed at each point (as shown below) then the fulcrum must be placed at the mean of the data for the rod to balance.

This visual demonstration also leads to an arithmetic interpretation. The arithmetic rationale for this is that in order for the fulcrum to balance, the total negative deviation from the mean (on the left side of the fulcrum) must equal to the total positive deviation from the mean (on the right side). Hence, the mean acts as the balancing point in a distribution.

This visual allows an immediate understanding of the mean as it relates to the distribution of the data points. Other property of the mean that becomes readily apparent from this demonstration is the fact that the mean will always be between the min and the max values in the distribution. Also, the effect of outliers can be easily understood – that a presence of outliers would shift the balancing point, and hence, impact the mean.

2. Redistribution (fair share) value

Another interesting way to understand the mean is to think of it as a redistribution value. This interpretation does require some understanding of the arithmetic behind the calculation of the mean, but it utilizes an anthropomorphic quality – namely, the socialist concept of redistribution – to intuitively grasp the concept of the mean.

The calculation of the mean involves summing up all values in a distribution (set of values) and dividing the sum by the number of data points in the distribution.

$$ \bar{x} = (\sum_{i=1}^n{x_i})/n $$

One way to understand the rationale behind this calculation is to think of each data point as apples (or some other fungible item). Using the same example as before, we have eight people in our sample: {1,1,1,3,3,6,7,10}. The first person has one apple, the second person has one apple, and so on. Now, if one wants to redistribute the number of apples such that it is “fair” to everyone, you can use the mean of the distribution to do this. In other words, you can give four apples (i.e., the mean value) to everyone for the distribution to be fair/equal. This demonstration provides an intuitive explanation for the formula above: dividing the sum of a distribution by the number of data points is equivalent to partitioning the whole of the distribution equally to all of the data points.

3. Visual Mnemonics

These following visual mnemonics provide the interpretation of the mean in a unique way:

This is a mnemonic for the leveling value interpretation of the mean. The height of the A's crossbar is the mean of the heights of the four letters.

And this is another mnemonic for the balance point interpretation of the mean. The position of the fulcrum is roughly the mean of the positions of the M, E, and doubled N.

Median

Once the interpretation of mean as the balancing point on a rod is understood, the median can be demonstrated by an extension of the same idea: the balancing point on a necklace.

Replace the rod with a string, but keep the data markings and weights. Then at the ends, attach a second string, longer than the first, to form a loop [like a necklace], and drape the loop over a well-lubricated pulley.

Suppose, initially, that the weights are distinct. The pulley and loop balance when the same number of weights are to each side. In other words, the loop ‘balances’ when the median is the lowest point.

Notice that if one of the weights is slid way up the loop creating an outlier, the loop doesn’t move. This demonstrates, physically, the principle that the median is unaffected by outliers.

Mode

The mode is probably the easiest concept to understand as it involves the most basic mathematical operation: counting. The fact that it’s equal to the most frequently occurring data point leads to an acronym: “Most-often Occurring Data Element”.

The mode can also be thought of the most typical value in a set. (Although, a deeper understanding of ‘typical’ would lead to the representative, or average value. However, it’s appropriate to equate ‘typical’ with the mode based on the very literal meaning of the word ‘typical’.)

Sources:

The Median is a balance point -- Lynch, The College Mathematics Journal (2009)
Making Statistics Memorable: New Mnemonics and Motivations -- Lesser, Statistical Education, JSM (2011)
On the Use of Mnemonics for Teaching Statistics -- Lesser, Model Assisted Statistics and Applications, 6(2), 151-160 (2011)
What does the mean mean? – Watier, Lamontagne and Chartier, Journal of Statistics Education, Volume 19, Number 2 (2011)
Typical? Children's and Teachers' Ideas About Average – Russell and Mokros, ICOTS 3 (1990) OVERALL REFERENCE: http://www.amstat.org/publications/jse/v22n3/lesser.pdf

Best Answer

Related Solutions

Central Tendency – Should the Mean Be Used When Data Are Skewed?

Descriptive Statistics – Explaining Mean, Median, Mode in Layman’s Terms

Mean:

Median

Mode

Related Question