Solved – How to interpret the standard deviation of a directional dataset

circular statisticsstandard deviation

Lets say I start with a vonMises distribution dataset of compass directions:

x <- rvonmises(n=10, mu = circular(0, units="degrees", type="angles", zero=pi/2,modulo="2pi",template="geographics"), kappa=10)

I understand the angular mean:

mean(x)
# Circular Data: 
# Type = angles 
# Units = degrees 
# Template = geographics 
# Modulo = 2pi 
# Zero = 1.570796 
# Rotation = clock 
# [1] 4.796034

The mean is ~4.8$^\circ$ from true N. Fine.

But when I calculate the standard deviation, defined as $\sqrt{-2\ln{r}}$ (where $r$ is the mean resultant length), I get a number between 0 and infinity that I don't know how to interpret.

I vaguely understand that the reason for not adopting the arithmetic definition of the standard deviation is to preserve statistical properties of the directional data's distribution function. But at least I know that the arithmetic standard deviation is in the units of the original data, and can interpret it as the bounds of ~%60 of the data.

How is the angular standard deviation helpful as a descriptive statistic (i.e., how does one interpret its meaning)?

Best Answer

To get a better intuition of what is going on, first realize that the ~60% rule that you mention only holds for the normal distribution. Although the variance and the standard deviation have many useful properties for all kinds of statistical models, there is no strong 'interpretation' other than that a higher standard deviation denotes more dispersed data.

In the circular situation, the situation is similar. There is some motivation for using the circular standard deviation, but it is related to one specific model.

So, the interpretation is as follows. If we have a (normal) random variable $X \sim N(\mu, \sigma^2)$, then $X_c = X ~~ \text{mod} ~~ 2 \pi$ has a wrapped Normal distribution $WN(\mu, \rho)$, where $\rho$ is the resultant length and given by $\rho = e^{-\sigma^2 / 2}$. This means we could write the standard deviation of our original random variable as $\sigma = \sqrt{-2 \ln {\rho}},$ which you can of course recognize from the definition of the circular standard deviation.

So, in words, the circular standard deviation is the standard deviation of the normal distribution that, when wrapped, produces a wrapped normal distribution which has the same resultant length as the data set (from which we calculated the circular standard deviation).

So, it still might seem fairly arbitrary, but this is the strongest interpretation of this measure in the circular case.

Related Question