Solved – Standard deviation of a cluster

meanstandard deviation

I have a list of (x,y) pairs that form a cluster of points. I was able to find the mean of the cluster, but I'd like to be able to find the standard deviation of the cluster. How can I define the standard deviation? I defined the mean as the point (mean of x values, mean of y values).

What type of standard deviation value is typical for clusters like this? I'd imagine I could get a standard deviation of the spread, or something like that?

This looks good. Gaussian on a cluster:

Gaussian Data

How do I do this?

Best Answer

The two easiest approaches:

  1. One way to do it would be to count the standard deviation on $x$s and $y$s separately, to get you a box shape.

  2. If you prefer the circle shape, then simply given the mean $(\bar{x},\bar{y})$ and the points $(x_i,y_i)$ compute the distance $r_i = \sqrt{(\bar{x} - x_i)^2 + (\bar{y} - y_i)^2}$ and then calculate the standard deviation of the $r_i$'s around $0$. The resulting standard deviation (or if you divide by $\sqrt{N}$ then standard error) will be the radius of the circle.

Related Question