Solved – Where do the default values in the Elo ratings formulas come from

expected valuegamesnormal distributionprobabilityranking

After doing some reading about the Elo ratings system, I am trying to implement one. I have some questions on the default values in the formulas.

If player A has rating $r_a$ and player b has rating $r_b$. Then with

$x = \frac{r_a}{400}$ and $y = \frac{r_b}{400}$

The two main formulas are:

$E = \frac{10^x}{10^x + 10^y}$,

$r_{post} = r_{pre} + K * (S – S_{exp})$

where $E$ is expected score, also the probability of player A beating player B, and $r_{post}$ is a player's new ranking based on their expected score and actual score, 1 if they won 0 if they lost.

My question is where do the numbers 400 and 10 come from? Are they arbitrary? I know that in Elo with a pair of players that are 200 points apart, the better player is predicted to win ~75% of the time, is this why 400 was chosen, to satisfy this condition? My guess is this stems from the fact that Elo assumes players ratings are gaussian, and they all have the same standard deviation. What about 10, and why is it raised to an exponent? All I know is the formula comes from the Bradley–Terry model. Lastly, I've read the default average rating in the Elo system is 1500, is this connected to 400 as well?

Best Answer

I found a slightly different equation for the calculation of the estimate: E = 1/(1+10^((ra-rb)/400)). And the number 400 contained in the formula was chosen by Arpad Elo so, that the Elo numbers are as compatible as possible with the numbers of the formerly used rating system by Kenneth Harkness. In fact one can see the Harkness-Model as a stepwise linear approximation towards the Elo-Model. If the rating difference between player A and B, i.e. |R_B - R_A| is more than 400 points, the number 400 (or -400) is used instead of the real difference.

Related Question