[Math] How often does it happen that the oldest person alive dies

actuarial-sciencemarkov chainsstatistics

Today, we are brought the sad news that Europe's oldest woman died. A little over a week ago the oldest person in the U.S. unfortunately died. Yesterday, the Netherlands' oldest man died peacefully. The Gerontology Research Group keeps records: Guinness World Records.

If you live in a country with $N_{\text{country}}$ people, a continent with $N_{\text{continent}}$ people, and a world with $N_{\text{world}}$ people, during a year and on average, how often will you be notified (if you're paying attention to your quality tabloid) of the death of the oldest man/woman/person alive of your country/continent/world? (Note that a death will result in at most one notification.)


Edit, Suggestions (due to comments; thanks!) only:

Ultimately, I am looking for a realistic formula. That means that life tables are certainly allowed (but mind the ending, and note that some of the oldest people are older than the maximum in the tables).

I guess that the Gompertz–Makeham law of mortality, or any plausible model of late-life mortality, is also fair game.

However, these suggestions do not preclude some insight or realistic assumption that might not need any of those things.

Further potentially useful assumptions (with regard to a subpopulation) may include (if reasonable w.r.t. the question):

  1. Time is discrete.
  2. The number of births at any time is constant.
  3. The number of deaths at any time is constant.
  4. The number of births is equal to the number of deaths at any time.
  5. Time is defined in such a way that, at any time, one birth and one death occur.
  6. In the limit to infinite age, the probability of a person of such age dying goes to $1$.
  7. At any time, the oldest person(s) alive is (are) the most likely to die.
  8. At any time, an older person(s) alive is (are) more likely to die than any younger person alive.

(Although such assumptions aren't necessarily realistic, I don't immediately see how they would distort the outcome.)

Best Answer

I think that this question is best approached through careful modelling, rather than pure mathematics. Here's the approach I took. I don't claim that this is the perfect approach by any means, but it's a start.

Spoiler: My simulations give a rate of approximately once every 0.66 years, for a population of 7 billion people who share US mortality statistics.

First, I took the US mortality tables from the Center for Disease Control. They only go up to age 100, so I need to extrapolate beyond that. I fitted a power law to the hazard rate $h(a)$ which gives the probability of dying between age $a$ and $a+1$, getting

$$h(a) = 3.54 \times 10^{-15} \times a^{6.933}$$

I assume $h(a)=1$ in the case that my power law gives me a number above 1. This occurs at $a=122$, which seems realistic (the oldest person to ever live died at age 122).

I then simulated an evolving population until it converged on a stable distribution. I assume $N(a)$ people at age $A$, and a constant birth rate of $9\times 10^7$ people every year (chosen to give a stable popluation of 7 billion). The result is a reasonable-looking population pyramid:

enter image description here

Now that I have a stable population, I simulate again. For each age $a$, the number of people of age $a$ in year $t$ is the fraction of the population aged $a-1$ at time $t-1$ who don't die, i.e.

$$N(t,a) = (1-h(a-1)) \times N(t-1, a-1)$$

When appropriate I approximate the number of deaths with the normal distribution, but for small populations I use the binomial distribution. In the case that there are some deaths in the highest age bracket, I calculate the probability that the person who died was the oldest person in the world at that time, and record this as an event.

Taking the total number of events, and dividing by the number of years that I run the simulation for, gives an approximate rate. The punchline is that in my simulation, I see 15,234 events in 10,000 years, for an approximate rate of once in every 0.66 years.

Assuming a population of one billion people (the population of the developed world, to which the US mortality statistics are most likely to apply) we can see the following histogram, which gives the age of the oldest person in the world at the time they die. Comparing to the wikipedia page for oldest people it looks as though the numbers are too high by 1-2 years, but otherwise I'm surprised at how accurate this crude model is!

enter image description here

One final chart. This is how the number of deaths of the oldest living person each year varies as a function of the total population. Roughly, it seems to be linear in the logarithm of the population. I'd be interested to see a more rigorous mathematical treatment that can get this result out

enter image description here

Edit: I corrected a bug which was causing me to estimate the rate as too high. I was approximating the binomial $B(n,p)$ with a normal distribution $\sigma=np(1-p)$ rather than $\sigma^2=np(1-p)$.

Edit no. 2: It was pointed out in the comments that I had another bug, and I also realized that I wasn't ever checking for the possibility that more than 1 'oldest person' dies in a given year.