Could Kronecker Accept a Proof of Goodstein’s Theorem? – Analysis

foundationspeano-arithmeticset-theory

A famous result of Goodstein asserts that the Goodstein sequence of integers terminates.
For a precise statement and a short proof, see https://en.wikipedia.org/wiki/Goodstein%27s_theorem.

A well known proof of this theorem uses ordinal number. The idea is to associate with the Goodstein sequence a sequence of ordinal numbers that is decreasing and every decreasing sequence of ordinal numbers terminates after a finite number of steps.

A well-known result of Kirby and Paris asserts that Goodstein's theorem cannot be proved in the Peano arithmetic. My vague understanding is (correct if I am wrong) that any proof of the Goodstein theorem would require a proof of existence of the ordinal number
$$
\large \omega^{\omega^{\omega^{\omega^{\omega^\ldots}}}}
$$
and such a large number cannot be constructed in the Peano arithmetic.

I believe this result is of wide interest in the mathematical community. While there have been a lot of discussions related to this theorem on Mathoverflow, the theorem is set deeply in foundations of mathematics and most of the discussions are so hermetic that only people with good understanding of the foundations of mathematics can follow it. I have some working knowledge in the area (as most of us do), but that restricts to a basic understanding of ZFC, practical use of the axiom of choice, and some understanding of basic results in arithmetic of ordinal numbers. With such a limited knowledge I am not able to follow the discussions.

Georg Cantor created foundations of the modern set theory. However, his theory encountered a strong resistance from prominent mathematicians including Kronecker, Poincaré, Weyl and Brouwer. Kronecker described Cantor as a "scientific charlatan", a "renegade" and a "corrupter of youth".

Without the set theory created by Cantor, the proof of Goodstein's theorem based on the arithmetic of ordinal numbers would not exist and hence the known proof would not be accepted by Kronecker, Poincaré, Weyl or Brouwer.

Therefore my question is:

Question. Is it possible to prove Goodstein's theorem in an "elementary" way so that it would be accepted by Kronecker?

I have heard that there are "elementary", but rather long proofs using second-order arithmetic (I do not even know what it means).

Question. Where can I find a proof that does not involve ordinal numbers?

Since I find these questions of great interest for everyone, I would appreciate a detailed and "elementary" explanation. For example while referring to the second-order arithmetic I would appreciate if someone could explain in details what it really means by showing some examples.

Edit: I replaced Gauss with Kronecker in the question. It seems a better choice.

Best Answer

This is really an expansion of Aaron Bergman and others' comments as well as information that exists on the Wikipedia articles on Goodstein's theorem and epsilon numbers. I will sketch a proof of Goodstein's theorem and you'll have to tell me if Kronecker would have accepted it as a legitimate argument. (I'm not sure if this is the best 'elementary looking' proof and I don't know if this would be considered a long proof. If somebody knows a simpler proof that looks elementary please tell me. I'm also not sure how constructive this argument can be. As written it certainly isn't.)

Let $T$ be the collection of finite rooted trees. For each natural number $n$, let $T_n \subset T$ be the collection of rooted finite trees of height at most $n$ (we'll say that the rooted tree with a single node has height $0$).

We will define a linear order $\sqsubset$ on $T$ inductively by defining it on each $T_n$. $T_0$ has a single element, so the order on $T_0$ is the trivial order. Now for any $n$, given the order on $T_n$, for any $a \in T_{n+1}$ we can associate a finite sequence of elements of $T_n$, which we'll write $s(a)$, by listing the children of the root note of $a$ in descending order according to $\sqsubset$ (which we have already defined). Given $a,b \in T_{n+1}$, we say that $a\sqsubset b$ if $a\neq b$ and at the first index $i$ at which $s(a)$ and $s(b)$ differ, either $s(a)(i)$ doesn't exist or it does and $s(a)(i) \sqsubset s(b)(i)$.

It is not hard to show that $\sqsubset$ is a linear order on $T_{n+1}$, that $\sqsubset$ as defined in $T_{n+1}$ agrees with $\sqsubset$ as defined on $T_n$, and that $T_n$ is an initial segment of $T_{n+1}$ under $\sqsubset$. Therefore we can treat $\sqsubset$ as a linear order on all of $T$. Note that while $T$ is an infinite set, the elements of $T$ are finite objects (and can be coded by natural numbers) and the relation $a\sqsubset b$ is computable.

Lemma. For any $n$, if $f : \mathbb{N} \rightarrow T_n$ is non-increasing under $\sqsubset$, then it is eventually constant.

Proof. We will show this by induction on $n$. The proposition is obvious for $n=0$. Assume that we've shown it for $T_{n}$ and let $f : \mathbb{N} \rightarrow T_{n+1}$ be a non-increasing function. Clearly if $f(k)$ is ever the unique element of $T_0$, then we're done, so assume that $f(k)$ is never in $T_0$.

Claim: For every $i \in \mathbb{N}$, the partial function $k \mapsto s(f(k))(i)$ is eventually constant (i.e. either eventually always defined and a constant value or eventually always undefined).

Proof of claim. We will prove this by induction on $i$. For $i = 0$, we have that $k \mapsto s(f(k))(0)$ is a non-increasing function whose codomain is $T_n$. By the overall induction hypothesis (on $n$), this implies that $k \mapsto s(f(k))(0)$ is eventually constant.

Assume that we've shown the claim for some $i$ and consider the partial function $k \mapsto s(f(k))(i+1)$. If $k \mapsto s(f(k))(i)$ is eventually always undefined, then $k \mapsto s(f(k))(i+1)$ is also eventually always undefined and we're done, so assume that $k \mapsto s(f(k))(i)$ is eventually always defined and let it be constant on inputs larger than $m$. Consider $k \mapsto s(f(m + k))(i+1)$. If this is ever undefined, then it is undefined for any larger inputs as well, so assume that it is always defined. In this case, we have that this is a non-increasing function from $\mathbb{N}$ to $T_n$, so by the overall induction hypothesis we have that it must eventually be constant as well. So in both cases it is eventually constant.

Therefore by induction, $k \mapsto s(f(k))(i)$ is eventually constant for every $i \in \mathbb{N}$, as required. $\square_{\text{claim}}$

For each $i$, let $g(i)$ be the eventual value of the map $k \mapsto s(f(k))(i)$ (where we take $g(i)$ to be undefined if $s(f(k))(i)$ is undefined for sufficiently large $k$). We have that $g(i)$ is defined on an initial segment of the natural numbers, possibly all of them. Furthermore, since $s(a)$ is always in decreasing order, we have that $g(i)$ is a non-increasing partial function from $\mathbb{N}$ to $T_n$. This means that if $g(i)$ is defined for all $i$, by the induction hypothesis we have that it is eventually constant.

Assume for the sake of contradiction that $g(i)$ is defined for all $i$. Let $a$ be its eventual value and let $j$ be some index such that $g(j) = a$. Choose $m$ such that for all $k \in \mathbb{N}$, $s(f(m+k))(j) = a$. Since $k \mapsto s(f(m+k))(j)$ is constant, we have that $k\mapsto s(f(m+k))(j+1)$ is non-increasing, and in particular must be defined for all $k$, otherwise $g(j+1)$ would be undefined. By construction we have that $s(f(m))(j+1) \leq s(f(m))(j)$, so since the eventual value of $k \mapsto s(f(m))(j+1)$ is also $a$, we must actually have $s(f(m+k))(j+1) = a$ for all $k \in \mathbb{N}$. Iterating this argument shows by induction that for any $\ell \geq j$, $s(f(m+k))(\ell) = a$ for all $k \in \mathbb{N}$, so in particular $s(f(m))(\ell) = a$ for all $\ell \geq j$, but this contradicts the fact that $s(f(m))$ is a finite string.

Therefore $g(i)$ is undefined for all but finitely many $i$. Let $j$ be the largest such that for all $i \leq j$, $g(i)$ is defined. Find $m$ such that $s(f(m+k))(i)$ is constant for all $i \leq j$ and is always undefined for $i = j+1$. Since $s(f(m+k))(j+1)$ is always undefined, we have that $s(f(m+k))(\ell)$ is always undefined for any $\ell > j$, which implies that $k \mapsto f(m+k)$ is in fact a constant map, so we have that $f$ is eventually constant, as required. $\square$

Now from this lemma we immediately get that if $f : \mathbb{N} \rightarrow T$ is non-increasing under $\sqsubset$, then it is eventually constant, because each $T_n$ is an initial segment of $T$ and $f(0) \in T_n$ for some $n$. (Of course this is just the fact that $\varepsilon_0$ is well-ordered expressed with a particular notation for elements of $\varepsilon_0$, but I won't tell Kronecker if you don't.)

At this point the rest of the proof should be fairly familiar. Given any Goodstein sequence, $g_n$, we associate a sequence of elements of $T$ by converting the hereditary base-$(n+2)$ notation representation of $g_n$ to a finite rooted tree by the following: $0$ corresponds to the tree with a single node and the number $a_k (n+2)^k + a_{k-1} (n+2)^{k-1} + \dots + a_0$ corresponds to a tree where for each $\ell \leq k$, we attach $a_\ell$ copies of the tree associated to $\ell$ to the root node. It is clear that replacing $n+2$ by $n+3$ does not change the associated tree, but subtracting $1$ does produce a strictly smaller tree under $\sqsubset$. If we had a Goodstein sequence which did not terminate it would give us a function $f: \mathbb{N} \rightarrow T$ which was non-increasing but also not eventually constant, which contradicts our lemma. Therefore every Goodstein sequence must terminate.

Regarding the metamathematical properties of this proof, the place where we are using second order arithmetic is in the main induction in the lemma, because the statement is for all functions $f: \mathbb{N} \rightarrow T_n$, which is using a second-order quantifier. The thing is that we don't really need this full strength for Goodstein's theorem. It is good enough to know it for computable $f$ (since the function associated to a Goodstein sequence is computable), but note that in the inductive proof we use the induction hypothesis on $g(i)$, which is not a priori a computable function, even when $f$ itself is. Furthermore, even though $g(i)$ is in general only a partial function with finite domain (and hence a computable object), this is non-uniform. If we have a family of functions $f_p: \mathbb{N} \rightarrow T_n$ and we want to know the associated family $g_p(i)$, this will typically require the Turing jump of the family $f_p$ to compute. This is a sign that these proofs won't work very uniformly in Peano arithmetic, especially since unpacking a proof of the lemma for a specific $n$ will end up needing something like $n$ Turing jumps.

We can formalize an encoding of finite rooted trees as natural numbers in Peano arithmetic and prove many basic properties (e.g. everything before the lemma). For any finite $n$ and any family $f_p(k)$ of uniformly definable functions $f_p : \mathbb{N} \rightarrow T_n$, Peano arithmetic proves the following:

For any parameter $p$, if $f_p : \mathbb{N} \rightarrow T_n$ is non-increasing, then it is eventually constant.

The whole issue is that $\mathsf{PA}$ cannot prove this uniformly in $n$ for sufficiently complicated families $f_p$, such as, for instance, the family of computable functions.

Best Answer

Related Solutions

[Math] known natural model of Peano Arithmetic where Goodstein’s theorem fails

Peano Arithmetic – Stacked Quantifiers and Ordinals up to $\epsilon_0$

Related Question