Closed-form Ackermann variant function proof by induction

computer scienceinductionrecursion

Let $A$ be a variant of the Ackermann function.

$A(m,n)$=\begin{cases}2n&\text{if }m=0\\1&\text{if }m\ge1\text{ and }n=0\\2&\text{if }m\ge1\text{ and }n=1\\A\big(m-1,A(m,n-1)\big)&\text{if }m\ge1\text{ and }n\ge2.\end{cases}

A good guess for a closed form solution to this is as follows.

$A(m,n) = 2 \uparrow^{m} n$

The up-arrow is a notation created by Knuth where a single arrow represents exponentiation, double arrow represents tetration, triple arrow represents pentation and so on.

I am trying to prove this by induction but I am stuck because there are $2$ variables to possibly induct on. I was thinking of doing double induction over $m$ and $n$ where I prove the following:

$A(m,n) \implies A(m+1,n)$

$A(m,n) \implies A(m,n+1)$

If I assume the induction hypothesis holds for some $m$ and $n$, then I find it impossible to expand expressions like $A(m, A(m+1,n-1))$ from the first implication or $A(m-1, A(m,n))$ from the second implication.

I think my assumption that my induction hypothesis holds for some $m$ and $n$ is flawed and needs to be more nuanced than that. Any advice?

Best Answer

First, I think your definition is off a little. Your second case should be $1$, because $2^0 = 1$, etc.

Once that is fixed, your hypothesis is true. I was able to prove it in Agda.

Generally speaking, to prove something like this, it's natural for the structure of the induction to mirror the structure of the recursive definition (since in some ways, they are one and the same). So, for your case, you want an induction method based on cases:

$∀n. P(0,n)$
$∀m. P(1+m,0)$
$∀m. P(1+m,1)$
$∀m. (∀ n. P(m,n)) → ∀n. P(1+m,1+n) → P(1+m,2+n)$

This doesn't exactly match the particular structure of your definition in case 4, but it matches the justification for it. One recursive call is justified by decreasing on $n$ with the same $m$, and the other is justified by decreasing on $m$ with an arbitrarily large $n$.

And this isn't too difficult to derive from the normal principle of induction. To prove $∀m\ n. P(m,n)$:

First do induction on $m$
- Case 1 above covers the base case for this, where $m = 0$
- For the inductive step, we are given $∀n. P(m,n)$ and need to prove $∀n. P(1+m,n)$, do this by induction on $n$
  - When $n = 0$, use case 2 above
  - When $n = 1$, use case 3 above
  - Finally, we are given $P(1+m,1+n)$ and need to prove $P(1+m,2+n)$. This is case 4 above, since we already have that $∀n. P(m,n)$

So, by using induction twice, it's possible to build the sort of induction principle that directly corresponds to the recursive structure of your modified Ackermann function. Then it should be relatively simple to prove your theorem using this principle.

Related Solutions

[Math] Fibonacci Loop Invariants

First of all, yes, the loop invariant must be true before we start the loop, after each iteration, and after the loop completes. See https://en.wikipedia.org/wiki/Loop_invariant, http://www.cs.miami.edu/home/burt/learning/Math120.1/Notes/LoopInvar.html, any of several other pages from university course notes that you'll find if you search for "loop invariant", or (presumably) your textbook.

In the most general sense, finding a loop invariant is about constructing a mathematical proof, and there is not a simple set of instructions that will give a good invariant whenever one exists. But if you have an inductive definition of the result of the algorithm, as you do here, your invariant might also look like an inductive definition.

In this case you have that for all $i>1$, $F(i) = F(i-1) + F(i-2)$. You also know the values of $F(0)$ and $F(1)$. This suggests an algorithm in which before the first iteration of a loop, you store the values $F(0)$ and $F(1)$ in two variables, $A$ and $B$, respectively, and you maintain the invariant that $A=F(k)$ and $B=F(k+1)$ where $k$ is the number of iterations of the loop that have completed. You can prove that the invariant is maintained by showing that at the end of each loop, the value in $A$ is the value that previously was in $B$, and the value in $B$ is the previous value of $A+B$. That is, if at the end of $k$ iterations you had $F(k)$ stored in $A$ and $F(k+1)$ stored in $B$, at the end of the next iteration (that is, after $k+1$ iterations) you will have $F(k+1)$ stored in $A$ and $F(k+2) = F(k+1) + F(k)$ stored in $B$.

My $A$ and $B$ look suspiciously like your r and l, respectively. But note that in the algorithm I described, $B$ will store $F(n)$ after only $n-1$ iterations, while your algorithm is written to do $n$ iterations, and to return the larger number at the end of the $n$th iteration. Your algorithm resolves this by doing an extra iteration at the beginning, that is, it starts by setting l to $F(0)$ and r to $F(-1)$, respectively, where $F(-1)=1$ in order that $F(1) = F(0) + F(-1)$. So after $k$ iterations, where $k\geq 0$, your algorithm always has $F(k-1)$ stored in r and $F(k)$ stored in l.

Alternatively, rather than defining $F(-1) = 1$, you can write a special case into your loop invariant for the case when you have completed $0$ iterations. That is what your proposed invariant does, which is fine.

One of the potential difficulties of writing loop invariants is notational. The invariant as I stated it depends on knowing how many iterations of the loop have been completed. You have a variable i whose value during each loop is clearly defined, but whose values at the beginning and end of each loop are not clear to me. At the end of the first loop, does i store the value $0$ or the value $1$? You should have a clearly defined convention for some kind of variable associated with a loop control such as for i in [0,n), with clearly-defined values of this variable before the first iteration of the loop and at the end of each iteration of the loop. Your notes or textbook should have established a convention for how you construct such a variable for such a loop. Without a clearly established convention, I'm not sure whether l at the end of the last iteration is $f_n$ (which it is if $i$ in your invariant has the value $k-1$ at the end of $k$ iterations) or $f_{n+1}$ (which is what you get if $i$ denotes the number of iterations that have been completed).

EDIT: "How I would have done it," not necessarily how you should do it:

If I were writing this for a formal methods course (or possibly even in an ordinary program), I probably would have started by laying out the base cases in code like this (using = for an equality test, not assignment):

if n=0
  return 0
else if n=1
  return 1
else if n>1
  ... do loop ...
  return result

Note that the return value is undefined if $n<0$. For $n>1$, the loop would do $n-1$ iterations of the $A,B$ algorithm I described above, and the result would be the last value of $B$.

I might then notice that the outcomes would be the same if I wrote

if n=0
  return 0
else if n>0
  ... do loop ...
  return result

That is, one of the "if" conditions was superfluous. But I cannot get rid of the special handling of the case $n=0$ with the loop as written, because the loop depends on doing $n-1$ iterations to produce $F(0)$, and you can't do $-1$ iterations of a loop.

[Math] Is the proof by induction on binary trees correct

Here's a simpler inductive proof:

Induction start: If the tree consists of only one node, that node is clearly a leaf, and thus $S=0$, $L=1$ and thus $S=L-1$.

Induction hypothesis: The claim is true for trees of less than $n$ nodes.

Inductive step: Let's assume we've got a tree of $n$ nodes, $n>1$. Then its root node obviously isn't a leaf, but a stem, and thus there are two sub-trees attached to it, both obviously with less than $n$ nodes. Therefore for these trees, we have $S_k = L_k-1$. Thus we have $$S = 1 + S_l + S_r = 1 + (L_1-1) + (L_2-1) = L_1 + L_2 - 1 = L-1.$$

Best Answer

Related Solutions

[Math] Fibonacci Loop Invariants

[Math] Is the proof by induction on binary trees correct

Related Question