[Math] Why does the approximation for exponents $(a+b)^c \approx a^{c-bc} (a+1)^{cb}$ work

approximationexponential functionexponentiation

I was working with some code involving exponents in an environment where exponents can only be calculated if the base of the exponent is an integer. I needed a good fast way to approximate this without causing overflow issues. I accidentally stumbled upon an incredible approximation method and I'm not sure why it works.

Suppose you have an exponent in the form of $x^y$ where $x$ is not an integer and you want to approximate the value using only exponents which have integers for their base-values.

Break $x$ into two parts, an integer part, and an additive. For example $3.7\to 3 + 0.7$.

Therefor, $x\to(a+b)$ where $a$ is an integer part.

The approximation formula is:

$$(a+b)^c \approx a^{c-bc} (a+1)^{cb}$$

Or in my original form:

$$(a+b)^c \approx ((a+b)-b)^{c(1-b)} ((a+b)+(1-b))^{cb}$$

It's remarkably close to the right solution seemingly every time. Granted I've only been able to check about 100 cases, but I'm fascinated.

For example:

$$37.5^{28} ≈ 37^{14}\cdot38^{14}$$

And sure enough, if we divide both parts, the ratio is 1.002 which is very close.

Edit: Thanks to RayDansh pointing out in the comments, this is accurate IFF $a+b$ is big. In fact, the larger $a$ gets the more accurate this approximation seems to get.

Can anyone shed some light as to why this approximation method I've stumbled upon works?

Best Answer

Write your approximation $$a^{c-bc} \cdot (a+1)^{cb}=a^c\left(1+\frac 1a\right)^{bc}$$ and your approximation is $$\left(1+\frac 1a\right)^{b}\approx 1+\frac ba$$ Which is the first two terms of the binomial expansion. It will be reasonably accurate when $\frac ba \ll 1$ The next term is $\frac {b(b-1)}{2a^2}$

Related Solutions

[Math] Approximating $\pi$ using the sine function

The Taylor Series for $\sin(x)$ for $x$ near $\pi$ says $$ \sin(x)=\sin(\pi-x)=(\pi-x)-\frac{(\pi-x)^3}6+O\!\left((\pi-x)^5\right) $$ Thus $$ x+\sin(x)-\pi=\frac{(x-\pi)^3}6+O\!\left((\pi-x)^5\right) $$ That is, $$ x_{n+1}-\pi\sim\frac{(x_n-\pi)^3}6 $$ which means the number of correct digits more than triples with each iteration ($d_n=3d_{n-1}+0.778$).

How much faster and how good is it to use this approximation for $\binom{n}{k}$ when $n \gg k$

Big-O notation is a bit tricky for calculations like this, because we have to know details of things such as how the computer multiplies Fraction objects or performs the ** operation in order to say how much more time those operations take when you pass very large numbers to them.

For the sake of comparing the functions, however, you might assume for simplicity (if you don't have better information) that operations like ** and math.sqrt take constant time. Under that assumption, your approx_comb function is $\mathcal O(1)$ and your comb function is $\mathcal O(k)$. If comb only takes about $10$ times as long as approx_comb when $k=1000,$ rather than $1000$ times as long, you might conclude that the constant factor in the running time of approx_comb is $100$ times as large as for comb.

But given all the uncertain details inside the two functions, I would say the best way to estimate their big-O performance is to run some examples with different numbers and see how the times scale. For example, does comb really scale linearly with $k$, and does it really not matter whether $n$ is $10000$ or $1000000000$?

Update: The simple assumption is definitely incorrect. Perhaps it is possible to control precision in a way that avoids long running times, but when $n = 1000000,$ the running times of the approximations as written are very sensitive to $k$ when $k > 100$. I did not look at enough data points to estimate the asymptotic time, but it seems clearly worse than $\mathcal O(k)$. For $k = 1000$ the running time is also somewhat sensitive to $n.$

In favor of the approximations, the comb function is also clearly worse than $\mathcal O(k)$.

Regarding accuracy, provided the individual operations don't suffer some kind of overflow error, approx_comb will always give a result larger than the true value for $k > 1,$ since then

$$ n^k > n(n-1)\cdots(n - k + 1). $$

You might want to take advantage of the fact that for $k > 1,$

$$ n(n-k+1) < \left(n - \frac{k - 1}2\right)^2 < n^2 $$

and similarly

$$ (n-j)(n-k+j+1) < \left(n - \frac{k - 1}2\right)^2 < n^2 $$

for $0 < j < k - j - 1.$ In other words, you can take the terms of $n(n-1)\cdots(n - k + 1)$ in pairs from both ends of the expression, working from the outside toward the middle, and the product of each pair is less than $\left(n - \frac{k - 1}2\right)^2$. If $k$ is even this accounts for all the terms $n(n-1)\cdots(n - k + 1)$, but if $k$ is odd you have a leftover term exactly equal to $n - \frac{k - 1}2$. In either case, you have that

$$ n(n-1)\cdots(n - k + 1) < \left(n - \frac{k - 1}2\right)^k < n^k, $$

so you can improve your approximation by substituting $n - \frac{k - 1}2$ for $n$ in your formula.

By the way, int rounds downward rather than rounding to the nearest integer. Normally I would say use round or add $0.5$ to the result before calling int, but in this case the approximation is always greater than the true answer, which is an integer, so rounding down is appropriate.

You might also want to look at https://en.wikipedia.org/wiki/Binomial_coefficient#n_much_larger_than_k, which gives the approximation

$$ \binom nk \approx \exp\left( \left(n + \tfrac12\right) \ln\left(\frac{n + \tfrac12}{n - k + \tfrac12}\right) + k \ln\left(\frac{n - k + \tfrac12}k\right) - \frac{\ln(2\pi k)}2 \right). $$

In this case I'm not sure that rounding down is correct, so I would round to nearest.

For reference, here are some python functions I tested:

from operator import mul
from fractions import Fraction
import functools
import math
from decimal import Decimal
import timeit

def comb(n,k): 
    return int(functools.reduce(mul, (Fraction(n-i, i+1) for i in range(k)), 1))

def approx_comb_a(n,k):
    n = Decimal(n)
    k = Decimal(k)
    base = n * Decimal(math.exp(1)) / k
    term2 = 2 * Decimal(math.pi) * k
    return int(base**k / term2.sqrt())

def approx_comb_b(n,k):
    n = Decimal(n)
    k = Decimal(k)
    base = (n - Decimal(0.5) * (k - 1)) * Decimal(math.exp(1)) / k
    term2 = 2 * Decimal(math.pi) * k
    return int(base**k / term2.sqrt())

def approx_comb_c(n,k):
    n1 = Decimal(n + 0.5)
    k = Decimal(k)
    nk = n1 - k
    base1 = n1 / nk
    base2 = nk / k
    term3 = 2 * Decimal(math.pi) * k
    return int(base1**n1 * base2**k / term3.sqrt())

And here are some results:

>>> approx_comb_a(1000000,1000)/comb(1000000,1000)
1.6483088671229085
>>> approx_comb_b(1000000,1000)/comb(1000000,1000)
1.0001250494328289
>>> approx_comb_c(1000000,1000)/comb(1000000,1000)
1.0000833367611621

As you can see, all approximations are within a factor of $2$ of the correct result, but the simple approximation using $n^k$ has a $64.8\%$ error whereas the approximation using $(n - (k - 1)/2)^k$ has only about a $0.0125\%$ error, and the error for the third approximation is about $\frac23$ of that. Running times were not much different among the three approximations.

Best Answer

Related Solutions

[Math] Approximating $\pi$ using the sine function

How much faster and how good is it to use this approximation for $\binom{n}{k}$ when $n \gg k$

Related Question