[Math] Using Intermediate Value Theorem to prove # of polynomial roots

calculusreal-analysis

I've heard there's a proof out there of this, basically that (I think) you can use the intermediate value theorem to prove that an Nth-degree polynomial has no more than N roots.

I'm not in school anymore, just an interested engineer. Does anyone know where I can find this proof or any really strong hints on how to do it myself? I've been out of it for a while and I'm rusty.

Best Answer

An algebraic argument: The fact that a polynomial of degree $n$, where $n \ge 1$, has at most $n$ roots can be proved without using machinery from the calculus. When $n=1$, the result is clear, there is in fact precisely one root. Suppose now that we know that the result is true for any polynomial of degree $k$. We show that the result is true for any polynomial of degree $k+1$.

Let $P(x)$ have degree $k+1$. If $P(x)$ has no roots, we are finished. If $P(x)$ has a root $\alpha$, then the polynomial $x-\alpha$ divides $P(x)$. So $P(x)$ is identically equal to $(x-\alpha)Q(x)$ for some quotient polynomial $Q(x)$. Now $Q(x)$ has degree $k$, so by assumption has no more than $k$ roots. It follows that $P(x)$ has at most $k+1$ roots, namely the roots of $Q(x)$, together with $\alpha$.

The above proof works for polynomials with coefficients in any field.

A calculus argument: For polynomials with real coefficients, we can prove the result by using "calculus" tools. These are not really appropriate, since the calculus tools are far more sophisticated than the algebraic tools used in the above proof, and we end up with a proof that only applies to polynomials with real coefficients. But as an exercise, let's do it.

The tool that gives the quickest argument is the Mean Value Theorem, actually a special case of it usually called Rolle's Theorem. This says that if $f(x)$ is a function that is differentiable in the interval $[a,b]$, and $f(a)=f(b)=0$, there is a $c$, with $a<c<b$, such that $f'(c)=0$. Informally, between any two roots of $f(x)$, there is a root of the derivative $f'(x)$.

Back to polynomials. Suppose that we know that any polynomial of degree $k$, with real coefficients, has at most $k$ real roots. We want to show that a polynomial $P(x)$ of degree $k+1$ has at most $k+1$ real roots.

Suppose to the contrary that $P(x)$ has at least $k+2$ roots. Pick any $k+2$ of them, and list them in increasing order $\alpha_1$, $\alpha_2$, and so on up to $\alpha_{k+2}$. By Rolle's Theorem, between any two roots of $P(x)$, there is a root of $P'(x)$. That implies that $P'(x)$ has at least $k+1$ roots. But $P'(x)$ is a polynomial of degree $k$, so by assumption can have no more than $k$ roots. This completes the proof.

We can avoid Rolle's Theorem, and use only the Intermediate Value Theorem. We want to show that between any two roots of $P(x)$, there is a root of $P'(x)$. The easiest calculation uses the fact that if $\alpha$ is a root of $P(x)$, then $x-\alpha$ divides $P(x)$. It is slightly unpleasant, since we have to take special care with roots of multiplicity greater than $1$.

The Fundamental Theorem of Algebra: I would guess that what you were told is a distorted version of an important comment about proofs of what is usually called the Fundamental Theorem of Algebra. Roughly speaking, this result says that a polynomial of degree $n \ge 1$ with complex coefficients has, if you take multiplicities into account, exactly $n$ roots in the field $\mathbb{C}$ of complex numbers.

There is a large number of different proofs of this result. The classical ones depend on fairly subtle arguments about functions of two variables.
The end of the Wikipedia article has a list of useful references.

The following important result is a consequence of the Fundamental Theorem of Algebra. Conversely, the Fundamental Theorem of Algebra can derived fairly simply from it. The result mentions only real numbers.

Theorem: Let $P(x)$ be a polynomial of degree $\ge 1$, with real coefficients. Then $P(x)$ can be expressed as a product $A_1(x)A_2(x)\cdots A_k(x)$, where each $A_i(x)$ has real coefficients, and is of degree $1$ or $2$.

There are proofs of this that use a fair bit of algebra (in the modern sense of algebra) and very little function theory. The only bit of "calculus" needed is to to show that any odd degree polynomial has a real root, and any positive number has a square root, both of which are consequences of the Intermediate Value Theorem. Proofs that use so "little" machinery from analysis tend to boast about it by saying that they "only" use the Intermediate Value Theorem.

Related Question