Sequence of rationals converging to $\sqrt{2}$ via Newton’s method

calculusreal-analysis

That $a_{n+1}=\frac{a_n}{2} + \frac{1}{a_n}$, with $a_1=1$, is a sequence of rationals converging to $\sqrt{2}$ is well known.

I recently saw that the construction of this sequence can be done using Newton's method for finding solutions of $f(x)=0$ for $f(x)=x^2-2$. But I did not understand the steps rigorously.

(1) Let $x_0$ be first approximate root of $f(x)=x^2-2$.

(2) Linear approximation at $x_0$: the equation of (tangent) line (at $x_0$) is $y=f(x_0) + (x-x_0)f'(x_0)$.

(3) This line intersects $x$-axis and take the $x$-intersept to be next approximate root, so we solve for $y=0$. We get $x=x_0-\frac{f(x_0)}{f'(x_0)}$; declare this to be next approximation, so call this $x_1$.

(4) Repeat the process (3) with $x_0$ replaced by $x_1$, and declare $x_2=x_1 -\frac{f(x_1)}{f'(x_1)}$.

Now, if we take $x_0=2$, and do above iteration, we get the sequence of rationals stated in the beginning.

Q.1 Can we take other approximate root to be any other integer or rational number? What is the importance of $2$ taking as first approximate root?

Q.2 In (3), why it is better to declare the $x$-intersept of line in (2) to be next approximate root?


I had learned Newton's method around 12 years back (And not relooked after it), so I didn't get the clear picture of the above process.

Best Answer

Q1: 2 has no significance whatsoever, except that it recreates the sequence from the top of your post. In this case, as long as you pick a positive number, you'll converge to $\sqrt2$. They picked it because it's nice. In general, though, you'd want your first approximate to be a decent guess. And you can't start without picking something.

Q2: Because that's basically the only thing you have to work with. What else should the next approximate root be? Also, as pointed out in the comments below, the whole idea behind Newton's method is to take what we know about the function at a single point (value and derivative), and guess where the function crosses the $x$ axis as well as we can. Assuming that the function looks somewhat like its tangent at that point is a good approximation as long as we're already decently close to the $x$-axis intercept.

Related Question