[Math] Where did the idea of hermite interpolation came from

interpolationintuitionnumerical methods

I am given the Hermite interpolation formula directly in my text book without ANY explanations about how it was first made (obviously it was somehow constructed for the first time with some sort of intuition ) .

the formula for n+1 data from $x_0$ till $x_n$ with $f(x_0)$ till $f(x_n)$
and with $ f^{\prime}(x_0)$ till $f^{\prime}(x_n)$
$$H_{2n+1}(x) = \sum_{j=0}^n f(x_j)H_{n,j}(x) + \sum_{j=0}^n f^{\prime}(x_j)\hat H_{n,j}(x)$$

where
$$H_{n,j} = [1 − 2(x − x_j)L^{\prime}_{n,j}(x_j)]L_{n,j}^2(x) $$

$$ \hat H_{n,j}(x) = (x-x_j) L_{n,j}^2(x) $$
I DO understand the proof and why the polynomial agrees with data and their derivatives.

i DO understand the intuition behind Lagrange polynomials.

so I am looking for the intuition behind the formula (how it was made) specially the construction of $H$ and $\hat H$. so instead of memorizing it i can learn it!

Best Answer

Both kinds of interpolation formulas rely on the superposition principle (the sum of the effect of individuals causes is the effect of the sum of the causes), and achieve a decomposition such that every point brings its own contribution. Actually, you form a basis of polynomials and linear combinations thereof.

In the case of Lagrange, consider the special case $f(x_i)=\delta_{ij}$: all ordinates but the $j^{th}$ are zero, and the latter is one. This is easily achieved by forming the product of $(x-x_i)$ and normalizing to one at $x_j$. From these $n$ basis polynomial, you can construct the interpolant for any ordinates.

The generalization to Hermite follows the same idea. You will form two families of polynomials: the first family carries the ordinates ($f(x_i)=\delta_{ij}, f'(x_i)=0$), and the second one carries the derivatives ($f(x_i)=0,f'(x_i)=\delta_{ij}$).

The rest is technical trickery, based on the idea that by squaring a Lagrange polynomial, the simple roots become double roots and the derivative vanishes at the roots, preparing candidates for the first and second family.

More precisely, $L^2_j$ achieves $f(x_i)=\delta_{ij}$, and $f'(x_i)=0$, except at $x_j$.

Let us introduce the polynomial $Z_j=(x-x_j)L_j$, such that $Z_j(x_i)=0$ and $Z_j'(x_j)=1$.

To obtain the first family, we cancel the derivative of $L_j^2$ at $x_j$ by introducing a corrective term $-Z_j(L_j^2)'$, that derives as $-Z_j'(L_j^2)'-Z_j(L_j^2)''$, i.e. $-(L_j^2)'$ at $x_j$ and $0$ elsewhere: $$H_j=L_j^2-Z_j(L_j^2)'=(1-2(x-x_j)L_j')L_j^2.$$

To obtain the second family, we cancel $f(x_j)$ using the product $Z_jL_j$, and we have $(Z_jL_j)'=Z_j'L_j+Z_jL_j'=\delta_{ij}$ as desired. Hence: $$\hat H_j=Z_jL_j=(x-x_j)L_j^2.$$

Related Solutions

Numerical Methods – Hermite Interpolation of e^x: Analyzing Derivative Behavior

I have found the cause, it seems that the Hermite Interpolation like other Interpolation methods most likely is unstable in the sense that if $\left(\tilde{x}_{i},\tilde{f}(\tilde{x}_{i})\right)$ are the perturbed values of $(x_i,f(x_i))$ say due to round off error or measurement then the resulting interpolating polynomials will wildly deviate from the true Hermite Interpolating polynomial (which of course is unique). Especially when the Lebesgue constant is large. To remedy this increase the working precision in the second line of the code above

x[k_] := N[Cos[(k + 1/2) \[Pi]/(n + 1)],20]

[Math] How do the Barycentric weights work with the Lagrange interpolation

Note that $\ell$ by itself does not interporate your points $f(x_i)$. It is just a stepping stone in developing a later expression for the actual interpolation.

$\ell(x)$ is just the monic (i.e., leading coefficient = 1) polynomial having simple zeros at each of the interpolation points $x_i$ and no other zeros.

For fixed $j$, by removing the $(x - x_j)$ factor from $\ell$, we get another polynomial with simple zeros at all the other interpolation points, but which is non-zero for $x_j$. Call it $$L_j(x) = (x - x_0)...(x- x_{j-1})(x-x_{j+1})...(x-x_n) = \frac {\ell(x)}{x - x_j}$$

But this isn't quite useful enough. We'd like to have $\ell_j(x_j) = 1$ as well. But that is simply a matter of dividing by the right constant: $$\ell_j(x) = \frac {L_j(x)}{L_j(x_j)}$$ You may note that $x = x_j$ is the one point where the equation $$L_j(x) = \frac {\ell(x)}{x - x_j}$$ does not hold, since the right-hand side is undefined there. However, $L_j(x_j)$ itself is defined. It is just $$L_j(x_j) = (x_j - x_0)...(x_j- x_{j-1})(x_j-x_{j+1})...(x_j-x_n)$$

To make the notation a little easier, we rename $$w_j = \frac 1{L_j(x_j)}$$, giving the expression in your post. I'll leave proving that $L_j(x_j) = \ell'(x_j)$ to you.

The point of all of this is that now we have a set of polynomials with the property that $$\ell_j(x_i) = \begin{cases} 1 & i = j\\0 & i \ne j\end{cases}$$ and are the simplest such polynomials possible. And therefore we can take $$P(x) = \sum_{i=0}^n f(x_i)\ell_i(x)$$

Best Answer

Related Solutions

Numerical Methods – Hermite Interpolation of e^x: Analyzing Derivative Behavior

[Math] How do the Barycentric weights work with the Lagrange interpolation

Related Question