[4]: For a (n+1) times differentiable function f:R→R and some ξ∈H{x0,⋯,xn}, the polynomial interpolation pn of f satisfies the following for some t∈R. f(t)−pn(t)=(n+1)!(t−x0)⋯(t−xn)f(n+1)(ξ)
H{a,b,c,⋯} represents the smallest interval that includes a,b,c,⋯.
Explanation
Polynomial interpolation is simplified in Korean as 다항식 보간법.
Condition degp≤n
The condition degp≤n implies there’s not always a guarantee that a p satisfying degp=n exists. For instance, when n=2, if the three points lie in a straight line as shown above, a p2(x)=a2x2+a1x+a0 passing through (n+1)=3 points does not exist, while a lower degree p1(x)=a1x+a0 does. This actually means that p2(x)=a2x2+a1x+a0 was found, but in cases where a2=0.
Lagrange’s Formula and Newton’s Divided Difference Formula are the same
Though formulas [2] and [3] appear different, they are essentially the same due to [1]. The core difference between the two formulas is merely how pn is represented. Functionally, the significant difference is that Newton’s Divided Difference Formula is easier to update when new data is added.
Error with the Actual Function
Theorem [4] shows how much difference there is between the interpolating pn and the actual function f. Generally, the divergence rate of (n+1)! is very fast, so the accuracy of interpolation pn increases with more data. However, this formula does not necessarily discuss convergence. For a simple example, consider when t is very, very far from H{x0,⋯,xn}. Moreover, if the value of f worsens with each differentiation, and that even surpasses the divergence rate of (n+1)!, it can turn weird.
Proof
[1]
Strategy: Show the uniqueness of the coefficients a0,a1,⋯,an of pn that satisfy yi=pn(xi)=a0+a1xi+⋯+anxin for all i=0,⋯,n by representing a system of (n+1) equations in matrix form and simultaneously ensuring the existence and uniqueness of the inverse matrix.
Define y:=y0y1⋮yn,X:=11⋮1x0x1⋮xn⋯⋯⋱⋯x0nx1n⋮xnn,a:=a0a1⋮an and represent the simultaneous equations in matrix form, and then
y0y1⋮yn=11⋮1x0x1⋮xn⋯⋯⋱⋯x0nx1n⋮xnna0a1⋮an
Now, the problem has changed to finding a solution a that satisfies y=Xa.
Since the x0,⋯,xn are different by assumption, detX=0, and X−1 exists. Thus, a=X−1y also exists. Meanwhile, since the inverse of a given matrix is unique, a is also unique.
Strategy: Define new dummy functions and avoid direct calculation by using their differentiability. Since the setting is complex, it is actually easier to understand from the back.
Claim: For E(x):=f(x)−pn(X), the following holds.
E(t)=(n+1)!(t−x0)⋯(t−xn)f(n+1)(ξ)
Part 1.
Firstly, if t is the same as the nodal points x0,⋯,xn, it trivially holds, so assume t is different from these nodal points.
Ψ(x):=G(x):=(x−x0)(x−x1)⋯(x−xn)E(x)−Ψ(t)Ψ(x)E(t)
Upon defining the new functions as above, since f, pn, and Ψ are (n+1) times differentiable, G is also (n+1) times differentiable.
Part 2.
Substituting x=xi into G leads to E(xi)=f(xi)−pn(xi)=0 and Ψ(xi)=0, thus
G(xi)=E(xi)−Ψ(t)Ψ(xi)E(t)=0
Substituting x=t into G leads to Ψ(t)Ψ(t)=1, thus
G(t)=E(t)−E(t)=0
Therefore, G has (n+2) different zero x0,⋯,xn,ts.
Part 3.
For convenience, let’s say xn+1:=t.
Rolle’s theorem: If function f(x) is continuous on [a,b], differentiable on (a,b), and f(a)=f(b), then there exists at least one c in (a,b) that satisfies f′(c)=0.
For all i=0,⋯,n,
G(xi)=0=G(xi+1)
hence, by Rolle’s theorem, there exists xi′∈H{xi,xi+1} satisfying g′(x’i)=0, and similarly for all i=0,⋯,(n−1),
g′(x’i)=0=g′(x’i+1)
hence, by Rolle’s theorem, there exists xi′′∈H{x’i,x’i+1} satisfying G′′(x’’i)=0. By inductively using Rolle’s theorem (n+1) times, it can be guaranteed that there exists ξ∈H{x0,⋯,xn+1} satisfying
G(n+1)(ξ)=0
Meanwhile, since pn is a n degree polynomial,
E(n+1)(x)=f(n+1)(x)
The highest degree term of Ψ is xn+1, so
Ψ(n+1)(x)=(n+1)!
Differentiating both sides of G(x)=E(x)−Ψ(t)Ψ(x)E(t)(n+1) times with respect to x gives
G(n+1)(x)=f(n+1)(x)−Ψ(t)(n+1)!E(t)
Substituting x=ξ into G(n+1) and f(n+1) yields
0=f(n+1)(ξ)−Ψ(t)(n+1)!E(t)
Since E(t)=f(t)−pn(t), it follows that
f(t)−j=0∑nf(xj)lj(t)=(n+1)!(t−x0)⋯(t−xn)f(n+1)(ξ)
■
Atkinson. (1989). An Introduction to Numerical Analysis(2nd Edition): p131. ↩︎