The Chain Rule of Differentiation in Analysis
📂AnalysisThe Chain Rule of Differentiation in Analysis
Theorem
If f:[a,b]→R is a continuous function and is differentiable at x∈[a,b], and if g:f([a,b])→R is differentiable at f(x)∈f([a,b]), and if we define h:[a,b]→R as follows.
h(t)=g(f(t))(a≤t≤b)
Then, h is differentiable at x and its value is as follows.
h′(x)=g′(f(x))f′(x)
Using the composite function symbol, it can be represented as:
(g∘f)′(x)=g′(f(x))f′(x)
Explanation
This result is commonly referred to as the chain rule.
Here, f′(x) is also called the inner derivative. If we denote y=f(x), z=g(y), and represent it using Leibniz’s notation, it can be expressed as follows.
dxdz=dydzdxdy
The reason why Leibniz’s notation is convenient is because the left side of the above equation looks as though it’s being simplified like the right side. dxdy is not “dx over dy” but the derivative of y, yet treating it like a fraction perfectly fits its meaning.
Proof
First, let’s define the function G as follows.
G(f(t)):={f(x)−f(t)g(f(x))−g(f(t))−g′(f(x))0f(t)=f(x)f(t)=f(x),(t∈[a,b])
Then, for all f(t), the following holds.
f(s)→f(t)limG(f(s))=G(f(t))
Since this is a condition for continuity, G is a continuous function. Furthermore, the following holds.
h(x)−h(t)=g(f(x))−g(f(t))=(f(x)−f(t))(g′(f(x))+G(f(t)))
Then, by the properties of limits, the equation below holds.
h′(x)======= t→xlimx−th(x)−h(t) t→xlimx−t(f(x)−f(t))(g′(f(x))+G(f(t))) t→xlim[g′(f(x))x−tf(x)−f(t)+G(f(t))x−tf(x)−f(t)] t→xlim[g′(f(x))x−tf(x)−f(t)]+t→xlim[G(f(t))x−tf(x)−f(t)] t→xlimg′(f(x))t→xlimx−tf(x)−f(t)+t→xlimG(f(t))t→xlimx−tf(x)−f(t) g′(f(x))f′(x)+0⋅f′(x) g′(f(x))f′(x)
■