The Chain Rule
Subtopic: Differentiation
The chain rule differentiates compositions: if
Introduction
You're driving. Your speed depends on the gear, and the gear depends on time. How does speed depend on time? Combine the effects: (change in speed per gear)
The chain rule makes this precise: the derivative of a composition is the product of the derivatives.
The Formula
If
In words: differentiate the outer function (leaving the inner alone), then multiply by the derivative of the inner.
Leibniz notation makes it memorable: the
Why It Works
Small changes propagate through compositions. If
The rates multiply.
Worked Example 1
Differentiate
Let
Multiple Compositions
For
Peel off layers one at a time, multiplying derivatives.
Common Patterns
These follow directly from the chain rule:
d(ƒ(x)n)/d(x)=n*ƒ(x)(n-1)ƒ′*(x) d(eƒ(x))/d(x)=eƒ(x)ƒ′*(x) d(ln(ƒ(x)))/d(x)=(ƒ′*(x))/ƒ(x) d(sin(ƒ(x)))/d(x)=cos(ƒ(x))ƒ′*(x)
Implicit Differentiation
The chain rule powers implicit differentiation. If
The
Related Rates
In related rates problems, quantities change with time. The chain rule connects their rates:
If you know how volume depends on radius and how radius changes with time, you can find how volume changes with time.
Applications
In neural networks, back propagation applies the chain rule to compute gradients through many layers.
In physics, the chain rule relates velocities and accelerations in different coordinate systems.
In economics, it computes how changes in input prices affect output costs through intermediate goods.
Summary
The chain rule differentiates compositions: