Chain Rule Article Index for
Chain
Website Links For
Chain
 

Information About

Chain Rule




In Calculus , the chain rule is a Formula for the Derivative of the composition of two Functions .

In intuitive terms, if a Variable , ''y'', depends on a second variable, ''u'', which in turn depends on a third variable, ''x''; then, the rate of Change of ''y'' with respect to ''x'' can be Computed as the Product of the rate of change of ''y'' with respect to ''u'' Multiplied By the rate of change of ''u'' with respect to ''x''.

Suppose, for example, that one is climbing a mountain at a rate of 0.5 Kilometres Per Hour . The Temperature is lower at higher elevations; suppose the rate by which it decreases is 6 °F per kilometre. If one multiplies 6 °F per kilometre by 0.5 kilometre per hour, one obtains 3 °F per hour. This calculation is a typical chain rule application.

In Algebra ic terms, the chain rule (of one variable) states that if the function ''f'' is Differentiable at ''g''(''x'') and the function ''g'' is differentiable at ''x''. The chain rule may be stated in any of several equivalent forms:

: (f \circ g)'(x) = f'(g(x)) g'(x),\,

or in the Leibniz Notation

: rac {df}{dx} = rac {df} {dg} \cdot rac {dg}{dx},

or

: rac {df} {dx} = rac {d} {dx} f(g(x)) = f'(g(x)) g'(x).

In Integration , the counterpart to the chain rule is the Substitution Rule .


THE GENERAL POWER RULE

The General Power Rule (GPR) is derivable, via the chain rule.


Example I

Consider f(x) = (x^2 + 1)^3. f(x) is comparable to h(g(x)) where g(x) = x^2 + 1 and h(x) = x^3; thus,


Example II

In order to differentiate the Trigonometric function
:f(x) = \sin(x^2),
one can write f(x) = h(g(x)) with h(x) = \sin x and g(x) = x^2. The chain rule then yields
:f'(x) = 2x \cos(x^2)
since h'(g(x)) = \cos (x^2) and g'(x) = 2x.


CHAIN RULE FOR SEVERAL VARIABLES

The chain rule works for functions of several variables as well. For example, if we have a function f(u(x, y), v(x, y)) where
:u(x, y) = 3x + y^2 and v(x, y) = \sin(xy), and if f=u+v,
then
:{\partial f \over \partial x}={\partial f \over \partial u}{\partial u \over \partial x}+{\partial f \over \partial v}{\partial v \over \partial x}=3 + \cos(xy)y.


PROOF OF THE CHAIN RULE

Let ''f'' and ''g'' be functions and let ''x'' be a number such that ''f'' is differentiable at ''g(x)'' and ''g'' is differentiable at ''x''. Then by the definition of differentiability,

: g(x+\delta)-g(x)= \delta g'(x) + \epsilon(\delta) \, where rac{\epsilon(\delta)}{\delta} o 0 \, as \delta o 0.

Similarly,
: f(g(x)+\alpha) - f(g(x)) = \alpha f'(g(x)) + \eta(\alpha) \, where rac{\eta(\alpha)}{\alpha} o 0 \, as \alpha o 0. \,

Now

where \alpha_\delta = \delta g'(x) + \epsilon(\delta) \,.
Observe that as \delta o 0, rac{\alpha_\delta}{\delta} o g'(x) and rac{\eta(\alpha_\delta)}{\delta} o 0. Hence

: rac{f(g(x+\delta))-f(g(x))}{\delta} o g'(x)f'(g(x))\mbox{ as } \delta o 0.


THE FUNDAMENTAL CHAIN RULE

The chain rule is a fundamental property of all definitions of derivative and is therefore valid in much more general contexts. For instance, if ''E'', ''F'' and ''G'' are ) of the composition ''g'' o ''f'' at the point ''x'' is given by

:\mbox{D}_x\left(g \circ f ight) = \mbox{D}_{f\left(x ight)}\left(g ight) \circ \mbox{D}_x\left(f ight).

Note that the derivatives here are Linear Maps and not numbers. If the linear maps are represented as Matrices (namely Jacobian s), the composition on the right hand side turns into a matrix multiplication.

A particularly clear formulation of the chain rule can be achieved in the most general setting: let ''M'', ''N'' and ''P'' be ''C''''k'' Manifold s (or even Banach-manifolds) and let

f


be differentiable maps. The derivative of ''f'', denoted by d''f'', is then a map from the Tangent Bundle of ''M'' to the tangent bundle of ''N'', and we may write

:\mbox{d}\left(g \circ f ight) = \mbox{d}g \circ \mbox{d}f.

In this way, the formation of derivatives and tangent bundles is seen as a Functor on the Category of ''C'' manifolds with ''C'' maps as morphisms.


TENSORS AND THE CHAIN RULE

See Tensor Field for an advanced explanation of the fundamental role the chain rule plays in the geometric nature of Tensor s.


HIGHER DERIVATIVES

Faà Di Bruno's Formula generalizes the chain rule to higher derivatives. The first few derivatives are
: rac{df}{dx} = rac{dg}{dx} rac{df}{dg}
:
rac{d^2 f}{d x^2}
= \left( rac{dg}{dx} ight)^2 rac{d^2 f}{d g^2}
+ rac{d^2 g}{dx^2} rac{df}{dg}

:
rac{d^3 f}{d x^3}
= \left( rac{dg}{dx} ight)^3 rac{d^3 f}{d g^3}
+ 3 rac{dg}{dx} rac{d^2 g}{d x^2} rac{d^2 f}{d g^2}
+ rac{d^3 g}{d x^3} rac{df}{dg}

:
rac{d^4 f}{d x^4}
= \left( rac{dg}{dx} ight)^4 rac{d^4 f}{dg^4}
+ 6 \left( rac{dg}{dx} ight)^2 rac{d^2 g}{d x^2} rac{d^3 f}{d g^3}
+ \left\{ 4 rac{dg}{dx} rac{d^3 g}{dx^3} + 3\left( rac{d^2 g}{dx^2} ight)^2 ight\}
rac{d^2 f}{d g^2}
+ rac{d^4 g}{dx^4} rac{df}{dg}