Definition of Directional Derivative
Buildup
Let’s say a multivariable function is given. When trying to calculate the derivative of , unlike the case with a univariable function, one must consider the rate of change in ‘which direction’. A familiar example is the partial derivative. The partial derivative considers the rate of change with respect to only one variable. For instance, the partial derivative of with respect to the variable takes into account the change in the function value of only in the direction of .
The directional derivative is a concept meant to think about the rate of change towards any given direction, rather than in the directions of each variable separately.
Definition1
Let’s assume a multivariable function and a unit vector are given. If the following limit exists, it is called the directional derivative of in the direction at and is denoted by .
Explanation
Partial Differentiation
The definition of the directional derivative only differs from that of partial differentiation in that the , which signifies the direction of each variable, is replaced by any arbitrary direction . By generalizing it this way, one can see that the partial derivative is a special case of the directional derivative.
The following notations are used.
Let’s assume there’s a fixed unit vector . Then, every time is given, is determined, which means the vector itself can be considered an operator. Therefore, notations such as or are also used. Especially in differential geometry, tangent vectors are treated as operators, and it’s thought that “tangent vector = differentiation”. Refer to See Also for more information.
From the theorem introduced below, it can be understood that the directional derivative can be expressed by partial derivatives.
Furthermore, it can be shown that the value of the directional derivative is greatest when is in the same direction as the gradient , hence the direction of is the same as that of the direction in which the rate of change of is the highest. Thus, it can be considered that the gradient notation does not have a subscript in because it is the directional derivative in ’that highest rate of change direction’.
Theorem
The following equation holds between the directional derivative of and its gradient .
Proof
Let’s say . If we find the derivative of , since the derivative of a scalar function is the gradient, by the chain rule,
Then we obtain the following.
Furthermore, by the definition of the directional derivative, the following holds.
■
See Also
Walter Rudin, Principles of Mathmatical Analysis (3rd Edition, 1976), p216-218 ↩︎