logo

Tangent Vector on Differentiable Manifold 📂Geometry

Tangent Vector on Differentiable Manifold

Buildup1

To define a tangent vector at each point on a differentiable manifold MM, let’s assume a differentiable curve α:(ϵ,ϵ)M\alpha : (-\epsilon , \epsilon) \to M is given. We would like to define the derivative dαdt(0)\dfrac{d \alpha}{dt}(0) at t=0t=0 in α\alpha as a tangent vector, like in differential geometry, but since the range of α\alpha is MM (since it’s not guaranteed to be a metric space), we cannot speak of the derivative of α\alpha. For this reason, tangent vectors on a manifold are defined as functions, namely operators. If you’ve studied differential geometry, treating vectors as operators should be familiar. See the following explanation.

Directional derivative

Let XTpM\mathbf{X} \in T_{p}M be a tangent vector at point pp of surface MM, and let α(t)\alpha (t) be a curve on MM. Then α:(ϵ,ϵ)M\alpha : (-\epsilon, \epsilon) \to M and α(0)=p\alpha (0) = p are satisfied, meaning X=dαdt(0)\mathbf{X} = \dfrac{d \alpha}{d t} (0). Now, let function ff be a differentiable function defined in some neighborhood of point pMp \in M on surface MM. Then, the directional derivative Xf\mathbf{X}f in the direction of X\mathbf{X} is defined as follows:

X:DR,where D is set of all differentiable functions near p \mathbf{X} : \mathcal{D} \to \mathbb{R}, \quad \text{where } \mathcal{D} \text{ is set of all differentiable functions near } p

Xf:=ddt(fα)(0) \mathbf{X} f := \dfrac{d}{dt_{}} (f \circ \alpha) (0)

As shown in the definition above, if there is a fixed tangent vector X\mathbf{X}, then every time ff is given, Xf\mathbf{X}f is determined. Therefore, a tangent vector is treated as an operator itself. The notation like Xf\mathbf{X}f is used because it is viewed from the perspective of an operator. Tangent vectors on a differential manifold are similarly defined as functions that map real space through the composition with some curve α\alpha every time a differentiable function ff is given on MM.

Definition

Let’s say MM is a nn-dimensional differentiable manifold. A differentiable function α:(ϵ,ϵ)M\alpha : (-\epsilon , \epsilon) \to M is called a differentiable curve at MM. Assuming α(0)=pM\alpha (0)=p\in M, let’s define the set D\mathcal{D} as the set of differentiable functions at pp.

D:={f:MRfunctions on Mthat are differentiable at p} \mathcal{D} := \left\{ f : M \to \mathbb{R} | \text{functions on } M \text{that are differentiable at } p \right\}

Then, the tangent vector α(0):DR\alpha^{\prime}(0) : \mathcal{D} \to \mathbb{R} at α(0)=p\alpha (0) = p is defined as the following function.

α(0)f=ddt(fα)(0),fD \alpha^{\prime} (0) f = \dfrac{d}{dt} (f\circ \alpha)(0),\quad f\in \mathcal{D}

The set of all tangent vectors at point pMp\in M is called the tangent space and is denoted as TpMT_{p}M.

Explanation

f:MRf : M \to \mathbb{R} and α:(ϵ,ϵ)M\alpha : (-\epsilon, \epsilon) \to M cannot be differentiated in the classical sense because their domains and ranges are not guaranteed to be metric spaces, but their composition fα:(ϵ,ϵ)Rf \circ \alpha : (-\epsilon, \epsilon) \to \mathbb{R} can be differentiated.

Since a tangent vector is determined whenever a differentiable curve α\alpha is given, it can be thought that there are as many tangent vectors as there are differentiable curves. Moreover, even if two tangent vectors X,Y\mathbf{X}, \mathbf{Y} are determined by two different curves α\alpha and β\beta, if Xf=Yf\mathbf{X}f = \mathbf{Y}f holds for all fDf \in \mathcal{D}, then X\mathbf{X} and Y\mathbf{Y} are considered the same tangent vector.

The reason the set of tangent vectors TpMT_{p}M is called a tangent space is that it is actually a nn-dimensional vector space.

From the theorem introduced below, it is possible to express the function value α(0)f\alpha^{\prime}(0)f of the tangent vector at point pp in terms of any coordinate system x:UM\mathbf{x} : U \to M concerning pp, and this value does not depend on the choice of x\mathbf{x}.

Example

Consider TpR3T_{p}\mathbb{R}^{3}. When a differentiable curve α:(ϵ,ϵ)R3\alpha : (-\epsilon, \epsilon) \to \mathbb{R}^{3} is determined, a 3-dimensional vector α(0)=v=(v1,v2,v3)R3\alpha^{\prime}(0) = \mathbf{v} = (v_{1}, v_{2}, v_{3}) \in \mathbb{R}^{3} is determined. Therefore, according to the definition, the tangent vector for f:R3Rf : \mathbb{R}^{3} \to \mathbb{R} is as follows:

Xf=d(fα)dt(0)=ifxidαidt(0)=ivifxi \mathbf{X}f = \dfrac{d (f\circ \alpha)}{d t}(0) = \sum \limits_{i} \dfrac{\partial f}{\partial x_{i}}\dfrac{d \alpha_{i}}{d t}(0) = \sum\limits_{i} v_{i} \dfrac{\partial f}{\partial x_{i}}

This is the same as the directional derivative in Euclidean space.

v[f]=vf=vf=ivifvi \mathbf{v}[f] = \nabla _{\mathbf{v}}f = \mathbf{v} \cdot \nabla f = \sum \limits_{i} v_{i} \dfrac{\partial f}{\partial v_{i}}

The directional derivative is essentially the same as treating the vector as an operator. Therefore, X\mathbf{X} can be considered an element of R3\mathbb{R}^{3}, and the following holds:

TpR3R3 T_{p}\mathbb{R}^{3} \approxeq \mathbb{R}^{3}

Theorem

Let’s say a differentiable curve α(0)=p\alpha (0) = p and a coordinate system x:UM\mathbf{x} : U \to M at point pp are given. (u1,,un)(u_{1}, \dots, u_{n}) are the coordinates of Rn\mathbb{R}^{n},

(x1(p),,xn(p))=x1(p) (x_{1}(p), \dots, x_{n}(p)) = \mathbf{x}^{-1}(p)

Then, the following formula holds:

α(0)f= i=1nxi(p)(fx)uip= i=1nxi(α(0))fxit=0 \begin{align*} \alpha ^{\prime} (0) f =&\ \sum \limits_{i=1}^{n}x_{i}^{\prime}(p) \left.\dfrac{\partial (f\circ \mathbf{x})}{\partial u_{i}}\right|_{p} \\ =&\ \sum \limits_{i=1}^{n}x_{i}^{\prime}(\alpha (0)) \left.\dfrac{\partial f}{\partial x_{i}}\right|_{t=0} \end{align*}

Here, we simply denote it as xi(0)=xi(α(0))x_{i}^{\prime}(0) = x_{i}^{\prime}(\alpha (0)). Therefore, α(0)\alpha^{\prime}(0) is defined as the following differential operator:

α(0)=i=1nxi(0)xit=0 \begin{equation} \alpha ^{\prime} (0) = \sum \limits_{i=1}^{n}x_{i}^{\prime}(0) \left.\dfrac{\partial }{\partial x_{i}}\right|_{t=0} \end{equation}

If we express it as coordinate vectors for basis {xit=0}\left\{ \left.\dfrac{\partial }{\partial x_{i}}\right|_{t=0} \right\}, it is as follows:

α(0)=[x1(0)xn(0)] \alpha ^{\prime} (0) = \begin{bmatrix} x_{1}^{\prime}(0) \\ \vdots \\ x_{n}^{\prime}(0) \end{bmatrix}

Proof

Choose a coordinate system x:URnM\mathbf{x} : U \subset \mathbb{R}^{n} \to M such that p=x(0)p = \mathbf{x}(0) is satisfied. Consider fα=fxx1αf\circ \alpha = f \circ \mathbf{x} \circ \mathbf{x}^{-1} \circ \alpha to express the tangent vector in terms of the coordinate system. Then, since xx1=I\mathbf{x} \circ \mathbf{x}^{-1} = I is an identity function, any choice of coordinate system is irrelevant. Now, think of fxf \circ \mathbf{x} and x1α\mathbf{x}^{-1} \circ \alpha as one function each, and consider fαf \circ \alpha as their composite function.

fα=(fx)(x1α) f \circ \alpha = (f \circ \mathbf{x}) \circ (\mathbf{x}^{-1} \circ \alpha)

First, consider fxf \circ \mathbf{x}. Since fx:RnRf \circ \mathbf{x} : \mathbb{R}^{n} \to \mathbb{R}, it can be expressed as follows and differentiated in the classical sense:

fx=fx(u)=fx(u1,u2,,un),u=(u1,,un)Rn f \circ \mathbf{x} = f \circ \mathbf{x} (u) = f \circ \mathbf{x} (u_{1}, u_{2}, \dots, u_{n}),\quad u=(u_{1},\dots,u_{n}) \in \mathbb{R}^{n}

x1α\mathbf{x}^{-1} \circ \alpha can also be expressed as follows since x1α:RRn\mathbf{x}^{-1} \circ \alpha : \mathbb{R} \to \mathbb{R}^{n}, and it can be differentiated in the classical sense:

x1α(t)= (x1(α(t)),x2(α(t)),,xn(α(t)))= (x1(t),x2(t),,xn(t)) \begin{align*} \mathbf{x}^{-1} \circ \alpha (t) =&\ (x_{1}(\alpha (t)), x_{2}(\alpha (t)), \dots, x_{n}(\alpha (t))) \\ =&\ (x_{1}(t), x_{2}(t), \dots, x_{n}(t)) \end{align*}

Note that xix_{i} is a function of xi:MRx_{i} : M \to \mathbb{R}, and xi(t)x_{i}(t) is a simplified notation of xi(α(t))x_{i}(\alpha (t)).

Thinking this way, fαf \circ \alpha is a composition of two functions, mapped as Rx1αRnfxR\mathbb{R} \overset{\mathbf{x}^{-1} \circ \alpha}{\longrightarrow} \mathbb{R}^{n} \overset{f\circ \mathbf{x}}{\longrightarrow} \mathbb{R}. Therefore, by the chain rule, the following holds:

ddt(fα)=ddt((fx)(x1α))=i=1n(fx)uid(x1α)idt=i=1n(fx)uidxidt \dfrac{d}{d t}(f \circ \alpha) = \dfrac{d}{dt} \left( (f\circ \mathbf{x}) \circ (\mathbf{x}^{-1} \circ \alpha) \right) = \sum \limits_{i=1}^{n}\dfrac{\partial (f\circ \mathbf{x})}{\partial u_{i}} \dfrac{d (\mathbf{x}^{-1} \circ \alpha )_{i}}{d t} = \sum \limits_{i=1}^{n}\dfrac{\partial (f\circ \mathbf{x})}{\partial u_{i}} \dfrac{d x_{i}}{d t}

Thus, we obtain the following:

α(0)f:= ddt(fα)(0)= i=1n(fx)uit=0dxidt(0)= i=1n(fx)uit=0xi(0)= i=1nxi(0)(fx)uit=0 \begin{align*} \alpha^{\prime}(0) f :=&\ \dfrac{d}{dt} (f\circ \alpha)(0) \\ =&\ \sum \limits_{i=1}^{n} \left.\dfrac{\partial (f\circ \mathbf{x})}{\partial u_{i}}\right|_{t=0} \dfrac{d x_{i}}{d t}(0) \\ =&\ \sum \limits_{i=1}^{n} \left.\dfrac{\partial (f\circ \mathbf{x})}{\partial u_{i}}\right|_{t=0} x_{i}^{\prime}(0) \\ =&\ \sum \limits_{i=1}^{n} x_{i}^{\prime}(0) \left.\dfrac{\partial (f\circ \mathbf{x})}{\partial u_{i}}\right|_{t=0} \end{align*}

Here, let’s define xit=0\left.\dfrac{\partial }{\partial x_{i}}\right|_{t=0} as the following operator:

xit=0f:=(fx)uit=0 \left.\dfrac{\partial }{\partial x_{i}}\right|_{t=0} f := \left.\dfrac{\partial (f\circ \mathbf{x})}{\partial u_{i}}\right|_{t=0}

Summarizing the meaning of fxi\dfrac{\partial f}{\partial x_{i}}:

ff cannot be differentiated since its domain is MM. Therefore, consider the composition with coordinate system x:RnM\mathbf{x} : \mathbb{R}^{n} \to M. This maps Rn\mathbb{R}^{n} to R\mathbb{R}, thus can be differentiated in the classical sense. Therefore, fxi\dfrac{\partial f}{\partial x_{i}} is defined as differentiating after composing ff with x\mathbf{x} in Euclidean space Rn\mathbb{R}^{n} at the uiu_{i}-th variable.

Finally, we obtain the following:

α(0)f= i=1nxi(0)(fx)uit=0= i=1nxi(0)xit=0f= i=1nxi(0)fxit=0 \begin{align*} \alpha^{\prime}(0) f =&\ \sum \limits_{i=1}^{n} x_{i}^{\prime}(0) \left.\dfrac{\partial (f\circ \mathbf{x})}{\partial u_{i}}\right|_{t=0} \\ =&\ \sum \limits_{i=1}^{n} x_{i}^{\prime}(0) \left.\dfrac{\partial }{\partial x_{i}}\right|_{t=0}f = \ \sum \limits_{i=1}^{n} x_{i}^{\prime}(0) \left.\dfrac{\partial f}{\partial x_{i}}\right|_{t=0} \end{align*}

    α(0)=i=1nxi(0)xit=0 \implies \alpha^{\prime}(0) = \sum \limits_{i=1}^{n} x_{i}^{\prime}(0) \left.\dfrac{\partial }{\partial x_{i}}\right|_{t=0}

See Also


  1. Manfredo P. Do Carmo, Riemannian Geometry (Eng Edition, 1992), p6-8 ↩︎