Matrix Differential Dissection
📂Vector AnalysisMatrix Differential Dissection
Definition
n×n The differential of a matrix X=[xij]a is defined as follows.
dX=dx11dx21⋮dxn1dx12dx22⋮dxn2⋯⋯⋱⋯dx1ndx2n⋮dxnn
Explanation
If the generalization of the differential dx of a univariate function to a vector is dx, then its generalization to a matrix is dX.
The total differential of a univariate function f:R→R:
df=dxdfdx
The total differential of a multivariable function f:Rn→R:
df=i∑∂xi∂fdxi=⟨∇xf,dx⟩=(∇xf)Tdx=[∂x1∂f⋯∂xn∂f]dx1⋮dxn
The total differential of a function of a matrix f:Rn×n→R:
df=i,j∑∂xij∂fdxij=⟨∇Xf,dX⟩=Tr((∇Xf)TdX)
The properties of the differential of a scalar dx remain unchanged.
d(ax)=adxd(x+y)=dx+dyd(xy)=ydx+xdy
Properties
For a variable matrix X,Y∈Rn×n and scalar α∈R, and a constant matrix A∈Rn×n, the following hold:
- d(αX)=αdX
- d(XT)=(dX)T
- d(AX)=AdX and d(XA)=(dX)A
- d(X+Y)=dX+dY
- d(XY)=(dX)Y+XdY
Proof
3.
Since [AX]ij=∑k=1naikxkj,
[d(AX)]ij=d(k=1∑naikxkj)=k=1∑naik(dxkj)
⟹d(AX)=AdX
■
5.
Since [XY]ij=∑k=1nxikykj,
[d(XY)]ij=d(k=1∑nxikykj)=k=1∑n(dxik)ykj+k=1∑nxikdykj
Therefore, we obtain the following.
[d(XY)]ij=k=1∑nxikdykj+k=1∑nxikdykj=[(dX)Y]ij+[XdY]ij⟹d(XY)=(dX)Y+XdY
■