For any two vectors b,d∈Rp and a positive definite matrix A∈Rp×p, the following inequality holds.
(bTd)2≤(bTAb)(dTA−1d)
The equivalence conditions for this to be an equality are represented as either b=cA−1d or d=cAb for some constant c∈R.
XT is the transpose matrix of the matrix X.
Explanation
This inequality is a generalization of the Cauchy-Schwarz inequality, which equates to the original Cauchy-Schwarz inequality when A is the identity matrix I. The right side of the inequality introduces a quadratic form, naturally making it highly applicable in mathematical statistics.
Proof
Part 1. The Inequality
Inverse and Square Root of a Positive Definite Matrix: Assuming the eigenpairs {(λk,ek)}k=1n of a positive definite matrix A are sorted as λ1>⋯>λn>0, its inverse A−1 and square root A matrices relative to the orthogonal matrix P=[e1⋯en]∈Rn×n and the diagonal matrix Λ=diag(λ1,⋯,λn) are as follows.
A−1=A=PΛ−1PT=k=1∑nλk1ekekTPΛPT=k=1∑nλkekekT
If A is a positive definite matrix, then its square root matrix is
A=PΛPT=k=1∑nλkekekT
thus being a transpose matrix, A1/2=(A1/2)T holds, and for the same reason, A−1 is also a transpose matrix.
When we set x:=A1/2b and y:=A−1/2d, according to the original Cauchy-Schwarz inequality (xTy)≤(xTx)(yTy),
=====≤≤=(bTd)2(bTA1/2A−1/2d)2(bT(A1/2)TA−1/2d)2((A1/2b)TA−1/2d)2((A1/2b)TA−1/2d)2(xTy)2(xTx)(yTy)((A1/2b)T(A1/2b))((A−1/2d)T(A−1/2d))(bTAb)(dTA−1d)
it can be generalized as such.
Part 2. The Equality
If a certain constant is c=0 so that b=0 or d=0, then the equality obviously holds. If neither are the zero vector and without loss of generality we assume d=cAb, then
bTd==bTcAbcbTAb
holds as well as
bTd==(c1A−1d)Tdc1dTA−1d
By multiplying the extreme sides of these two equations we obtain the following equality.
(bTd)2=(bTAb)(dTA−1d)