Proof of the Hogg-Craig Theorem 📂Mathematical Statistics

Proof of the Hogg-Craig Theorem

Theorem

Let sample $\mathbf{X} = \left( X_{1} , \cdots , X_{n} \right)$ follow iid Normal distribution like $X_{1} , \cdots , X_{n} \overset{\text{iid}}{\sim} N \left( 0, \sigma^{2} \right)$. Consider a symmetric matrix $A_{1} , \cdots , A_{k} \in \mathbb{R}^{n \times n}$ and a random variable $Q_{1} , \cdots , Q_{k}$ represented as a random vector quadratic form $Q_{i} := \mathbf{X}^{T} A_{i} \mathbf{X}$. Define symmetric matrix $A$ and random variable $Q$ as follows: $$ \begin{align*} A =& A_{1} + \cdots + A_{k} \\ Q =& Q_{1} + \cdots + Q_{k} \end{align*} $$ If $Q / \sigma^{2}$ follows a Chi-squared distribution $\chi^{2} \left( r \right)$ and for $i = 1 , \cdots , k-1$ it is $Q_{i} / \sigma^{2} \sim \chi^{2} \left( r_{i} \right)$, and $Q_{k} \ge 0$ is true, then $Q_{1} , \cdots , Q_{k}$ is independent, and $Q_{k} / \sigma^{2}$ follows a Chi-squared distribution with degrees of freedom $r_{k} = r - r_{1} - \cdots - r_{k-1}$ $\chi^{2} \left( r_{k} \right)$.

Explanation

It might initially seem strange that not $Q / n \sigma^{2}$ but $Q / \sigma^{2}$ follows a Chi-squared distribution, but it is precise to discuss $Q / \sigma^{2}$ because the addition involved is not of samples, but of a matrix as follows: $$ \begin{align*} Q =& Q_{1} + \cdots + Q_{k} \\ =& \mathbf{X}^{T} A_{1} \mathbf{X} + \cdots + \mathbf{X}^{T} A_{k} \mathbf{X} \\ =& \mathbf{X}^{T} \left( A_{1} + \cdots + A_{k} \right) \mathbf{X} \\ =& \mathbf{X}^{T} A \mathbf{X} \end{align*} $$

This theorem is used in the proof of Cochran’s theorem.

Proof ¹

We will prove it by mathematical induction. First, let $k = 2$.

Equivalence condition for Chi-squaredness of normal distribution random vector quadratic forms: Let sample $\mathbf{X} = \left( X_{1} , \cdots , X_{n} \right)$, and follow iid Normal distribution like $X_{1} , \cdots , X_{n} \overset{\text{iid}}{\sim} N \left( 0, \sigma^{2} \right)$. For a symmetric matrix with rank $r \le n$ $A \in \mathbb{R}^{n \times n}$, if we set the random vector quadratic form as $Q = \sigma^{-2} \mathbf{X}^{T} A \mathbf{X}$, the following holds. $$ Q \sim \chi^{2} (r) \iff A^{2} = A $$

Since $Q / \sigma^{2}$ follows a Chi-squared distribution, $A$ is an idempotent matrix.

Eigenvalues of an idempotent matrix: The eigenvalues of an idempotent matrix are only $0$ or $1$.

Since $A$ is symmetric and a real matrix, it is diagonalizable, and the eigenvalues of $A$ are only $0$ and $1$, so there exists an orthogonal matrix which satisfies the following regarding the sizes of identity matrix $I_{r} \in \mathbb{R}^{r \times r}$ and zero matrix $O$. $$ \Gamma^{T} A \Gamma = \begin{bmatrix} I_{r} & O \\ O & O \end{bmatrix} $$

If you derive $A = A_{1} + A_{2}$, it is as follows. $$ \begin{bmatrix} I_{r} & O \\ O & O \end{bmatrix} = \Gamma^{T} A_{1} \Gamma + \Gamma^{T} A_{2} \Gamma $$

Positive definitiveness and eigenvalues: The necessary and sufficient condition for $A$ to be positive definite is that all eigenvalues of $A$ are positive.

Since it was assumed $Q_{2} \ge 0$, the matrix $A_{2}$ is positive semidefinite, and since $A$ and $A_{1}$ are idempotent, the eigenvalues are only $0$ and $1$, so they are also positive semidefinite according to the equivalence condition of semidefiniteness. Naturally, $\Gamma^{T} A \Gamma$, $\Gamma^{T} A_{1} \Gamma$, and $\Gamma^{T} A_{2} \Gamma$, which have orthogonal matrices multiplied on the front and back, are also positive semidefinite.

Properties of diagonal elements of positive definite matrices: Given a positive definite matrix $A = \left( a_{ij} \right) \in \mathbb{C}^{n \times n}$. The sign of diagonal elements $a_{ii}$ of $A$ is the same as the sign of $A$. Suppose a symmetric matrix $A \in \mathbb{R}^{n \times n}$ made up of real numbers is positive semidefinite.

When a positive semidefinite matrix is composed of real numbers and has symmetry, if any among the diagonal elements is $0$, all rows and columns of that element would be $0$. According to this, the following expression is possible for certain $G_{r} \in \mathbb{R}^{r \times r}$ and $H_{r} \in \mathbb{R}^{r \times r}$. $$ \begin{align*} \Gamma^{T} A \Gamma = & \Gamma^{T} A_{1} \Gamma + \Gamma^{T} A_{2} \Gamma \\ \implies \begin{bmatrix} I_{r} & O \\ O & O \end{bmatrix} =& \begin{bmatrix} G_{r} & O \\ O & O \end{bmatrix} + \begin{bmatrix} H_{r} & O \\ O & O \end{bmatrix} \end{align*} $$

Since $Q_{1} / \sigma^{2} \sim \chi^{2} \left( r_{1} \right)$, $A_{1}$ is also an idempotent matrix, and the following is obtained. $$ \left( \Gamma^{T} A_{1} \Gamma \right)^{2} = \Gamma^{T} A_{1} \Gamma = \begin{bmatrix} G_{r} & O \\ O & O \end{bmatrix} $$ Multiplying both sides of $\Gamma^{T} A \Gamma = \Gamma^{T} A_{1} \Gamma + \Gamma^{T} A_{2} \Gamma$ by $\Gamma^{T} A_{1} \Gamma$ results in the following: $$ \begin{align*} \begin{bmatrix} I_{r} & O \\ O & O \end{bmatrix} =& \begin{bmatrix} G_{r} & O \\ O & O \end{bmatrix} + \begin{bmatrix} H_{r} & O \\ O & O \end{bmatrix} \\ \implies \begin{bmatrix} I_{r} & O \\ O & O \end{bmatrix} \Gamma^{T} A_{1} \Gamma =& \Gamma^{T} A_{1} \Gamma \cdot \Gamma^{T} A_{1} \Gamma + \begin{bmatrix} H_{r} & O \\ O & O \end{bmatrix} \Gamma^{T} A_{1} \Gamma \\ \implies \begin{bmatrix} I_{r} & O \\ O & O \end{bmatrix} \begin{bmatrix} G_{r} & O \\ O & O \end{bmatrix} =& \Gamma^{T} A_{1} \Gamma + \begin{bmatrix} H_{r} & O \\ O & O \end{bmatrix} \begin{bmatrix} G_{r} & O \\ O & O \end{bmatrix} \\ \implies \begin{bmatrix} G_{r} & O \\ O & O \end{bmatrix} =& \begin{bmatrix} G_{r} & O \\ O & O \end{bmatrix} + \begin{bmatrix} G_{r} H_{r} & O \\ O & O \end{bmatrix} \\ \implies \begin{bmatrix} O & O \\ O & O \end{bmatrix} =& \begin{bmatrix} G_{r} H_{r} & O \\ O & O \end{bmatrix} \\ \implies G_{r} H_{r} =& O \\ \implies \Gamma^{T} A_{1} \Gamma \Gamma^{T} A_{2} \Gamma =& O \\ \implies A_{1} A_{2} =& O \end{align*} $$

Craig’s theorem: Let sample $\mathbf{X} = \left( X_{1} , \cdots , X_{n} \right)$ follow iid Normal distribution like $X_{1} , \cdots , X_{n} \overset{\text{iid}}{\sim} N \left( 0, \sigma^{2} \right)$. For a symmetric matrix $A, B \in \mathbb{R}^{n \times n}$, if random variables $Q_{1}$ and $Q_{2}$ are defined as random vector quadratic forms $Q_{1} := \sigma^{-2} \mathbf{X}^{T} A \mathbf{X}$ and $Q_{2} := \sigma^{-2} \mathbf{X}^{T} B \mathbf{X}$, the following holds. $$ Q_{1} \perp Q_{2} \iff A B = O_{n} $$

Addition of random variables: $X_i \sim \chi^2 ( r_{i} )$ then $$ \sum_{i=1}^{n} X_{i} \sim \chi ^2 \left( \sum_{i=1}^{n} r_{i} \right) $$

According to Craig’s theorem, $Q_{1}$ and $Q_{2}$ are independent, and $Q_{2}$ follows a Chi-squared distribution with degrees of freedom $\left( r - r_{1} \right)$.

It suffices to show that it holds for $k = 3$. Suppose $A_{3}$ is a positive semidefinite matrix satisfying the following: $$ A = A_{1} + \left( A_{2} + A_{3} \right) = A_{1} + B_{1} $$ By factoring $B_{1} := A_{2} + A_{3}$, $B_{1}$ is still a positive semidefinite matrix, and if we apply the result when $k = 2$ to $A = A_{1} + B_{1}$, we obtain $A_{1} B_{1} = O$ as follows: $$ \begin{align*} A = A^{2} =& \left( A_{1} + B_{1} \right)^{2} \\ =& A_{1}^{2} + A_{1} B_{1} + B_{1} A_{1} + B_{1}^{2} \\ =& A_{1} + O + B_{1}^{2} \\ \implies B_{1}^{2} =& A - A_{1} = B_{1} \end{align*} $$ On the other hand, applying the result when $k = 2$ to $B_{1} = A_{2} + A_{3}$ itself gives $A_{2} A_{3} = O$ and $A_{3}^{2} = A_{3}$. Repeating this factoring process for $B_{1}$ applied to $A = A_{2} + \left( A_{1} + A_{3} \right)$ gives $A_{1} A_{3} = O$, and continuing this completes the proof.

■

Hogg et al. (2018). Introduction to Mathematical Statistcs(8th Edition): p564. ↩︎