Expectation of Random Vectors 📂Mathematical Statistics

Expectation of Random Vectors

Definition ¹

$E \left( X \right) := \begin{bmatrix} E \left( X_{1} \right) \\ \vdots \\ E \left( X_{n} \right) \end{bmatrix}$ The expectation of a random vector $X = \left( X_{1} , \cdots , X_{n} \right)$ is defined as a vector of the expectations of its components, as shown above. Similarly, the matrix $\mathbf{X} = \left[ X_{ij} \right]$ of a random variable of size $m \times n$ is also defined as a matrix $E \left( \mathbf{X} \right) := \left[ E \left( X_{ij} \right) \right]$ that contains the expectation of each element.

Properties

[1] Linearity: If $\mathbf{X}_{1}$ and $\mathbf{X}_{2}$ are random matrices of size $m \times n$ and constant matrices $A_{1}, A_{2} \in \mathbb{R}^{k \times m}$ and $B \in \mathbb{R}^{n \times l}$ are given, then the following holds true: $\begin{align*} E \left( A_{1} \mathbf{X}_{1} + A_{2} \mathbf{X}_{2} \right) =& A_{1} E \left( \mathbf{X}_{1} \right) + A_{2} E \left( \mathbf{X}_{2} \right) \\ E \left( A_{1} \mathbf{X}_{1} B \right) =& A_{1} E \left( X_{1} \right) B \end{align*}$
[2] Trace: $E(\tr(\mathbf{X})) = \tr(E(\mathbf{X}))$

Proof

[1]

We will only show $E \left( A \mathbf{X} \right) = A E \left( \mathbf{X} \right)$ , and omit the rest.

Let’s denote $A = \begin{bmatrix} a_{ik}\end{bmatrix}$ as a $m \times p$ matrix and $\mathbf{X} = \begin{bmatrix} X_{kj}\end{bmatrix}$ as a $p \times n$ matrix. Then, by the definition of matrix multiplication and the expectation of matrices,

$\begin{align*} E(A \mathbf{X}) &= E \left( \begin{bmatrix} \sum\limits_{k=1}^{p} a_{ik}X_{kj} \end{bmatrix} \right) \\ &= \begin{bmatrix} E \left( \sum\limits_{k=1}^{p} a_{ik}X_{kj} \right) \end{bmatrix} \\ &= \begin{bmatrix} \sum\limits_{k=1}^{p} a_{ik} E \left( X_{kj} \right) \end{bmatrix} & \text{by linearity of$

■

[2]

$\mathbf{X} = \begin{bmatrix} X_{ij} \end{bmatrix}$ 를 $n \times n$ 행렬이라고 하자.

$\begin{align*} E(\tr(A)) &= E \left( \sum\limits_{i=1}^{n} X_{ii} \right) \\ &= \sum\limits_{i=1}^{n} E(X_{ii}) & \text{by linearity of$

■

Hogg et al. (2013). Introduction to Mathematical Statistcs(7th Edition): p125. ↩︎