logo

Block Matrices 📂Matrix Algebra

Block Matrices

Definition

Let’s say AA is a matrix m×nm \times n.

A=[a11a12a1na21a22a2nam1am2amn] A = \begin{bmatrix} a_{11} & a_{12} & \cdots & a_{1n} \\ a_{21} & a_{22} & \cdots & a_{2n} \\ \vdots & \vdots & \ddots & \vdots \\ a_{m1} & a_{m2} & \cdots & a_{mn} \\ \end{bmatrix}

Consider any vertical and horizontal lines that cut the matrix as follows.

A=[a11a12a13a14a15a1n1a1na21a22a23a24a25a2n1a2na31a32a33a34a35a3n1a3na41a42a43a44a45a4n1a4nam1am2am3am4am5am1namn] A = \left[ \begin{array}{cc|ccc|c|} a_{11} & a_{12} & a_{13} & a_{14} & a_{15} & \cdots & a_{1n-1} & a_{1n} \\ a_{21} & a_{22} & a_{23} & a_{24} & a_{25} & \cdots & a_{2n-1} & a_{2n} \\ \hline a_{31} & a_{32} & a_{33} & a_{34} & a_{35} & \cdots & a_{3n-1} & a_{3n} \\ a_{41} & a_{42} & a_{43} & a_{44} & a_{45} & \cdots & a_{4n-1} & a_{4n} \\ \hline \vdots & \vdots & \vdots & \vdots & \vdots & \ddots & \vdots & \vdots & \\ \hline a_{m1} & a_{m2} & a_{m3} & a_{m4} & a_{m5} & \cdots & a_{m-1n} & a_{mn} \end{array} \right]

The parts cut by each line are referred to as the blocks of AA.

A11=[a11a12a21a22],A12=[a13a14a15a23a24a25],,Akl=[am1namn] A_{11} = \begin{bmatrix} a_{11} & a_{12} \\ a_{21} & a_{22} \end{bmatrix},\quad A_{12} = \begin{bmatrix} a_{13} & a_{14} & a_{15}\\ a_{23} & a_{24} & a_{25} \end{bmatrix},\quad \cdots,\quad A_{kl} = \begin{bmatrix} a_{m-1n} & a_{mn}\end{bmatrix}

A matrix AA represented as blocks like the following is called a block matrix.

A=[A11A12A1lA21A22A2lAk1Ak2Akl] A = \begin{bmatrix} A_{11} & A_{12} & \cdots & A_{1l} \\ A_{21} & A_{22} & \cdots & A_{2l} \\ \vdots & \vdots & \ddots & \vdots \\ A_{k1} & A_{k2} & \cdots & A_{kl} \\ \end{bmatrix}

Explanation

Handling matrices as block matrices simplifies matrix calculations. In fact, the calculation of block matrices can be done just like normal matrix calculations.

A=[A11A12A21A22]andB=[B11B12B21B22]    AB=[A11B11+A12B21A11B12+A12B22A21B11+A22B21A21B12+A22B22] A = \begin{bmatrix} A_{11} & A_{12} \\ A_{21} & A_{22} \end{bmatrix}\quad \text{and} \quad B = \begin{bmatrix} B_{11} & B_{12} \\ B_{21} & B_{22} \end{bmatrix} \\[1em] \implies AB = \begin{bmatrix} A_{11}B_{11} + A_{12}B_{21} & A_{11}B_{12} + A_{12}B_{22} \\ A_{21}B_{11} + A_{22}B_{21} & A_{21}B_{12} + A_{22}B_{22} \end{bmatrix}

Therefore, if the matrix is converted into a block matrix form including zero matrices or identity matrices, the matrix multiplication can be easily calculated.

A=[A11IOA22]andB=[B11B12B21B22]    AB=[A11B11+B21A11B12+B22A22B21A22B22] A = \begin{bmatrix} A_{11} & I \\ O & A_{22} \end{bmatrix}\quad \text{and} \quad B = \begin{bmatrix} B_{11} & B_{12} \\ B_{21} & B_{22} \end{bmatrix} \\[1em] \implies AB = \begin{bmatrix} A_{11}B_{11} + B_{21} & A_{11}B_{12} + B_{22} \\ A_{22}B_{21} & A_{22}B_{22} \end{bmatrix}

A matrix split into row vectors and column vectors is also a block matrix.

A=[a11a12a1na21a22a2nam1am2amn]=[r1r2rm]=[c1c2cn] \begin{align*} A =& \begin{bmatrix} a_{11} & a_{12} & \cdots & a_{1n} \\ a_{21} & a_{22} & \cdots & a_{2n} \\ \vdots & \vdots & \ddots & \vdots \\ a_{m1} & a_{m2} & \cdots & a_{mn} \end{bmatrix} = \begin{bmatrix} \mathbf{r}_{1} \\ \mathbf{r}_{2} \\ \vdots \\ \mathbf{r}_{m} \end{bmatrix} \\ =& \begin{bmatrix} \mathbf{c}_{1} & \mathbf{c}_{2} & \cdots & \mathbf{c}_{n} \end{bmatrix} \end{align*}

Therefore, AxA \mathbf{x} can be represented as follows.

Ax=[c1c2cn][x1x2xn]=inxici \begin{align*} A \mathbf{x} &= \begin{bmatrix} \mathbf{c}_{1} & \mathbf{c}_{2} & \cdots & \mathbf{c}_{n} \end{bmatrix} \begin{bmatrix} x_{1} \\ x_{2} \\ \vdots \\ x_{n} \end{bmatrix} \\ &= \sum_{i}^{n} x_{i}\mathbf{c}_{i} \end{align*}

Furthermore, let’s say AA is a matrix m×pm \times p and ai\mathbf{a}_{i} is a row vector of AA, and BB is a matrix p×np \times n and bi\mathbf{b}_{i} is a column vector of BB. Then the multiplication of the two matrices is as follows.

AB=[a1a2am][b1b2bn]=[a1b1a1b2a1bna2b1a2b2a2bnamb1amb2ambn] AB = \begin{bmatrix} \mathbf{a}_{1} \\ \mathbf{a}_{2} \\ \vdots \\ \mathbf{a}_{m} \end{bmatrix} \begin{bmatrix} \mathbf{b}_{1} & \mathbf{b}_{2} & \cdots & \mathbf{b}_{n} \end{bmatrix} = \begin{bmatrix} \mathbf{a}_{1} \mathbf{b}_{1} & \mathbf{a}_{1} \mathbf{b}_{2} & \cdots & \mathbf{a}_{1} \mathbf{b}_{n} \\ \mathbf{a}_{2} \mathbf{b}_{1} & \mathbf{a}_{2} \mathbf{b}_{2} & \cdots & \mathbf{a}_{2} \mathbf{b}_{n} \\ \vdots & \vdots & \ddots & \vdots \\ \mathbf{a}_{m} \mathbf{b}_{1} & \mathbf{a}_{m} \mathbf{b}_{2} & \cdots & \mathbf{a}_{m} \mathbf{b}_{n} \\ \end{bmatrix}

Theorem

Let’s say A=[A1A2OA3]A = \begin{bmatrix} A_{1} & A_{2} \\ O & A_{3} \end{bmatrix} is a block matrix. The following holds for its determinant.

detA=detA1detA3 \det A = \det A_{1} \det A_{3}

Corollary

  • The determinant of the block matrix A=[A1A2OI]A = \begin{bmatrix} A_{1} & A_{2} \\ O & I \end{bmatrix} is the same as detA1\det A_{1}.
  • A matrix AA that can be expressed in the following form by taking permutations of rows and columns is called a reducible matrix, and its determinant detA\det A is either detBdetD\det B \det D or detBdetD-\det B \det D. A~=[BOCD] \widetilde{A} = \begin{bmatrix} B & O \\ C & D \end{bmatrix}

Proof

The block matrix AA can be decomposed into the product of three block matrices as follows.

A=[A1A2OA3]=[IA1+OOIA2+OIOA1+A3OOA2+A3I]=[IOOA3][A1A2OI]=[IOOA3][IA1+A2OIO+A2IOA1+IOOO+II]=[IOOA3][IA2OI][A1OOI] \begin{align*} A &= \begin{bmatrix} A_{1} & A_{2} \\ O & A_{3} \end{bmatrix} \\ &= \begin{bmatrix} IA_{1} + OO & IA_{2} + OI \\ OA_{1} + A_{3}O & OA_{2} + A_{3}I \end{bmatrix} \\ &= \begin{bmatrix} I & O \\ O & A_{3} \end{bmatrix} \begin{bmatrix} A_{1} & A_{2} \\ O & I \end{bmatrix} \\ &= \begin{bmatrix} I & O \\ O & A_{3} \end{bmatrix} \begin{bmatrix} IA_{1} + A_{2}O & IO + A_{2}I \\ OA_{1} + IO & OO + II \end{bmatrix} \\ &= \begin{bmatrix} I & O \\ O & A_{3} \end{bmatrix} \begin{bmatrix} I & A_{2} \\ O & I \end{bmatrix} \begin{bmatrix} A_{1} & O \\ O & I \end{bmatrix} \end{align*}

Since the determinant of the product is the same as the product of determinants

detA=det([IOOA3][IA2OI][A1OOI])=det([IOOA3])det([IA2OI])det([A1OOI]) \begin{align*} \det A &= \det \left( \begin{bmatrix} I & O \\ O & A_{3} \end{bmatrix} \begin{bmatrix} I & A_{2} \\ O & I \end{bmatrix} \begin{bmatrix} A_{1} & O \\ O & I \end{bmatrix} \right) \\ &= \det \left( \begin{bmatrix} I & O \\ O & A_{3} \end{bmatrix} \right) \det \left( \begin{bmatrix} I & A_{2} \\ O & I \end{bmatrix} \right) \det \left( \begin{bmatrix} A_{1} & O \\ O & I \end{bmatrix} \right) \end{align*}

Considering the Laplace expansion of the determinant,

det([IOOA3])=detA3,det([A1OOI])=detA1, \det \left( \begin{bmatrix} I & O \\ O & A_{3} \end{bmatrix} \right) = \det A_{3},\quad \det \left( \begin{bmatrix} A_{1} & O \\ O & I \end{bmatrix} \right) = \det A_{1},

anddet([IA2OI])=1 \text{and} \quad \det \left( \begin{bmatrix} I & A_{2} \\ O & I \end{bmatrix} \right)=1

we can see that

detA=detA1detA3 \det A = \det A_{1} \det A_{3}