Eigenvalues and Eigenvectors
📂Matrix AlgebraEigenvalues and Eigenvectors
Definition
Given a matrix n×n A, for a non-zero column vector 0 n×1 and a constant x, the following equation is referred to as the eigenvalue equation or the eigenvalue problem.
Ax=λx
For a given A, a x that satisfies the eigenvalue equation above is called the eigenvalue of A, and n×1 is called the eigenvector corresponding to the eigenvalue x of A.
Explanation
The definition above applies not only when λ∈R and x∈Rn, but also when λ∈C and x∈Cn. The condition “non-zero” is important because, as the following equation shows, if x=0, the equation always holds.
A0=0=λ0
Geometric Motivation
If the direction of a vector x remains the same after being transformed by the matrix A, this implies there exists some real number λ such that
Ax=λx
While the matrix A itself does not possess any notion of direction, if it has eigenvectors, it can be said to indicate a specific direction. Thus, such vectors x are named eigenvectors. Consider a matrix such as 2×2.
A=[6223]
Then, vector [21], when transformed by 2×2, becomes [147] and maintains the same direction. If we multiply vector [21] by λ=7, the length of the vector also matches, satisfying the eigenvalue equation
Ax[6223][21]=λx=7[21]
Hence, λ=7 is referred to as the eigenvalue. Upon closer inspection, although many eigenvectors can be found by scaling [21], the eigenvalue remains unchanged. Therefore, [21] is described as the eigenvector of A corresponding to the eigenvalue 7.
By extending this geometric discussion to a general context, eigenvalues algebraically satisfy the equation Ax=λx, and the eigenvectors are the non-trivial solutions to the equation for the given x.
Solving the Eigenvalue Equation
To find the eigenvalues, one starts with the eigenvalue equation. Simplifying the equation (1) yields:
⟹⟹⟹AxAx−λxAx−λIx(A−λI)x=λx=0=0=0
Here, the eigenvector must satisfy condition x=0. The linear system above will have a non-zero solution if and only if the inverse of (A−λI) does not exist, which is equivalent to the following equation.
det(A−λI)=0
Therefore, x satisfying the above equation becomes the eigenvalue of A. This equation is called the characteristic equation of A. For a n×n matrix, det(A−λI) becomes a polynomial of degree n, known as the characteristic polynomial.
Note that the eigenvalues of A+B may differ from the sum of the eigenvalues of A and B, and similarly, the eigenvalues of AB may not match the product of the eigenvalues of A and B. Furthermore, as it can be seen from solving the equation, there is no guarantee that eigenvalues will always be real numbers.
Examples
Finding Eigenvalues
As an example, consider again A=[6223]. Since A−λI=[6−λ223−λ], solving the characteristic equation of A yields:
⟹⟹⟹det(A−λI)(6−λ)(3−λ)−4λ2−9λ+18−4(λ−2)(λ−7)=0=0=0=0
Therefore, the eigenvalues of A are λ=2 and λ=7. By substituting 2 and 7 for x, one can find the corresponding eigenvectors for each eigenvalue. Here, only the case for λ=7 is presented.
Finding the Eigenvector Corresponding to λ=7
Substituting λ=7 into (1) and simplifying yields:
⟹⟹[6223][x1x2][6x1+2x22x1+3x2][−x1+2x22x1−4x2]=7[x1x2]=[7x17x2]=[00]
Solving this results in:
{−x1+2x22x1−4x2=0=0
⟹x1=2x2
Thus, for all 0 not equal to zero, the vector [2x2x2] is the eigenvector corresponding to λ=7. Typically, the simplest form or a unit vector with magnitude 1 is chosen. Substituting x2=1 gives the following eigenvector:
A=[21]
Properties
- For a positive integer k, if x is an eigenvalue of matrix A and n×1 is the corresponding eigenvector to x, then λk is an eigenvalue of Ak, and n×1 is the corresponding eigenvector to λk.