Skip to Tutorial Content

Click Start Over menu at the left bottom to start Back to Contents


1. Eigenvalues and Eigenvectors

Let \(\mathbf{A}\) be a \(n\times n\) square matrix. If a complex number \(\lambda\) and a nonzero \({n\times 1}\) vector \(\mathbf{x}\) satisfy

\[ \begin{equation} \mathbf{A} \mathbf{x} = \lambda \mathbf{x}, \end{equation} \label{eigenequation} \]

then we call \(\lambda\) an eigenvalue of \(\mathbf{A}\) and \(\mathbf{x}\) an eigenvector of \(\mathbf{A}\).

An eigenvalue \(\lambda\) is the solution of \(det(\mathbf{A}-\lambda\mathbf{I}) = 0\), where \(det\) stands for determinant. If \(\lambda\) is an \(k^{th}\) order of solution, then we say that \(\lambda\) has algebraic multiplicity \(\alpha_{\mathbf{A}} (\lambda) = k\). The dimension of the eigenspace \(\mathcal{E}_{\mathbf{A}}(\lambda)\equiv \{\mathbf{x} \mid \mathbf{A} \mathbf{x} = \lambda \mathbf{x} \}\), or the number of independent eigenvectors associated with \(\lambda\) are called the geometric multiplicity \(\gamma_{\mathbf{A}} (\lambda)\) of \(\lambda\). If \(\alpha_{\mathbf{A}} (\lambda)=1\), \(\lambda\) is called a simple eigenvalue of \(\mathbf{A}\).

Example 1. \(\mathbf{I}_{2\times 2}\) has one eigenvalue 1. Its algebraic multiplicity is 2. It has two independent eigenvectors \([0, 1]^T\) and \([1, 0]^T\). Its geometric multiplicity is 2.

A1 <- matrix(c(1, 0, 0, 1), nrow = 2)
eigen(A1)
## eigen() decomposition
## $values
## [1] 1 1
## 
## $vectors
##      [,1] [,2]
## [1,]    0   -1
## [2,]    1    0

Example 2. \(\mathbf{A}_2\) has one eigenvalue 1. Its algebraic multiplicity is 2. It has one independent eigenvector \([1, 0]^T\). So its geometric multiplicity is 1.

A2 <- matrix(c(1, 0, 1, 1), nrow = 2)
eigen(A2)
## eigen() decomposition
## $values
## [1] 1 1
## 
## $vectors
##      [,1]          [,2]
## [1,]    1 -1.000000e+00
## [2,]    0  2.220446e-16

Example 3. \(\mathbf{A}_3\) has two simple eigenvalues 2 and 1. Their corresponding eigenvectors are \([0, 1]^T\) and \([1, -1]^T\).

A3 <- matrix(c(1, 1, 0, 2), nrow = 2)
eigen(A3)
## eigen() decomposition
## $values
## [1] 2 1
## 
## $vectors
##      [,1]       [,2]
## [1,]    0  0.7071068
## [2,]    1 -0.7071068

Theorem 1. Geometric multiplicity and algebraic multiplicity can be different. But geometric multiplicity can never exceed algebraic multiplicity.

Proof. Let \(\mathbf{x}_1, \mathbf{x}_2, \cdots, \mathbf{x}_r\) be all the linearly independent eigenvectors associated with eigenvalue \(e\). So the geometric multiplicity of \(\mathbf{A}\) is \(r\). Adding \(n-r\) independent vectors \(\mathbf{x}_{r+1}, \mathbf{x}_{r+2}, \cdots, \mathbf{x}_n\) to form a basis of \(\mathbb{R}^n\), we can create an inversible matrix \(\mathbf{S} = [\mathbf{x}_1, \mathbf{x}_2, \cdots, \mathbf{x}_r, \mathbf{x}_{r+1}, \mathbf{x}_{r+2}, \cdots, \mathbf{x}_n]\) such that

\[ \mathbf{S}^{-1}\mathbf{A}\mathbf{S} = \left[ \begin{array}{cc} e\mathbf{I}_{r\times r} & ? \\ \mathbf{0} & ? \end{array} \right] \]

Since \(det(\mathbf{S}^{-1}(\mathbf{A}-\lambda\mathbf{I})\mathbf{S})=det(\mathbf{A}-\lambda\mathbf{I})\), the characteristic polynomial \(det(\mathbf{A}-\lambda\mathbf{I})\) contains factor \((\lambda - e)^r\). Therefore its algebraic multiplicity is not less than \(r\).

2. Generalized Eigenspaces

Let \(\mathbf{A}\) be a \(n\times n\) square matrix and \(\lambda\) an eigenvalue of \(\mathbf{A}\). Let \(\alpha_{\mathbf{A}} (\lambda)\) be the algebraic multiplicity of \(\lambda\). A nonzero \({n\times 1}\) vector \(\mathbf{x}\) satisfying

\[ \begin{equation} (\mathbf{A} - \lambda \mathbf{I})^{\alpha_{\mathbf{A}} (\lambda)} \mathbf{x} = 0 \end{equation} \label{geigenequation} \]

is called a generalized eigenvector of \(\mathbf{A}\) associated with eigenvalue \(\lambda\). The space \(\mathcal{G}_{\mathbf{A}}(\lambda)=\{\mathbf{x} \mid (\mathbf{A} - \lambda \mathbf{I})^{\alpha_{\mathbf{A}} (\lambda)} \mathbf{x} = \mathbf{0} \}\) is called the generalized eigenspace.

Theorem 2. Generalized Eigenspace Decomposition. If an \(n\times n\) matrix \(\mathbf{A}\) has \(m\) (\(m \leq n\)) distinct eigenvalues \(\lambda_j\), \(1\leq j \leq m\), then

\[ \begin{equation} \mathbb{C}^n = \mathcal{G}_{\mathbf{A}}(\lambda_1) \bigoplus \mathcal{G}_{\mathbf{A}}(\lambda_2) \bigoplus \cdots \bigoplus \mathcal{G}_{\mathbf{A}}(\lambda_m) \end{equation} \label{span} \]

Corollary 1. Dimension of Generalized Eigenspaces. For any eigenvalue \(\lambda\) of a matrix \(\mathbf{A}\),

\[ dim(\mathcal{G}_{\mathbf{A}}(\lambda)) = \alpha_{\mathbf{A}}(\lambda) \]

Proofs of Theorem 2 and Corollary 1 are not given here. But we will explain the construction of a basis in a generalized eigenspace using the Jordan chain. Suppose that \(\mathbf{A}\) has an eigenvalue \(\lambda\) whose geometricity is 1 and algebraic multiplicity 3. Let us compose three vectors:

\[ (\mathbf{A} - \lambda \mathbf{I}) \mathbf{x}_1 = \mathbf{0} \\ (\mathbf{A} - \lambda \mathbf{I}) \mathbf{x}_2 = \mathbf{x}_1 \\ (\mathbf{A} - \lambda \mathbf{I}) \mathbf{x}_3 = \mathbf{x}_2 \]

It is obvious that \(\mathbf{x}_1, \mathbf{x}_2, \mathbf{x}_3\) are in \(\mathcal{G}_{\mathbf{A}}(\lambda)\). If there is a constant vector \([c_1, c_2, c_3]^T\) such that \(c_1 \mathbf{x}_1 + c_2 \mathbf{x}_2 + c_3 \mathbf{x}_3 = \mathbf{0}\).

\((\mathbf{A} - \lambda \mathbf{I})^2 (c_1 \mathbf{x}_1 + c_2 \mathbf{x}_2 + c_3 \mathbf{x}_3) = c_3 \mathbf{x}_1 = \mathbf{0}\) yields \(c_3 = 0\);

\((\mathbf{A} - \lambda \mathbf{I}) (c_1 \mathbf{x}_1 + c_2 \mathbf{x}_2) = c_2 \mathbf{x}_1 = \mathbf{0}\) yields \(c_2 = 0\);

\(c_1 \mathbf{x}_1 = \mathbf{0}\) yields \(c_1 = 0\).

So \(\mathbf{x}_1, \mathbf{x}_2, \mathbf{x}_3\) are independent.

In R, we can first find a vector in the generalized eigenspace using nullspace(\((\mathbf{A} - \lambda \mathbf{I})^{\alpha_{\mathbf{A}} (\lambda)}\)) and then backward generate the Jordan chain (see Example 6).

3. Systems of Differential Equations

Consider the following first order, linear systems of ordinary differential equations with constant coefficients

\[ \begin{equation} \frac{d\mathbf{x}(t)}{dt}=\mathbf{A}\mathbf{x}(t), \; t > 0 \end{equation} \label{diffequation} \]

Given initial value \(\mathbf{x}(0)=\mathbf{x}_0\), we want to solve Eq. (\ref{diffequation}).

Case 1. \(\mathbf{A}\) has exactly \(n\) independent eigenvectors.

In this case, \(\mathbf{A}\) has \(n\) independent eigenvectors \(\mathbf{e}_1, \mathbf{e}_2, \cdots, \mathbf{e}_n\) corresponding to \(n\) eigenvalues \(\lambda_1, \lambda_2, \cdots, \lambda_n\) (possibly repeated). The initial value vector \(\mathbf{x}_0\) can be uniquely expressed as \(\mathbf{x}_0=\sum_{j=1}^{n}c_j \mathbf{e}_j\), where \([c_1, c_2, \cdots, c_n]^T\) is the coeficient vector. Therefore the solution to Eq. (\ref{diffequation}) can be written as

\[ \begin{equation} \begin{split} \mathbf{x}(t) & = e^{\mathbf{A}t}\mathbf{x}_0 \\ & = \sum_{j=1}^{n}c_j e^{\mathbf{A}t}\mathbf{e}_j \\ & = \sum_{j=1}^{n}c_j e^{\lambda_j t}\mathbf{e}_j \end{split} \end{equation} \label{case1} \]

Example 4. Find the solution to the following system

\[ \begin{aligned} & x_1' = x_1 + 2 x_2 \;\;\; x_1 (0) = 0 \\ & x_2' = 3 x_1 + 2 x_2 \;\;\; x_2(0) = -4 \end{aligned} \]

We first convert the system into matrix form:

\[ \frac{d}{dt}\left[\begin{array}{c} x_1(t)\\ x_2(t) \end{array} \right] = \left[ \begin{array}{cc} 1 & 2 \\ 3 & 2 \end{array} \right] \left[\begin{array}{c} x_1(t)\\ x_2(t) \end{array} \right], \]

A <- matrix(c(1, 3, 2, 2), nrow = 2)
Aeig <- eigen(A)
c0 <- solve(Aeig$vectors, c(0, -4))
c0
## [1]  2.884441 -2.262742
Aeig$values
## [1]  4 -1
Aeig$vectors
##            [,1]       [,2]
## [1,] -0.5547002 -0.7071068
## [2,] -0.8320503  0.7071068

Solution

\[ \mathbf{x}(t)=2.884441 e^{4t} \left[\begin{array}{c} -0.5547002 \\ -0.8320503 \end{array}\right] - 2.262742 e^{-t} \left[\begin{array}{c} -0.7071068 \\ 0.7071068 \end{array}\right] \]

Example 5. Solve the following initial value problem.

\[ \mathbf{x}' = \left[ \begin{array}{cc} 3 & 9 \\ -4 & -3 \end{array} \right]\mathbf{x} \;\;\; \mathbf{x}(0) = \left[ \begin{array}{c} 2 \\ -4 \end{array} \right] \]

A <- matrix(c(3, -4, 9, -3), nrow = 2)
Aeig <- eigen(A)
c0 <- solve(Aeig$vectors, c(2, -4))
c0
## [1] 1.20185+3.469443i 1.20185-3.469443i
Aeig$values
## [1] 0+5.196152i 0-5.196152i
Aeig$vectors
##                       [,1]                  [,2]
## [1,]  0.8320503+0.0000000i  0.8320503+0.0000000i
## [2,] -0.2773501+0.4803845i -0.2773501-0.4803845i

Solution

\[ \mathbf{x}(t)=2.884441 e^{+5.196152it} \left[\begin{array}{c} 0.8320503+0.0000000i \\ -0.2773501+0.4803845i \end{array}\right] - 2.262742 e^{-5.196152it} \left[\begin{array}{c} 0.8320503+0.0000000i \\ -0.2773501-0.4803845i \end{array}\right] \]



Case 2. \(\mathbf{A}\) has less than \(n\) independent eigenvectors.

Lemma 1. If \(\lambda\) is an eigenvalue of \(\mathbf{A}\) with algebraic multiplicity \(\alpha\), then for any generalized eigenvector \(\mathbf{x}\) associated with \(\lambda\)

\[ \begin{equation} \begin{split} e^{\mathbf{A}t} \mathbf{x}=\sum_{p=0}^{\alpha - 1}\frac{t^p}{p!}\left[\sum_{k=0}^{\infty}\frac{t^{k-p}}{(k-p)!}(\lambda \mathbf{I})^{k-p} \right](\mathbf{A} - \lambda \mathbf{I})^p \mathbf{x}. \end{split} \end{equation} \label{Taylor1} \]

Proof. Since

\[ \begin{equation} \begin{split} \frac{(\mathbf{A}t)^k}{k!} \mathbf{x}& = \frac{t^k}{k!}(\lambda \mathbf{I} + \mathbf{A} - \lambda \mathbf{I})^k \mathbf{x}\\ & = \frac{t^k}{k!}\sum_{p=0}^{\alpha - 1}\frac{k!}{(k-p)!p!}(\lambda \mathbf{I})^{k-p} (\mathbf{A} - \lambda \mathbf{I})^p \mathbf{x}\\ & = \sum_{p=0}^{\alpha - 1}\frac{t^p}{p!}\frac{t^{k-p}}{(k-p)!}(\lambda \mathbf{I})^{k-p} (\mathbf{A} - \lambda \mathbf{I})^p \mathbf{x}, \end{split} \end{equation} \]

\[ \begin{equation} \begin{split} e^{\mathbf{A}t}\mathbf{x}=\sum_{k=0}^{\infty}\frac{(\mathbf{A}t)^k}{k!} \mathbf{x} & = \sum_{k=0}^{\infty}\sum_{p=0}^{\alpha - 1}\frac{t^p}{p!}\frac{t^{k-p}}{(k-p)!}(\lambda \mathbf{I})^{k-p} (\mathbf{A} - \lambda \mathbf{I})^p \mathbf{x}\\ & = \sum_{p=0}^{\alpha - 1}\frac{t^p}{p!}\left[\sum_{k=0}^{\infty}\frac{t^{k-p}}{(k-p)!}(\lambda \mathbf{I})^{k-p} \right](\mathbf{A} - \lambda \mathbf{I})^p \mathbf{x}. \end{split} \end{equation} \label{Taylor2} \]

In this case, \(\mathbf{A}\) has at least one eigenvalue whose geometric multiplicity is less than its algebraic multiplicity. Let us consider a special case where the eigenvalue \(\lambda_1\) has algebraic multiplicity 3 and geometric multiplicity 1. Three independent vectors in the generalized eigenspace of \(\lambda_1\) are chosen as described in Section 2, i.e.

\[ (\mathbf{A} - \lambda_1 \mathbf{I}) \mathbf{x}_1 = \mathbf{0} \\ (\mathbf{A} - \lambda_1 \mathbf{I}) \mathbf{x}_2 = \mathbf{x}_1 \\ (\mathbf{A} - \lambda_1 \mathbf{I}) \mathbf{x}_3 = \mathbf{x}_2 \]

So a basis can be formed as \(\{\mathbf{x}_1, \mathbf{x}_2, \mathbf{x}_3, \mathbf{e}_4, \cdots, \mathbf{e}_n\)}$ corresponding to \(n\) eigenvalues \(\lambda_1, \lambda_1, \lambda_1, \lambda_4, \cdots, \lambda_n\) (possibly repeated). The initial value vector \(\mathbf{x}_0\) can be uniquely expressed as \(\mathbf{x}_0=c_1\mathbf{x}_1 + c_2\mathbf{x}_2 + c_3\mathbf{x}_3 + \sum_{j=4}^{n}c_j \mathbf{e}_j\), where \([c_1, c_2, \cdots, c_n]^T\) is the coeficient vector.

For \(\mathbf{x}_1\), using Eq. (\ref{Taylor1}), we obtain \(e^{\mathbf{A}t} \mathbf{x}_1 = e^{\lambda_1 t} \mathbf{x}_1\).

For \(\mathbf{x}_2\), using Eq. (\ref{Taylor1}), we obtain \(e^{\mathbf{A}t} \mathbf{x}_2 = e^{\lambda_1 t} \mathbf{x}_2 + t e^{\lambda_1 t} \mathbf{x}_1\).

For \(\mathbf{x}_3\), using Eq. (\ref{Taylor1}), we obtain \(e^{\mathbf{A}t} \mathbf{x}_3 = e^{\lambda_1 t} \mathbf{x}_3 + t e^{\lambda_1 t} \mathbf{x}_2 + \frac{t^2}{2!} e^{\lambda_1 t} \mathbf{x}_1\).

Therefore the solution to Eq. (\ref{diffequation}) can be written as

\[ \begin{equation} \begin{split} \mathbf{x}(t) & = e^{\mathbf{A}t}\mathbf{x}_0 \\ & = c_1 e^{\mathbf{A}t}\mathbf{x}_1 + c_2 e^{\mathbf{A}t}\mathbf{x}_2 + c_3 e^{\mathbf{A}t}\mathbf{x}_3 + \sum_{j=4}^{n}c_j e^{\mathbf{A}t}\mathbf{e}_j \\ & = (c_1 + c_2 t + c_3 \frac{t^2}{2!} )e^{\lambda_1 t}\mathbf{x}_1 + (c_2 + c_3 t) e^{\lambda_1 t}\mathbf{x}_2 + c_3 e^{\lambda_1 t}\mathbf{x}_3 + \sum_{j=4}^{n}c_j e^{\lambda_j t}\mathbf{e}_j \end{split} \end{equation} \label{case2} \]

Example 6. Solve the following initial value problem.

\[ \mathbf{x}' = \left[ \begin{array}{cc} 7 & 1 \\ -4 & 3 \end{array} \right]\mathbf{x} \;\;\; \mathbf{x}(0) = \left[ \begin{array}{c} 2 \\ -5 \end{array} \right] \]

Step 1. Find eigenvalues and eigenvectors

A <- matrix(c(7, -4, 1, 3), nrow = 2)
Aeig <- eigen(A)
Aeig$values
## [1] 5 5
Aeig$vectors
##            [,1]       [,2]
## [1,] -0.4472136  0.4472136
## [2,]  0.8944272 -0.8944272

Step 2. Find generalized eigenvectors

library(pracma)
M <- (A - Aeig$values[1] * diag(2)) %*% (A - Aeig$values[1] * diag(2))
K <- nullspace(M)
# Jordan chain
(A - Aeig$values[1] * diag(2)) %*% K
##      [,1] [,2]
## [1,]    2    1
## [2,]   -4   -2
(A - Aeig$values[1] * diag(2)) %*% (A - Aeig$values[1] * diag(2)) %*% K
##      [,1] [,2]
## [1,]    0    0
## [2,]    0    0

Step 3. Construct a basis

B <- cbind(c(1, -2), K[,2])
B
##      [,1] [,2]
## [1,]    1    0
## [2,]   -2    1
# decompose initial value
C <- solve(B, c(2, -5))
C
## [1]  2 -1

Solution

\[ (2 - t) e^{5t} \left[\begin{array}{c} 1 \\ -2 \end{array} \right] - e^{5t} \left[\begin{array}{c} 0 \\ 1 \end{array} \right] \]

References

Kuang Z. and Yang M., On the index of the eigenvalue of a transport operator. Transport Theory and Statistical Physics 1995, 24 (9), 1411 - 1418.

https://www.statmethods.net/advstats/matrix.html

http://linear.ups.edu/scla/html/frontmatter.html

https://tutorial.math.lamar.edu/Classes/DE/DE.aspx

Eigenvalues and Differential Equations