Eigensystems by Naresh
Eigensystems by Naresh
2 Diagonalization 14
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.2 Similarity of matrices . . . . . . . . . . . . . . . . . . . . . . . 15
2.3 Diagonalizable matrices . . . . . . . . . . . . . . . . . . . . . . 16
1
1 Eigenvalues and Eigenvectors
1.1 Introduction
Sometimes, the effect of multiplying a vector by a matrix is to produce simply
a scalar multiple
of thatvector.
For example,
3 2 1 5 1
= =5 .
−4 9 1 5 1
We often want to know to what extent this phenomenon occurs for a given
matrix. That is, we ask for all the instances of the equation AX = λX, where
A is given matrix but we know neither the vector X nor the scalar λ. Because
there are two unknowns on the right multiplied together, this is not a linear
problem! If λ were fixed, and we only wanted to find all the appropriate
X − vectors, it would be a standard linear problem. Even easier is the case
when x is known, and we only want to determine λ, each of these is a linear
problem. But it is nonlinear if λ and X are both unknown. Therefore, we
must expect to encounter some obstacles in solving this problem.
2
1.2 Eigenvalues
First we note that the vector 0 always solve the equation AX = λX.
We rule out that case as uninteresting.
The properly posed question is therefore:
For what scalars λ is there a nontrivial solution (X ̸= 0) to the equation
AX = λX where the matrix A is given.
This is known as the eigenvalue problem for the given matrix A.
Challenge
1.1. Find the characteristic equation and the eigenvalues of the
2 1
matrix .
4 −1
Solution: The
characteristicequation of A is
2−λ 1
Det(A − λI) = = 0. Next we expand the determinant and
4 −1 − λ
obtain: (2 − λ)(−1 − λ) − 4 = 0. This leads to λ2 − λ − 6 = 0.
3
Therefore λ2 − λ − 6 = 0 is the required characteristic equation of A solving
it for λ gives the eigenvalues of A.
Hence, the given matrix A has precisely two eigenvalues λ = 3 and λ = −2.
7 0 0
Challenge 1.2. Find the eigenvalues of the matrix −8 3 0 .
0 0 0
Solution: The
characteristic equation
of A is
7−λ 0 0
Det(A − λI) = −8 3 − λ 0 = 0.
0 0 0−λ
On expanding we obtain: (7 − λ)(3 − λ)(−λ) = 0.
Hence, the given matrix A has three eigenvalues λ = 7, 3, 0.
4
4 6 6
Challenge 1.3. Find the eigenvalues of the matrix 1 3 2 .
−1 −5 −2
Solution: The characteristic polynomial is
λ3 − T race(A)λ2 + (A11 + A22 + A33 )λ − Det(A)
= λ3 − 5λ2 + (4 + (−2) + 6)λ − 4
= λ3 − 5λ2 + 8λ − 4.
Solving λ3 − 5λ2 + 8λ − 4 = 0 gives λ = 1, 2, 2 as the eigenvalues.
0 1
Challenge 1.4. Find the eigenvalues of the matrix .
−1 0
5
1.3 Eigenvectors
In previous section we learned how to find the eigenvalues of the given matrix
A. Here in this section, we are going to find the eigenvectors of matrix A by
using its eigenvalues. i.e., we will solve AX = λX in which A and λ(which
we are able to compute now) are known, so this is a linear problem to solve.
1 2
Challenge 1.5. Find the eigenvalues and eigenvectors of A = .
4 3
Solution: The characteristic polynomial is
λ2 − T race(A)λ + Det(A) = λ2 − 4λ − 5.
Solving λ2 − 4λ − 5 = 0 gives λ = 5, −1 as the eigenvalues.
Now we fix one eigenvalue and find the corresponding eigenvector/s.
Let λ = 5 be fixed,
we want to find a vector which satisfy AX = λX
which (A −λI)X
is equivalent to =0
1−5 2 x1 0
i.e., = ,
4 3 − 5 x2 0
−4 2 x1 0
= which gives x2 = 2x1 .
4 −2 x2 0
Therefore X = (x1 , x2 )t = (x1 , 2x1 )t = x1 (1, 2)t .
So X1 = (1, 2)t is an eigenvector corresponding to λ = 5.
Note: Any nonzero multiple of X1 is also an eigenvector of A corresponding
to λ = 5.
Now fix λ = −1,
1+1 2 x1 0
consider (A − λI)X = 0 i.e., = ,
4 3 + 1 x 2 0
2 2 x1 0
= which gives x2 = −x1 .
4 4 x2 0
Ttherefore X = (x1 , x2 )t = (x1 , −x1 )t = x1 (1, −1)t .
So X2 = (1, −1)t is an eigenvector corresponding to λ = −1.
6
2 −1 1
Challenge 1.6. Find the eigenvalues and eigenvectors of A = 1 2 −1 .
1 −1 2
Solution: The characteristic polynomial is
λ3 − T race(A)λ2 + (A11 + A22 + A33 )λ − Det(A)
= λ3 − 6λ2 + (3 + 3 + 5)λ − 6
= λ3 − 6λ2 + 11λ − 6.
Solving λ3 − 6λ2 + 11λ − 6 = 0 gives λ = 1, 2, 3 as the eigenvalues.
For λ = 1,
2 − 1 −1 1 x1 0
consider (A − I)X = 0, 1 2 − 1 −1 x2 = 0 ,
1 −1 2 − 1 x3 0
1 −1 1 x1 0
1 1 −1 x2 = 0 , R2 − R1 , R3 − R1 gives
1 −1 1 x3 0
1 −1 1 x1 0
0 2 −2 x2 = 0 .
0 0 0 x3 0
This implies that x1 − x2 + x3 = 0 and x2 = x3 substituting in the first
equation gives x1 = 0.
Therefore X = (x1 , x2 , x3 ) = (0, x2 , x2 ) = x2 (0, 1, 1).
Hence X1 = (0, 1, 1) is an eigenvector corresponding to λ = 1.
Note: As the rank of the coefficient matrix is 2 and the number of unknowns
are 3 therefore there is 3 − 2 = 1 eigenvector corresponding to λ = 2.
For λ = 2,
2 − 2 −1 1 x1 0
consider (A − 2I)X = 0, 1 2 − 2 −1 x2 = 0 ,
1 −1 2 − 2 x3 0
0 −1 1 x1 0
1 0 −1 x2 = 0 .
1 −1 0 x3 0
This implies that x2 = x3 , x1 = x3 and x1 = x2 .
Therefore X = (x1 , x2 , x3 ) = (x1 , x1 , x1 ) = x1 (1, 1, 1).
Hence X2 = (1, 1, 1) is an eigenvector corresponding to λ = 2.
For λ = 3,
2 − 3 −1 1 x1 0
consider (A − 3I)X = 0, 1 2 − 3 −1 x2 = 0 ,
1 −1 2 − 3 x3 0
7
−1 −1 1 x1 0
1 −1 −1 x2 = 0 , R2 + R1 , R3 − R2 gives
1 −1 −1 x3 0
−1 −1 1 x1 0
0 −2 0 x2 = 0 .
0 0 0 x3 0
This implies that x1 + x2 − x3 = 0 and x2 = 0 substituting in first equation
gives x1 = x3 .
Therefore X = (x1 , x2 , x3 ) = (x3 , 0, x3 ) = x3 (1, 0, 1).
Hence X3 = (1, 0, 1) is an eigenvector corresponding to λ = 3.
3 2 4
Challenge 1.7. Find the eigenvalues and eigenvectors of A = 2 0 2 .
4 2 3
Solution: The characteristic polynomial is
λ − T race(A)λ2 + (A11 + A22 + A33 )λ − Det(A)
3
4 2 −5 x 3
0
−5 2 4 x1 0
0 − 36 18 x2 = 0 R3 + 5 R2 gives
5 5 2
0 18 −9 x 3
0
−5 2 4 x1 0
0 − 36 18 x2 = 0 .
5 5
0 0 0 x3 0
This implies that −5x1 + 2x2 + 4x3 = 0 and x3 = 2x2 substituting in the first
equation gives x1 = 2x2 .
Therefore X = (x1 , x2 , x3 ) = (2x2 , x2 , 2x2 ) = x2 (2, 1, 2).
Hence X1 = (2, 1, 2) is an eigenvector corresponding to λ = 8.
For λ = −1,
8
3+1 2 4 x1 0
consider (A + I)X = 0, 2 0+1 2 x2 = 0 ,
4 2 3+1 x3 0
4 2 4 x1 0
2 1 2 x2 = 0 , R2 − 1 R1 , R3 − R1 gives
2
4 2 4 x 3
0
4 2 4 x1 0
0 0 0 x2 = 0 .
0 0 0 x3 0
This implies that 4x1 + 2x2 + 4x3 = 0 i.e., x2 = −2x1 − 2x3 .
Therefore X = (x1 , x2 , x3 ) = (x1 , −2x1 −2x3 , x3 ) = x1 (1, −2, 0)+x2 (0, −2, 1).
Hence X2 = (1, −2, 0) and X3 = (0, −2, 1) are eigenvectors corresponding to
λ = −1.
Note: As the rank of the coefficient matrix (A + I) is 1 and the number of
unknowns are 3 therefore there are 3 − 1 = 2 eigenvectors corresponding to
λ = −1.
6 −6 6 x 3 0
7 −1 1 x1 0
0 0 0 x2 = 0 .
0 − 367
36
7
x3 0
9
This implies that 7x1 − x2 + x3 = 0 and x3 = x2 substituting in the first
equation gives x1 = 0.
Therefore X = (x1 , x2 , x3 ) = (0, x2 , x2 ) = x2 (0, 1, 1).
Hence X1 = (0, 1, 1) is an eigenvector corresponding to λ = −4
For λ = 2,
3−2 −1 1 x1 0
consider (A − 2I)X = 0, 7 −5 − 2 1 x2 = 0 ,
6 −6 2−2 x3 0
1 −1 1 x1 0
7 −7 1 x2 = 0 , R2 − 7R1 , R3 − 6R1 gives
6 −6 0 x3 0
1 −1 1 x1 0
0 0 0 x2 = 0 .
0 0 −6 x3 0
This implies that x1 − x2 + x3 = 0 and x3 = 0 substituting in the first equa-
tion gives x1 = x2 .
Therefore X = (x1 , x2 , x3 ) = (x2 , x2 , 0) = x2 (1, 1, 0).
Hence X2 = (1, 1, 0) is an eigenvector corresponding to λ = 2.
1 0 0
Challenge 1.9. Find the eigenvalues and eigenvectors of A = 0 1 0 .
0 0 1
Solution: The characteristic polynomial is
λ − T race(A)λ2 + (A11 + A22 + A33 )λ − Det(A)
3
= λ3 − 3λ2 + (1 + 1 + 1)λ − 1
= λ3 − 3λ2 + 3λ − 3.
Solving λ3 − 3λ2 + 3λ − 3 = 0 gives λ = 1, 1, 1 as the eigenvalues.
For λ = 1,
1−1 0 0 x1 0
consider (A − I)X = 0, 0 1−1 0 x2 = 0 ,
0 0 1−1 x3 0
0 0 0 x1 0
0 0 0 x2 = 0 .
0 0 0 x3 0
This means there is no condition on x1 , x2 , x3 .
Therefore X = (x1 , x2 , x3 ) = x1 (1, 0, 0) + x2 (0, 1, 0) + x3 (0, 0, 1).
Hence X1 = (1, 0, 0), X2 = (0, 1, 0) and X3 = (0, 0, 1) are eigenvectors corre-
10
sponding to λ = 1.
Note: As the rank of the coefficient matrix is 0 and the number of unknowns
are 3, therefore there are 3 − 0 = 3 eigenvectors corresponding to λ = 1,
Some facts:
Or
11
As the given matrix is lower triangular, so the eigenvalues are the diagonal
elements i.e., λ = −1, −2, −3. are the eigenvalues of A.
For λ = −1,
−1 + 1 0 0 x1 0
consider (A + I)X = 0, 2 −3 + 1 0 x2 = 0 ,
1 4 −2 + 1 x3 0
0 0 0 x1 0
2 −2 0 x2 = 0 .
1 4 −1 x3 0
This implies that x1 + 4x2 − x3 = 0 and x1 = x2 substituting in the first
equation gives x3 = 5x2 .
Therefore X = (x1 , x2 , x3 ) = (x2 , x2 , 5x2 ) = x2 (1, 1, 5).
Hence X1 = (1, 1, 5) is an eigenvector of A corresponding to λ = −1.
For λ = −2,
−1 + 2 0 0 x1 0
consider (A + 2I)X = 0, 2 −3 + 2 0 x2 = 0 ,
1 4 −2 + 2 x3 0
1 0 0 x1 0
2 −1 0 x2 = 0 .
1 4 0 x3 0
This implies that x1 = 0 and 2x1 − x2 = 0 which gives x2 = 0 and there is
no condition on x3 .
Therefore X = (x1 , x2 , x3 ) = (0, 0, x3 ) = x3 (0, 0, 1).
Hence X2 = (0, 0, 1) is an eigenvector of A corresponding to λ = −2.
For λ = −3,
−1 + 3 0 0 x1 0
consider (A + 3I)X = 0, 2 −3 + 3 0 x2 = 0 ,
1 4 −2 + 3 x3 0
2 0 0 x1 0
2 0 0 x2 = 0 .
1 4 1 x3 0
This implies that x1 + 4x2 + x3 = 0 and x1 = 0 substituting in the first
equation gives x3 = −4x2 .
Therefore X = (x1 , x2 , x3 ) = (0, x2 , −4x2 ) = x2 (0, 1, −4).
Hence X3 = (0, 1, −4) is an eigenvector of A corresponding to λ = −3.
As the eigenvalues of A are −1, −2, −3.
So the eigenvalues of A2 + 5A − 2I are
(−1)2 + 5(−1) − 2, (−2)2 + 5(−2) − 2, (−3)2 + 5(−3) − 2
12
i.e., the eigenvalues of A2 + 5A − 2I are −6, − 8, − 8.
We know that eigenvectors of A2 + 5A − 2I are same as that of A.
So eigenvectors of A2 + 5A − 2I are X1 = (1, 1, 5), X2 = (0, 0, 1) and X3 =
(0, 1, −4).
More Challenges:
Q.1. Find the eigenvalues and eigenvectors of the following matrices
8 −8 −2
1. 4 −3 −2 . Answer: λ = 1, 2, 3.
3 −4 1
2 1 1
2. 2 3 2 . Answer: λ = 1, 1, 7.
3 3 4
1 2 2
3. 2 1 2 . Answer: λ = −1, −1, 5.
2 2 1
2 3 4
4. 0 2 −1 . Answer: λ = 1, 2, 2.
0 0 1
−2 2 −3
5. 2 1 −6 . Ans: λ = 5, −3, −3. X1 = (1, 2, −1), X2 = (2, −1, 0), X3 =
−1 −2 0
(3, 0, 1).
1/3 2/3 2/3
6. 2/3 1/3 −2/3 . Ans: λ = 1, 1, −1. X1 = (1, −1, 0), X2 =
2/3 −2/3 1/3
(1, 0, −1), X3 = (1, −1, −1).
13
2 Diagonalization
2.1 Introduction
Diagonalizable matrices are of interest because diagonal matrices are espe-
cially easy to handle. In this section we will try to express the given matrix A
as A = P DP −1 where D is diagonal matrix and P is some invertible matrix.
Not all matrices are expressible in such way but the matrices which has such
expression are called diagonalizable matrix, we will define diagonalizability
formally ahead. Once we have such expression then we can easily find An as
An = (P DP −1 )(P DP −1 ) · · · (P DP −1 )
= P D(P −1 P )D(P −1 P )D · · · D(P −1 P )P −1 = P Dn P −1 . Similarly we will be
able to compute eA , tan(A) and manymore functions of A which we will dis-
cuss in detail.
14
2.2 Similarity of matrices
Definition 2.1 (Similar). Two n × n matrices A and B are called similar if
there exists an n × n invertible matrix P such that B = P −1 AP.
2 −3 1 0
For example: A = is similar to B = because
1 −2 0 −1
3 1
P = works. To check B = P −1 AP, we check that P B = AP and
1 1
3 −1
observe that P B = = AP.
1 −1
Some properties of similar matrices:
If A and B are two similar matrices then the following holds:
1. Rank(A)=Rank(B).
In this section we saw some examples which tells us that there are matrices
which are similar and there are matrices which are not similar.
The next section focuses on the matrices which are similar to diagonal ma-
trices to which we call as diagonalizable matrices.
15
2.3 Diagonalizable matrices
Definition 2.2 (Diagonalizable). A square matrix A is said to be diagonal-
izable if it is similar to diagonal matrix. i.e., if there exist a diagonal matrix
D and an invertible matrix P (called modal matrix) such that A = P DP −1 .
Some facts:
3. Necessary and sufficient condition and above point ensures that there
are n linearly independent eigenvectors for a diagonalizable matrix.
16
7. Suppose matrix A has distinct eigenvalues i.e., A.M (λ) = 1 for all the
eigenvalues of A then G.M (λ) = 1 for all the eigenvalues of A by above
point. This implies that A.M (λ) = G.M (λ) = 1, for all the eigenvalues
of A. So we can state the following
If a matrix A has distinct eigenvalues then it is diagonalizable,
converse of this statement is not true.
17
−7 2 −3 x1 0
2 −4 −6 x2 = 0 , R1 − 7R3 , R2 + 2R3 gives
−1 −2 −5 x3 0
0 16 32 x1 0
0 −8 −16 x2 = 0 , R1 − 2R2 gives
−1 −2 −5 x3 0
0 0 0 x1 0
0 −8 −16 x2 = 0 .
−1 −2 −5 x3 0
This implies that x2 = −2x3 and −x1 − 2x2 − 5x3 = 0 substituting first
equation in second equation gives x1 = −x3 . Therefore X = (x1 , x2 , x3 ) =
(−x3 , −2x3 , x3 ) = x3 (−1, −2, 1).
Hence X3 = (−1, −2, 1) is an eigenvector corresponding to λ = 5.
We obtained 2 linearly independent eigenvectors corresponding to λ = −3
and 1 eigenvector corresponding to λ = 5,
therefore G.M (−3) = 2 and G.M (5) = 1.
Henceby necessary and sufficient
condition, A is diagonalizable.
5 0 0 −1 −2 3
D = 0 −3 0 , P = −2 1 0 are the required diagonal and
0 0 −3 1 0 1
modal matrix respectively.
18
8−1 −8 −2 x1 0
consider (A − I)X = 0, 4 −3 − 1 −2 x2 = 0 ,
3 −4 1−1 x3 0
7 −8 −2 x1 0
4 −4 −2 x2 = 0 , R1 − R2 , R2 − 4 R3 gives
3
3 −4 0 x 3
0
3 −4 0 x1 0
0 4 −2 x2 = 0 , R1 − R3 gives
3
3 −4 0 x3 0
0 0 0 x1 0
0 4 −2 x2 = 0 .
3
3 −4 0 x3 0
4
This implies that 3 x2 = −2x3 i.e., 2x2 = 3x3 and 3x1 = 4x2 ,
if we substitute x3 = 2t then x2 = 3t and x1 = 4t,
then X = (x1 , x2 , x3 ) = (4t, 3t, 2t) = t(4, 3, 2).
Hence X1 = (4, 3, 2) is an eigenvector corresponding to λ = 1.
For λ = 2,
8−2 −8 −2 x1 0
consider (A − 2I)X = 0, 4 −3 − 2 −2 x2 = 0 ,
3 −4 1−2 x3 0
6 −8 −2 x1 0
4 −5 −2 x2 = 0 , R1 − 2R3 , R2 − 4 R3 gives
3
3 −4 −1 x 3
0
0 0 0 x1 0
0 1 − 2 x2 = 0 .
3 3
3 −4 −1 x3 0
This implies that 3x1 − 4x2 − x3 = 0 and x2 = 2x3 substituting in the first
equation gives x1 = 3x3 .
Therefore X = (x1 , x2 , x3 ) = (3x3 , 2x3 , x3 ) = x3 (3, 2, 1).
Hence X1 = (3, 2, 1) is an eigenvector corresponding to λ = 2.
For λ = 3,
8−3 −8 −2 x1 0
consider (A − 3I)X = 0, 4 −3 − 3 −2 x2 = 0 ,
3 −4 1−3 x3 0
5 −8 −2 x1 0
4 −6 −2 x2 = 0 , R1 − 2R3 , R2 − 4 R3 gives
3
3 −4 −2 x3 0
19
5 −8 −2 x1 0
0 2 − 2 x2 = 0 .
5 5
0 45 − 54 x3 0
This implies that 5x1 − 8x2 − 2x3 = 0 and x2 = x3 substituting in the first
equation gives x1 = 2x3 .
Therefore X = (x1 , x2 , x3 ) = (2x3 , x3 , x3 ) = x3 (2, 1, 1).
HenceX1 = (2, 1,1) is aneigenvector corresponding to λ = 3.
1 0 0 4 3 2
D = 0 2 0 , P = 3 2 1 are the required diagonal and modal
0 0 3 2 1 1
matrix respectively.
By referring to the problems solved in previous section we observe that
More challenges:
Check whether the following matrices are diagonalizable? If it is, then find
the diagonal matrix and modal matrix.
4 2 −2 1 0 0 2 1 0
1. −5 3 2 Ans: D = 0 2 0 , P = 1 1 1 .
−2 4 1 0 0 5 4 2 1
2 1 0
2. 0 2 1 Ans: λ = 2, 2, 2. Here A.M (2) = 3 but G.M (2) = 1.
0 0 2
Hence the given matrix is not diagonalizable.
−17 18 −6
3. −18 19 −6 Ans: λ = 2, 1, 1. Here A.M (1) = 2 but G.M (1) = 1
−9 9 2
Hence the given matrix is not diagonalizable.
20
2 −1 1
4. 2 2 −1 Ans: λ = 1, 1, 1. Here A.M (1) = 3 but G.M (1) = 1.
1 2 −1
Hence the given matrix is not diagonalizable.
21
3 Cayley Hamilton theorem
3.1 Introduction
We turn now to a remarkable theorem, named after Arthur Cayley and Sir
William Rowan Hamilton. This deals with the polynomials of the square
matrices. The theorem helps us to find A−1 if exists and also gives answer
for the higher degree polynomials of the matrix.
3.2 Theorem
Theorem 3.1 (Cayley-Hamilton). Every square matrix satisfies its own
characteristic polynomial, i.e., If p(λ) is characteristic polynomial of a square
matrix A then p(A) = 0.
7 2 −2
Challenge 3.1. Verify Cayley-Hamilton theorem for A = −6 −1 2
6 2 −1
−1
and hence find A if exists.
Solution: The characteristic polynomial of A is given by
λ − T race(A)λ2 + (A11 + A22 + A33 )λ − Det(A)
3
22
79 − 125 + 49 − 3 26 − 40 + 14 −26 + 40 − 14 0 0 0
= −78 + 120 − 42 −25 + 35 − 7 26 − 40 + 14 = 0 0 0 .
78 − 120 + 42 26 − 40 + 14 −25 + 35 − 7 0 0 0
In the process of finding characteristic polynomial of A, we found that Det(A) =
3, so A−1 exists.
We verified that A3 − 5A2 + 7A − 3I = 0.
Multiplying both side by A−1 we get,
A−1 A3 − 5A−1 A2 + 7A−1 A − 3A−1 I = 0 i.e., A2 − 5A + 7I − 3A−1 = 0.
−1 1 2
Hence A = 3 (−A + 5A− 7I)
25 8 −8 7 2 −2 1 0 0
= 31 − −24 −7 8 + 5 −6 −1 2 − 7 0 1 0
24 8 −7 6 2 −1 0 0 1
−3 −2 2
1
=3 6 5 −2 .
−6 −2 5
1 2 −2
Challenge 3.2. Verify Cayley-Hamilton theorem for A = −1 3 0
0 −2 1
−1 4
and hence find A if exists, and A .
23
1 2 −2 1 0 0
9 −1 3 0 − 0 1 0
0 −2 1 0 0 1
−13 + 5 + 9 − 1 42 − 60 + 18 −2 + 20 − 18 0 0 0
= −11 + 20 − 9 9 − 35 + 27 − 1 10 − 10 = 0 0 0 .
10 − 10 −22 + 40 − 18 −3 − 5 + 9 − 1 0 0 0
In the process of finding characteristic polynomial of A,
we found that Det(A) = 1, so A−1 exists.
We verified that A3 − 5A2 + 9A − I = 0.
Multiplying both side by A−1 we get,
A−1 A3 − 5A−1 A2 + 9A−1 A − A−1 I = 0 i.e., A2 − 5A + 9I − A−1 = 0.
−1 2
Hence A = −A + 5A −9I
−1 12 −4 1 2 −2 1 0 0 3 2 6
= − −4 7 2 +5 −1 3 0 −9 0 1 0 = 1 1 2 .
2 −8 1 0 −2 1 0 0 1 2 2 5
4 3 2
To find A multiply the verified equation A − 5A + 9A − I = 0 by A,
4 3 2
A = 5A − 9A + A
we get
−13 42 −2 −1 12 −4 1 2 −2
= 5 −11 9 10 − 9 −4 7 2 + −1 3 0
10 −22 −3 2 −8 1 0 −2 1
−55 104 24
= −20 −15 32 .
32 −42 13
2 1 1
Challenge 3.3. Find the characteristic equation of the matrix A = 0 1 0
1 1 2
and hence find the matrix represented by A8 − 5A7 + 7A6 − 3A5 + A4 − 5A3 +
8A2 − 2A + I.
Solution: The characteristic polynomial of A is given by
λ3 − T race(A)λ2 + (A11 + A22 + A33 )λ − Det(A)
= λ3 − 5λ2 + (2 + 3 + 2)λ − 3
= λ3 − 5λ2 + 7λ − 3.
Therefore λ3 − 5λ2 + 7λ − 3 = 0 is the required characteristic equation.
Now dividing the polynomial λ8 − 5λ7 + 7λ6 − 3λ5 + λ4 − 5λ3 + 8λ2 − 2λ + 1
by the characteristic polynomial λ3 − 5λ2 + 7λ − 3,
we get the quotient λ5 + λ and the remainder λ2 + λ + 1.
So λ8 − 5λ7 + 7λ6 − 3λ5 + λ4 − 5λ3 + 8λ2 − 2λ + 1 = (λ3 − 5λ2 + 7λ − 3)(λ5 +
24
λ) + λ2 + λ + 1.
In terms of A we have,
A8 − 5A7 + 7A6 − 3A5 + A4 − 5A3 + 8A2 − 2A + I = (A3 − 5A2 + 7A − 3I)(A5 +
A) + A2 + A + I.
But by Cayley-Hamilton theorem A3 − 5A2 + 7A − 3I = 0.
8 7 6 5 4 3 2 2
A − 5A
Therefore + 7A − 3A +A − 5A + 8A − 2A + I = A + A + I.
2 1 1 2 1 1 5 4 4
2
A = 0 1 0
0 1 0 = 0 1 0 .
1 1 2 1 1 2 4 4 5
5 4 4 2 1 1 1 0 0 8 5 5
Hence A2 +A+I = 0 1 0 + 0 1 0 + 0 1 0 = 0 3 0 .
4 4 5 1 1 2 0 0 1 5 5 8
1 2
Challenge 3.4. Apply Cayley-Hamilton theorem to A = and
2 −1
deduce that A8 = 625I.
Solution: The characteristic polynomial of A is given by
2
λ − T race(A)λ + Det(A)
= λ2 − 0λ − 5 = λ2 − 5.
So by Cayley-Hamilton theorem A2 − 5I = 0 i.e., A2 = 5I.
Therefore A8 = 54 I = 625I.
1 2 0
Challenge 3.5. Verify Cayley-Hamilton theorem for A = 2 −1 0
0 0 −1
−2
and hence find A if exists.
Solution: The characteristic equation is
λ + λ2 − 5λ − 5 = 0.
3
3 2 2
whether A + A − 5A − 5I= 0For this we
To check need A and A3 .
1 2 0 1 2 0 5 0 0
A2 = 2 −1 0 2 −1 0 = 0 5 0
0 0 −1 0 0 −1 0 0 1
5 0 0 1 2 0 5 10 0
A3 = A2 A = 0 5 0 2 −1 0 = 10 −5 0
0 0 1 0 0 −1 0 0 −1
5 10 0 5 0 0 1 2 0
A3 + A2 − 5A − 5I = 10 −5 0 + 0 5 0 − 5 2 −1 0 −
0 0 −1 0 0 1 0 0 −1
25
1 0 0
5 0 1 0
0 0 1
5+5−5−5 10 − 10 0 0 0 0
= 10 − 10 −5 + 5 + 5 − 5 0 = 0 0 0
0 0 −1 + 1 + 5 − 5 0 0 0
Multiply the verified equation A3 + A2 − 5A − 5I = 0 by A−1 (which exists
as Det(A) = 5)
−1 −1
we get, 2
A + A −5I − 5A = 0 i.e.,A = 51 (A2 + A
− 5I)
5 0 0 1 2 0 1 0 0 1 2 0
= 15 0 5 0 + 2 −1 0 − 5 0 1 0 = 15 2 −1 0 .
0 0 1 0 0 −1 0 0 1 0 0 −5
1 2 0 1 2 0 5 0 0
Now A−2 = (A−1 )2 = 51 2 −1 0 15 2 −1 0 = 25 1
0 5 0 .
0 0 −5 0 0 −5 0 0 25
More challenges:
2 −1 1
Q.1. Find the characteristic equation of the matrix A = −1 2 −1
1 −1 2
−1
and verify Cayley-Hamilton theorem and hence, obtain A if exists.
1 3 7
Q.2. Find the characteristic equation of the matrix A = 4 2 3 and
1 2 1
hence, find the matrix represented by A − 4A − 20A − 34A − 4A3 − 20A2 −
7 6 5 4
33A + I.
cos θ sin θ
Q.3. Verify Cayley-Hamilton theorem for A = and obtain
− sin θ cos θ
A−1 .
1 2
Q.4. Verify Cayley-Hamilton theorem for A = and evaluate 2A4 −
2 2
5A3 − 7A + 6I.
26
4 Functions of square matrix
4.1 Introduction
In this section, we use the concepts which were discussed in the previous
sections and apply it to find the function f (A) = An , eA , tan(A), etc.
where A is given.
27
3. cA = P cD P −1 , where c is constant.
28
3 1
Challenge 4.2. If A = 2
1
2
3 , find eA and 4A .
2 2
29
Therefore X = (x1 , x2 ) = (x2 , x2 ) = x2 (1, 1).
Hence X1 = (1, 1) is an eigenvector corresponding to λ = 3.
For λ = −3,
−1 + 3 4 x1 0
consider (A + 3I)X = 0 i.e., = .
2 1+3 x2 0
This imples that x1 = −2x2 .
Therefore X = (x1 , x2 ) = (−2x2 , x2 ) = x2 (−2, 1).
Hence X2 = (−2, 1) is an eigenvector corresponding to λ =−3.
3 0 1 −2 1 2
So D = ,P = and P −1 = 13 .
0 −3 1 1 −1 1
−1
Therefore
tan(A)
= P tan(D)P
1 −2 tan 3 0 1 1 2
=
1 1 0 tan(−3)
3 −1 1
− tan 3 4 tan 3 −1 4
= 13 = tan3 3 .
2 tan 3 tan 3 2 1
Hence 3 tan(A) = (tan 3)A.
30
(Note that the given matrix is upper triangular, so eigenvalues are the diag-
onal elements.)
Here f (A) = A100 and A is of order 3 × 3.
So let A100 = a2 A2 + a1 A + a0 I, in terms of λ the equation becomes
λ100 = a2 λ2 + a1 λ + a0 .
The above equality is satisfied by the eigenvalues of A.
Putting λ = 1, 2, −1 we get the equations
a2 + a1 + a0 = 1 (3)
a2 + a0 = 1 (6)
a1 = 0 (7)
31
Since the eigenvalue is repeated, so the eigenvalue will also satisfy the deriva-
tive of λ50 = a1 λ + a0 i.e., 50λ49 = a1
Putting λ = −1, we get
a1 = −50 (9)
Solving both the equation gives a0 = −49 and a1 = −50.
50 −100 −150 49 0
∴ A = a1 A + a0 I = −50A − 49I = − =
150 200 0 49
−149 −150
.
150 151
π π4
Challenge 4.6. If A = , find cos A.
0 π2
Solution: The given matrix is upper triangular, so the eigenvalues of the
matrix A are the diagonal elements, i.e., λ = π, π2 .
Let cos A = a1 A + a0 I in terms of λ the equation becomes
cos λ = a1 λ + a0 .
The above equation is satisfied by the eigenvalues of A.
Putting λ = π, we get
a1 π + a0 = −1 (10)
Putting λ = π2 , we get
a1 π
+ a0 = 0 (11)
2
On solving, we get a1 = − π2 and a0 = 1.
π π4 −1 − 12
2 2 1 0
Therefore cos A = − π A + I = − π + =
0 π2 0 1 0 0
1 0 0
Challenge 4.7. If A = 1 0 1 , find A50 .
0 1 0
Solution: The characteristic equation is
λ − λ2 + ((−1) + 0 + 0)λ − (−1) = 0 i.e., λ3 − λ2 − λ + 1 = 0
3
32
The above equality is satisfied by the eigenvalues of A. Putting λ = −1, we
get
a2 − a1 + a0 = 1 (12)
Putting λ = 1, we get
a2 + a1 + a0 = 1 (13)
Now as the eigen values λ = 1 is repeated, so λ = 1 will satify the derivative
of λ50 = a2 λ2 + a1 λ + a0 = 0 i.e., 50λ49 = 2a2 λ + a1 . Putting λ = 1, we get
50 = 2a2 + a1 (14)
33
References
[1] Linear Algebra, Theory and application, Ward Cheney and David Kin-
caid.
34