Eigenvalues and Diagonalization Explained
Eigenvalues and Diagonalization Explained
39. Prove that the intersection of any two distinct eigenspaces of Working withTechnology
a matrix A is {0}. T1. For the given matrix A, find the characteristic polynomial
and the eigenvalues, and then use the method of Example 7 to find
True-False Exercises bases for the eigenspaces.
TF. In parts (a)–(f) determine whether the statement is true or
−8 33 38 173 −30
false, and justify your answer. 0
0 −1 −4 0
(a) If A is a square matrix and Ax = λx for some nonzero scalar
λ, then x is an eigenvector of A. A=
0 0 −5 −25 1
0 0 1 5 0
(b) If λ is an eigenvalue of a matrix A, then the linear system 4 −16 −19 −86 15
(λI − A)x = 0 has only the trivial solution.
T2. The Cayley–Hamilton Theorem states that every square ma-
(c) If the characteristic polynomial of a matrix A is trix satisfies its characteristic equation; that is, if A is an n × n
p(λ) = λ2 + 1, then A is invertible. matrix whose characteristic equation is
λ## + c1 λn−1 + · · · + cn = 0
(d) If λ is an eigenvalue of a matrix A, then the eigenspace of A
corresponding to λ is the set of eigenvectors of A correspond- then An + c1 An−1 + · · · + cn = 0.
ing to λ. (a) Verify the Cayley–Hamilton Theorem for the matrix
0 1 0
(e) The eigenvalues of a matrix A are the same as the eigenvalues
of the reduced row echelon form of A. A = 0 0 1
2 −5 4
(f ) If 0 is an eigenvalue of a matrix A, then the set of columns of (b) Use the result in Exercise 28 to prove the Cayley–Hamilton
A is linearly independent. Theorem for 2 × 2 matrices.
def(x) 0
5.2 Diagonalization
=
In this section we will be concerned with the problem of finding a basis for Rn that consists
b =-. of eigenvectors of an n × n matrix A. Such bases can be used to study geometric properties
of A and to simplify various numerical computations. These bases are also of physical
A1,
significance in a wide variety of applications, some of which will be considered later in this
text.
bI -
The Matrix Diagonalization Products of the form P −1AP in which A and P are n × n matrices and P is invertible
Problem will be our main topic of study in this section. There are various ways to think about
A →P −1AP
in which the matrix A is mapped into the matrix P −1AP . These are called similarity
transformations. Such transformations are important because they preserve many prop-
determinant since I
I
erties of the matrix A. For example, if we let B = P −1AP , then A and B have the same
T-
=
1
= det(A) det(P ) = det(A)
det(P )
5.2 Diagonalization 303
Property Description
We will find the following terminology useful in our study of similarity transforma-
tions.
Note that if B is similar to A, then it is also true that A is similar to B since we can
express A as A = Q−1 BQ by taking Q = P −1 . This being the case, we will usually say
that A and B are similar matrices if either is similar to the other.
Because diagonal matrices have such a simple form, it is natural to inquire whether
a given n × n matrix A is similar to a matrix of this type. Should this turn out to be
the case, and should we be able to actually find a diagonal matrix D that is similar to
A, then we would be able to ascertain many of the similarity invariant properties of A
directly from the diagonal entries of D . For example, the diagonal entries of D will
be the eigenvalues of A (Theorem 5.1.2), and the product of the diagonal entries of D
will be the determinant of A (Theorem 2.1.2). This leads us to introduce the following
terminology.
The following theorem and the ideas used in its proof will provide us with a roadmap
for devising a technique for determining whether a matrix is diagonalizable and, if so,
for finding a matrix P that will perform the diagonalization.
304 Chapter 5 Eigenvalues and Eigenvectors
Part (b) of Theorem 5.2.1 is THEOREM 5.2.1 If A is an n × n matrix, the following statements are equivalent.
equivalent to saying that there (a) A is diagonalizable.
is a basis for R n consisting of
eigenvectors of A. Why?
(b) A has n linearly independent eigenvectors.
Proof (a) ⇒ (b) Since A is assumed to be diagonalizable, it follows that there exist an
invertible matrix P and a diagonal matrix D such that P −1AP = D or, equivalently,
X
AP = PD (1)
If we denote the column vectors of P by p1 , p2 , . . . , pn , and if we assume that the
diagonal entries of D are λ1 , λ2 , . . . , λn , then by Formula (6) of Section 1.3 the left side
of (1) can be expressed as
AP = A[p1 p2 · · · pn ] = [Ap1 Ap2 · · · Apn ]
and, as noted in the comment following Example 1 of Section 1.7, the right side of (1)
can be expressed as
PD = [λ1 p1 λ2 p2 · · · λn pn ]
Thus, it follows from (1) that
Ap1 = λ1 p1 , Ap2 = λ2 p2 , . . . , Apn = λn pn (2)
Since P is invertible, we know from Theorem 5.1.5 that its column vectors p1 , p2 , . . . , pn
are linearly independent (and hence nonzero). Thus, it follows from (2) that these n
column vectors are eigenvectors of A.
X
entries, then
AP = A[p1 p2 · · · pn ] = [Ap1 Ap2 · · · Apn ]
= [λ1 p1 λ2 p2 · · · λn pn ] = PD
Since the column vectors of P are linearly independent, it follows from Theorem 5.1.5
that P is invertible, so that this last equation can be rewritten as P −1AP = D , which
shows that A is diagonalizable.
Whereas Theorem 5.2.1 tells us that we need to find n linearly independent eigen-
vectors to diagonalize a matrix, the following theorem tells us where such vectors might
be found. Part (a) is proved at the end of this section, and part (b) is an immediate
consequence of part (a) and Theorem 5.2.1 (why?).
-
THEOREM 5.2.2
Remark Part (a) of Theorem 5.2.2 is a special case of a more general result: Specifically, if
λ1 , λ2 , . . . , λk are distinct eigenvalues, and if S1 , S2 , . . . , Sk are corresponding sets of linearly
independent eigenvectors, then the union of these sets is linearly independent.
5.2 Diagonalization 305
Procedure for Theorem 5.2.1 guarantees that an n × n matrix A with n linearly independent eigen-
Diagonalizing a Matrix vectors is diagonalizable, and the proof of that theorem together with Theorem 5.2.2
suggests the following procedure for diagonalizing A.
=
Step 1. Determine first whether the matrix is actually diagonalizable by searching for
n linearly independent eigenvectors. One way to do this is to find a basis for
each eigenspace and count the total number of vectors obtained. If there is
a total of n vectors, then the matrix is diagonalizable, and if the total is less
than n, then it is not.
XB
Step 2. If you ascertained that the matrix is diagonalizable, then form the matrix
P = [p1 p2 · · · pn ] whose column vectors are the n basis vectors you ob-
x
tained in Step 1.
Step 3. P −1AP will be a diagonal matrix whose successive diagonal entries are the
eigenvalues λ1 , λ2 , . . . , λn that correspond to the successive columns of P .
⑧
Solution In Example 7 of the preceding section we found the characteristic equation of
A to be
(λ − 1)(λ − 2)2 = 0
and we found the following bases for the eigenspaces:
−1 0 −2
I λ = 2: p1 = 0 , p2 = 1 ;
1
0
I
λ= 1: p3 = 1
1
- -
In general, there is no preferred order for the columns of P . Since the i th diagonal
entry of P −1AP is an eigenvalue for the i th column vector of P , changing the order of
the columns of P just changes the order of the eigenvalues on the diagonal of P −1AP .
Thus, had we written
−1 −2 0
P = 0 1 1
1 1 0
O
det (XI-A) 0
=
(???)
=-
-=
pAp
(= i]
=
(b-
as) is) 0
=
(b z)(d(x 3)
-
-
2)
+
0
=
(3 -
2)(32 - 3 2)
+
0
=
6 - 2 0
=
or ( - 36 2
+ 0
=
2)
Size (b 1)(x 0
=
-
-
X-2
↑
or =
0
et
sexamet
:(i) 1=?] :1i]
306 Chapter 5 Eigenvalues and Eigenvectors
1 0 0
A= 1
−3
2
5
0
2
I
Solution The characteristic polynomial of A is
' '
-
'λ−1
' 0 0 ''
y
'
det(λI − A) = ' −1
'
' 3
'
λ−2 ' = (λ − 1)(λ − 2)2
'
0 I -
o) (b 1)(x
=
-
-
−5 λ−2'
d !, x 2
=
0
(I A)*
=
-
=
so the characteristic equation is =
(λ − 1)(λ − 2)2 = 0
is(to
I he and the distinct eigenvalues of A are λ = 1 and λ = 2. We leave it for you to show that
Elt
bases for the eigenspaces are
:)
1
0
81
λ = 1: p1 = − ; λ = 2: p2 = 0
8
1 1
--- v
et
=
Since A is a 3 × 3 matrix and there are only two basis vectors in total, A is not diago-
nalizable.
it
t
Alternative Solution If you are concerned only in determining whether a matrix is di-
Let agonalizable and not with actually finding a diagonalizing matrix P , then it is not nec-
essary to compute bases for the eigenspaces—it suffices to find the dimensions of the
st
eigenspaces. For this example, the eigenspace corresponding to λ = 1 is the solution
space of the system
0 0 0 x1 0
−1 −1 0 x2 = 0
3 −5 −1
aftt
x3 0
Since the coefficient matrix has rank 2 (verify), the nullity of this matrix is 1 by Theo-
rem 4.8.2, and hence the eigenspace corresponding to λ = 1 is one-dimensional.
The eigenspace corresponding to λ = 2 is the solution space of the system
1 0 0 x1 0
−1 0 0 x2 = 0
3 −5 0 x3 0
This coefficient matrix also has rank 2 and nullity 1 (verify), so the eigenspace corre-
sponding to λ = 2 is also one-dimensional. Since the eigenspaces produce a total of two
basis vectors, and since three are needed, the matrix A is not diagonalizable.
5.2 Diagonalization 307
E X A M P L E 3 Recognizing Diagonalizability
We saw in Example 3 of the preceding section that
0 1 0
A = 0 0 1
4 −17 8
-
√ √
has three distinct eigenvalues: λ = 4, λ = 2 + 3, and λ = 2 − 3. Therefore, A is
diagonalizable and
4 0 0
√
P −1AP = 0 2+ 3 0
# √
0 0 2− 3
for some invertible matrix P . If needed, the matrix P can be found using the method
shown in Example 1 of this section.
−1 2 4 0
Y distinct
Ibil
0 7
O 3 1
A=
0 0 0 5 8 -
0 0 0 0 −2
is a diagonalizable matrix with eigenvalues λ1 = −1, λ2 = 3, λ3 = 5, λ4 = −2.
- - - -
Eigenvalues of Powers of a Since there are many applications in which it is necessary to compute high powers of a
Matrix square matrix A, we will now turn our attention to that important problem. As we will
=
see, the most efficient way to compute Ak , particularly for large values of k , is to first
diagonalize A. But because diagonalizing a matrix A involves finding its eigenvalues and
x eigenvectors, we will need to know how these quantities are related to those of Ak . As an
=
illustration, suppose that λ is an eigenvalue of A and x is a corresponding eigenvector.
Then
-
O A2 x = A(Ax) = A(λx) = λ(Ax) = λ(λx) = λ2 x
- . =
G-
which shows not only that λ2 is a eigenvalue of A2 but that x is a corresponding eigen-
vector. In general, we have the following result.
Note that diagonalizability is THEOREM 5.2.3 If k is a positive integer, λ is an eigenvalue of a matrix A, and x is
not a requirement in Theo- a corresponding eigenvector, then λk is an eigenvalue of Ak and x is a corresponding
rem 5.2.3. eigenvector.
A= 1 2 0
−3 5 2 >=
2
x5 b
b =
x2,
=
128] ⑰)]
↓ t
=
eigenvalues
-
of
308 Chapter 5 Eigenvalues and Eigenvectors
Computing Powers of a The problem of computing powers of a matrix is greatly simplified when the matrix is
Matrix diagonalizable. To see why this is so, suppose that A is a diagonalizable n × n matrix,
that P diagonalizes A, and that
λ1 0 ··· 0
0 λ2 ··· 0
P −1AP = .. .. .. = D
. . .
pAP:D"
0 0 · · · λn
Squaring both sides of this equation yields
λ21 0 ··· 0
0 0
O
⑧
λ22 ···
** A*I* PpYp"
(P −1AP )2 = .. .. .. = D 2
. . .
=
0 0 · · · λ2n
-
We can rewrite the left side of this equation as
I. A"l
PDYp-
=
(P-.
−1
AP )2 = P −1APP −1AP = P −1 AIAP = P −1A2 P
from which we obtain the relationship P −1A2P = D 2 . More generally, if k is a positive
Al PDp
integer, then a similar computation will show that
= λk1 0 ··· 0
0 λk2 ··· 0
P −1AkP = D k = .. .. ..
. . .
0 0 · · · λkn
which we can rewrite as
k
λ1 0 ··· 0
0 λk2 ··· 0
Ak = PD kP −1 = P .. .. .. P −1 (3)
.
⑳
. . -
0 0 · · · λkn
Formula (3) reveals that rais-
ing a diagonalizable matrix A E X A M P L E 6 Powers of a Matrix
to a positive integer power has
the effect of raising its eigen- Use (3) to find A13 , where
values to that power. 0 0 −2
A = 1 2 1
1 0 3
Solution We showed in Example 1 that the matrix A is diagonalized by
−1 0 −2
P = 0 1 1
1 0 1
and that
2 0 0
−1
D = P AP = 0 2 0
0 0 1
det(3I-A) 0 =
(972)
=
a
=
(x-
=
).*ist:
z(d z)(3(d -
-
3) 2)
+
=
0 (3 z)(3
=
-
z2)
+
0
=
(x z)(x
= -
-
z)(
=1 -
25(d -
1) =
0
[())
=
2
b.
/
-i) E]:(E
when
!
2
=)
= :]())
1. =)()=1]
I
->
0
2x, 2kz
+ =
0-
x, 2x, 0
=
x, x,
= +
-
-
0
-x x
left,
+
=
at tell
te
and SER Let
St,
aiy-s et aft
P [iii]
=
%
=Ppp- D PAP
A
=
D
[82 I
=
pil pi
2 as i
=
-8: I
2
--
[ i
10 010
i?"i I
I ⑧ I
i
10
[ ·!!: I
i
o
R,isR2 I -
R2
->
)iI=. I
=R2 R3
I!8 =]
I
I
+
->
o I 2
-,
I p=[102]
1 I
00' I I
I
W
2123 + R2 =
10
S
-
->
o 102 -
10-1
-
1RB + RI 0 01-10
-
I
": Dp-
w
A P
/?? Ig
5.2 Diagonalization 309
E
−1 0 −2 213 0 0 1 0 2
...
A13 = PD 13 P −1 = 0 1 1 0 213 0 1 1 1 (4)
1 0 1 0 0 113 −1 0 −1
−8190 0 −16382
= 8191 8192 8191
8191 0 16383
Remark With the method in the preceding example, most of the work is in diagonalizing A.
Once that work is done, it can be used to compute any power of A. Thus, to compute A1000 we
need only change the exponents from 13 to 1000 in (4).
Geometric and Algebraic Theorem 5.2.2(b) does not completely settle the diagonalizability question since it only
Multiplicity guarantees that a square matrix with n distinct eigenvalues is diagonalizable; it does not
preclude the possibility that there may exist diagonalizable matrices with fewer than n
distinct eigenvalues. The following example shows that this is indeed the case.
A full excursion into the study of diagonalizability is left for more advanced courses,
but we will touch on one theorem that is important for a fuller understanding of diago-
nalizability. It can be proved that if λ0 is an eigenvalue of A, then the dimension of the
eigenspace corresponding to λ0 cannot exceed the number of times that λ − λ0 appears
as a factor of the characteristic polynomial of A. For example, in Examples 1 and 2 the
characteristic polynomial is
(λ − 1)(λ − 2)2
Thus, the eigenspace corresponding to λ = 1 is at most (hence exactly) one-dimensional,
and the eigenspace corresponding to λ = 2 is at most two-dimensional. In Example 1
310 Chapter 5 Eigenvalues and Eigenvectors
c1 = c2 = · · · = cr = 0 (7)
cr+1 vr+1 = 0
5.2 Diagonalization 311
22. Show that the matrices 29. In the case where the matrix A in Exercise 28 is diagonalizable,
find a matrix P that diagonalizes A. [Hint: See Exercise 30 of
1 1 1 3 0 0
Section 5.1.]
A = 1 1 1 and B = 0 0 0
1 1 1 0 0 0 In Exercises 30–33, find the standard matrix A for the given lin-
ear operator, and determine whether that matrix is diagonalizable.
are similar. If diagonalizable, find a matrix P that diagonalizes A.
23. We know from Table 1 that similar matrices have the same 30. T (x1 , x2 ) = (2x1 − x2 , x1 + x2 )
rank. Show that the converse is false by showing that the
matrices 31. T (x1 , x2 ) = (−x2 , −x1 )
* + * +
1 0 0 1 32. T (x1 , x2 , x3 ) = (8x1 + 3x2 − 4x3 , −3x1 + x2 + 3x3 ,
A= and B =
0 0 0 0 4x1 + 3x2 )
have the same rank but are not similar. [Suggestion: If they
33. T (x1 , x2 , x3 ) = (3x1 , x2 , x1 − x2 )
were similar, then there would be an invertible 2 × 2 matrix P
for which AP = PB . Show that there is no such matrix.] 34. If P is a fixed n × n matrix, then the similarity transformation
24. We know from Table 1 that similar matrices have the same A →P −1AP
eigenvalues. Use the method of Exercise 23 to show that the
can be viewed as an operator SP (A) = P −1AP on the vector
converse is false by showing that the matrices
space Mnn of n × n matrices.
* + * +
1 1 1 0 (a) Show that SP is a linear operator.
A= and B =
0 1 0 1 (b) Find the kernel of SP .
have the same eigenvalues but are not similar. (c) Find the rank of SP .
25. If A, B , and C are n × n matrices such that A is similar to B Working with Proofs
and B is similar to C , do you think that A must be similar to
35. Prove that similar matrices have the same rank and nullity.
C ? Justify your answer.
36. Prove that similar matrices have the same trace.
26. (a) Is it possible for an n × n matrix to be similar to itself ?
Justify your answer. 37. Prove that if A is diagonalizable, then so is Ak for every positive
(b) What can you say about an n × n matrix that is similar to integer k.
0n×n ? Justify your answer. 38. We know from Table 1 that similar matrices, A and B , have
(c) Is is possible for a nonsingular matrix to be similar to a the same eigenvalues. However, it is not true that those eigen-
singular matrix? Justify your answer. values have the same corresponding eigenvectors for the two
matrices. Prove that if B = P −1AP , and v is an eigenvector of
27. Suppose that the characteristic polynomial of some matrix A B corresponding to the eigenvalue λ, then P v is the eigenvec-
is found to be p(λ) = (λ − 1)(λ − 3)2 (λ − 4)3 . In each part, tor of A corresponding to λ.
answer the question and explain your reasoning.
39. Let A be an n × n matrix, and let q(A) be the matrix
(a) What can you say about the dimensions of the eigenspaces
of A? q(A) = an An + an−1 An−1 + · · · + a1 A + a0 In
(b) What can you say about the dimensions of the eigenspaces
(a) Prove that if B = P −1AP , then q(B) = P −1 q(A)P .
if you know that A is diagonalizable?
(b) Prove that if A is diagonalizable, then so is q(A).
(c) If {v1 , v2 , v3 } is a linearly independent set of eigenvectors
of A, all of which correspond to the same eigenvalue of A, 40. Prove that if A is a diagonalizable matrix, then the rank of A
what can you say about that eigenvalue? is the number of nonzero eigenvalues of A.
28. Let ( ) 41. This problem will lead you through a proof of the fact that
a b the algebraic multiplicity of an eigenvalue of an n × n matrix
A=
c d A is greater than or equal to the geometric multiplicity. For
Show that this purpose, assume that λ0 is an eigenvalue with geometric
multiplicity k.
(a) A is diagonalizable if (a − d)2 + 4bc > 0.
(a) Prove that there is a basis B = {u1 , u2 , . . . , un } for R n
(b) A is not diagonalizable if (a − d)2 + 4bc < 0. in which the first k vectors of B form a basis for the
[Hint: See Exercise 29 of Section 5.1.] eigenspace corresponding to λ0 .
5.3 Complex Vector Spaces 313
(b) Let P be the matrix having the vectors in B as col- (g) If there is a basis for R n consisting of eigenvectors of an n × n
umns. Prove that the product AP can be expressed as matrix A, then A is diagonalizable.
( )
λ0 Ik X (h) If every eigenvalue of a matrix A has algebraic multiplicity 1,
AP = P
0 Y then A is diagonalizable.
[Hint: Compare the first k column vectors on both sides.] (i) If 0 is an eigenvalue of a matrix A, then A2 is singular.
(c) Use the result in part (b) to prove that A is similar to
( ) Working withTechnology
λ0 Ik X
C= T1. Generate a random 4 × 4 matrix A and an invertible 4 × 4
0 Y matrix P and then confirm, as stated in Table 1, that P −1AP and
and hence that A and C have the same characteristic poly- A have the same
nomial. (a) determinant.
(d) By considering det(λI − C), prove that the charac- (b) rank.
teristic polynomial of C (and hence A) contains the factor
(c) nullity.
(λ − λ0 ) at least k times, thereby proving that the algebraic
multiplicity of λ0 is greater than or equal to the geometric (d) trace.
multiplicity k. (e) characteristic polynomial.
(f ) eigenvalues.
True-False Exercises
TF. In parts (a)–(i) determine whether the statement is true or T2. (a) Use Theorem 5.2.1 to show that the following matrix is
false, and justify your answer. diagonalizable.
(a) An n × n matrix with fewer than n distinct eigenvalues is not −13 −60 −60
diagonalizable.
A = 10 42 40
(b) An n × n matrix with fewer than n linearly independent eigen- −5 −20 −18
vectors is not diagonalizable. (b) Find a matrix P that diagonalizes A.
(c) If A and B are similar n × n matrices, then there exists an (c) Use the method of Example 6 to compute A10 , and check your
invertible n × n matrix P such that PA = BP . result by computing A10 directly.
(d) If A is diagonalizable, then there is a unique matrix P such T3. Use Theorem 5.2.1 to show that the following matrix is not
that P −1AP is diagonal. diagonalizable.
(e) If A is diagonalizable and invertible, then A−1 is diagonaliz- −10 11 −6
able.
A = −15 16 −10
(f ) If A is diagonalizable, then AT is diagonalizable. −3 3 −2