Textbook on Matrices for Students
Textbook on Matrices for Students
co/
n a' s
ri s h
K TEXT BOOK on
M atrices
(For B.A. and [Link]. IInd year students of All Colleges affiliated to universities in Uttar Pradesh)
By
A. R. Vasishtha A. K. Vasishtha
Retired Head, Dep’t. of Mathematics [Link]., Ph.D.
(Kanpur Edition)
Dedicated
to
Lord
Krishna
Authors & Publishers
[Link]
P reface
This book on MATRICES has been specially written according to the latest
Unified Syllabus to meet the requirements of the B.A. and [Link]. Part-II
Students of all Universities in Uttar Pradesh.
The subject matter has been discussed in such a simple way that the students
will find no difficulty to understand it. The proofs of various theorems and
examples have been given with minute details. Each chapter of this book
contains complete theory and a fairly large number of solved examples.
Sufficient problems have also been selected from various university examination
papers. At the end of each chapter an exercise containing objective questions has
been given.
We have tried our best to keep the book free from misprints. The authors
shall be grateful to the readers who point out errors and omissions which, inspite
of all care, might have been there.
The authors, in general, hope that the present book will be warmly received
by the students and teachers. We shall indeed be very thankful to our colleagues
for their recommending this book to their students.
The authors wish to express their thanks to Mr. S.K. Rastogi, M.D.
and Mr. Sugam Rastogi, Executive Director and entire team of
KRISHNA Prakashan Media (P) Ltd., Meerut for bringing out this book in
the present nice form.
The authors will feel amply rewarded if the book serves the purpose for
which it is meant. Suggestions for the improvement of the book are always
welcome.
Syllabus
M atrices
U.P. UNIFIED (w.e.f. 2012-13)
Section-A: Matrices
Unit-1: Symmetric and skew-symmetric matrices, Hermitian and skew-Hermitian
matrices, Orthogonal and unitary matrices, Triangular and diagonal matrices, Rank of a
matrix, Elementary transformations, Echelon and normal forms, Inverse of a matrix by
elementary transformations.
Unit-2: Characteristic equation, Eigen values and eigen vectors of a matrix, Cayley-
Hamilton's theorem and its use in finding inverse of a matrix, Application of matrices to
solve a system of linear (both homogeneous and non-homogeneous) equations,
Consistency and general solution, Diagonalization of square matrices with distinct eigen
values, Quadratic forms.
[Link]
B rief C ontents
Dedication.........................................................................(v)
Preface ...........................................................................(vi)
Syllabus ........................................................................(vii)
Brief Contents ...............................................................(viii)
SECTION
B
MATRICES
C hapters
1. Matrices
2. Rank of a Matrix
1.
3. Linear Equations
1.
4. Eigenvalues and Eigenvectors
4.
1.
5. Quadratic Forms
4.
[Link]
M-3
1
Matrices
6 x + 8 y − 3 z = 1, 4 x − 2 y + z = 2.
Here x, y and z are unknowns and their coefficients are all numbers. Arranging the
coefficients in the order in which they occur in the equations and enclosing them in
square brackets, we obtain a rectangular array of the form
2 9 7
3 4 −3
6 .
8 −3
4 −2 1
[Link]
M-4
This rectangular array is an example of a matrix. The horizontal lines (→ ) are called
()
rows or row vectors, and vertical lines ↓ are called Columns or column vectors of the
matrix. There are 4 rows and 3 columns in this matrix. Therefore it is a matrix of the
type 4 × 3. The numbers 3, 4, − 3, 2 etc. constituting this matrix are called its
elements. The difference between a matrix and a number should be clearly
understood. A matrix is not a number. It has got no numerical value. It is a new thing
formed with the help of numbers. It is just an ordered collection of numbers arranged
in the form of a rectangular array. Simply 7 is a number. But in our notation of
matrices [7] is a matrix of the type 1 × 1and we cannot have 7 = [7]. We cannot have a
relation of equality between a matrix and a number.
We shall use capital letters (in bold type or in italic type) to denote matrices.
0 0 0
5 0 1
Thus A= and B = 0 0 0
6 1 72 × 3 0
0 0 3 × 3
Sometimes we also use the brackets ( ) or the double bars, , in place of square
brackets [ ] to denote matrices.
2 2 3 + 5 i 9 7 7
Thus A= , B = ,C = ,
2 2 −4 3 − 5 i 7 7
1.2 Matrix
Definition: A set of mn numbers (real or complex) arranged in the form of a rectangular array
having m rows and n columns is called an m × n matrix [ to be read as ‘m by n’ matrix. ].
The numbers a11, a12 etc. of this rectangular array are called the elements of the
matrix. The element aij belongs to the ith row and the j th column and is sometimes
called the (i, j)th element of the matrix . Thus in the element aij the first suffix i will
always denote the number of the row and the second suffix j, the number of the column
in which the element occurs. In a matrix, the number of rows and the columns need
not be equal.
0 1 2 3
2 3 1 0
A=
5 0 1 1
0 0 1 24 × 4
1 0 0
1 0
For example, I3 = 0 1 0 and I2 =
0 0 1
0 1
[Link]
M-6
0 0 0 0 0 0 0
For example, 0 0 0 0 and 0 0 0
0 0 0 0 3 × 4 0 0 0 3 × 3
2
Y = −9 is a column matrix of the type 3 × 1.
11
3 × 1
1 2 3 9
1 2 3
Example: The matrix is a submatrix of the matrix A = 7 11 6 5 as
0 2 1 0
2 1 8
it can be obtained from A by omitting the second row and the fourth column.
[Link]
M-7
(ii) the elements in the corresponding places of the two matrices are the same i.e., aij = bij for
each pair of subscripts i and j.
If two matrices A and B are equal, we write A = B. If two matrices A and B are not equal,
we write A ≠ B. If two matrices are not of the same size, they cannot be equal.
For example, if
3 2 −1 1 −2 7
A= and B = ,
4 −3 12 × 3 3 2 −12 × 3
[Link]
M-8
3 +1 2−2 − 1 + 7 4 0 6
then A+B= = .
4 + 3 −3+2 1 − 1 7 −1 0 2 × 3
Important Note: It should be noted that addition is defined only for matrices which
are of the same size. If two matrices A and B are of the same size, they are said to be
conformable for addition. If the matrices A and B are not of the same size, we cannot
find their sum.
A + B = B + A.
(ii) Matrix addition is associative: If A, B, C be three matrices each of the type m × n,then
(A + B) + C = A + (B + C).
Negative of matrix. Definition: Let A = [aij]m × n. Then the negative of the matrix A is
defined as the matrix [− aij]m × n and is denoted by –A.
− A + A = O = A + ( − A).
Here O is the null matrix of the type m × n. It is identity element for matrix addition.
(v) Cancellation laws hold good in the case of addition of matrices i.e., if A, B, C
are three m × n matrices, then
3 2 −1
For example, if k = 2 and A = ,
4 −3 12 × 3
2 × 3 2×2 2 × − 1 6 4 −2
then 2A = = .
2 × 4 2× −3 2 × 1 8 −6 22 × 3
( p + q)A = pA + qA.
p (qA) = ( pq)A.
(− k )A = − (kA) = k (− A).
In the product AB, the matrix A is called the pre-factor and the matrix B is called the
post-factor. Also we say that the matrix A has been post-multiplied by the matrix B.
and the matrix B has been pre-multiplied by the matrix A.
i.e., the (i, k )th element c ik of the matrix AB is obtained by multiplying the
corresponding elements of the ith row of A and the k th column of B and then adding the
products. The rule of multiplication is row-by-column multiplication i. e., in the
process of multiplication we take the rows of A and the columns of B. The element c11
of the matrix AB is obtained by adding the products of the corresponding elements of
the first row of A and the first column of B. The element c12 of the matrix AB is
obtained by adding the products of the corresponding elements of the first row of A
and the second column of B. Similarly the element c21 of the matrix AB is obtained by
adding the products of the corresponding elements of the second row of A and the first
column of B. In this way we multiply two matrices A and B.
a11 a12
b11 b12
For example, if A = a21 a22 ,B=
a b21 b22 2 × 2
31 a32 3 × 2
Important Note: If the product AB exists, then it is not necessary that the product BA
will also exist. For example, if A is a 4 × 5 matrix and B is a 5 × 3 matrix, then the
product AB exists while the product BA does not exist.
2 2 0 1 2 3 4
Example : If A = 3 2 1 and B = 2 0 1 2 then find AB . Does BA exist ?
1 0 1 3 1 0 5
[Link]
M-11
Solution : The matrix A is of the type 3 × 3 and the matrix B is of the type 3 × 4. Since the
number of columns of A is equal to the number of rows of B therefore AB is defined i. e.,
the product AB exists and it will be a matrix of the type 3 × 4.
Then c11 = the sum of the products of the corresponding elements of the first row of A
and the first column of B.
c12 = the sum of products of the corresponding elements of the first row of A and the
second column of B.
c13 = the sum of the products of the corresponding elements of the first row of A and
the third column of B.
c23 = the sum of the products of the corresponding elements of the second row of A
and the third column of B.
c32 = the sum of the products of the corresponding elements of the third row of A and
the second column of B, and so on.
2 1 0 1 2 3 4
AB = 3 2 1 × 2 0 1 2
1 0 13 × 3 3 1 0 53 × 4
2.1 + 1.2 + 0.3 2.2 + 1.0 + 0.1 2.3 + 1.1 + 0.0 2.4 + 1.2 + 0.5
= 3.1 + 2.2 + 1.3 3.2 + 2.0 + 1.1 3.3 + 2.1 + 1.0 3.4 + 2.2 + 1.5
1.1 + 0.2 + 1.3 1.2 + 0.0 + 1.1 1.3 + 0.1 + 1.0 1.4 + 0.2 + 1.5
4 4 7 10
= 10 7 11 21
4 3 3 9 3 × 4.
Since the number of the columns of B is not equal to the number of rows of A, therefore
the product BA does not exist.
A (B + C) = AB + AC,
(iv) If A be any m × n matrix and On,p be an n × p null matrix, then AOn,p = Om,p where Om,p
is an m × p null matrix.
(v) The equation AB = O does not necessarily imply that at least one of the matrices A and B
must be a zero matrix.
Or
The product of two matrices can be a zero matrix while neither of them is a zero matrix
(vi) In the case of matrix multiplication if AB = O, then it does not necessarily imply that BA = O.
(vii) If A be an m × n matrix, In denotes the n-rowed unit matrix, it can be easily seen that
AIn = A = Im A.
Thus in an upper triangular matrix all the elements below the principal diagonal are
zero.
(ii) Lower Triangular Matrix. Definition: A square matrix A = [aij] is called a lower
triangular matrix if aij = 0 whenever i < j.
[Link]
M-13
Thus in a lower triangular matrix all the elements above the principal diagonal are
zero.
a11 0 0 … 0
a21 a22 0 … 0
For example a31 a32 a33 … 0 is a lower triangular matrix of the size n × n.
… … … … …
an1 an2 an3 … ann n × n
(iii) Diagonal matrix. Definition: A square matrix A = [aij]n × n whose elements above and
below the principal diagonal are all zero, i. e., aij = 0 for all i ≠ j, is called a diagonal matrix.
Thus a diagonal matrix is both upper and lower triangular. An n-rowed diagonal matrix
whose diagonal elements in order are d1, d2 , d3,,… dn will often be denoted by the
symbol Diag. [d1, d2 ,…, dn].
(iv) Scalar Matrix. Definition: A diagonal matrix whose diagonal elements are all equal is
called a scalar matrix.
k 0 … 0
0 k … M
If S = 0 0 … M
M M … M
0 0 … k
is an n-rowed scalar matrix each of whose diagonal elements is equal to k and A is any
n-rowed square matrix, then
AS = SA = kA.
2 −2 −4
A = −1 3 4
1 −2 −3
[Link]
M-14
−5 −8 0
A= 3 5 0
1 2 −1
ab b2
A=
2
is nilpotent of index 2.
− a − ab
1 −3 4
A = −1 3 4 is nilpotent of index 2.
1 −3 −4
(ii) The number of rows may or may not be equal to number of columns in a matrix
while in a determinant the number of rows is equal to the number of columns.
[Link]
M-15
(iii) Interchanging the rows and columns, a different matrix is formed while in a
determinant, an interchange of rows and columns does not change the value of
the determinant.
| A| ≠ 0 or | A| = 0
A = [aij]m × n ,
1 2 3
1 2 3 4
2 3 4
A = 2 3 4 1 is the 4 × 3 matrix A′ = .
3 3 4 2
4 2 13 × 4
4 1 14 × 3
The first row of A is the first column of A′. The second row of A is the second column of
A′ . The third row of A is the third column of A′.
(i) (A′ )′ = A;
The above law (iv) is called the reversal law for transposes i. e., the transpose of the product
is the product of the transposes taken in the reverse order.
[Link]
M-16
⇒ | A| ⋅| A| = 1 [∵| A ′| = | A|]
⇒ | A|2 = 1 ⇒ | A| = ± 1 ⇒ | A| ≠ 0
⇒ A is invertible.
A ′ A = I = AA ′ .
Proof: Since A and B are both n-rowed square matrices, therefore AB is also an n-rowed
square matrix.
Now (AB)′ = B ′ A ′.
= B ′ (A ′ A) B
= B ′ IB [∵ A ′ A = I]
= B′ B
= I. [∵ B ′ B = I]
0 2β γ
Example 1: Determine the values of α, β, γ when α β − γ is orthogonal.
α −β γ
[Link]
M-17
0 2β γ
Solution : Let A = α β − γ .
α −β γ
0 α α
Then A′ = 2β β −β
γ −γ γ
If A is orthogonal, then AA ′ = I.
0 2β γ 0 α α 1 0 0
α β − γ 2β β −β = 0 1 0
α −β γ γ −γ γ 0 0 1
4β2 + γ 2 = 1 …(1)
2β2 − γ 2 = 0 …(2)
α2 + β2 + γ 2 = 1. …(3)
1 1
From (1) and (2), we get β = ± ,γ = ± .
6 3
1
From (3), we get α = ± .
2
1 1 1
Hence α=± ,β = ± ,γ = ± .
2 6 3
cos θ sin θ
is orthogonal.
− sin θ cos θ
cos θ sin θ
Solution: Let A= .
− sin θ cos θ
cos θ − sin θ
Then A′ = .
sin θ cos θ
[Link]
M-18
1 2 2
1
2 1 −2 is orthogonal.
3
−2 2 −1
1 2 2
1
Solution: Let A= 2 1 −2 .
3
−2 2 −1
1 2 −2
1
Then A′ = 2 1 2
3
2 −2 −1
1 2 2 1 2 −2
1
We have AA′ = 2 1 −2 2 1 2
9
−2 2 −1 2
−2 −1
9 0 0 1 0 0
1
= 0 9 0 = 0 1 0 = I3 .
9 0
0 9 0 0 1
If z1 and z2 are two complex numbers, then it can be easily seen that
Conjugate of a Matrix
Definition: The matrix obtained from any given matrix A on replacing its elements by the
corresponding conjugate complex numbers is called the conjugate of A and is denoted by A.
Thus if A = [aij]m × n, then A = [aij]m × n where aij denotes the conjugate complex of aij.
If A be a matrix over the field of real numbers, then obviously A coincides with A.
2 + 3 i 4 − 7i 8 2 − 3 i 4 + 7i 8
Example: If A = , then A = .
−i 6 9 + i i 6 9 − i
(i) (A) = A
Obviously the conjugate of the transpose of A is the same as the transpose of the
conjugate of A i. e.,
( A ′) = (A)′ = A θ .
where b ji = aij i. e., the ( j, i)th element of A θ = the conjugate complex of the (i, j)th
element of A.
1 + 2 i 2 − 3i 3 + 4 i
Example: If A = 4 − 5 i 5 + 6i 6 − 7 i ,
8 7 + 8i 7
1 + 2 i 4 − 5i 8
then A′ = 2 − 3 i 5 + 6i 7 + 8 i
3 + 4 i 6 − 7i 7
[Link]
M-20
1 − 2 i 4 + 5i 8
θ
and (A ′ ) = A = 2 + 3 i 5 − 6i 7 − 8 i .
3 − 4 i 6 + 7i 7
(i) (A θ )θ = A
a h g p
1 i −2 i
h b f q 2 4
For example, g , i −2 4 ,
f c r 4 3
−2 i 4 3
p q r s
Theorem 1: A necessary and sufficient condition for a matrix A to be symmetric is that A and A′
are equal.
∴ 2 aii = 0 or aii = 0.
0 h g 0 −3 i −4
For example, the matrices − h 0 f and 3 i 0 8 are skew-symmetric
− g −f 0 4 −8 0
matrices.
A ′ = − A.
1 2 − 3i 3 + 4 i
a b + ic
For example, , 2 + 3 i 0 4 − 5 i are Hermitian matrices.
b − ic d 3 − 4 i
4 + 5i 2
∴ aii is real for all i. Thus every diagonal element of a Hermitian matrix must be
real.
A Hermitian matrix over the field of real numbers is nothing but a real symmetric
matrix.
∴ aii + aii = 0
Thus the diagonal elements of a skew-Hermitian matrix must be pure imaginary numbers or zero.
0 −2 − i −i 3 + 4i
For example, the matrices ,
2 − i 0 −3 + 4 i 0
are skew-Hermitian matrices. A skew-Hermitian matrix over the field of real numbers
is nothing but a real skew-symmetric matrix.
A θ = − A.
We have (kA)′ = kA ′
= kA. [∵ A ′ = A]
We have (kA)′ = kA ′ = k (− A) [∵ A ′ = − A]
= − (kA).
= (− i)A θ [∵ i = − i]
= − (iA θ )
= − (iA) [∵ A θ = A].
Example 6: If A and B are symmetric matrices, then show that AB is symmetric if and only if A
and B commute i. e., AB = BA. (Lucknow 2008)
We have (AB)′ = B ′ A ′
= BA [∵ A ′ = A, B ′ = B]
= AB. [∵ AB = BA]
AB = BA.
= B ′ A ′ = BA.
Example 7: If A be any matrix, then prove that AA′ and A ′ A are both symmetric matrices.
= AA ′ [∵ (A ′ )′ = A].
Again (A ′ A)′ = A ′ (A ′ )′ = A ′ A.
= B′ A ′ B [since (B ′ )′ = B]
= B ′ AB.
Hence B ′ AB is symmetric.
Then A ′ = − A.
Now (B ′ AB)′ = B ′ A ′ (B ′ )′ = B ′ A ′ B = B ′ (− A) B
= − (B ′ A) B = − B ′ AB.
i 3 + 2i −2− i
− 3 + 2 i 0 3 − 4i
2−i − 3 − 4i − 2 i
is skew-Hermitian.
−i 3 − 2i − 2 + i
= − 3 − 2 i 0 3 + 4i
2+i − 3 + 4i + 2 i
−i −3 − 2 i 2+i
∴ A = (A)′ = 3 − 2 i 0 −3 + 4 i
−2 + i 3 + 4i 2 i
i 3 + 2i − 2 − i
= − − 3 + 2 i 0 3 − 4 i = − A.
2−i − 3 − 4i − 2 i
3 2 − 3i 3 + 5 i
Example 10: If A = 2 + 3 i 5 i , prove that A is a Hermitian matrix.
3 − 5 i −i 7
3 2 + 3i 3 − 5 i
Then A = 2 − 3 i 5 −i .
3 + 5 i i 7
3 2 − 3i 3 + 5 i
θ
∴ A = (A) ′ = 2 + 3 i 5 i =A
3 − 5 i −i 7
− 2 + 3 i 1− i 2+i
Example 11: Express 3 4 − 5i 5 as the sum of a Hermitian and a
1 1+ i − 2 + 2 i
skew-Hermitian matrix.
1 1
A= (A + A θ ) + (A − A θ ),
2 2
1 1
where (A + A θ ) is a Hermitian matrix and (A − A θ ) is a skew-Hermitian matrix.
2 2
− 2 + 3 i 1− i 2+i
Let A= 3 4 − 5i 5
1 1+ i − 2 + 2 i
− 2 − 3 i 1+ i 2−i
= 3 4 + 5i 5
1 1− i − 2 − 2 i
− 2 − 3 i 3 1
θ
∴ A = (A)′ = 1 + i 4 + 5i 1− i
2−i 5 − 2 − 2 i
−4 4−i 3 + i
1 θ 1
Now (A + A ) = 4 + i 8 6 − i
2 2
3 − i 6+i − 4
1 3 1
−2 2−
2
i + i
2 2
1 1
= 2 + i 4 3 − i,
2 2
3 − 1 i 3+
1
i −2
2 2 2
6i −2− i 1 + i
1 θ 1
Again (A − A ) = 2 − i − 10 i 4 + i
2 2
− 1 + i −4+ i 4 i
[Link]
M-26
1 1 1
3i − 1−
2
i + i
2 2
1 1
= 1− i −5 i 2 + i,
2 2
− 1 + 1 i −2+
1
i 2i
2 2 2
Example 12: Express the following matrix as the sum of a symmetric and a skew-symmetric
matrix :
1 2 4
−2 5 3.
−1 6 3
1 2 4 1 −2 −1
Solution: Let A = −2 5 3; so that A′ = 2 5 6.
−1 6 3 4 3 3
1 2 4 1 −2 −1 2 0 3
∴ A + A ′ = −2 5 3 + 2 5 6 = 0 10 9
−1 6 3 4 3 3 3 9 6
1 0 3 /2
1
or (A + A ′ ) = 0 5 9 / 2, which is a symmetric matrix.
2 3 / 2
9/2 3
1 2 4 1 −2 −1 0 4 5
Again A − A ′ = −2 5 3 − 2 5 6 = −4 0 −3
−1 6 3 4 3 3 −5 3 0
0 2 5/2
1
or (A − A ′ ) = −2 0 −3 / 2, which is a skew-symmetric matrix.
2 −5 / 2
3/2 0
1 0 3 / 2 0 2 5/2
Thus A= 0 5 9 / 2 + −2 0 −3 / 2,
3 / 2 9/2 3 −5 / 2 3/2 0
where the first matrix is symmetric and the second matrix is skew-symmetric.
[Link]
M-27
Example 13: Show that every square matrix is uniquely expressible as the sum of a symmetric
matrix and a skew-symmetric matrix.
1 1
We have P′ = (A + A ′ ) ′ = (A + A ′ )′ [∵ (kA)′ = kA ′ ]
2 2
1
= { A ′ + (A ′ )′ } [∵(A + B)′ = A ′ + B ′ ]
2
1
= (A ′ + A) [∵ (A ′ )′ = A]
2
1
= (A + A ′ ) = P.
2
1 1 1
Again Q′ = (A − A ′ ) ′ = (A − A ′ )′ = { A ′ − (A ′ )′ }
2 2 2
1 1
= (A ′ − A) = − (A − A ′ ) = − Q.
2 2
Thus we have expressed the square matrix A as the sum of a symmetric and a
skew-symmetric matrix.
∴ A + A ′ = 2 R and A − A ′ = 2S.
1 1
This gives R= (A + A ′ ) and S = (A − A ′ ).
2 2
Thus R = P and S = Q.
Thus the adjoint of a matrix A is the transpose of the matrix formed by the cofactors of
A i. e., if
Note: Sometimes the adjoint of a matrix is also called the adjugate of that matrix.
A (adj A) = | A| I n = (adj A) A.
Note: For the products AB, BA to be both defined and be equal, it is necessary that A
and B are both square matrices of the same order. Thus non-square matrices cannot
possess inverse.
Existence of the Inverse. Theorem. The necessary and sufficient condition for a square
matrix A to possess the inverse is that | A| ≠ 0 .
[Link]
M-29
1
Important. If A be an invertible matrix, then the inverse of A is Adj. A. It is usual to denote
| A|
the inverse of A by A −1 .
1
Thus, A −1 = Adj . A, provided | A| ≠ 0 .
| A|
Thus the necessary and sufficient condition for a matrix to be invertible is that it is
non-singular.
2 −1 3
A = −5 3 1.
−3 2 3
2 −5 −3
T
Solution: (i) We have the transpose of A = A = −1 3 2 .
3 1 3
2 −1 3
(ii) We have | A| = −5 3 1
−3 2 3
= 2 (9 − 2) − (− 1) (− 15 + 3) + 3 (− 10 + 9)
= 14 − 12 − 3 = − 1.
3 1 −5 1
Then A11 = = 7, A12 = − = 12,
2 3 −3 3
−5 3
A13 = = − 1,
−3 2
−1 3 2 3
A21 = − = 9, A22 = = 15,
2 3 −3 3
2 −1
A23 = − = − 1,
−3 2
−1 3 2 3
A31 = = − 10 , A32 = − = − 17,
3 1 −5 1
2 −1
A33 = = 1.
−5 3
7 12 −1
B= 9 15 −1 .
−10 −17 1
7 9 −10
Now adj. A (= the transpose of the matrix B) = 12 15 −17 .
−1 −1 1
7 9 −10
−1 1 1
(iii) We have A = adj. A = 12 15 −17
| A| (− 1) −1
1 1
7 9 −10 −7 −9 10
= (−1) 12 15 −17 = −12 −15 17 .
−1 −1 1 1 1 −1
Note: To check that the answer is correct the students should verify that AA −1 = I.
Hence A θ A = I implies AA θ = I.
Proof: Since A and B are both n-rowed square matrices, therefore AB is also an n-rowed
square matrix.
Now (AB)θ = B θ A θ .
= B θ (A θ A) B
= B θ IB [∵ A θ A = I]
= Bθ B
= I. [∵ B θ B = I]
1 1 1+ i
B= is unitary.
3 1 − i −1 (Bundelkhand 2006)
1 1 1− i
Solution: We have B= .
3 1 + i −1
1 1 1+ i
∴ B θ = (B)′ = .
3 1 − i −1
[Link]
M-32
1 1 1 + i 1 1+ i
Now Bθ B =
3 1 − i −1 1 − i −1
1 1.1 + (1 + i) (1 − i) 1. (1 + i) + (1 + i) (−1)
=
3 (1 − i).1 + (−1).(1 − i) (1 − i) (1 + i) + (−1) (−1)
1 3 0 1 0
= = = I.
3 0 3 0 1
∴ B is unitary.
α + iγ −β + iδ
Example 15: Show that the matrix A = is a unitary matrix, if
β + iδ α − iγ
α2 + β2 + γ 2 + δ2 = 1.
α + iγ −β + iδ
Solution: We have A=
β + iδ α − iγ
α + iγ β + iδ
∴ A′ =
− β + iδ α − iγ
α − iγ β − iδ
Aθ = A′ = .
− β − iδ α + iγ
α + iγ −β + iδ α − iγ β − iδ 1 0
∴ =
β + iδ α − iγ − β − iδ α + iγ 0 1
α2 + β2 + γ 2 + δ2
or
αβ − iβγ + iαδ + γδ − αβ − iαδ + iβγ − δγ
α2 + β2 + γ 2 + δ2 0 1 0
or =
0 α2 + β2 + γ 2 + δ2 0 1
if α2 + β2 + γ 2 + δ2 = 1.
⇒ (A θ )−1. A −1 = A −1 (A θ )−1 = I
⇒ (A −1)θ . A −1 = A −1 (A −1)θ = I.
Comprehensive Exercise 1
−1 2 2
1
1. (i) Show that the matrix 2 −1 2 is orthogonal.
3
2 2 −1
1 1 1
−
3 6 2
1 2
− 0 is orthogonal.
3 6
1 1 1
3 6 2
1 1 i
2. (i) Show that the matrix = is unitary.
2 − i −1
1 + i − 1 + i
2 is unitary.
(ii) Prove that the matrix 2
1+ i 1− i
2 2 (Rohilkhand 2010)
2 1 −1
3. (i) Let A = −1 2 1, find A + A ′ and A − A ′ and hence express A as the
3 4 1
(ii) Write the following matrix as the sum of a symmetric and a skew-symmetric
1 2 3
matrix : 4 5 6. (Lucknow 2010)
7 0 0
[Link]
M-34
0 6 7
4. (i) Show that the matrix −6 0 8 is skew-symmetric.
−7 −8 0
1/ 6 −2 / 6 1 / 6
(ii) Show that the matrix A ,where A = −2 / 6 4/6 −2 / 6 is symmetric.
1/ 6 −2 / 6 1 / 6
(Rohilkhand 2010)
1 1− i 2
5. If A = 1 + i 3 i , prove that A is a Hermitian matrix.
2 −i 0
(Luckhnow 2011; Bundelkhand 06)
6. Prove that the matrix A2 is symmetric if either A is symmetric or A is
skew-symmetric.
11. Show that all positive integral powers of a symmetric matrix are symmetric.
13. If A be any square matrix, prove that A + A θ , AA θ , A θ A are all Hermitian and
A − A θ is skew-Hermitian.
14. Show that every square matrix is uniquely expressible as the sum of a Hermitian
matrix and a skew-Hermitian matrix.
(Purvanchal 2007, 10; Lucknow 09)
15. Show that every square matrix A can be uniquely expressed as P + iQ where P and
Q are Hermitian matrices.
(Lucknow 2007, 10)
[Link]
M-35
A nswers 1
4 0 2 0 2 −4
3. (i) A + A ′ = 0 4 5; A − A ′ = −2 0 −3 ;
2 5 2 4 3 0
4 0 2 0 2 −4
1 1
A = 0 4 5 + −2 0 −3
2 2
2 5 2 4 3 0
1 3 5 0 −1 −2
(ii) A = 3 5 3 + 1 0 3
5 3 0 2 −3 0
True or False
Write ‘T’ for true and ‘F’ for false statement.
A nswers
True or False
1. T 2. T
o
[Link]
M-37
2
Rank of a Matrix
2 4 1 9 1
0 5 2 5 2
A=
1 9 7 3 4
3 −2 8 1 84 × 5.
Thus 2 4 1 9 2 4 9 1
0 5 2 5 0 5 5 2
, , etc.
1 9 7 3 1 9 3 4
3 −2 8 1 3 −2 1 8
If we leave two columns and one row from A, we shall get a square submatrix of A of
order 3.
Thus 2 4 1 4 1 9 5 2 5
0 5 2 , 5 2 5 , 9 7 3 , etc.
1 9 7 9 7 3 −2 8 1
If we leave three columns and two rows from A , we shall get a square submatrix of A of
order 2.
Thus 2 4 4 1 5 2
, , , etc. are 2-rowed minors of A.
0 5 5 2 9 7
Definition: A number r is said to be the rank of a matrix A if it possesses the following two
properties :
(i) There is at least one square submatrix of A of order r whose determinant is not equal to zero.
(ii) If the matrix A contains any square submatrix of order r + 1, then the determinant of every
square submatrix of A of order r + 1 should be zero.
In short the rank of a matrix is the order of any highest order non-vanishing minor of the matrix.
Thus the rank of a matrix A is the order of any highest order square submatrix of A whose
determinant is not equal to zero.
It is obvious that the rank r of an (m × n) matrix can at most be equal to the smaller of
the numbers m and n , but it may be less.
If there is a matrix A which has at least one non-zero minor of order n and there is no
minor of A of order n + 1, then the rank of A is n. Thus the rank of every non-singular
matrix of order n is n. The rank of a square matrix A of order n can be less than n if and
only if A is singular i. e., |A| = 0.
Note 1 : Since the rank of every non-zero matrix is ≥ 1, we agree to assign the rank, zero, to
every null matrix :
Important : The following two simple results will help us very much in finding the
rank of a matrix:
(i) The rank of a matrix is ≤ r, if all (r + 1) - rowed minors of the matrix vanish.
(ii) The rank of a matrix is ≥ r, if there is at least one r-rowed minor of the matrix which is not
equal to zero.
Example :
1 0 0
(a) Let A = I3 = 0 1 0 be a unit matrix of order 3.
0 0 1
0 0 0
(b) Let A = 0 0 0 .
0 0 0
1 2 3
(c) Let A = 2 3 4 .
0 2 2
1 2 3
(d) Let A = 3 4 5 .
4 5 6
Therefore the rank of A is less than 3. Now there is at least one minor of A of order 2,
1 2
namely which is not equal to zero. Hence rank A = 2.
3 4
3 1 2
(e) Let A = 6 2 4 .
3 1 2
Also each 2-rowed minor of A is equal to zero. But A is not a null matrix. Hence rank
A = 1.
2 4 3 2
(f) Let A= .
3 5 1 4
2 4
Here we see that there is at least one minor of A of order 2 i. e., which is not
3 5
equal to zero. Also there is no minor of A of order greater than 2. Hence the rank of
A = 2.
(i) Every row of A which has all its entries 0 occurs below every row which has a non-zero entry.
(ii) The number of zeros before the first non-zero element in a row is less than the number of such
zeros in the next row.
Important result. The rank of a matrix in Echelon form is equal to the number of non-zero
rows of the matrix.
0 1 2 3
Example: Find the rank of the matrix 0 0 1 −1 .
0 0 0 0
The matrix A has one zero row. We see that it occurs below every non-zero row.
[Link]
M-41
Further the number of zeros before the first non-zero element in the first row is one.
The number of zeros before the first non-zero element in the second row is two. Thus
the number of zeros before the first non-zero element in any row is less than the
number of such zeros in the next row.
2.3 Theorem
The rank of the transpose of a matrix is the same as that of the original matrix.
Proof. Let A be any matrix and A′ be the transpose of the matrix A. Let rank A = r and
rank A′ = s. Then to prove that r = s.
We have rank A = r
⇒ there exists at least one r-rowed square submatrix, say R, of A such that
| R ′| = | R| ≠ 0
| R ′| = | R| ≠ 0
⇒ rank A′ ≥ r ⇒ s ≥ r. ...(1)
Since (A ′ )′ = A, therefore interchanging the roles of A and A′ in the above result (1), we
have
r ≥ s. …(2)
1 2 3
1 2 3
(i) 2 1 0 , (ii)
0 2 4 5
1 2 (Bundelkhand 2009)
[Link]
M-42
1 2 3
Solution : (i) Let A = 2 1 0 .
0 1 2
= 2 − 8 + 6 = 0.
1 2
But there is at least one minor of order 2 of the matrix A, namely which is not
2 1
equal to zero. Hence rank A = 2.
1 2 3
(ii) Let A= .
2 4 5
1 3
Here there is at least one minor of order 2 of the matrix A, namely which is not
2 5
equal to 0. Also there is no minor of the matrix A or order greater than 2. Hence rank
A = 2.
2 4 2
A = 2 1 2 ?
1 0 x
2 4 2
i. e., 2 1 2 ≠0
1 0 x
or 2 ( x − 0 ) − 4 (2 x − 2) + 2 (0 − 1) ≠ 0
or 2x − 8x + 8 − 2 ≠ 0
or − 6x + 6 ≠ 0
or x ≠ 1.
[Link]
M-43
1 5 4
Example 3: For which value of ‘b’ the rank of the matrix A = 0 3 2 is 2?
b 13 10
1 5 4
i. e., 0 3 2 =0
b 13 10
or 1 (30 − 26) − 5 (0 − 2 b) + 4 (0 − 3 b) = 0
or 4 + 10 b − 12 b = 0
or 4 − 2b = 0
or b = 2.
Example 4: Find the values of a so that rank (A) < 3, where A is the matrix
3 a − 8 3 3
A= 3 3a − 8 3 .
3 3 3 a − 8
3 a − 8 3 3
i. e., 3 3a − 8 3 =0
3 3 3 a − 8
Applying R1 → R1 + R2 + R3 , we get
3a − 2 3a − 2 3a − 2
3 3a − 8 3 =0
3 3 3a − 8
1 1 1
or (3 a − 2) 3 3a − 8 3 =0
3 3 3a − 8
[Link]
M-44
1 0 0
(3 a − 2) 3 3 a − 11 0 =0
3 0 3 a − 11
or (3 a − 2) (3 a − 11) (3 a − 11) = 0
2 11 11
or a= , , .
3 3 3
Example 5: Prove that the points ( x1, y1), ( x2 , y2 ) and ( x3 , y3 ) are collinear if and only if the
rank of the matrix.
x1 y1 1
A = x2 y2 1 is less than three.
x y3 1
3 (Kanpur 2001; Bundelkhand 07)
x1 y1 1
We are to prove that the rank of matrix A = x2 y2 1 is less than three.
x y3 1
3
x1 y1 1
1
⇒ x2 y2 1 =0
2
x3 y3 1
x1 y1 1
⇒ x2 y2 1 =0
x3 y3 1
x1 y1 1
⇒ The rank of matrix x2 y2 1 is less than 3.
x3 y3 1
x1 y1 1
The rank of the given matrix x2 y2 1 is less than 3.
x3 y3 1
rank A < 3
x1 y1 1
⇒ x2 y2 1 =0
x3 y3 1
x1 y1 1
1
⇒ x2 y2 1 = 0.
2
x3 y3 1
Example 6: A is a non-zero column and B a non-zero row matrix, show that rank (AB) = 1 .
a11
a21
Solution : Let A = a31 and B = [b11 b12 b13 … b1n]
…
am1
Since A and B are non-zero matrices, therefore the matrix AB will also be non-zero. The
matrix AB will have at least one non-zero element obtained by multiplying
corresponding non-zero elements of A and B .
All the two-rowed minors of A obviously vanish. But A is a non-zero matrix. Hence
rank A = 1.
[Link]
M-46
Comprehensive E xercise 1
1 2 3 4 1 2 −7 5
(v) 2 4 6 8 (vi) 0 5 0 8
3 6 9 12 0 0 0 −3
1 2 3
(vii) .
2 4 5
2. Show that the rank of a matrix is ≥ the rank of every sub-matrix thereof.
3. Show that the rank of a matrix does not alter on affixing any number of
additional rows or columns of zeros.
0 1 0 0
0 0 1 0
4. If A = , find the rank of A and A2 .
0 0 0 1
0 0 0 0
5. Under what conditions the rank of the following matrix is 3 ? Is it possible for the
2 4 2
rank to be 1? Why ? A = 3 1 4.
1 0 x
(Kanpur 2010)
6. A is an n-rowed square matrix of rank (n − 1) , show that Adj. A is not a null matrix.
A nswers 1
7
5. x≠ ; No, because one minor of order 2 of A is non-zero
5
2. The multiplication of the elements of any row (or column) by any non-zero number.
3. The addition to the elements of any other row (or column) the corresponding elements of any
other row (or column) multiplied by any number.
3. The addition of k times the j th row to the ith row will be denoted by Ri → Ri + kRj.
1 4 2 9
B = 8 19 17 11
3 7 8 4 3 × 4.
1 4 2 9
C = 6 15 3 9 .
3 7 8 4
1
Now if we apply the elementary transformation R2 → R2 to the matrix C , we see that
3
the matrix C transforms back to the matrix A.
0 0 1 1 0 0 1 2 0
0 1 0 , 0 4 0 , 0 1 0
1 0 0 0 0 1 0 0 1
are the elementary matrices obtained from I3 by subjecting it to the elementary operations
C1 ↔ C3 , R2 → 4 R2 , R1 → R1 + 2 R2 respectively.
(i) Eij will denote the E-matrix obtained by interchanging the ith and j th rows of a unit
matrix. The students can easily see that the matrices obtained by interchanging
the ith and j th rows or the ith and j th columns of a unit matrix are the same.
Therefore Eij will also denote the elementary matrix obtained by interchanging
the ith and j th columns of a unit matrix.
[Link]
M-49
(ii) Ei (k ) will denote the E-matrix obtained by multiplying the ith row of a unit matrix
by a non-zero number k. It can be easily seen that the matrices obtained by
multiplying the ith row or the ith column of a unit matrix by k are the same.
Therefore Ei (k ) will also denote the elementary matrix obtained by multiplying
the ith column of a unit matrix by a non-zero number k.
(iii) Eij (m) will denote the elementary matrix, obtained by adding to the elements of
the ith row of a unit matrix, the products by any number m of the corresponding
elements of the j th row. It may be easily seen that the E-matrix, Eij (m) can also be
obtained by adding to the elements of the j th column of a unit matrix, the
products by m of the corresponding elements of the ith column.
Now it is very interesting to note that the elementary transformations of a matrix can
also be obtained by algebraic operations on the same by the corresponding elementary
matrices. In this connection we have the following theorem.
2.7 Theorem
Every elementary row (column) transformation of a matrix can be obtained by pre-multiplication
(post-multiplication) with the corresponding elementary matrix.
We shall first prove that every elementary row transformation of a product AB of two
matrices A and B can be obtained by subjecting the pre-factor A to the same
elementary row transformation. Similarly every elementary column transformation
of a product AB of two matrices A and B can be obtained by subjecting the post-factor B
to the same elementary column transformation.
Let R1, R2 , R3 ,…, R m denote the row vectors of the matrix A and C1, C2 , …, Cp denote
the column vectors of the matrix B.
[Link]
M-50
R1
R2
We can then write, A = R3 , B = [C1 C2 C3 … Cp ].
…
R m
Now if σ denotes any elementary row transformation, it is quite obvious from the
above representation that (σ A) B = σ (AB). For example, if σ denotes the elementary
row transformation R1 ↔ R2 , it is quite obvious that (σA) B = σ (AB).
Similarly it is quite obvious that if the columns C1, C2 ,…, Cp of B be subjected to any
elementary column transformation, the columns ofAB are also subjected to the same
elementary column transformation. Hence the result.
A = Im A.
σ A = σ (I m A) = (σ Im ) A = EA,
1 4 2
Example: Let A = 2 7 1 .
3 8 4
7 20 10
B = 2 7 1 .
3 8 4
1 0 0
I3 = 0 1 0 ,
0 0 1
1 0 2
then the E-matrix E thus obtained is E = 0 1 0 .
0 0 1
1 0 2 1 4 2
Now EA = 0 1 0 2 7 1
0 0 1 3 8 4
1⋅ 1 + 0 ⋅ 2 + 2 ⋅ 3 1⋅ 4 + 0 ⋅ 7 + 2 ⋅ 8 1 ⋅ 2 + 0 ⋅ 1 + 2 ⋅ 4
= 0 ⋅ 1 + 1 ⋅ 2 + 0 ⋅ 3 0 ⋅ 4 + 1⋅ 7 + 0 ⋅ 8 0 ⋅ 2 + 1 ⋅ 1 + 0 ⋅ 4
0 ⋅ 1 + 0 ⋅ 2 + 1 ⋅ 3 0 ⋅ 4 + 0 ⋅ 7 + 1⋅ 8 0 ⋅ 2 + 0 ⋅ 1 + 1 ⋅ 4
7 20 10
= 2 7 1 = B.
3 8 4
Let Eij denote the elementary matrix obtained by interchanging the ith and j th rows of a
unit matrix.
The interchange of the ith and j th rows of Eij will transform Eij to the unit matrix. But
every elementary row transformation of a matrix can be brought about by
pre-multiplication with the corresponding elementary matrix. Therefore the row
transformation which changes Eij to I can be effected on pre-multiplication by Eij.
[Link]
M-52
Similarly, we can show that the elementary matrix corresponding to the E-operation Ci ↔ C j is
its own inverse.
(ii) The inverse of the E-matrix corresponding to the E-operation Ri → kRi, (k ≠ 0 ), is the
E-matrix corresponding to the E-operation Ri → k −1 Ri.
Let Ei (k ) denote the elementary matrix obtained by multiplying the elements of the ith
row of a unit matrix I by a non-zero number k.
Similarly, we can show that the inverse of the E-matrix corresponding to the E-operation
Ci → kCi, k ≠ 0 , is the E-matrix corresponding to the E-operation Ci → k −1 Ci.
(iii) The inverse of the E-matrix corresponding to the E-operation Ri → Ri + kRj is the E-matrix
corresponding to the E-operation Ri → Ri − kRj.
Let Eij (k ) denote the elementary matrix obtained by adding to the elements of the ith
row of a unit matrix I, the products by any number k of the corresponding elements of
the j th row of I.
If we add to elements of the ith row of Eij (k ), the products by −k of the corresponding
elements of its j th row, then this row operation will transform Eij(k ) to the unit matrix I.
Now this row transformation of Eij(k ) can be effected on pre-multiplication by the
corresponding elementary matrix Eij (− k ).
Similarly, we can show that the inverse of the E-matrix corresponding to the E-operation
Ci → Ci + kC j is the E-matrix corresponding to the E-operation Ci → Ci − kC j.
From the above theorem, we thus conclude that the inverse of an elementary matrix
is also an elementary matrix of the same type.
[Link]
M-53
Proof: Let A = [aij] be an m × n matrix of rank r. We shall prove the theorem in three
stages.
Let B be the matrix obtained from the matrix A by the E-transformation Rp ↔ Rq . Let r
be the rank of A and s be the rank of B. Then to prove that r = s.
We have rank A = r ⇒ there exists at least one r-rowed square submatrix, say R, of A
such that | R| ≠ 0 .
Let S be the r-rowed square submatrix of B which has the same rows as are in R though
they may be in different relative positions.
∴ | R | ≠ 0 ⇒ |S| ≠ 0 .
∴ rank B ≥ r ⇒ S ≥ r.
Hence r = s.
Case II. Multiplication of the elements of a row by a non-zero number does not change the rank.
Now if| B0| be any (r + 1) -rowed minor of B there exists a uniquely determined minor
| A0| of A such that
| B0| = | A0| ( this happens if the pth row of B is one of those rows which
or | B0| = k | A0|
(this happens when pth row is retained while obtaining B0 from B).
[Link]
M-54
Since the matrix A is of rank r, therefore every (r + 1)-rowed minor of A vanishes i. e.,
| A0| = 0 . Hence | B0 | = 0 . Thus we see that every (r + 1)-rowed minor of B also vanishes.
Therefore, s (the rank of B) cannot exceed r ( the rank of A).
∴ s ≤ r.
Also, since A can be obtained from B by E-transformation of the same type i. e.,
Rp → (1 / k ) Rp , therefore, by interchanging the roles of A and B we find that
r ≤ s.
Thus r = s.
Case III. Addition to the elements of a row the products by any number k of the corresponding
elements of any other row does not change the rank .
The transformation Rp → Rp + kRq has changed only the pth row of the matrix A. Also
the value of a determinant does not change if we add to the elements of any row the
corresponding elements of any other row multiplied by some number k.
Therefore, if no row of the sub-matrix A0 is part of the pth row of A, or if two rows of A0
are parts of the pth and q th rows of A, then | B0| = | A0|.
Again, if a row of A0 is a part of the pth row of A, but no row is a part of the q th row, then
| B0| = | A0| + k |C0| , where C0 is an (r + 1) -rowed square matrix which can be obtained
from A0 by replacing the elements of A0 in the row which corresponds to the pth row of
A by the corresponding elements in the q th row of A. Obviously all the r + 1rows of the
matrix C0 are exactly the same as the rows of some (r + 1)-rowed square sub-matrix of A,
though arranged in some different order. Therefore|C0| is ± 1times some (r + 1)-rowed
minor of A. Since the rank of A is r, therefore, every (r + 1)-rowed minor of A is zero, so
that | A0| = 0 ,|C0| = 0 , and consequently | B0| = 0 .
Thus we see that every (r + 1)-rowed minor of B also vanishes. Hence, s (the rank of B)
cannot exceed r ( the rank of A).
∴ s ≤ r.
[Link]
M-55
Also, since A can be obtained from B by an E-transformation of the same type i. e.,
Rp → Rp − kRq , therefore, interchanging the roles of A and B, we have r ≤ s.
Thus r = s.
We have thus shown that rank of a matrix remains unaltered by any E-row
transformation. Therefore we can also say that the rank of a matrix remains unaltered
by a series of elementary row transformations.
Similarly we can show that the rank of a matrix remains unaltered by a series of
elementary column transformations.
Finally, we conclude that the rank of a matrix remains unaltered by a finite chain of elementary
operations.
Since A is a non-zero matix, therefore A has at least one element different from zero,
say apq = k ≠ 0.
By interchanging the pth row with the first row and the q th column with the first
column respectively, we obtain a matrix B whose leading element is equal to k which is
not equal to zero.
Multiplying the elements of the first row of the matrix B by 1/k, we obtain a matrix C
whose leading element is equal to unity.
[Link]
M-56
Subtracting suitable multiples of the first column of C from the remaining columns,
and suitable multiples of the first row from the remaining rows, we obtain a matrix D in
which all elements of the first row and first column except the leading element are
equal to zero.
1 0 0 0 … 0
0
0 A1
Let D= ,
0
…
0
where A1 is an (m − 1) × (n − 1) matrix.
If now, A1 be a non-zero matrix, we can deal with it as we did with A. If the elementary
operations applied to A1 for this purpose be applied to D, they will not affect the first
row and the first column of D . Continuing this process, we shall finally obtain a matrix
M, such that
Ik O
M= .
O O
The matrix M has the rank k. Since the matrix M has been obtained from the matrix A
by elementary transformations and elementary transformations do not alter the rank,
therefore we must have k = r.
Ir O
Hence every m × n matrix of rank r can be reduced to the form by a finite chain
O O
of elementary transformations.
Note. The above form is usually called the first canonical form or normal form of a
matrix.
Corollary 1: The rank of an m × n matrix A is r if and only if (iff) it can be reduced to the form
Ir O
by a finite chain of E-operations.
O O
[Link]
M-57
The condition is necessary. The proof has been given in the above theorem.
The condition is also sufficient. The matrix A has been transformed into the form
Ir O
by elementary transformations which do not alter the rank of the matrix.
O O
Ir O
Since the rank of the matrix is r, therefore the rank of the matrix A must also
O O
be r.
Ir O
PAQ = .
O O
Ir O
Proof: If A be an m × n matrix of rank r, it can be transformed into the form by
O O
elementary operations. Since E-row (column) operations are equivalent to
pre-(post)-multiplication by the corresponding elementary matrices, we have the
following result :
Ir O
Ps Ps − 1 … P1A Q1Q2 …Qt = .
O O
Now each elementary matrix is non-singular and the product of non-singular matrices
is also non-singular. Therefore if P = Ps Ps − 1 … P2 P1 and Q = Q1 Q2 , …Qt , then P and Q
are non-singular matrices. Hence
Ir O
PAQ = .
O O
The following three properties of the relation ‘~’ in the set of all m × n matrices are
quite obvious :
[Link]
M-58
Therefore the relation ‘~’ in the set of all m × n matrices is an equivalence relation.
Example 7: If A and B be two equivalent matrices, then show that rank A = rank B.
Example 8: Show that if two matrices A and B have the same size and the same rank, they are
equivalent.
Solution: Let A and B be two m × n matrices of the same rank r. Then by 2.9, we have
Ir O Ir O
A~ and also B ~ .
O O O O
Ir O Ir O
B~ implies ~ B.
O O O O
Ir O Ir O
A~ and ~ B implies A ~ B.
O O O O
Example 9: (i) Use elementary transformations to reduce the following matrix A to triangular
form and hence find rank A :
[Link]
M-59
5 3 14 4
A = 0 1 2 1.
1 −1 2 0
(Bundelkhand 2010)
8 1 3 6
(ii) Find the rank of the matrix 0 3 2 2.
−8 −1 −3 4
1 −1 2 0
A ~ 0 1 2 1 by R1 ↔ R3
5 3 14 4
1 −1 2 0
~ 0 1 2 1 by R3 → R3 − 5 R1
0 8 4 4
1 −1 2 0
~ 0 1 2 1 by R3 → R3 − 8 R2 .
0 0 −12 −4
The last equivalent matrix is in Echelon form (or in triangular form). The number of
non-zero rows in this matrix is 3. Therefore its rank is 3. Hence rank A = 3.
(ii) Let us denote the given matrix by A. To find the rank of A, we shall reduce it to
1
Echelon form. Performing the column operation C1 → C1, we get
8
1 1 3 6 1 1 3 6
A~ 0 3 2 2 ~ 0 3 2 2 by R3 → R3 + R1.
−1 −1 −3 4 0 0 0 10
The last equivalent matrix is in Echelon form. The number of non-zero rows in this
matrix is 3. Therefore its rank is 3. Hence rank A = 3.
1 2 1
Example 10: Is the matrix −1 0 2 equivalent to I3 ?
2 1 −3
(Meerut 2008)
[Link]
M-60
1 2 1
Solution: Let A = −1 0 2.
2 1 −3
1 2 1 1 2 1
We have | A | = −1 0 2 = 0 2 3 R2 + R1, R3 − 2 R1
2 1 −3 0 −3 −5
= − 10 + 9 = − 1 i. e., ≠ 0.
Thus the matrix A is non-singular. Hence it is of rank 3. The rank of I3 is also 3. Since A
and I3 are matrices of the same size and the same rank, therefore A ~ I3 .
1 −1 2 −3
4 1 0 2 Ir O
Example 11: Reduce the matrix, A = to the nor mal form and
0 3 0 4 O O
0 1 0 2
hence determine its rank. (Meerut 2001, 09B, 10)
1 0 0 0
4 5 −8 14
~ by C2 → C2 + C1, C3 → C3 − 2 C1, C4 → C4 + 3 C1
0 3 0 4
0 1 0 2
1 0 0 0
0 5 −8 14
~ by R2 → R2 − 4 R1
0 3 0 4
0 1 0 2
1 0 0 0
0 1 0 2
~ by R2 ↔ R4
0 3 0 4
0 5 −8 14
1 0 0 0
0 1 0 0
~ by C4 → C4 − 2 C2
0 3 0 −2
0 5 −8 4
[Link]
M-61
1 0 0 0
0 1 0 0
~ by R3 → R3 − 3 R2 , R4 → R4 − 5 R2
0 0 0 −2
0 0 −8 4
1 0 0 0
0 1 0 0
~ by C3 ↔ C4
0 0 −2 0
0 0 4 −8
1 0 0 0
0 1 0 0 1 1
~ by C3 → − C3 , C4 → − C4
0 0 1 0 2 8
0 0 −2 1
1 0 0 0
0 1 0 0
~ by R4 → R4 + 2 R3
0 0 1 0
0 0 0 1
2 −1 3 4
0 3 4 1
(i) 2 3 7 5
2 5 11 6 (Kanpur 2010)
−2 −1 −3 −1
1 2 3 −1
(ii) 1 0 1 1
0 1 1 −1 (Meerut 2003, 09 B, 10B)
Solution : (i) Let us denote the given matrix by A. Performing the elementary
operations R3 → R3 − R1, R4 → R4 − R1, we see that
[Link]
M-62
2 −1 3 4 2 −1 3 4
0 3 4 1 0 3 4 1
A~ ~ by R4 → R4 − 2 R2
0 4 4 1 0 4 4 1
0 6 8 2 0 0 0 0
2 −1 3 4
0 12 16 4
~ by R2 → 4 R2 , R3 → 3 R3
0 12 12 3
0 0 0 0
2 −1 3 4
0 12 16 4
~ by R3 → R3 − R2 .
0 0 −4 −1
0 0 0 0
The last equivalent matrix is in Echelon form. The number of non-zero rows in this
matrix is 3. Therefore its rank is 3. Hence rank A = 3.
(ii) Let us denote the given matrix by A. Performing the elementary operation
R1 ↔ R2 , we see that
1 2 3 −1
−2 −1 −3 −1
A~
1 0 1 1
0 1 1 −1
1 2 3 −1
0 3 3 −3
~ by R2 → R2 + 2 R1, R3 → R3 − R1
0 −2 −2 2
0 1 1 −1
1 2 3 −1
0 1 1 −1 1 1
~ by R2 → R2 , R3 → − R3
0 1 1 −1 3 2
0 1 1 −1
1 2 3 −1
0 1 1 −1
~ by R3 → R3 − R2 , R4 → R4 − R2 .
0 0 0 0
0 0 0 0
[Link]
M-63
The last equivalent matrix is in Echelon form. The number of non-zero rows in this
matrix is 2. Therefore rank A = 2.
2 −2 0 6
4 2 0 2
Example 13: Find the rank of the matrix A = by reducing it to normal form.
1 −1 0 3
1 −2 1 2
(Meerut 2008; Avadh 08; Rohilkhand 09)
1 1
Solution : Performing the operation R1 → R1, R2 → R2 , we see that
2 2
1 −1 0 3 1 −1 0 3
2 1 0 1 0 3 0 −5
A~ ~
1 −1 0 3 0 0 0 0
1 −2 1 2 0 −1 1 −1
by R2 → R2 − 2 R1, R3 → R3 − R1, R4 → R4 − R1
1 0 0 0
0 3 0 −5
~ by C2 → C2 + C1, C4 → C4 − 3 C1
0 0 0 0
0 −1 1 −1
1 0 0 0
0 −1 1 −1
~ by R2 ↔ R4
0 0 0 0
0 3 0 −5
1 0 0 0
0 1 −1 1
~ by R2 → (−1) R2
0 0 0 0
0 3 0 −5
1 0 0 0
0 1 −1 1
~ by R4 → R4 − 3 R2
0 0 0 0
0 0 3 −8
[Link]
M-64
1 0 0 0
0 1 0 0
~ by C3 → C3 + C2 , C4 → C4 − C2
0 0 0 0
0 0 3 −8
1 0 0 0
0 1 0 0
~ by R4 ↔ R3
0 0 3 −8
0 0 0 0
1 0 0 0
0 1 0 0 1 1
~ by C3 → C3 , C4 → − C4
0 0 1 1 3 8
0 0 0 0
1 0 0 0
0 1 0 0
~ by C4 → C4 − C3 ,
0 0 1 0
0 0 0 0
I3 O
which is the normal form . Hence rank A = 3.
O O
Example 14: Find two non-singular matrices P and Q such that PAQ is in the normal form where
1 1 1
A = 1 −1 −1 .
3 1 1
1 1 1 1 0 0 1 0 0
1 −1 −1 = 0 1 0 A 0 1 0 ⋅
3 1 1 0 0 1 0 0 1
Now we go on applying E-operations on the matrix A (the left hand member of the
above equation) until it is reduced to the normal form. Every E-row operation will also
be applied to the pre-factor I3 (or its transform) of the product on the right hand
[Link]
M-65
member of the above equation and every E-column operation to the post-factor I3 (or
its transform).
1 1 1 1 0 0 1 0 0
0 −2 −2 = −1 1 0 A 0 1 0 .
0 −2 −2 −3 0 1 0 0 1
1 0 0 1 0 0 1 −1 −1
0 −2 −2 = −1 1 0 A 0 1 0 .
0 −2 −2 −3 0 1 0 0 1
1
Performing R2 → − R2 , we get
2
1 0 0 1 0 0 1 −1 −1
1 1
0 1 1 = − 0 A 0 1 0 .
0 2 2
−2 −2 −3 0 1 0 0 1
Performing R3 → R3 + 2 R2 , we get
1 0 0 1 0 0 1 −1 −1
1 1
0 1 1 = − 0 A 0 1 0 .
0 2 2
0 0 −2 −1 1 0 0 1
Performing C3 → C3 − C2 , we get
1 0 0 1 0 0 1 −1 0
1 1
0 1 0 = − 0 A 0 1 −1 .
0 2 2
0 0 −2 −1 1 0 0 1
I2 O
∴ PAQ = ,
O O
1 0 0 1 −1 0
1 1
where P= − 0 , Q = 0 1 −1 .
−2 2
1 0 1
2 −1 0
I2 O
Since A~ , therefore rank A = 2.
O O
[Link]
M-66
Comprehensive Exercise 2
8 0 0 1
1 −1 3 6
1 0 8 1
10. 1 3 −3 −4 11.
5 0 0 1 8
3 3 11
0 1 1 8 (Kumaun 2008)
0 1 −3 −1
1 0 1 1
12. 3 1 0 2
1 1 −2 0 (Avadh 2010)
[Link]
M-67
1 1 1
1 −3 4 7
13. (Kumaun 2008) 14. 2 2 2
9 1 2 0 3
3 3
2 1 3 4 5 6
15. 4 7 13 16. 5 6 7
4 −3 −1 7 8 9
1 0 2 1
1 1 1
0 1 −2 1
17. 1 18. a b c
−1 4 0 a3
b3 c 3
−2 2 8 0 (Meerut 2011)
19. With the help of elementary transformations find the rank of the following
matrix:
1 1 2 3
1 3 0 3
1 −2 −3 −3
1 1 2 3 (Rohilkhand 2010)
20. Reduce the following matrix to its Echelon form and find its rank :
1 3 4 5
3 9 12 9
−1 −3 −4 −3
(Meerut 2004B; Rohilkhand 06, 10)
1 2 3
21. Find the rank of the matrix A = 2 3 4 after reducing it to normal form.
3 5 7
0 1 2 −1
1 0 1 1
22. Reduce the matrix to normal form and find its rank.
3 1 0 2
1 1 −2 0
4 0 2 3 9 0 2
(ii) 3 1 0 , 7 −2 0 1
5 0 0 8 1 1 5
25. Find the ranks of A, B, A + B, AB and BA where
1 1 −1 −1 −2 −1
A = 2 −3 4, B = 6 12 6 .
3 −2 3 5 10 5
26. Show that if A and B are equivalent matrices, then there exist non-singular
matrices P and Q such that B = PAQ.
27. Show that the rank of a matrix is not altered if a column of it is multiplied by a
non-zero scalar.
28. (i) What is the rank of a non-singular matrix of order n?
(ii) What is the rank of an elementary matrix?
A nswers 2
1. 3 2. 2 3. 3 4. 2 5. 2 6. 2 7. 3 8. 4 9. 3
10. 3 11. 4 12. 2 13. 2 14. 1 15. 2 16. 2 17. 3
18. rank (A ) = 3 if a ≠ b ≠ c and a + b + c ≠ 0; rank (A ) = 2 if a ≠ b ≠ c and
a + b + c = 0; Also rank (A ) = 2 if a = b ≠ c ; and rank (A ) = 1 if a = b = c
19. 3 20. 2 21. 2 22. 3 23. (i) 3 (ii) 2
24. (i) No, since the rank of the first matrix is 4 and that of the second matrix is 2.
(ii) No, since the marices are not of the same type
25. Rank A = 2; Rank B = 1; Rank A + B = 2;
Rank AB = 0; Rank BA = 1
28. (i) n. (ii) Equal to the order of the matrix
[Link]
M-69
Ir O
PAQ = . …(1)
O O
Ir O
PAQ1 Q2 ..... Qt = ⋅ …(2)
O O
G
PA = ⋅
O
[Link]
M-70
Since elementary transformations do not alter the rank, therefore the rank of the
G
matrix PA is the same as that of the matrix A which is r. Thus the rank of the matrix
O
is r and therefore the rank of the matrix G is also r as the matrix G has r rows and last
G
m − r rows of the matrix consist of zeros only.
O
Ir O
PAQ = ⋅ …(1)
O O
Ir O
P1 P2 .... Ps AQ = . …(2)
O O
successively and affecting the corresponding row transformations in the R.H.S. of (2),
we get a relation of the form AQ = [H O].
Now elementary transformations do not alter the rank. Therefore the rank of the
matrix AQ is the same as that of A which is r. Thus the rank of the matrix [H O] is r and
[Link]
M-71
therefore the rank of the matrix H is also r as the matrix H has r columns and the last
n − r columns of the matrix [H O] consists of zero only.
Let A and B be two m × n and n × p matrices respectively. Let r1, r2 be the ranks of A and
B respectively and let r be the rank of the product AB.
To prove r ≤ r1 and r ≤ r2 .
Since A is an m × n matrix of rank r1, therefore there exists a non-singular matrix P such
G
that PA = , where G is an r1 × n matrix of rank r1 and O is (m − r1) × n.
O
G
∴ PAB = B.
O
Since the rank of a matrix does not alter by multiplying it with a non-singular matrix,
therefore
G
∴ Rank of the matrix B = r.
O
G
Since the matrix G has only r1 non-zero rows, therefore the matrix B cannot have
O
more than r1 non-zero rows which arise by multiplying the r1 non-zero rows of G with
the columns of B.
G
∴ Rank of the matrix B is ≤ r1 i. e., t ≤ r1
O
= r2 .
2.15 Theorem
Every non-singular matrix is row equivalent to a unit matrix.
Proof. We shall prove the theorem by induction on n, the order of the matrix. If the
matrix be of order 1 i. e., if A =| a11|, the theorem obviously holds.
Let us assume that the theorem holds for all non-singular matrices of order n − 1.
Let A = [aij] be an n × n non-singular matrix. The first column of the matrix A has at
least one element different from zero, for otherwise we shall have |A | = 0 and the
matrix A will not be non-singular.
Let ap1 = k ≠ 0 .
By interchanging the p th row with the first row (if necessary), we obtain a matrix B
whose leading element is equal to k which is not equal to zero.
Multiplying the elements of the first row of the matrix B by 1 / k, we obtain a matrix C
whose leading element is equal to unity.
Subtracting suitable multiples of the first row of C from the remaining rows, we obtain
a matrix D in which all elements of the first column except the leading element are
equal to zero.
By adding suitable multiples of the second, third, …, n th rows to the first row of M, we
obtain the matrix In.
Corollary 1: If A be an n-rowed non-singular matrix, there exist E-matrices E1, E2 , ...., Et such
that
If A be an n-rowed non-singular matrix, there exist E-matrices E1, E2 , ...., Et such that
Pre-multiplying both sides of the relation (1) by (Et Et − 1 ...... E2 E1)−1, we get
−1
or A = E1−1E2−1 ..... Et − 1E−t 1.
[Link]
M-74
Since the inverse of an elementary matrix is also an elementary matrix of the same type
hence we get the result.
Now we go on applying E-row transformations only to the matrix A and the pre-factor
In of the product InA till we reach the result In = BA.
1 2 1
Example 15: Find the inverse of the matrix A = 3 2 3 by using E-transformations.
1 1 2
(Avadh 2006, 08; Purvanchal 09; Rohilkhand 09; Lucknow 09)
1 2 1 1 0 0
Solution : We write A = I3 A, i. e., 3 2 3 = 0 1 0 A.
1 1 2 0 0 1
Now we go on applying E-row transformations to the matrix A (the left hand member
of the above equation) until it is reduced to the form I3 . Every E-row transformation
will also be applied to the prefactor I3 (or its transform) of the product on the right
hand side of the above equation.
1 2 1 1 0 0
0 −4 0 = −3 1 0 A.
0 −1 1 −1 0 1
Now we should try to make 1 in the place of the second element of the second row of
1
the matrix on the left hand side. So applying R2 → − R2 , we get
4
1 2 1 1 0 0
3 1
0 1 0 = − 0 A.
0 4 4
−1 1 −1 0 1
Now we shall make zeros in the place of the second elements of the first and third rows
with the help of the second row. So applying R1 → R1 − 2 R2 , R3 → R3 + R2 , we get
1 1
− 0
1 0 1 2 2
3 1
0 1 0 = − 0 A.
0 4 4
0 1 1 1
− − 1
4 4
[Link]
M-76
Now the third element of the third row is already 1. So to make the third element of
the first row zero, we apply R1 → R1 − R3 , and we get
1 3
− −1
1 0 1 3 4
3 1
0 1 0 = − 0 A.
0 4 4
0 1 1 1
− − 1
4 4
1 3
− 4 −1
4
3 1
Thus I3 = BA, where B = − 0 .
4 4
− 1 −
1
1
4 4
1 3
− 4 −1
4
3 1
∴ A −1 =B= − 0 ⋅
4 4
− 1 −
1
1
4 4
0 1 2 2
1 1 2 3
Example 16: Find the inverse of the matrix A = by using E-transformations.
2 2 2 3
2 3 3 3
0 1 2 2 1 0 0 0
1 1 2 3 0 1 0 0
Solution : We write = A.
2 2 2 3 0 0 1 0
2 3 3 3 0 0 0 1
Performing R1 ↔ R2 , we get
1 1 2 3 0 1 0 0
0 1 2 2 1 0 0 0
2 = A.
2 2 3 0 0 1 0
2 3 3 3 0 0 0 1
[Link]
M-77
1 1 2 3 0 1 0 0
0 1 2 2 1 0 0 0
0 = A.
0 −2 −3 0 −2 1 0
0 1 −1 −3 0 −2 0 1
Performing R4 → R4 − R2 , R1 → R1 − R2 , we get
1 0 0 1 − 1 1 0 0
0 1 2 2 1 0 0 0
0 = A.
0 −2 −3 0 −2 1 0
0 1 −3 −5 −1 −2 0 1
Performing R4 → R4 − R2 , R1 → R1 − R2 , we get
1 0 0 1 − 1 1 0 0
0 1 2 2 1 0 0 0
0 = A.
0 −2 −3 0 −2 1 0
0 0 −3 −5 −1 −2 0 1
1
Performing R3 → − R3 , we get
2
1 0 0 1 − 1 1 0 0
0 1 2 2 1 0 0 0
0 3 = 1 A.
0 1 0 1 − 0
2 2
0 0 −3 −5 −1 −2 0 1
Performing R2 → R2 − 2 R3 , R4 → R4 + 3 R3 , we get
1 0 0 1 −1 1 0 0
0 1 0 −1 1 −2 1 0
0 3 = 1
0 1 0 1 − 0 A.
2 2
1 3
0 0 0 − −1
2
1 −
2
1
[Link]
M-78
Performing R4 → − 2 R4 , we get
1 0 0 1 −1 1 0 0
0 1 0 −1 1 −2 1 0
0 3 = 1 A.
0 1 0 1 − 0
2 2
0 0 0 1 2 −2 3 −2
3
Performing R3 → R3 − R4 , R2 → R2 + R4 , R1 → R1 − R4 we get
2
1 0 0 0 −3 3 −3 2
0 1 0 0 3 −4 4 −2
0 = A.
0 1 0 −3 4 −5 3
0 0 0 1 2 −2 3 −2
−3 3 −3 2
3 −4 4 −2
∴ A −1 = ⋅
−3 4 −5 3
2 −2 3 −2
Comprehensive Exercise 3
1 2 1
1. Reduce the matrix A = −1 0 2 to I3 by E-row transformations only.
2 1 3
2. Compute the inverse of the following matrices by using elementary
row-transformations :
1 2 3 i −1 2 i
(i) 2 4 5 (ii) 2 0 2
3 5 6 −1 0 1
−1 −3 3 −1
1 3 3
1 1 −1 0
(iii) (iv) 1 4 3
2 −5 2 −3 1
3 4
−1 1 0 1
(Avadh 2007)
[Link]
M-79
0 1 2
(v) 1 2 3
3 1 1
(Rohilkhand 2011; Bundelkhand 08)
A nswers 3
1 1
1 −3 2 0 4
−
2
1 3i i
2. (i) −3 3 −1 (ii) −1
157 4 2
−2 −1 0 0 1 1
4 2
0 2 1 3
7 −3 −3
1 1 −1 −2
(iii) (iv) −1 1 0
1 2 0 1 −1
0 1
−1 1 2 6
1 1 1
2 −
2 2
(v) −4 3 −1
5 3 1
−
2 2 2
(a) rank A = r + 1
(b) rank A = r
(d) rank A ≥ r + 1
[Link]
M-80
(b) rankA = 1
(c) rank A = m
(d) rank A = n
1 2 3 0
2 4 3 2
3. The rank of the matrix is
3 2 1 3
6 8 7 5
(a) 1 (b) 2
(c) 3 (d) 4
4. If A and B are two matrices such that rank of A = m and rank of B = n, then
(a) rank (A B) = m n
10. If two matrices A and B have the same size and the same rank, they are ……… .
1 0
11. The rank of a matrix is ………
0 1 (Agra 2008)
True or False
Write ‘T’ for true and ‘F’ for false statement.
1. The rank of the transpose of a matrix is the same as that of the original matrix.
A nswers
True or False
1. T 2. F 3. F
4. T 5. T
o
[Link]
M-83
3
Linear Equations
e shall devote this chapter to the study of the nature of solutions of a system of
W linear equations with the help of the theory developed in the preceding
chapters. Before discussing the solutions of linear equations, we shall discuss concepts
of linearly dependent and linearly independent sets of vectors.
3.1 Vectors
Definition: Any ordered n-tuple of numbers is called an n-vector. By an ordered n-tuple we
mean a set consisting of n numbers in which the place of each number is fixed. If
x1, x2 , ...., xn be any n numbers, then the ordered n -tuple X = ( x1, x2 , ..., xn) is called an
n-vector. The ordered triad ( x1, x2 , x3 ) is called a 3-vector. Similarly (1, 0 , 1, − 1) and
(1, 8, − 5, 7) are 4-vectors. The n numbers x1, x2 , ..., xn are called components of the
n-vector X = ( x1, x2 , ..., xn). A vector may be written either as a row vector or as a column
vector. If A be a matrix of the type m × n, then each row of A will be an n-vector and each
column of A will be an m-vector. A vector whose components are all zero is called a zero
vector and will be denoted by O.
If k be any number and X be any vector, then relative to the vector X, k is called a scalar.
Equality of two vectors. Two n-vectors X and Y where X = ( x1, x2 ,..., xn) and
Y = ( y1, y2 , ..., yn) are said to be equal if and only if their corresponding components
are equal i. e., if xi = yi, for all i = 1, 2,.., n.
For example if
then X = Y.
X = ( x1, x2 , …, xn),
then by definition
The vector k X is called the scalar multiple of the vector X by the scalar k.
and 0 X = (0 , 0 , 0 ).
(i) X + Y = Y + X. (ii) X + (Y + Z) = (X + Y) + Z.
A set of r n-vectors X1, X2 , …, X r is said to be linearly dependent if there exist r scalars (numbers)
k1, k2 , ..., k r , not all zero, such that
k1X1 + k2 X2 + .. + k r X r = O,
A set of r n-vectors X1, X2 , ..., X r is said to be linearly independent if every relation of the type
k1X1 + k2 X2 + … + k r X r = O
implies k1 = k2 = k3 … = k r = 0 .
Example 1: Show that the vectors X1 = (1, 2, 4), X2 = (3, 6, 12) are linearly dependent.
Thus there exist numbers k1 = 3, k2 = − 1 which are not all zero such that
k1X1 + k2 X2 = O.
Example 2: Show that the set consisting only of the zero vector, O, is linearly dependent.
Solution : Let X = (0 , 0 , 0 , ...., 0 ) be an n-vector whose components are all zero. Then
the relation kX = O is true for some non-zero value of the number k. For example,1X = O
and 1 ≠ 0.
Example 3: Show that the vectors X1 = (1, 2, 3) and X 2 = (4, − 2, 7) are linearly independent.
i.e., (k1 + 4 k2 , 2 k1 − 2 k2 , 3 k1 + 7 k2 ) = (0 , 0 , 0 ).
Equating the corresponding components, we get
k1 + 4 k2 = 0 , 2 k1 − 2 k2 = 0 , 3 k1 + 7 k2 = 0 .
The only common values of k1 and k2 which satisfy these equations are k1 = 0 , k2 = 0 .
X1 = (1, 0 , 0 ), X2 = (0 , 1, 0 ), X3 = (0 , 0 , 1)
is linearly independent.
k1X1 + k2 X2 + k3 X3 = O,
i. e., k1(1, 0 , 0 ) + k2 (0 , 1, 0 ) + k3 (0 , 0 , 1) = (0 , 0 , 0 ),
i. e., (k1, 0 , 0 ) + (0 , k2 , 0 ) + (0 , 0 , k3 ) = (0 , 0 , 0 ),
i. e., (k1, k2 , k3 ) = (0 , 0 , 0 ).
X = k1X1 + k2 X2 + … + k r X r ,
(i) If a set of vectors is linearly dependent, then at least one member of the set can be expressed as
a linear combination of the remaining members.
(ii) If a set of vectors is linearly independent then no member of the set can be expressed as a linear
combination of the remaining members.
[Link]
M-87
It is important to note that every sub-space of Vn contains the zero vector, being the
scalar product of any vector with the scalar zero.
i. e., (k1 + k2 ) a
which is also a member of S. Also the scalar multiple by any scalar x of any vector k1a of
S is the vector ( xk1)a which is again a member of S.
which is again a member of S. Thus S is a vector subspace and we say that S is a spanned
by the vectors a, b and c. More generally, if a1, a2 , ..., a r be a set of r fixed vectors of Vn,
then the set S of all n-vectors of the form p1a1 + p2 a2 + … pr a r where p1, p2 , ...., pr are
any scalars is a vector subspace of Vn.
[Link]
M-88
en = (0 , 0 , 0 , ..., 1)
We have already shown that these vectors are linearly independent. Moreover any
vector a = (a1, a2 , ..., an) of Vn is expressible as a = a1e1 + a2 e2 + a3 e3 + … + anen. Hence
the vectors e1, e2 , e3 , ..., en constitute a basis of Vn.
Theorem: A basis of a subspace, S, can always be selected out of a set of vectors which span S.
Let a1, a2 , ..., a r be a set of vectors which spans a subspace S. If these vectors are linearly
independent, they already constitute a basis of S as they span S. In case they are
linearly dependent, some member of the set is a linear combination of the members.
Deleting this member we obtain another set which also spans.
For examlple, if
But it is important to note that the number of members in any one basis of a subspace is the
same as in any other basis. This number is called the dimension of the subspace.
[Link]
M-89
We have already shown that one basis of Vn possesses n members. Therefore every
basis of Vn must possess n members. Thus Vn is of dimension n. In a particular the
dimension of V3 is 3.
Also it can be easily shown that if r be the dimension of a subspace S and if a1, a2 ,… a k
be a linearly independent set of vectors belonging to S, then we can always find vectors
a k + 1, a k + 2 , .... a r such that the vectors a1, a2 , . . . , a k , a k + 1,..., a r constitute a basis of S.
In other words we can say that every linearly independent set of vectors belonging to a
subspace S can always be extended so as to constitute a basis of S.
Moreover if r be the dimension of a subspace S, then every set of more than r members
of S will be linearly dependent.
Row rank of a matrix. Definition: The maximum number of linearly independent rows
of a matrix A is said to be the row rank of the matrix A.
We shall now prove that the sum of the row rank and the row nullity of a matrix is equal to the
number of rows, i. e.,
r + s = m.
Proof: Since the row space R of A is spanned by the row vectors of A, therefore it will
be a set of all vectors of the form
i. e., of the form ( x1a11 + x2 a21 + … + xm am1, x1a12 + x2 a22 + .... + xm am2 ,
Let u1, u2 , ..., u s be a basis of the subspace S of Vm generated by all vectors X such that
XA = O. Then, we have
u1A = u2 A = … = u s A = O.
Since the vectors u1, u2 , ..., u s belong to Vm and form a linearly independent set,
therefore we can find vectors u s + 1, u s + 2 , …, u m in Vm such that the vectors
u1, u2 , ..., u s , u s + 1, ..., u m constitute a basis of Vm . Then every vector X belonging to Vm
can be expressed in the form
X = h1u1 + h2 u2 + .... + hm u m .
i. e., as (h1u1 + h2 u2 + … + hm u m ) A
i. e., as h1u1A + h2 u2 A + … + hs u s A + hs + 1u s + 1A + hs + 2 u s + 2 A + … + hm u m A
i. e., as hs + 1u s + 1A + hs + 2 u s + 2 A + … + hm u m A.
k s + 1u s + 1A + k s + 2 u s + 2 A + … + k m u m A = O
implies (k s + 1u s + 1 + k s + 2 u s + 2 + … + k m u m )A = O,
But the vectors u1, u2 , ..., u m are linearly independent. Therefore a relation of the form
k s + 1u s + 1 + k s + 2 u s + 2 + … + k m u m = p1u1 + p2 u2 + .....+ ps u s will exist if and only if
k s + 1 = k s + 2 = … = k m = 0 . Hence the vectors u s + 1A, u s + 2 A , ..., u m A are linearly
independent and form a basis of R. Thus the dimension of R is m − s.
Hence r=m−s or r + s = m.
[Link]
M-91
Proof. Let A be any given m × n matrix. Let B be a matrix row equivalent to A. Since B
is obtainable from A by a finite chain of E-row operations and every E-row operation is
equivalent to pre-multiplication by the corresponding E-matrix, there exist E-matrices
E1, E2 , ..., Ek each of the type m × m such that
B = Ek Ek − 1 … E2 E1A, i. e., B = PA ,
Let us write
where the matrix A has been expressed as a matrix of its row sub-matrices
R1, R2 , ..., R m .
[Link]
M-92
From the product of the matrices on the R.H.S. of (1), we observe that the rows of the
matrix B are
p21R1 + p22 R2 + … + p2 m R m ,
… … … … …
… … … … …
Thus we see that the rows of B are all linear combinations of the rows R1, R2 , ...., R m of
A. Therefore every member of the row space of B is also a member of the row space of A.
Similarly by writing A = P−1B and giving the same reasoning we can prove that every
member of the row space of A is also a member of the row space of B. Therefore the row
spaces of A and B are identical.
Thus we see that elementary row operations do not alter the row space of a matrix.
Hence the row rank of a matrix remains invariant under E-row transformations.
Or
Post-multiplication by a non-singular matrix does not alter the column rank of a matrix.
Proof: Proceeding in the same way as in 3.10, we can show that post-multiplication
with a non-singular matrix does not alter the column space and therefore the column
rank of a matrix.
Note. Since every n-rowed E-matrix is obtained from In by single E-operation (row or
column operation as may be desired), therefore the row rank and column rank of an
E-matrix are equal to n.
[Link]
M-93
Proof: Let A be any given m × n matrix and let B be a matrix row equivalent to A. Then
there exists a non-singular matrix P such that B = PA.
BX = (PA) X = P (AX) = PO = O.
Thus we see that the matrices A and B have the same right nullities and consequently
their column ranks are equal.
Similarly we can prove that column equivalent matrices have the same row rank.
3.13 Theorem
If r be the row rank of an m × n matrix A then there exists a non-singular matrix, P such that
K
PA = ,
O
Proof. If the row rank r of A is zero, we have nothing to prove. Therefore let us
assume that r > 0. The matrix A has then r linearly independent rows. By elementary
row operations on A we can bring these linearly independent rows in the first r places.
Since the last m − r rows are now linear combinations of the first r rows, they can be
made zero by E-row operations without altering the first r rows.
Thus we see that the matrix A is row equivalent to a matrix B such that
K
B= ,
O
K
PA = ⋅
O
AR = [L O],
Proof: Let, s be the row rank and r, the rank of an m × n matrix A. Since the matrix A is
of row rank s, therefore by 3.13, there exists a non-singular matrix P such that
K
PA = , where K is an s × n matrix.
O
Now we know that pre-multiplication by a non-singular matrix does not alter the rank
of a matrix.
But each minor of order (s + 1) of the matrix PA involves atleast one row of zeros.
∴ Rank (PA) ≤ s.
∴ r≤s
Again, since the rank of the matrix A is r, therefore by 2.12 of chapter 2 there exists a
non-singular matrix R such that
G
RA = ,
O
where G is an r × n matrix.
Now we know that pre-multiplication by a non-singular matrix does not alter the row
rank of a matrix.
[Link]
M-95
But the matrix RA has only r non-zero rows. Therefore the row rank of RA can, at the
most, be equal to r.
∴ s≤ r
Hence r = s.
Proof: Let the matrix A′ be the transpose of the matrix A. Then the columns of A are
the rows of A′ .
∴ the column rank of A = the row rank of A′ = the rank of A′ = the rank of A.
Thus from theorems 1 and 2, we conclude that the rank, row rank and column rank of a
matrix are all equal. In other words we can say that the maximum number of linearly
independent rows of a matrix is equal to the maximum number of its linearly
independent columns and is equal to the rank of the matrix.
Thus we have proved that the row rank, the column rank and the rank of a matrix are
all equal. Therefore sometimes the rank of a matrix is also defined as the maximum number of
linearly independent row vectors or column vectors.
We shall now first consider systems of linear homogeneous equations and then
proceed to discuss systems of non-homogeneous linear equations.
AX = O. …(2)
The matrix A is called the coefficient matrix of the system of equations (1).
Again suppose X1 and X2 are two solutions of (2). Then their linear combination k1X1 + k2 X2 ,
where k1 and k2 are any arbitrary numbers, is also a solution of (2).
Therefore the collection of all the solutions of the system of equations AX = O forms a sub-space of
the n-vector space Vn.
Proof:
Since the rank of the coefficient matrix A is r, therefore it has r linearly independent
columns. Without loss of generality we can suppose that the first r columns from the
left of the matrix A are linearly independent, because it amounts only to renaming the
components of X.
The matrix A can be written as A = [C1, C2 , …, Cr , … Cn] 1 × n , where C1, C2 , ..., Cn are the
column vectors of the matrix A each of them being an m-vector.
The vectors X1, X2 , ..., X n − r form a linearly independent set. For, if we have a relation
of type
l1X1 + l2 X2 + … + ln − r X n − r = O, …(4)
then comparing the (r + 1)th, (r + 2)th, … , nth components on both sides of (4), we get
− l1 = 0 , − l2 = 0 , ..., − ln − r = 0 ,
It can now be easily seen that every solution of the equation AX = O is some suitable
linear combination of these n − r solutions X1, X2 , ...., X n − r .
Suppose the vector X, with components x1, x2 , ..., xn is any solution of the equation
AX = O. The the vector
AX = O.
Therefore
X = − xr + 1X1 − xr + 2 X2 − … − xnX n − r .
{ X1, X2 , …, X n − r }
forms a basis of the vector space of all the solutions of the system of equations AX = O.
Case II. If r < n, we shall have n − r linearly independent solutions. Any linear
combination of these n − r solutions will also be a solution of AX = O. Thus in this case
the equation AX = O will have an infinite number of solutions.
Case III. Suppose m < n i. e., the number of equations is less than the number of
unknowns. Since r ≤ m, therefore r is definitely less than n. Hence in this case the given
system of equations must possess a non-zero solution. The number of solutions of the
equation AX = O will be infinite.
X = c1X1 + c2 X2 + … + c k X k ,
If r = n, the zero solution (trivial solution) will be the only solution. If r < n, there will
be an infinity of solutions.
[Link]
M-100
Example 5: Does the following system of equations possess a common non-zero solution?
x + 2 y + 3 z = 0, 3 x + 4 y + 4 z = 0 , 7 x + 10 y + 12 z = 0 . (Lucknow 2005)
Solution : The given system of equations can be written in the form of the single matrix
equation
1 2 3 x 0
AX = 3 4 4 y = 0 = O.
7 10 12 z 0
We shall start reducing the coefficient matrix A to triangular form by applying only
E-row transformations on it. Applying R2 → R2 − 3 R1, R3 → R3 − 7 R1, the given
system of equations is equivalent to
1 2 3 x
0 −2 −5 y = O.
0 −4 −9 z
Here we find that the determinant of the matrix on the left hand side of this equation is
not equal to zero. Therefore the rank of this matrix is 3. So there is no need of further
applying E-row transformations on the coefficient matrix. The rank of the coefficient
matrix A is 3, i. e., equal to the number of unknowns. Therefore the given system of
equations does not possess any linearly independent solution. The zero solution, i. e.,
x = y = z = 0 is the only solution of the given system of equations.
x + 3 y − 2 z = 0 , 2 x − y + 4 z = 0 , x − 11 y + 14 z = 0 . (Meerut 2005B)
Solution : The given system of equations is equivalent to the single matrix equation
1 3 −2 x
AX = 2 −1 4 y = O.
1 −11 14 z
We shall reduce the coefficient matrix A to Echelon form by applying only E-row
operations on it. Performing R2 → R2 − 2 R1, R3 → R3 − R1, we have
[Link]
M-101
1 3 −2 x
0 −7 8 y = O.
0 −14 1 z
Performing R3 → R3 − 2 R2 , we have
1 3 −2 x
0 −7 8 y = O.
0 0 0 z
The coefficient matrix is now triangular. The coefficient matrix being of rank 2, the
given system of equations possess 3 − 2 = 1 linearly independent solution. We shall
assign arbitrary values to n − r = 3 − 2 = 1 variable and the remaining r = 2 variables
shall be found in terms of these. The given system of equations is equivalent to
x + 3 y − 2 z = 0, − 7 y + 8 z = 0 .
8 10
Thus y= z, x = − z.
7 7
Choose z = c.
8 10
Then y= c, x = − c.
7 7
10 8
Hence x=− c, y = c, z = c
7 7
constitute the general solution of the given system, where c is an arbitrary parameter.
10 10
− c − 7
x 7
8 8
y = c =c , where c is an arbitrary number.
z 7 7
c 1
Example 7: Does the following system of equations possess a common non-zero solution?
x + y + z = 0 , 2 x − y − 3 z = 0 , 3 x − 5 y + 4 z = 0 , x + 17 y + 4 z = 0 .
Solution: The given system of equations is equivalent to the single matrix equation
[Link]
M-102
1 1 1
x
2 −1 −3
AX = y = O.
3 −5 4 z
1 17 4
We shall first find the rank of the coefficient matrix A by reducing it to Echelon form
by applying elementary row transformations only.
1 1 1 1 1 1
0 −3 −5 0 −3 −5
A~ ~ by R3 → 3 R3 , R4 → 3 R4
0 −8 1 0 −24 3
0 16 3 0 48 9
1 1 1
0 −3 −5
~ by R3 → R3 − 8 R2 , R4 → R4 + 16 R2
0 0 43
0 0 −71
1 1 1
0 −3 −5 71
~ by R4 → R4 + R3 .
0 0 43 43
0 0 0
Above is the Echelon form of the coefficient matrix A. We have rank A = the number
of non-zero rows in this Echelon form = 3. The number of unknowns is also 3. Since
rank A is equal to the number of unknowns, therefore the given system of equations
does not possess any linearly independent solution. Thus the given system of
equations possesses no non-zero solution. Hence the zero solution i. e., x = y = z = 0 is
the only solution of the given system of equations.
3 x + 4 y − z − 6w = 0, 2 x + 3 y + 2z − 3w = 0,
2 x + y − 14 z − 9 w = 0 , x + 3 y + 13 z + 3 w = 0 .
Solution: The given system of equations is equivalent to the single matrix equation
[Link]
M-103
3 4 −1 −6 x
2 3 2 −3 y
AX = z = O.
2 1 −14 −9
1 3 13 3 w
We shall first find the rank of the coefficient matrix A by reducing it to Echelon form
by applying E-row transformations only.
1 3 13 3
2 3 2 −3
A~
2 1 −14 −9
3 4 −1 −6
1 3 13 3
0 −3 −20 −9
~
0 −5 −40 −15
0 −5 −40 −15
By R2 → R2 − 2 R1, R3 → R3 − 2 R1, R4 → R4 − 3 R1
1 3 13 3
0 1 8 3
~
0 1 8 3
0 1 8 3
1 1 1
by R2 → − R2 , R3 → − R3 , R4 → − R4
3 5 5
1 3 13 3
0 1 8 3
~ by R3 → R3 − R2 , R4 → R4 − R2 .
0 0 0 0
0 0 0 0
The rank of A is obviously 2 which is less than the number of unknowns 4. Therefore
the given system of equations possesses 4 − 2, i. e., 2 linearly independent solutions.
The given system of equations is equivalent to the equation
[Link]
M-104
1 3 13 3 x 0
0 1 8 3 y 0
0 0 0 0 z = 0 ⋅
0 0 0 0 w 0
Thus the given system of four equations is equivalent to the system of two equations,
i. e.,
x + 3 y + 13 z + 4 w = 0
⋅
y + 8z + 3w = 0
y = − 8 z − 3 w, x = − 3 (− 8 z − 3 w) − 13 z − 3 w
i. e., y = − 8 z − 3 w, x = 11z + 6 w.
Remark: In matrix form, the general solution of the given system of equations can
be expressed as
x 11c1 + 6 c2 11 6
y − 8 c1 − 3 c2 − 8 − 3
z = 1c + 0 c = c1 1 + c2 0 .
1 2
w 0 c1 + 1c2 0 1
x 11 x −6
y − 8 y −3
Here =
z 1 and z = 0
w 0 w 1
are two linearly independent solutions of the given system of equations and all their
linear combinations will also be the solutions of the given system of equations.
Example 9: Show that the only real value of λ for which the following equations have non-zero
solutions is 6 :
x + 2 y + 3 z = λx, 3 x + y + 2 z = λy, 2 x + 3 y + z = λz .
Solution: The given system of equations is equivalent to the single matrix equation
1 − λ 2 3 x
AX = 3 1− λ 2 y = O.
2 3 1 − λ z
1 − λ 2 3
3 1− λ 2 =0
2 3 1 − λ
6 − λ 6−λ 6 − λ
or 3 1− λ 2 = 0,
2 3 1 − λ
1 1 1
or (6 − λ ) 3 1− λ 2 =0
2 3 1 − λ
1 1 0
or (6 − λ ) 3 −λ − 2 −1 = 0 , C2 − C1, C3 − C1
1 1 − λ − 1
or (6 − λ ) [(λ + 2) (λ + 1) + 1] = 0
or (6 − λ ) [λ 2 + 3 λ + 3] = 0 .
− 3 ± (9 − 12)
The roots of the equation λ 2 + 3 λ + 3 = 0 are λ = i. e.,are imaginary.
2
Hence the only real value of λ for which the system of equations is to have a non-zero
solution is 6.
[Link]
M-106
Comprehensive E xercise 1
Find all the solutions of the following system of linear homogeneous equations :
1. 2 x − 3 y + z = 0 , x + 2 y − 3 z = 0 , 4 x − y − 2 z = 0 .
2. x + y − 3 z + 2 w = 0 , 2 x − y + 2 z − 3 w = 0 , 3 x − 2 y + z − 4 w = 0 ,
− 4 x + y − 3z + w = 0.
3. x + y + z = 0 , 2 x + 5 y + 7 z = 0 , 2 x − 5 y + 3 z = 0 .
4. x + 2 y + 3 z = 0 , 2 x + 3 y + 4 z = 0 , 7 x + 13 y + 19 z = 0 .
5. 4 x + 2 y + z + 3 u = 0 , 6 x + 3 y + 4 z + 7 u = 0 , 2 x + y + u = 0 .
6. 2 x − 2 y + 5 z + 3 w = 0 , 4 x − y + z + w = 0 , 3 x − 2 y + 3 z + 4 w = 0 ,
x − 3 y + 7 z + 6 w = 0.
7. x − 2 y + z − w = 0 , x + y − 2 z + 3 w = 0 , 4 x + y − 5 z + 8 w = 0 ,
5 x − 7 y + 2z − w = 0
A nswers 1
1. x = 0 , y = 0 , z = 0 2. x = 0 , y = 0 , z = 0 , w = 0
3. x = 0 , y = 0 , z = 0 4. x = 2 c , y = − 2 c , z = c
5. x = c1, u = c2 , y = − 2 c1 − c2 , z = − c2
5 7
6. x = c , y = 4 c , z = c , w = c
9 9
5 4
7. x = c1 − c2 , y = c1 − c2 , z = c1, w = c2
3 3
There is no set of values of x and y which satisfies both these equations. Such
equations are said to be inconsistent.
[Link]
M-107
3x + 4 y = 5
.
6 x + 8 y = 10
These equations are consistent since there exist values of x and y which satisfy both of
4 5
these equations. We see that x = − c + , y = c constitute a solution of these
3 3
equations, where c is arbitrary. Thus these equations possess an infinite number of
solutions.
If we write
Any set of values of x1, x2 , ..., xn which simultaneously satisfy all these equations is called a
solution of the system (1). When the system of equations has one or more solutions, the equations
are said to be consistent otherwise they are said to be inconsistent.
The matrix
Proof. Let C1, C2 , ..., C n denote the column vectors of the matrix A. The equation
AX = B is then equivalent to
x1
x2
[C1, C2 , ...., Cn] … = B i. e., x1C1 + x2C2 + … + xnCn = B. …(1)
…
xn
Let now r be the rank of the matrix A. The matrix A has then r linearly independent
columns and without loss of generality, we can suppose that the first r columns
C1, C2 , ..., Cr form a linearly independent set so that each of the remaining n − r
columns is a linear combination of these r columns.
The condition is sufficient. Now suppose that the matrices A and [A B] are of the
same rank r. The maximum number of linearly independent columns of the matrix
[A B] is then r. But the first r columns C1, C2 , ..., Cr of the matrix [A B] already form a
linearly independent set. Therefore the column B should be expressed as a linear
combination of the columns C1, C2 , ...., Cr .
To show that the solution is unique, let us suppose that X1 and X2 be two solutions of
AX = B.
⇒ IX 1 = IX 2 ⇒ X 1 = X 2.
In this case the equations AX = B are inconsistent i. e., they have no solution.
In this case the equations AX = B are consistent i. e., they possess a solution. If r < m,
then in the process of reducing the matrix [A B] to Echelon form, (m − r) equations
will be eliminated. The given system of m equations will then be replaced by an
equivalent system of r equations. From these r equations we shall be able to express
the values of some r unknowns in terms of the remaining n − r unknowns which can be
given any arbitrarily chosen values.
If r < n, then n − r variables can be assigned arbitrary values. So in this case there will be
an infinite number of solutions. Only n − r + 1 solutions will be linearly independent
and the rest of the solutions will be linear combinations of them.
If m < n, then r ≤ m < n. Thus in this case n − r > 0. Therefore when the number of
equations is less than the number of unknowns, the equations will always have an
infinite number of solutions, provided they are consistent.
x + y + z = − 3, 3 x + y − 2 z = − 2, 2 x + 4 y + 7 z = 7
Solution : The given system of equations is equivalent to the single matrix equation
1 1 1 x −3
AX = 3 1 −2 y = −2 = B.
2 4 7 z 7
1 1 1 : −3
The augmented matrix [A B] = 3 1 −2 : −2.
2 4 7 : 7
1 1 1: −3
[A B] ~ 0 −2 −5 : 7
0 2 5 : 13
1 1 1: −3
~ 0 −2 −5 : 7, aplying R3 → R3 + R2 .
0 0 0 : 20
Above is the Echelon form of the matrix [A B]. We have rank [A B] = the number of
non-zero rows in this Echelon form = 3
1 1 1
A ~ 0 −2 −5.
0 0 0
Obviously rank A = 2.
Since Rank A ≠ Rank [A B], therefore the given equations are inconsistent i. e., they
have no solution.
Remark: The inconsistency of the given equations can also be shown as below :
1 1 1 x −3
0 −2 −5 y = 7.
0 0 0 z 20
x + y + z = −3
0 x − 2 y − 5z = 7 .
0 x + 0 y + 0 z = 20
The last equation shows that 0 = 20 , which is not possible. Hence the given equations
are inconsistent.
x + y + z = 6, x + 2 y + 3 z = 14, x + 4 y + 7 z = 30
Solution : The given system of equations is equivalent to the single matrix equation
1 1 1 x 6
AX = 1 2 3 y = 14 = B.
1 4 7 z 30
1 1 1 M 6
The augmented matrix [A B] = 1 2 3 M 14 .
1 4 7 M 30
1 1 1M 6 1 1 1M 6
[A B] ~ 0 1 2M 8 ~ 0 1 2M 8, by R3 → R3 − 3 R2 .
0 3 6M 24 0 0 0 M 0
Above is the Echelon form of the matrix [A B]. We have rank [A B] = the number of
non-zero rows in this Echelon form = 2 .
1 1 1
By the same elementary transformations, we get A ~ 0 1 2.
0 0 0
Obviously rank A = 2. Since rank A = rank [A B], therefore the given equations are
consistent. Here the number of unknowns is 3. Since rank A is less than the number of
unknowns therefore the given system will have an infinite number of solutions. We
see that the given system of equations is equivalent to the matrix equation
1 1 1 x 6
0 1 2 y = 8.
0 0 0 z 0
x + y + z = 6
.
y + 2z = 8
∴ y = 8 − 2 z , x = 6 − y − z = 6 − (8 − 2 z ) − z = z − 2.
Example 12: Apply the test of rank to examine if the following equations are consistent :
2 x − y + 3 z = 8, − x + 2 y + z = 4, 3 x + y − 4 z = 0
Solution : The given system of equations is equivalent to the single matrix equation
2 −1 3 x 8
AX = −1 2 1 y = 4 = B.
3 1 −4 z 0
2 −1 3 M 8
[A B] = −1 2 1M 4.
3 1 −4 M 0
We shall reduce the augmented matrix to Echelon form by applying elementary row
transformations only. Applying R1 ↔ R2 , we get
−1 2 1M 4
[A B] ~ 2 −1 3 M 8
3 1 −4 M 0
−1 2 1M 4
~ 0 3 5 M 16, by R2 → R2 + 2 R1, R3 → R3 + 3 R1
0 7 −1 M 12
−1 2 1 M 4
0 3 5 M 16, by R3 → 3 R3
0 21 −3 M 36
−1 2 1 M 4
~ 0 3 5 M 16, by R3 → R3 − 7 R2
0 0 −38 M −76
−1 2 1 M 4
1
~ 0 3 5 M 16, by R3 → − R3 .
0 38
0 1 M 2
[Link]
M-114
Above is the Echelon form of the matrix [A B]. We have the rank [A B] = the number
of non-zero rows in this Echelon form = 3.
−1 2 1
By the same transformations, we get A ~ 0 3 5 ⋅
0 0 1
Obviously rank A = 3. Since rank A = rank [A B], therefore the given equations are
[Link] the number of unknowns is 3. Since rank A is equal to the number of
unknowns, therefore the given equations have a unique solution. We see that the
given equations are equivalent to the matrix equation
−1 2 1 x 4
0 3 5 y = 16.
0 0 1 z 2
− x + 2 y + z = 4, 3 y + 5 z = 16, z = 2.
These give z = 2, y = 2, x = 2.
x + 2 y − z = 3, 3 x − y + 2 z = 1, 2 x − 2 y + 3 z = 2, x − y + z = − 1
are consistent and solve them. (Meerut 2006B, 09; Rohilkhand 06)
Solution: The given system of equations is equivalent to the single matrix equation
1 2 −1 3
x
3 −1 2 1
AX = y = = B.
2 −2 3 2
z
1 −1 1 −1
1 2 −1 &: 3
3 −1 2 &: 1
[A B] = .
2 −2 3 &: 2
1 −1 1 &: −1
1 2 −1 M 3
0 −7 5 M −8
[A B] ~
0 −6 5 M −4
0 −3 2 M −4
1 2 −1 M 3
0 −1 0 M −4
~ , by R2 → R2 − R3
0 −6 5 M −4
0 −3 2 M −4
1 2 −1 M 3
0 −1 0 M −4
~ , by R3 → R3 − 6 R2 , R4 → R4 − 3 R2
0 0 5 M 20
0 0 2 M 8
1 2 −1 M 3
0 −1 0 M −4 1 1
~ , by R3 → R3 , R4 → R4
0 0 1 M 4 5 2
0 0 1 M 4
1 2 −1 M 3
0 −1 0 M −4
~ , by R4 → R4 − R3 .
0 0 1 M 4
0 0 0 M 0
Thus the matrix [A B] has been reduced to Echelon form. We have rank [A B] = the
number of non-zero rows in this Echelon form = 3. Also
1 2 −1
0 −1 0
A ~ .
0 0 1
0 0 0
We have rank A = 3. Since rank [A B] = rank A, therefore the given equations are
consistent. Since rank A = 3 = the number of unknowns, therefore the given equations
have unique solution. The given equations are equivalent to the equations
x + 2 y − z = 3, − y = − 4, z = 4.
These give z = 4, y = 4, x = − 1.
[Link]
M-116
Example 14: State the conditions under which a system of non-homogeneous equations will have
(i) no solution (ii) a unique solution (iii) infinity of solutions. (Lucknow 2008)
(i) These equations will have no solution if the coefficient matrix A and the
augmented matrix [A B] are not of the same rank.
(ii) These equations will possess a unique solution if the matrices A and [A B] are of
the same rank and the rank is equal to the number of variables. In particular if A is
a square matrix, these equations will possess a unique solution if and only if the
matrix A is non-singular.
(iii) These equations will have infinity of solutions if the matrices A and [A B] are of
the same rank and the rank is less than the number of variables.
x + y + z = 9, 2 x + 5 y + 7 z = 52, 2 x + y − z = 0 .
Solution : The given system of equations is equivalent to the single matrix equation
1 1 1 x 9
AX = 2 5 7 y = 52 = B.
2 1 −1 z 0
1 1 1 M 9
[A B] = 2 5 7 M 52
2 1 −1 M 0
1 1 1 M 9
~ 0 3 5 M 34, by R2 → R2 − 2 R1, R3 → R3 − 2 R1
0 −1 −3 M −18
1 1 1 M 9
~ 0 −1 −3 M −18 , by R2 ←→ R3
0 3 5 M 34
[Link]
M-117
1 1 1 M 9
~ 0 −1 −3 M −18 , by R3 → R3 + 3 R2 .
0 0 −4 M −20
Above is the Echelon form of the matrix [A B]. We have rank [A B] = the number of
non-zero rows in this Echelon form = 3.
1 1 1
A ~ 0 −1 −3 .
0 0 −4
∴ rank A = 3.
Since rank A = rank [A B], therefore the given equations are consistent. Also rank
A = 3 and the number of unknowns is also 3. Hence the given equations will have a
unique solution. To find the solution we see that the given system of equations is
equivalent to the matric equation
1 1 1 x 9
0 −1 −3 y = −18.
0 0 −4 z −20
x + y + z = 9, − y − 3 z = − 18, − 4 z = − 20 .
x + y + z = 6, x + 2 y + 3 z = 10 , x + 2 y + λz = µ
have (i) no solution, (ii) a unique solution, (iii) an infinite number of solutions.
1 1 1 x 6
AX = 1 2 3 y = 10 = B .
1 2 λ z µ
[Link]
M-118
1 1 1 M 6
The augmented matrix [A B] = 1 2 3 M 10
1 2 λ M µ
1 1 1 M 6
~ 0 1 2 M 4 by R2 → R2 − R1, R3 → R3 − R1
0 1 λ −1 M µ − 6
1 1 1 M 6
~ 0 1 2 M 4 by R3 → R3 − R2 .
0 0 λ −3 M µ − 10
So in this case the given system of equations is consistent. Since rank A = the number
of unknowns, therefore the given system of equations possesses a unique solution.
Thus if λ ≠ 3,the given system of equations possesses a unique solution for any value of
µ.
If λ = 3 and µ ≠ 10, we have rank [A B] = 3 and rank A = 2. Thus in this case rank
[A B] ≠ rank A and so the given system of equations is inconsistent i. e., possesses no
solution.
So in this case the given system of equations is again consistent. Since rank A < the
number of unknowns, therefore in this case the given system of equations possesses an
infinite number of solutions.
x + y + z = 1, x + 2 y + 4 z = η, x + 4 y + 10 z = η2 ,
1 1 1 x 1
1 2 4 y = η.
1 4 10 z η2
[Link]
M-119
1 1 1 x 1
0 1 3 y = η − 1 .
0 3 9 z η2 − 1
Performing R3 → R3 − 3 R2 , we have
1 1 1 x 1
0 1 3 y = η−1 …(1)
0 0 0 z η2 − 3 η + 2
1 1 1 x 1
0 1 3 y = 1 .
0 0 z
0 0
y + 3 z = 1, x + y + z = 1.
∴ y = 1 − 3z , x = 2z .
1 1 1 x 1
0 1 3 y = 0 .
0 0 0 z 0
y + 3 z = 0 , x + y + z = 1.
∴ y = − 3z , x = 1 + 2z .
Comprehensive Exercise 2
1. Use the test of rank to show that the following equations are not consistent :
2 x − y + z = 4, 3 x − y + z = 6, 4 x − y + 2 z = 7, − x + y − z = 9.
(Rohilkhand 2009)
A nswers 2
3. x = − 7, y = 22, z = − 9
4. x = 1, y = 2, z = 3
5. x = 1, y = 1, z = 1
1 3 5
6. x= , y = ,z =
2 2 2
7. x = − 1 + 2 c , y = 3 − 2 c , z = c , where c is arbitrary
35 29 5
8. x= , y= ,z =
18 18 18
9. x = c − 2, y = 3 − 2 c , z = c
10. Consistent; x = − 1, y = − 2, z = 4
1
11. Consistent ; x = 2, y = 2, z =
2
7 7
13. λ ≠ , the solution is unique; λ = − , no solution
2 2
14. In case λ = 2, the general solution of the given system of equations is given by
1
x = − c, y = c, z = 0
2
[Link]
M-122
7 7
15. λ ≠ , the solution is unique; λ = − , no solution
10 10
7 16 3 1
16. x = − c, y = + c, z = c
11 11 11 11
17. x = 3, y = 4, z = 6
5 3 3 1
18. x = − c, y = − + c, z = c
2 2 2 2
−8 5 25 13
19. x = + c, y = − + c, z = c
7 7 7 7
(a) k1 + k2 + ......... + k r = 0
(b) k1 = k2 = ......... = k r = 0
(c) k1 + k2 + ......... + k r = 1
(c) no solution
(a) k ≠ 0
[Link]
M-123
(d) k = 0
4. The system of equations x + 2 y + 3 z = 1, 2 x + y + 3 z = 2, 5 x + 5 y + 9 z = 4 has
(c) no solution
4. When the system of equations has one or more solutions, the equations are said
to be ....., otherwise they are said to be .......
7. If the number of equations is less than the number of unknowns, the equations
will aways have an ....... number of solutions, provided they are consistent.
True or False
Write ‘T’ for true and ‘F’ for false statement.
2. If a set of vectors is linearly independent, then at least one member of the set can
be expressed as a linear combination of the remaining members.
[Link]
M-124
3. The zero solution i. e., x = y = z = 0 is the only solution of the system of equations
x + 2 y + 3 z = 0, 3 x + 4 y + 4 z = 0 , 7 x + 10 y + 12 z = 0 .
A nswers
True or False
1. T 2. F 3. T
4. T 5. F
o
[Link]
M- 125
4
Eigenvalues and
Eigenvectors
F (λ ) = A0 + A1λ + A2 λ2 + … + A m − 1λ m − 1 + A m λ m ,
where A0 , A1, A2 , ..., A m are all square matrices of the same order, is called a Matric
polynomial of degree m provided A m is not a null matrix. The symbol λ is called
indeterminate. If the order of each of the matric coefficients A0 , A1, …, A m is n, then we
say that the matric polynomial is n-rowed. According to this definition of a matric
polynomial, each square matrix can be expressed as a matric polynomial with zero
degree. For example, if A be any square matrix, we can write A = λ0 A.
Equality of Polynomials : Two matric polynomials are equal iff (if and only if), the
coefficients of the like powers of λ are the same.
Theorem: Every square matrix whose elements are ordinary polynomials in λ, can essentially be
expressed as a matric polynomial in λ of degree m, where m is the highest power of λ occurring in
any element of the matrix. We shall illustrate this theorem by the following example.
1 + 2 λ + 3 λ2 λ2 4 − 6λ
A = 1 + λ3 3 + 4λ 2
1 − 2λ + 4λ 3
3
2 − 3 λ + 2 λ 5 6
in which the highest power of λ occurring in any element is 3. Rewriting each element
as a cubic in λ, supplying missing coefficients with zeros, we get
1 + 2 . λ + 3 . λ2 + 0 . λ3 0 + 0 . λ + 1 . λ2 + 0 . λ3 4 − 6 . λ + 0 . λ2 + 0 . λ3
A = 1 + 0 . λ + 0 . λ2 + 1 . λ3 3 + 0 . λ + 4. λ2 + 0 . λ3 1 − 2 . λ + 0 . λ2 + 4 . λ3
2 3
2 − 3 . λ + 0 . λ + 2 . λ 5 + 0 . λ + 0 . λ2 + 0 . λ3 6 + 0 . λ + 0 . λ2 + 0 . λ3
1 0 4 2 0 −6 3 1 0 0 0 0
2 3
A = 1 3
1 + λ 0 0 −2 + λ 0 4 0 + λ 1 0 4.
2 5 6 −3 0 0 0 0 0 2 0 0
x1
x
2
A = …
…
xn
AX = λX. ..(1)
It is obvious that the zero vector X = O is a solution of (1) for any value of λ. Now let us
see whether there exist scalars λ and non-zero vectors X which satisfy (1).
If I denotes the unit matrix of order n, then the equation (1) may be written as
AX = λIX
or (A − λI) X = O. ...(2)
[Link]
M- 127
The matrix equation (2) represents the following system of n homogeneous equations
in n unknowns :
The coefficient matrix of the equations (3) is A − λI. The necessary and sufficient
condition for equations (3) to possess a non-zero solution (X ≠ O) is that the
coefficient matrix A − λI should be of rank less than the number of unknowns n. But
this will be so if and only if the matrix A − λI is singular i. e., if and only if| A − λI| = 0.
Thus the scalars λ for which
| A − λI| = 0
Let A = [aij]n × n be any n-rowed square matrix and λ an indeterminate. The matrix
A − λI is called the characteristic matrix of A where I is the unit matrix of order n.
| A − λI| = 0
and the matrix A − λI is singular. Therefore there exists a non-zero vector X such that
(A − λI) X = O or AX = λX.
[Link]
M-128
Proof. Suppose λ is a characteristic root of the matrix A. Then | A − λI| = 0 and the
matrix A − λI is singular. Therefore, the matrix equation (A − λI) X = O possesses a
non-zero solution i. e., there exists a non-zero vector X such that (A − λI)X = O or
AX = λX.
λ1 X = λ 2 X
⇒ (λ1 − λ 2 ) X = O
[Link]
M- 129
⇒ λ1 − λ 2 = 0 [∵ X ≠ O]
⇒ λ1 = λ 2 .
AX = λX. ...(1)
X θ AX = λX θ X. ...(2)
or X θ A θ (X θ )θ = λX θ (X θ )θ
or X θ AX = λX θ X ...(3)
λX θ X = λX θ X
or (λ − λ )X θ X = O.
Corollary 1. The characteristic roots of a real symmetric matrix are all real.
[Link]
M-130
If the elements of a Hermitian matrix A are all real, then A is a real symmetric matrix.
Thus a real symmetric matrix is Hermitian and therefore the result follows.
Corollary 2. The characteristic roots of a skew-Hermitian matrix are either pure imaginary or
zero. (Lucknow 2010)
AX = λX
Corollary 3. The characteristic roots of a real skew-symmetric matrix are either pure imaginary
or zero, for every such matrix is skew-Hermitian.
AX = λX. ...(1)
(AX) θ = (λX)θ
or X θ A θ = λX θ . ...(2)
( X θ A θ )(AX) = λλX θ X
or X θ (A θ A) X = λλX θ X [∵ A θ A = I]
or X θIX = λλX θ X
or X θ X = λλX θ X
or X θ X(λλ − 1) = O. ...(3)
λλ − 1 = 0
[Link]
M- 131
or λ λ =1
or | λ |2 = 1.
We know that if the elements of a unitary matrix A are all real, then A is said to be an
orthogonal matrix. Hence the result follows.
or (A − λ1I) X = O.
0 1 2
A = 1 0 −1.
2 −1 0 (Meerut 2010)
= A − λI
0 1 2 1 0 0
= 1 0 −1 − λ 0 1 0
2 −1 0 0 0 1
[Link]
M-132
0 − λ 1 2
= 1 0−λ −1 .
2 −1 0 − λ
= | A − λI|
− λ 1 2
= 1 −λ −1
2 −1 −λ
= − λ3 + λ + λ − 2 − 2 + 4 λ
= − λ3 + 6 λ − 4.
| A − λI| = 0
i. e., λ3 − 6 λ + 4 = 0
i. e., (λ − 2)(λ2 + 2 λ − 2) = 0 .
a h g
A = 0 b 0 .
0 c c (Meerut 2010B; Kanpur 09)
a − λ h g
Solution: Here | A − λI| = 0 b−λ 0
0 c c − λ
= (a − λ )(b − λ ) (c − λ ) .
[Link]
M- 133
| A − λI| = 0
5 4
A= .
1 2 (Bundelkhand 2006)
| A − λI| = 0
5 − λ 4
i. e., = 0
1 2 − λ
i. e., (5 − λ ) (2 − λ ) − 4 = 0
i. e., λ2 − 7 λ + 6 = 0 .
x1
The eigenvectors X = of A corresponding to the eigenvalue 6 are given by the
x2
non-zero solutions of the equation
(A − 6 I) X = O
5 − 6 4 x1 0
or 1 =
2 − 6 x2 0
−1 4 x1 0
or =
1 −4 x2 0
−1 4 x1 0
or 0 0 x = 0 , applying R2 → R2 + R1.
2
The coefficient matrix of these equations is of rank 1. Therefore these equations have
2 − 1 i. e., 1 linearly independent solution. These equations reduce to the single
equation − x1 + 4 x2 = 0. Obviously x1 = 4, x2 = 1 is a solution of this equation.
4
Therefore X1 = is an eigenvector of A corresponding to the eigenvalue 6. The set of
1
[Link]
M-134
(A − 1I) X = O
4 4 x1 0
or 1 1 x = 0
2
or 4 x1 + 4 x2 = 0 , x1 + x2 = 0 .
1
From these x1 = − x2 . Let us take x1 = 1, x2 = − 1. Then X2 = is an eigenvector of A
−1
corresponding to the eigenvalue 1. Every non-zero multiple of the vector X2 is an
eigenvector of A corresponding to the eigenvalue 1.
Example 4: Determine the characteristic roots and the corresponding characteristic vectors of the
matrix
8 −6 2
A = −6 7 −4.
2 −4 3
(Purvanchal 2010; Bundelkhand 08; Rohilkhand 05; Agra 07; Kanpur 09; Avadh 05)
| A − λI| = 0
8 − λ −6 2
i. e., −6 7−λ −4 = 0
2 −4 3 − λ
or λ3 − 18 λ2 + 45 λ = 0
or λ (λ − 3) (λ − 15) = 0.
(A − 0I)X = O
[Link]
M- 135
8 −6 2 x1 0
or −6 7 −4 x = 0
2
2 −4 3 x3 0
2 −4 3 x1 0
or −6 7 −4 x = 0 , by R ←→ R
2 1 3
8 −6 2 x3 0
2 −4 3 x1 0
or 0 −5 5 x = 0 , by R → R + 3 R , R → R − 4 R
2 2 2 1 3 3 1
0 10 −10 x3 0
2 −4 3 x1 0
or 0 −5 5 x = 0 , by R → R + 2 R .
2 3 3 2
0 0 0 x3 0
The coefficient matrix of these equations is of rank 2. Therefore these equations have
3 − 2 = 1 linearly independent solution. Thus there is only one linearly independent
eigenvector corresponding to the eigenvalue 0. These equations can be written as
2 x1 − 4 x2 + 3 x3 = 0 , − 5 x2 + 5 x3 = 0 .
From the last equation, we get x2 = x3 . Let us take x2 = 1, x3 = 1. Then the first equation
1 1
gives x1 = . Therefore X1 = 1 1 ′ is an eigenvector of A corresponding to the
2 2
eigenvalue 0. If c1 is any non-zero scalar, then c1X1 is also an eigenvector of A
corresponding to the eigenvalue 0.
(A − 3 I)X = O
5 −6 2 x1 0
or −6 4 −4 x = 0
2
2 −4 0 x3 0
−1 −2 −2 x1 0
or −6 4 −4 x = 0 , by R → R + R
2 1 1 2
2 −4 0 x3 0
[Link]
M-136
−1 −2 −2 x1 0
or 0 16 8 x2 = 0 , by R2 → R2 − 6 R1, R3 → R3 + 2 R1
0 −8 −4 x3 0
−1 −2 −2 x1 0
1
or 0 16 8 x2 = 0 , by R3 → R3 + R2 .
0 2
0 0 x3 0
The coefficient matrix of these equations is of rank 2. Therefore these equations have
3 − 2 = 1 linearly independent solution. These equations can be written as
− x1 − 2 x2 − 2 x3 = 0 , 16 x2 + 8 x3 = 0 .
1
From the second equation we get x2 = − x3 . Let us take x3 = 4, x2 = − 2. Then the first
2
equation gives x1 = − 4. Therefore X2 = [−4 − 2 4]′
(A − 15 I) X = O
8 − 15 −6 2 x1 0
or −6 7 − 15 −4 x = 0
2
2 −4 3 − 15 x3 0
−7 −6 2 x1 0
or −6 −8 −4 x = 0
2
2 −4 −12 x3 0
−1 2 6 x1 0
or −6 −8 −4 x = 0 , by R → R − R
2 1 1 2
2 −4 −12 x3 0
−1 2 6 x1 0
or 0 −20 −40 x = 0 , by R → R − 6 R , R → R + 2 R .
2 2 2 1 3 3 1
0 0 0 x3 0
The coefficient matrix of these equations is of rank 2. Therefore these equations have
3 − 2 = 1 linearly independent solution. These equations can be written as
[Link]
M- 137
− x1 + 2 x2 + 6 x3 = 0 , − 20 x2 − 40 x3 = 0 .
The last equation gives x2 = − 2 x3 . Let us take x3 = 1, x2 = − 2. Then the first equation
gives x1 = 2. Therefore
X3 = [2 − 2 1]′
Example 5: Determine the characteristic roots and the corresponding characteristic vectors of the
matrix
6 −2 2
A = −2 3 −1.
2 −1 3 (Meerut 2006B, 09; Purvanchal 07)
|A − λI| = 0
6 − λ −2 2
or −2 3−λ −1 = 0
2 −1 3 − λ
6 − λ −2 0
or −2 3−λ 2 − λ = 0 , by C3 → C3 + C2
2 −1 2 − λ
6 − λ −2 0
or (2 − λ ) −2 3−λ 1 = 0
2 −1 1
6 − λ −2 0
or (2 − λ ) −4 4−λ 0 = 0 , by R2 → R2 − R3
2 −1 1
or (2 − λ ) [(6 − λ ) (4 − λ ) − 8] = 0
or (2 − λ ) (λ2 − 10 λ + 16) = 0
or (2 − λ ) (λ − 2) (λ − 8) = 0 .
(A − 8 I) X = O
6 − 8 −2 2 x1 0
or −2 3 − 8 −1 x = 0
2
2 −1 3 − 8 x3 0
−2 −2 2 x1 0
or −2 −5 −1 x = 0
2
2 −1 −5 x3 0
−2 −2 2 x1 0
or 0 −3 −3 x = 0 , by R → R − R , R → R + R
2 2 2 1 3 3 1
0 −3 −3 x3 0
−2 −2 2 x1 0
or 0 −3 −3 x = 0 , by R → R − R .
2 3 3 2
0 0 0 x3 0
−2 x1 − 2 x2 + 2 x3 = 0 ,
−3 x2 − 3 x3 = 0 .
The last equation gives x2 = − x3 . Let us take x3 = 1, x2 = − 1. Then the first equation
2
gives x1 = 2. Therefore X1 = −1 is an eigenvector of A corresponding to the eigenvalue
1
8. Every non-zero multiple of X1 is an eigenvector of A corresponding to the
eigenvalue 8.
(A − 2 I) X = O
4 −2 2 x1 0
or −2 1 −1 x = 0
2
2 −1 1 x3 0
[Link]
M- 139
−2 1 −1 x1 0
or 4 −2 2 x = 0 , by R ←→ R
2 1 2
2 −1 1 x3 0
−2 1 −1 x1 0
or 0 0 0 x = 0 , by R → R + 2 R , R → R + R .
2 2 2 1 3 3 1
0 0 0 x3 0
−2 x1 + x2 − x3 = 0 .
−1 1
Obviously X2 = 0 , X3 = 2
2 0
are two linearly independent solutions of this equation. Therefore X2 and X3 are two
linearly independent eigenvectors of A corresponding to the eigenvalue 2. If c1, c2 are
scalars not both equal to zero, then c1X2 + c2 X3 gives all the eigenvectors of A
corresponding to the eigenvalue 2.
Example 6: Show that 0 is a characteristic root of a matrix if and only if the matrix is singular.
| A − λI| = 0 ⇒| A| = 0 ⇒ A is singular.
Conversely, A is singular ⇒| A| = 0
Example 7: If λ1, ........., λ n are the eigenvalues of A, then show that kλ1, ..., kλ n are the
eigenvalues of kA.
Solution: If k = 0, then kA = O and each eigenvalue of O is 0. Thus 0 λ1, ...., 0 λ n are the
eigenvalues of kA if λ1, ..., λ n are the eigen-values of A.
Thus kλ1, ...., kλ n are the eigenvalues of kA if λ1, ..., λ n are eigenvalues of A.
Example 8: If A is non-singular, prove that the eigenvalues of A −1 are the reciprocals of the
eigenvalues of A.
AX = λX
⇒ X = A −1 (λX) = λ (A −1X)
1
⇒ X = A − 1X [∵ A is non-singular ⇒ λ ≠ 0]
λ
1
A − 1X = X
λ
1
⇒ is an eigenvalue of A −1 and X is a corresponding eigenvector.
λ
Hence the eigenvalues of A −1 are nothing but the reciprocals of the eigenvalues of A.
Example 9: If the characteristic roots of A are λ1, λ 2 ,… λ n , then the characteristic roots of
A2 are λ12, λ 22 , …, λ n2 . (Kumaun 2008)
Solution : Let λ be a characteristic root of the matrix A. Then there exists a non-zero
vector X such that
AX = λX
⇒ A (AX) = A (λX)
⇒ A2 X = λ (AX)
⇒ A 2 X = λ (λX) [∵ AX = λ X]
⇒ A2 X = λ2 X. …(1)
[Link]
M- 141
Since X is a non-zero vector, therefore from the relation (1) it is obvious that λ2 is a
characteristic root of the matrix A2 . Therefore if λ1, …, λ n are the characteristic roots
2 2
of A, then λ1 , … , λ n are the characteristic roots of A2 .
Example 10: The characteristic roots of an idempotent matrix are either zero or unity.
AX = λX …(1)
⇒ A (AX) = A (λX)
⇒ A2 X = λ (AX)
⇒ AX = λ2 X. ...(2)
or (λ2 − λ ) X = O
or λ2 − λ = 0 [∵ X ≠ O]
or λ (λ − 1) = 0 .
∴ λ = 0 or λ = 1.
Hence the characteristic roots of an idempotent matrix are either zero or unity.
Example 11: The product of the characteristic roots of a square matrix of order n is equal to the
determinant of the matrix.
If λ1, λ 2 , …, λ n are the characteristic roots of A, then λ1, λ 2 , …, λ n are the roots of the
equation | A − λI| = 0.
Hence λ1 λ 2 … λ n = | A|.
Example 12: Any two characteristic vectors corresponding to two distinct characteristic roots of a:
(i) Hermitian, (ii) Real symmetric, (iii) Unitary matrix are orthogonal.
We have A θ = A.
θ θ θ θ
or λ1 X1 (X2 )θ = λ 2 X1 X2 ; [λ1 is real ⇒ λ1 = λ1]
θ θ
or λ1X1 X2 = λ 2 X 1 X2
θ
or (λ1 − λ 2 ) X1 X2 = O.
θ
Since λ1 − λ 2 ≠ 0 , we have X1 X2 = O.
(ii) A is real symmetric. The real symmetric matrix is always Hermitian, so the result
follows at once from (i).
where λ1, λ 2 are characteristic roots of a unitary matrix which must be uni-modular
i. e., λ1λ1 = 1, λ 2 λ 2 = 1.
[Link]
M- 143
θ θ
Now, AX2 = λ 2 X2 ⇒ (AX2 )θ = (λ 2 X2 )θ ⇒ X2 A θ = λ 2 A2
θ θ θ
⇒ X2 A θ AX1 = λ 2 X2 λ1 X1 = λ 2 λ1 X2 X1
θ θ
X2 X1 = λ 2 λ1 X2 X1
θ
⇒ (1 − λ 2 λ1) X2 X1 = O.
But λ 2 λ1 ≠ 1 ; so 1 − λ 2 λ1 ≠ 0 .
θ
This implies that X2 X1 = O ⇒ X1 and X2 are orthogonal.
Example 13: Show that the two matrices A, C−1AC have the same characteristic roots.
Then B − λI = C−1AC − λI
= C−1(A − λI) C.
= | A − λI|| I| = | A − λI|.
Thus the two matrices A and B have the same characteristic determinants and hence
the same characteristic equations and the same characteristic roots.
(Meerut 2000, 03, 05B, 06B, 09B, 10B; Rohilkhand 05, 08; Agra 07; Avadh 05;
Purvanchal 08; Lucknow 08, 09)
Proof. Since the elements of A − λI are at most of the first degree in λ, the elements of
Adj (A − λI) are ordinary polynomials in λ of degree n − 1 or less. Therefore Adj
(A − λI) can be written as a matrix polynomial in λ, given by
where B0 , B1, ..., B n − 1 are matrices of the type n × n whose elements are functions of
aij’s.
− IB0 = (−1)n I,
...........................
...........................
AB n − 1 = (−1)n anI.
A n − 1 + a1A n − 2 + a2 A n − 3 + … + an − 1 I + anA −1 = O
Corollary. 2. If m be a positive integer such that m ≥ n, then multiplying the result (1)
by A m − n, we get
A m + a1A m − 1 + … + anA m − n = O,
2 −1 1
Example 14: Find the characteristic equation of the matrix A = −1 2 −1 and verify that
1 −1 2
it is satisfied by A and hence obtain A −1. (Meerut 2002, 05; Bundelkhand 05, 08, 10, 11;
Kanpur 10; Lucknow 05,11)
2 − λ −1 1
Solution : We have | A − λI| = −1 2 − λ −1
1 −1 2 − λ
= (2 − λ ) (3 − 4 λ + λ2 ) + (λ − 1) + (λ − 1)
= − λ3 + 6 λ2 − 9 λ + 4.
We have
1 0 0 2 −1 1 6 −5 5
2
I = 0 1 0 , A = −1 2 −1, A = A × A = −5 6 −5,
0 0 1 1 −1 2 5 −5 6
[Link]
M-146
22 −21 21
3 2
A = A A = −21 22 −21.
21 −21 22
22 −21 21 6 −5 5
= −21 22 −21 − 6 −5 6 −5
21 −21 22 6
5 −5
2 −1 1 1 0 0
+ 9 −1 2 −1 − 4 0 1 0
1 −1 2 0 0 1
0 0 0
= 0 0 0 .
0 0 0
1 2
∴ A −1 = (A − 6 A + 9 I)
4
6 −5 5 −12 6 −6 9 0 0
2
Now A − 6 A + 9 I = −5 6 −5 + 6 −12 6 + 0 9 0
5 −5 6 −6 6 −12 0 0 9
3 1 −1
= 1 3 1 .
−1 1 3
3 1 −1
−1 1
∴ A = 1 3 1.
4
−1 1 3
1 0 2
Example 15: Obtain the characteristic equation of the matrix A = 0 2 1 and verify that
2 0 3
it is satisfied by A and hence find its inverse. (Meerut 2001, 10B; Bundelkhand 09, 11;
Kanpur 07; Purvanchal 10; Lucknow 07, 10)
[Link]
M- 147
Solution : We have
1 − λ 0 2
| A − λI| = 0 2−λ 1
2 0 3 − λ
= (1 − λ )(2 − λ ) (3 − λ ) + 2 [0 − 2 (2 − λ )]
= (2 − λ ) [(1 − λ ) (3 − λ ) − 4]
= (2 − λ ) [λ2 − 4 λ − 1]
= − (λ3 − 6 λ2 + 7 λ + 2).
λ3 − 6 λ2 + 7 λ + 2 = 0 . …(1)
A3 − 6 A2 + 7 A + 2 I = O. ...(2)
1 0 2 1 0 2 5 0 8
2
A = 0 2 1 × 0 2 1 = 2 4 5 .
2 0 3 2 0 3 8 0 13
1 0 2 5 0 8 21 0 34
3 2
Also A = A . A = 0 2 1 × 2 4 5 = 12 8 23.
2 0 3 8 0 13 34 0 55
Now A 3 − 6 A 2 + 7 A + 2I
21 0 34 5 0 8 1 0 2 1 0 0
= 12 8 23 − 6 2 4 5 + 7 0 2 1 + 2 0 1 0
34 0 55 8 0 13 2 0 3 0 0 1
−3 0 2
−1 1 2 1 1
∴ A = − (A − 6 A + 2I ) = −1 .
2 2 2 2
0 −1
0 c − b
Example 16: Show that the matrix A = − c 0 a satisfies Cayley-Hamiltion theorem.
b −a 0
(Meerut 2006B)
0 − λ c −b
Solution : We have | A − λI | = − c 0 −λ a
b −a 0 − λ
We have
0 c − b 0 c − b
2
A = − c 0 a × − c 0 a
b −a 0 b −a 0
− c 2 − b2 ab ac
= ab − c 2 − a2 bc .
ac bc − b2 − a2
− c 2 − b2 ab ac 0 c − b
2 2
∴ A 3 = A 2A = ab −c − a bc × − c 0 a
ac bc − b2 − a2 b −a 0
[Link]
M- 149
0 − c 3 − b2 c − a2 c bc 2 + b3 + a2 b
3
= c + a c + b2 c
2
0 − ab2 − ac 2 − a3
2 3 2
− bc − b − a b ac + ab2 + a3
2
0
0 c − b
2 2 2
= − (a + b + c )− c 0 a = − (a2 + b2 + c 2 )A.
b −a 0
1 2 0
Example 17: Find the characteristic equation of the matrix A = 2 −1 0 and hence find
0 0 −1
−1
A . (Rohilkhand 2009, 10)
| A − λI| = 0
1 − λ 2 0
or 2 −1 − λ 0 =0
0 0 −1 − λ
or (1 − λ ) (1 + λ )2 + 4 (1 + λ ) = 0
or (1 − λ )(1 + 2 λ + λ2 ) + 4 + 4 λ = 0
or 1 + 2 λ + λ2 − λ − 2 λ2 − λ3 + 4 + 4 λ = 0
or − λ3 − λ2 + 5 λ + 5 = 0
or λ3 + λ2 − 5 λ − 5 = 0 . ...(1)
A 3 + A2 − 5A − 5I = O
[Link]
M-150
or 5I = A3 + A2 − 5 A ...(2)
or 5 A −1 = A2 + A − 5 I
1 2
or A −1 = (A + A − 5 I ) …(3)
5
1 2 0 1 2 0 5 0 0
2
Now A = 2 −1 0 2 −1 0 = 0 5 0 .
0 0 −1 0 0 −1 0 0 1
5 0 0 1 2 0 5 0 0
2
∴ A + A − 5 I = 0 5 0 + 2 −1 0 − 0 5 0
0 0 1 0 0 −1 0 0 5
1 2 0
= 2 −1 0 .
0 0 −5
1 2 0
−1 1
Hence from (3), A = 2 −1 0 .
5
0 0 −5
3−λ 1
| A − λI| =
−1 2−λ
= (3 − λ ) (2 − λ ) + 1 = λ2 − 5 λ + 7.
[Link]
M- 151
λ2 − 5 λ + 7 = 0 ...(1)
A2 − 5 A + 7 I = O. …(2)
A2 = 5 A − 7 I … (3)
A3 = 5 A2 − 7 A …(4)
∴ A4 = 5 A3 − 7 A2 …(5)
and A5 = 5 A4 − 7 A3 …(6)
5
Now 2A − 3 A4 + A 2 − 4 I = 2 (5 A4 − 7 A3 ) − 3 A4 + A2 − 4 I
= 7 A4 − 14 A3 + A2 − 4 I = 7 (5 A3 − 7 A2 ) − 14 A3 + A2 − 4 I
[by (5)]
B = P–1AP.
Theorem 3: Similar matrices have the same characteristic polynomial and hence the same
eigenvalues. If X is an eigenvector of A corresponding to the eigenvalue λ , then P–1X is an
eigenvector of B corresponding to the eigenvalue λ, where
B = P–1AP.
Corollary. If A is similar to a diagonal matrix D , the diagonal elements of D are the eigenvalues
of A.
P−1AP = D
i. e., AP = PD
= [λ1X1, λ 2 X2 ,…, λ nX n]
Since the column vectors X1 , X2 ,…, X n of the matrix P are linearly independent,
therefore X is invertible and X −1 exists.
⇒ P−1AP = D
⇒ A is diagonalizable.
Remark: In the proof of the above theorem we have shown that if A is diagonalizable
and P diagonalizes A, then
λ1 0 … 0
0 λ2 … 0
P−1AP = =D
… … … …
0 0 … λ n
Theorem 2: If the eigenvalues of an n × n matrix are all distinct then it is always similar to a
diagonal matrix.
Proof: Let A be a square matrix of order n and suppose it has n distinct eigenvalues
λ1, λ 2 ,.…, λ n. We know that eigenvectors of a matrix corresponding to distinct
eigenvalues are linearly independent. Therefore A has n linearly independent
eigenvectors and so it is similar to a diagonal matrix
Corollary: Two n × n matrices with the same set of n distinct eigenvalues are similar.
Proof: Suppose A and B are two n × n matrices with the same set of n distinct
eigenvalues λ1, λ 2 ,…, λ n. Let D = dia. [λ1, λ 2 ,…, λ n]. Then both A and B are similar to
D. Now A is similar to D and D is similar to B implies that A is similar to B.
Theorem 3: The necessary and sufficient condition for a square matrix to be similar to a
diagonal matrix is that the geometric multiplicity of each of its eigenvalues coincides with the
algebraic multiplicity.
P−1AP = D.
(A − αI) X = O
r1 + … + rp = n.
X1 + X2 + … + X p = O, ...(3)
where X1, X2 ,…, X p denote the vectors written within brackets in (2) i. e., X1
= a11C11 + … + a1r C1r , and so on.
1 1
In case some one of X1,…, X p is not zero, then the relation (3) implies that a system of
eigenvectors of A corresponding to distinct eigenvalues of A is linearly dependent. But
this is not possible. Hence each of the vectors X1, X2…, X p must be zero.
Since C11, C12 ,…, C1r is a set of linearly independent vectors, therefore
1
O = X1 = a11 C11 + … + a1r C1r implies that
1 1
Similarly we can show that each of the scalars in relation (2) is zero. Therefore the n
vectors give in (1) are linearly independent. Thus A has n linearly independent
eigenvectors. So it is similar to a diagonal matrix.
Example 19: Show that the rank of every matrix similar to A is the same as that of A.
Solution: Let B be a matrix similar to A. Then there exists a non-singular matrix P such
that B = P−1AP. We know that the rank of a matrix does not change on multiplication
by a non-singular matrix.
[Link]
M-156
Therefore
Example 20: If U be a unitary matrix such that Uθ AU = diag [λ1,…, λ n], show that λ1,…, λ n
are the eigenvalues of A.
Uθ AU = D ⇒ U−1AU = D.
Thus A is similar to the diagonal matrix D. But similar matrices have the same
eigenvalues and eigenvalues of D are its diagonal elements. Therefore λ1,…, λ n are the
eigenvalues of A.
P−1 AP = D
⇒ A = P D P−1
⇒ AT = (P D P−1)T = (P−1)T DT PT
⇒ AT is similar to D
⇒ D is similar to AT .
Example 22: Show that a non-zero matrix is nilpotent if and only if all its eigenvalues are equal
to zero.
⇒ characteristic equation of A is λ n = 0
⇒ A is nilpotent.
1 2 −2
A= 1 2 1.
−1 −1 0
1− λ 2 −2
1 2−λ 1 =0
−1 −1 0−λ
or (1 − λ )(−2 λ + λ2 + 1) − 2 (− λ + 1) − 2(− 1 + 2 − λ ) = 0
or (− λ + 1)(λ − 1)2 + (4 λ − 4) = 0
or (− λ + 1)(λ + 1)(λ − 3) = 0
(A − 1 I) X = O or (A − I) X = O
[Link]
M-158
0 2 −2 x 0
i. e., 1 1 1 y = 0
−1 −1 −1 z 0
0 2 −2 x 0
or 1 1 1 y = 0 , by R3 → R3 + R1
0 0 0 z 0
0 x + 2 y − 2z = 0 , x + y + z = 0
−2
Therefore X1 = 1 is an eigenvector of A corresponding to the eigenvalue 1.
1
2 2 −2 x 0
i. e., 1 3 1 y = 0
−1 −1 1 z 0
2 2 −2 x 0
or 1 3 1 y = 0 , by R3 → 2 R3 + R1
0 0 0 z 0
2 x + 2 y − 2z = 0 , x + 3 y + z = 0
−2
Let us take z = − 1, then y = 1and x = − 2. Therefore X2 = 1 is an eigenvector of A
−1
−2 2 −2 x 0
i. e., 1 −1 1 y = 0
−1 −1 −3 z 0
−2 2 −2 x 0
or −1 −1 −3 y = 0 , R2 ↔ R3
1 −1 1 z 0
Similarly, we have x = 2, y = 1, z = − 1.
2
Therefore X3 = 1 is an eigenvector of A
−1
−2 −2 2
Let modal matrix P = [X1, X2 , X3 ] = 1 1 1
1 −1 1
0 −4 −4
−1 Adj P 1
∴ P = =− 2 0 4 .
|P | 8
−2 −4 0
The matrix P will transform A to diagonal form D which is given by the relation
1 0 0
−1
P A P = 0 −1 0 = D
0 0 3
8 −8 −2
A = 4 −3 −2
3 −4 1
8−λ −8 −2
4 −3 − λ −2 =0
3 −4 1− λ
1− λ −1 + λ −1 + λ
or 4 −3 − λ −2 = 0 , applying R1 − ( R2 + R3 )
3 −4 1− λ
1 −1 −1
or (1 − λ ) 4 −3 − λ −2 =0
3 −4 1− λ
1 0 0
or (1 − λ ) 4 1− λ 2 = 0 , applying C2 + C1, C3 + C1
3 −1 4−λ
or (1 − λ )[(1 − λ )(4 − λ ) + 2] = 0
or (1 − λ )(λ2 − 5 λ + 6) = 0
or (1 − λ )(λ − 2)(λ − 3) = 0
Since the eigenvalues of the matrix A are all distinct, therefore A is similar to a diagonal
matrix. Since the algebraic multiplicity of each eigenvalue of A is 1, therefore there will
be one and only one linearly independent eigenvector of A corresponding to each
eigenvalue of A.
7 −8 −2 x1 0
or 4 −4 −2 x2 = 0
3 −4 0 x3 0
7 −8 −2 x1 0
or −3 4 0 x2 = 0 , by R2 → R2 − R1
3 −4 0 x3 0
[Link]
M- 161
7 −8 −2 x1 0
or −3 4 0 x2 = 0 , by R3 → R3 + R2.
0 0 0 x3 0
The matrix of coefficients of these equations has rank 2. Therefore these equations
have only one linearly independent solution as it should have been because the
algebraic multiplicity of the eigenvalue 1 is 1. Note that the geometric multiplicity
cannot exceed the algebraic multiplicity. The above equations can be written as
7 x1 − 8 x2 − 2 x3 = 0 , − 3 x1 + 4 x2 = 0 . From the last equation, we get x1 = 4, x2 = 3.
4
Then the first gives x3 = 2. Therefore X1 = 3 is an eigenvector of A corresponding to
2
the eigenvalue 1.
6 −8 −2 x1 0
or 4 −5 −2 x2 = 0
3 −4 −1 x3 0
6 −8 −2 x1 0
or −2 3 0 x2 = 0 ,
0 0 0 x3 0
1
applying R2 → R3 − R1, R3 → R3 − R1.
2
3
Therefore X2 = 2 is an eigenvector of A corresponding to the eigenvalue 2.
1
5 −8 −2 x1 0
or 4 −6 −2 x2 = 0
3 −4 −2 x3 0
[Link]
M-162
5 −8 −2 x1 0
or −1 2 0 x2 = 0 ,
0 0 0 x3 0
applying R2 → R2 − R1, R3 → R3 + R1 − 2 R2 .
5 x1 − 8 x2 − 2 x3 = 0 , − x1 + 2 x2 = 0 .
2
∴ X3 = 1 is an eigenvector of A corresponding to the eigenvalue 3.
1
4 3 2
Let P = [X1 X2 X3 ] = 3 2 1.
2 1 1
1 0 0
−1
P A P = 0 2 0 = D.
0 0 3
Comprehensive Exercise 1
(Meerut 2009 B)
[Link]
M- 163
a c b
2. (i) If a + b + c = 0, find the characteristic roots of the matrix A = b b a.
c a c
(Kumaun 2008; Kanpur 07, 08)
(ii) Prove that the matrices :
0 a b 0 b a
A = a 0 c and B = b 0 c have
b c 0 a c 0
same characteristic equation.
(Rohilkhand 2007; Purvanchal 06; Agra 05)
1 1
3. (i) Verify Cayley-Hamilton theorem for matrix A, where A = .
1 1
(Rohilkhand 2011)
(ii) Verify that the matrix A satisfies its characteristic equation, where
1 2
A= .
−1 3
2 2 1
8. Show that the matrix A = 1 3 1 satisfies Calyley-Hamilton theorem.
1 2 2
Also determine the characteristic roots (i. e., latent roots) and the corresponding
characteristic vectors of the matrix A. (Meerut 2007)
0 0 1
9. Verify Cayley-Hamilton theorem for the matrix A = 3 1 0 .
−2 1 4
equation. Is it true of every square matrix? State the theorem that applies here.
12. Verify Cayley-Hamilton theorem for the following matrix :
1 0 2
A = 0 −1 1 .
0 1 0 (Avadh 2008)
13. Find the characteristic roots and the characteristic spaces of the matrix
1 2 3
0 2 3 .
0 0 2 (Meerut 2005; Rohilkhand 07)
14. Show that the characteristic roots of a triangular matrix are just the diagonal
elements of the matrix.
15. Let A and B be n-rowed square matrices and let A be non-singular. Show that
the matrices A −1B and BA −1 have the same eigenvalues.
16. If A and B are non-singular matrices of order n, show that the matrices AB and
BA are similar.
17. A and B are two n × n matrices with the same set of n distinct eigenvalues. Show
that there exist two matrices P and Q (one of them non-singular) such that
A = PQ, B = QP.
18. Prove that a non-zero nilpotent matrix cannot be similar to a diogonal matrix.
4 1
19. Find a matrix P which diagonalizes the matrix A = . Verify that
2 3
P−1 AP = D, where D is the diagonal matrix.
[Link]
M- 165
1 −1 2
20. Reduce the matrix A = 0 2 −1 to diagonal form.
0 0 3
A nswers 1
1. (ii) 1, –4, 7
1 /2
3
2. (i) λ = 0 , ± (a2 + b2 + c 2 )
2
3. (ii) −4 A + 5 I.
1 −3 4
4. 5, − 1;
5 2 −1
5. Characteristic equation is λ3 − 3 λ2 − λ + 9 = 0
0 3 3
1
A −1
= 3 2 −1
9
3 −7 −1
10. λ3 − 4 λ2 − 13 λ − 40 = 0 ;
−4 11 −5
1
A −1 = −4 1 25
40
8 −2 −10
19. 2,5. Corresponding to the characteristic root 2 a characteristic vector is [1, − 2]′.
Corresponding to the characteristic root 5 a characteristic vector is [1 , 1]′ .
1 1
P=
−2 1
20. 1, 2, 3 corresponding to the characteristic root 1 a characteristic vector is [1, 0 , 0 ]′ .
Corresponding to the characteristic root 2 a characteristic vector is [1, − 1, 0 ]′.
Corresponding to the characteristic root 3 a characteristic vector is [3, − 2, 2]′.
(a) 1 / λ (b) λ
(c) λ2 (d) 1 / λ2
(Bundelkhand 2001)
a h g
2. The eigenvalues of the matrix A = 0 b 0 are
0 c c
(a) a, h, g (b) a, h, c
(c) a, g, c (d) a, b, c
(Kanpur 2009, 11)
3. If λ is a characteristic root of the matrix A, then a characteristic root of the matrix
A + kI is
(a) λ (b) k + λ
3 1
4. The characteristic equation of A = is
−1 2
(a) λ2 − 5 λ + 7 = 0 (b) λ2 − 3 λ + 7 = 0
6. If λ1, ........., λ n are the eigenvalues of A, then kλ1, ........., kλ n are the
eigenvalues of ......... (Lucknow 2011)
7. If the characteristic roots of A are λ1, λ 2 , ........., λ n, then the characteristic roots
2 2 2
of ......... are λ1 , λ 2 , ........., λ n .
1 0 2
9. The characteristic equation of the matrix A = 0 −1 1 is ......... .
0 1 0
True or False
Write ‘T’ for true and ‘F’ for false statement.
1. The characteristic roots of a real symmetric matrix are all real.
2. The characteristic roots of a real skew-symmetric matrix are all pure imaginary.
3. The characteristic roots of a unitary matrix are of unit modulus.
4. The characteristic roots of a diagonal matrix are just the diagonal elements of the
matrix.
5. Two matrices A and C−1AC do not have the same characteristic roots.
A nswers
4. same 5. kA 7. A2
8. satisfies 9. λ3 − 2 λ + 1 = 0
True or False
1. T 2. F 3. T
4. T 5. F 6. T
7. F
o
[Link]
M- 169
5
Quadratic Forms
n n
∑ ∑ aij xi x j,
i =1 j =1
where aij's are all real numbers, is called a real quadratic form in the n variables x1, x2 ,…, xn.
For example
(i) 2 x2 + 7 xy + 5 y2 is a real quadratic form in the two variables x and y.
2 2 2 2
(iii) x1 − 2 x2 + 4 x3 − 4 x4 − 2 x1 x2 + 3 x1 x4 + 4 x2 x3 − 5 x3 x4 is a real quadratic form
in the four variables x1, x2 , x3 and x4 .
Theorem : Every Quadratic form over a field F in n variables x1, x2 ,…, xn can be expressed in
the form X ′ BX where
n n
Proof : Let ∑ ∑ aij xi x j, …(1)
i =1 j =1
be a quadratic form over the field F in the n variables x1, x2 ,……, xn.
In (1) it is assumed that xi x j = x j xi. Then the total coefficient of xi x j in (1) is aij + a ji. Let
us assign half of this coefficient to xij and half to x ji. Thus we define another set of
1
scalars bij, such that bii = aii and bij = b ji = (aij + a ji), i ≠ [Link] we have
2
n n n n
∑ ∑ aij xi x j = ∑ ∑ bij xi x j.
i =1 j =1 i =1 j =1
Let B = [bij]n × [Link] B is a symmetric matrix of order n over the field F since bij = b ji.
x1
x2
Let X = …. Then XT or X′ = [ x1 x2 …… xn].
…
xn
Now XT BX is a matrix of the type 1 × 1. It can be easily seen that the single element of
n n
this matrix is ∑ ∑ bij xi x j. If we identify a1 × 1matrix with its single element i. e., if we
i =1 j =1
n n n n
XT BX = ∑ ∑ bij xi x j = ∑ ∑ aij xi x j.
i =1 j =1 i =1 j =1
Since every quadratic form can always be so written that matrix of its coefficients is a
symmetric matrix, therefore we shall be considering quadratic forms which are so
adjusted that the coefficient matrix is symmetric.
Let A = [aij]n × n be symmetric matrix over the field F and let X = [ x1 x2 …… xn]T be a
n n
column vector. Then XT AX determines a unique quadratic form ∑ ∑ aij xi x j in n
i =1 j =1
Thus we have seen that there exists a one-to-one correspondence between the set of all
quadratic forms in n variables over a field F and the set of all n-rowed symmetric
matrices over F.
Example 1: Write down the matrix of each of the following quadratic forms and verify that they
can be written as matrix products XT AX :
2 2
(i) x1 − 18 x1 x2 + 5 x2 .
2 2 2
(ii) x1 + 2 x2 − 5 x3 − x1 x2 + 4 x2 x3 − 3 x3 x1.
1 −9
Let A be the matrix of this quadratic form. Then A = .
−9 5
x1
Let X= . Then X′ = [ x1 x2 ].
x2
1 −9
We have X ′ A = [ x1 x2 ] = [ x1 − 9 x2 − 9 x1 + 5 x2 ].
−9 5
x1
∴ X ′ AX = [ x1 − 9 x2 − 9 x1 + 5 x2 ]
x2
= x1( x1 − 9 x2 ) + x2 (− 9 x1 + 5 x2 )
2 2
= x1 − 9 x1 x2 − 9 x2 x1 + 5 x2
2 2
= x1 − 18 x1 x2 + 5 x2 .
1 3 1 3
x1 x1 − x1 x2 − x1 x3 − x2 x1 + 2 x2 x2 + 2 x2 x3 − x3 x1 + 2 x3 x2 − 5 x3 x3 .
2 2 2 2
1 −1 / 2 −3 / 2
A = −1 / 2 2 2
−3 / 2 2 −5
x1
Let X = x2 . Then X′ = [ x1 x2 x3 ].
x
3
1 −1 / 2 −3 / 2
We have X ′ A = [ x1 x2 x3 ] −1 / 2 2 2
−3 / 2 2 −5
1 3 1 3
= [x1 − x2 − x3 − x1 + 2 x2 + 2 x3 − x1 + 2 x2 − 5 x3 ].
2 2 2 2
[Link]
M- 173
x1
1 3 1 3
∴ X ′ AX = [x1 − x2 − x3 − x1 + 2 x2 + 2 x3 − x1 + 2 x2 − 5 x3 ] x2
2 2 2 2 x
3
1 3 1 3
= x1 ( x1 − x2 − x3 ) + x2 ( − x1 + 2 x2 + 2 x3 ) + x3 ( − x1 + 2 x2 − 5 x3 )
2 2 2 2
2 1 3 1 2 3 2
= x1 − x1 x2 − x1 x3 − x2 x1 + 2 x2 + 2 x2 x3 − x3 x1 + 2 x3 x2 − 5 x3
2 2 2 2
2 2 2
= x1 + 2 x2 − 5 x3 − x1 x2 + 4 x2 x3 − 3 x3 x1.
Solution: (i) The given quadratic form can be written as ax2 + hxy + hyx + by2 .
a h
A= , which is a symmetric matrix of order 2.
h b
2 2 2
0 x1 + x1 x2 + 3 x1 x3 + x2 x1 + 0 x2 − 2 x2 x3 +3 x3 x1 − 2 x3 x2 + 0 x3 .
0 1 3
A = 1 0 −2.
3 −2 0
2 2 2
x1 + 0 x1 x2 + 0 x1 x3 + 0 x2 x1 + 5 x2 + 0 x2 x3 + 0 x3 x1 + 0 x3 x2 − 7 x3 .
1 0 0
∴ if A is the matrix of this quadratic form, then A = 0 5 0 .
0 0 −7
[Link]
M-174
2 2
2 x1 + 2 x1 x2 + 0 x1 x3 + 2 x2 x1 + 0 x22 − 3 x2 x3 + 0 x3 x1 − 3 x3 x2 − 7 x3 .
2 2 0
∴ if A is the matrix of this quadratic form, then A = 2 0 −3.
0 −3 −7
2 2
x12 − 2 x2 − 3 x3 + 4 x1 x2 + 6 x1 x3 − 8 x2 x3
2 2 2
x1 + 2 x1 x2 + 3 x1 x3 + 2 x2 x1 − 2 x2 − 4 x2 x3 + 3 x3 x1 − 4 x3 x2 − 3 x3 .
1 2 3
A = 2 −2 −4.
3 −4 −3
x1
Let X = x2 . Then X′ = [ x1 x2 x3 ].
x
3
1 2 3
We have X ′ A = [ x1 x2 x3 ] 2 −2 −4
3 −4 −3
= [x1 + 2 x2 + 3 x3 2 x1 − 2 x2 − 4 x3 3 x1 − 4 x2 − 3 x3 ].
x1
∴ X ′ AX = [x1 + 2 x2 + 3 x3 2 x1 − 2 x2 − 4 x3 3 x1 − 4 x2 − 3 x3 ] x2
x
3
= ( x1 + 2 x2 + 3 x3 ) x1 + (2 x1 − 2 x2 − 4 x3 ) x2 + (3 x1 − 4 x2 − 3 x3 ) x3
2 2 2
= x1 − 2 x2 − 3 x3 + 4 x1 x2 − 8 x2 x3 + 6 x3 x1.
[Link]
M- 175
Example 4: Write down the quadratic forms corresponding to the following matrices :
0 1 2 3
0 5 −1
1 2 3 4
(i) 5 1 6 (ii) .
−1 2 3 4 5
6 2
3 4 5 6
Solution: (i) Let X = [ x1 x2 x2 ]T and A denote the given symmetric matrix. Then
0 5 −1
T
X A = [ x1 x2 x3 ] 5 1 6
−1 6 2
= [5 x2 − x3 5 x1 + x2 + 6 x3 − x1 + 6 x2 + 2 x3 ].
∴ XT AX = x1(5 x2 − x3 ) + x2 (5 x1 + x2 + 6 x3 ) + x3 ( − x1 + 6 x2 + 2 x3 )
2 2
= x2 + 2 x3 + 10 x1 x2 − 2 x1 x3 + 12 x2 x3 .
0 1 2 3
T 1 2 3 4
X A = [ x1 x2 x3 x4 ]
2 3 4 5
3 4 5 6
= [x2 + 2 x3 + 3 x4 x1 + 2 x2 + 3 x3 + 4 x4
2 x1 + 3 x2 + 4 x3 + 5 x4 3 x1 + 4 x2 + 5 x3 + 6 x4 ]
∴ XT AX = x1( x2 + 2 x3 + 3 x4 ) + x2 ( x1 + 2 x2 + 3 x3 + 4 x4 )
+ x3 (2 x1 + 3 x2 + 4 x3 + 5 x4 ) + x4 (3 x1 + 4 x2 + 5 x3 + 6 x4 )
2 2 2
= 2 x2 + 4 x3 + 6 x4 + 2 x1 x2 + 4 x1 x3 + 6 x1 x4 + 6 x2 x3 + 8 x2 x4 + 10 x3 x4 .
[Link]
M-176
Comprehensive Exercise 1
(i) x2 + 2 y2 + 3 z 2 + 4 xy + 5 yz + 6 zx.
2 2 2
(iii) a11 x1 + a22 x2 + a33 x3 + 2 a12 x1 x2 + 2 a23 x2 x3 + 2 a31 x3 x1.
2 2 2 2
(iv) x1 − 2 x2 + 4 x3 − 4 x4 − 2 x1 x2 + 3 x1 x4 + 4 x2 x3 − 5 x3 x4.
(v) x1 x2 + x2 x3 + x3 x1 + x1 x4 + x2 x4 + x3 x4.
2
(vi) x1 − 2 x2 x3 − x3 x4.
2 2 2 2 2
(vii) d1 x1 + d2 x2 + d3 x3 + d4 x4 + d5 x5 .
2. Write down the quadratic forms corresponding to the following symmetric matrices :
0 a b c
1 2 3
a 0 l m
(i) 2 0 3 (ii) .
3 b l 0 p
3 1
c m p 0
A nswers
1 2 3 a h g
1. (i) 2 2 5 / 2 (ii) h b f
3 5/2 3 g f c
3
1 −1 0
2
a11 a12 a31 −1
−2 2 0
(iii) a12 a22 a23 (iv) 5
0 2 4 −
a a23 a33 2
31
3 0 −
5
−4
2 2
[Link]
M- 177
1 1 1
0 2 2 2 1 0 0 0
1 1 1
0 0 0 −1 0
2 2 2 (vi) 0 1
(v) −1 0 −
1 1
0
1 2
2 2 2 1
1 1 1 0 0 −
2
0
2 0
2 2
2 2
2. (i) x1 + x3 + 4 x1 x2 + 6 x1 x3 + 6 x2 x3
2 2 2
(iii) λ1 x1 + λ 2 x2 + ...... + λ n xn
2. There exists a one-to-one correspondence between the set of all quadratic forms
in n variables over a field F and the set of all n-rowed …… matrices over F.
2 2 2 2 2
d1 x1 + d2 x2 + d3 x3 + d4 x4 + d5 x5 is A = .............
4. The quadratic form corresponding to the symmetric matrix diag. [λ1, λ 2 ,……, λ n]
is … .
2 1 5
5. The quadratic form corresponding to the matrix 1 3 −2 is ….
5 −2 4
[Link]
M-178
True or False
1. The matrix corresponding to the quadratic form d1 x12 + d2 x22 + d3 x32 + d4 x42 is a
diagonal matrix.
2. Every quadratic form over a field F in n variables x1, x2 ,……, xn can be expressed in
the form X ′ BX where X = [ x1, x2 ,……, xn]T is a column vector and B is a skew
A nswers
2 2 2
4. λ1 x1 + λ 2 x2 + …… + λ n xn
2 2 2
5. 2 x1 + 3 x2 + 4 x3 + 2 x1 x2 + 10 x1 x3 − 4 x2 x3
True or False
1. T 2. F
The rank of a matrix determines the dimension of the solution space of a system of equations it represents. A rank equal to the number of unknowns suggests a unique solution, while a lower rank indicates multiple solutions. Matrix transformations preserve rank and hence the characteristics of the solution space, allowing for reduced forms without solution loss .
Every m×n matrix of rank r can be reduced to the form \( \begin{bmatrix} I_r & O \end{bmatrix} \) by a finite chain of elementary transformations, which do not alter the rank as they involve adding multiples of one row to another, swapping rows, or multiplying a row by a non-zero scalar .
The characteristic roots (eigenvalues) of a matrix A are found by solving the characteristic equation |A - λI| = 0. For each eigenvalue, the eigenvectors are determined by solving (A - λI)x = 0 for non-zero x. If a matrix's equation leads to solutions where x = 0 is the only solution, no non-zero eigenvectors exist for that λ .
Two matrices are row equivalent if one can be transformed into the other via a finite number of elementary row transformations, and similarly, they are column equivalent if one can be transformed into the other via elementary column transformations .
A matrix is said to be in Echelon form if all zero rows occur at the bottom, and the number of zeros before the first non-zero element in a row is less than in the next row. The rank of a matrix in Echelon form is determined by the number of non-zero rows .
A set of vectors is linearly independent if no vector in the set can be represented as a linear combination of the others. For example, the vectors \( \begin{bmatrix} 1 \end{bmatrix}, \begin{bmatrix} 0 \end{bmatrix}, \begin{bmatrix} 0 \end{bmatrix} \) are linearly independent because the only solution to \( k_1 \begin{bmatrix} 1 \end{bmatrix} + k_2 \begin{bmatrix} 0 \end{bmatrix} + k_3 \begin{bmatrix} 0 \end{bmatrix} = \begin{bmatrix} 0 \end{bmatrix} \) is \( k_1 = k_2 = k_3 = 0 \).
A quadratic form \( \phi \) in variables is represented by a symmetric matrix B such that \( \phi = X^T BX \), where X is a vector of the variables. The matrix is symmetric because all quadratic forms can be adjusted so that their coefficient matrix reflects this property, reflecting the requirement that mixed terms have a consistent coefficient .
To obtain the modal and diagonal matrices of a matrix A, solve for its eigenvalues and eigenvectors. The modal matrix P consists of eigenvectors as columns, and the diagonal matrix D has corresponding eigenvalues on its diagonal. A matrix is diagonalizable if P exists, meaning there are enough independent eigenvectors to span the space .
A system of linear equations has a unique solution if the rank of the coefficient matrix equals the rank of the augmented matrix and equals the number of variables. It has infinite solutions if the rank of both matrices is less than the number of variables but equal, and it has no solution if the rank of the coefficient matrix is less than the rank of the augmented matrix .
The unique correspondence between quadratic forms and symmetric matrices allows for consistent and efficient manipulation of multivariable functions, facilitating analysis in optimization, physics, and statistics. Symmetric matrices also simplify computations, such as finding eigenvalues and diagonalization, critical in these applications .