2-Math (II) - Ms and PDF
2-Math (II) - Ms and PDF
Faculty of Science
Department of Mathematics
For
Faculty of Computers & Information
By
Professor
OSAMA RASHED SAYED
-0-
Table of Contents
CHAPTER I: MATRICES
Matrices 6
Exercise Set (1.1) 19
Elementary Row Operations 21
Exercise Set (1.2) 26
Inverse matrix 27
Exercise Set (1.3) 37
Special matrices 36
Exercise Set (1.4) 46
CHAPTER II: SYSTEMS OF LINEAR EQUATIONS
Introduction of systems of linear equations 47
Exercise Set (2.1) 67
Homogeneous system of linear equations 70
Exercise Set (2.2) 76
CHAPTER III: DETERMINATES
Determinants 78
Exercise Set (3.1) 84
Cramer' Rule 85
Exercise Set (3.2) 87
CHAPTER VII: LINEAR SPACE
Euclidean n-space 89
Linear Space 91
Exercise Set (4.2) 103
Subspaces 106
Exercise Set (4.3) 113
Linear combination and spanning sets 115
Exercise Set (4.4) 125
Linear Independence 127
Exercise Set (4.5) 135
Basis and Dimension 136
Exercise Set (4.6) 150
-1-
CHAPTER 8: INNER PRODUCT SPACES
Inner product 154
Exercise Set (5.1) 162
Orthogonally 164
Exercise Set (5.2) 173
Orthonormal Basis 176
Exercise Set (5.3) 187
EigenValues and EigenVectors 188
Exercise Set (5.4) 200
CHAPTER VI: PARTIAL FRACTIONS & SEQUENCES
Partial Fractions Decomposition 202
Exercise Set (6.1) 210
Sequence 211
Exercise Set (6.2) 223
Monotone Sequences 227
Exercise Set (6.3) 236
CHAPTER IV: INFINITE SERIES
Infinite Series 239
Exercise Set (7.1) 250
Convergence; The Integral Test 253
Exercise Set (7.2) 262
Additional Convergence Tests 264
Exercise Set (7.3) 269
Applying the Comparison Test 272
Exercise Set (7.4) 281
Alternating Series; Conditional Convergence 284
Exercise Set (7.5) 297
Power Series 300
Exercise Set (7.6) 309
Taylor and Maclaurin Series 311
Exercise Set (7.7) 323
-2-
CHAPTER V: DIFFERENTIAL EQUATION
Introduction 326
Definitions and Terminology 326
Initial-Value and Boundary-Value Problems 339
Solutions of First Order Differential Equations 347
Exercise Set (8.1) 360
MISCELLANEOUS PROBLEMS 363
REFERENCES
-3-
CHAPTER (I)
MATRICES
-4-
CHAPTER (I)
MATRICES
1. MATRICES
-5-
●We say that 𝐴 is a matrix of size (order) 𝑚 × 𝑛 (or m by
n matrix) where 𝑚 is the number of the rows of 𝐴 and n
is the number of the columns of 𝐴. So, in Example 1 the
matrices are of size 3 × 2 , 1 × 4, 3 × 3, 2 × 1, and
1 × 1, respectively.
● We will use R to denote for row and C for column.
●We will use capital letters A, B, C, … to denote for
matrices.
●We write 𝐴 = [𝑎𝑖𝑗 ] or 𝐴 = [𝑎𝑖𝑗 ] if the size is
𝑚×𝑛
known.
●We say that 𝐴 is a square matrix of size n, if 𝑛 = 𝑚.
●An 1 × 𝑛 matrix is called a row matrix (or, row vector)
and written as [𝑎1 𝑎2 … 𝑎𝑛 ] and 𝑛 × 1 matrix is called a
-6-
𝑎1
𝑎2
column matrix (or, vertical vector) and written as [ ⋮ ].
𝑎𝑛
In Example 1 the 2 × 1 matrix is a column matrix, the
1 × 4 matrix is a row matrix, and 1 × 1 matrix is both a
row matrix and a column matrix.
Definition 2.
Let 𝐴 = [𝑎𝑖𝑗 ] and 𝐵 = [𝑏𝑖𝑗 ] be two matrices of the same
size. Then 𝐴 = 𝐵 if and only if 𝑎𝑖𝑗 = 𝑏𝑖𝑗 for all 𝑖 and 𝑗.
Example 2.
𝑎 𝑏 1 1 3
The matrices 𝐴 = [ ] and 𝐵 = [ ] are not
𝑐 𝑑 −2 0 5
equal because they are not of the same size.
5 −1
But, 𝐶 = [ ] is equal to 𝐴 if and only if 𝑎 = 5,
7 0
𝑏 = −1, 𝑐 = 7 and 𝑑 = 0.◄
Definition 3.
Let 𝐴 = [𝑎𝑖𝑗 ] and 𝐵 = [𝑏𝑖𝑗 ] be two matrices of size
𝑚 × 𝑛. Then 𝐴 + 𝐵 is of size 𝑚 × 𝑛 and is defined as
𝐴 + 𝐵 = [𝑎𝑖𝑗 + 𝑏𝑖𝑗 ].
-7-
Example 3.
1 3 −3 −3 −2 0
[−1 4 ]+[ 0 4 ] = [−1 8].
0 −5 5 6 5 1
Example 4.
𝑎 3 4 8 1 11
Let [ ]+[ ]=[ ]. Find 𝑎, 𝑏 and 𝑐.
1 𝑏 𝑐 −1 5 −4
Solution:
𝑎 3 4 8 𝑎+4 11
Since [ ]+[ ]=[ ],
1 𝑏 𝑐 −1 1+𝑐 𝑏−1
𝑎 + 4 = 1, 1 + 𝑐 = 5, 𝑏 − 1 = −4. Therefore, we
obtain 𝑎 = 𝑏 = −3 and 𝑐 = 4.◄
Definition 4.
A matrix whose elements are zeros is called a zero-
matrix and is denoted by 0 (or [0]).
Definition 5.
Let 𝐴 = [𝑎𝑖𝑗 ] be a matrix and 𝑘 ∈ ℝ. The matrix 𝑘𝐴 is
defined as 𝑘𝐴 = [𝑘𝑎𝑖𝑗 ].
Notes:
●If 𝑘 = −1 , we write – 𝐴 instead of (−1)𝐴.
●𝐴 − 𝐵 = 𝐴 + (−𝐵).
-8-
Theorem 1.
Let [𝑎𝑖𝑗 ] , 𝐵 = [𝑏𝑖𝑗 ] and 𝐶 = [𝑐𝑖𝑗 ] be three 𝑚 × 𝑛
matrices; and 𝑟, 𝑠 ∈ ℝ . Then
(1) 𝐴 + 𝐵 = 𝐵 + 𝐴;
(2) 𝐴 + (𝐵 + 𝐶 ) = (𝐴 + 𝐵) + 𝐶;
(3) 𝐴 + 𝟎 = 𝐴;
(4) 𝐴 + (−𝐴) = 𝟎;
(5) 𝑟(𝐴 + 𝐵 ) = 𝑟𝐴 + 𝑟𝐵;
(6) (𝑟 + 𝑠)𝐴 = 𝑟𝐴 + 𝑠𝐴;
(7) (𝑟𝑠)𝐴 = 𝑟(𝑠𝐴);
(8) 1𝐴 = 𝐴.
Proof.
(1) Note that:
𝐴 + 𝐵 = [𝑎𝑖𝑗 ] + [𝑏𝑖𝑗 ] = [𝑎𝑖𝑗 + 𝑏𝑖𝑗 ]
= [𝑏𝑖𝑗 + 𝑎𝑖𝑗 ] = 𝐵 + 𝐴.
Therefore 𝐴 + 𝐵 = 𝐵 + 𝐴.
(2) 𝐴 + (𝐵 + 𝐶 ) = [𝑎𝑖𝑗 ] + ([𝑏𝑖𝑗 ] + [𝑐𝑖𝑗 ])
= [𝑎𝑖𝑗 ] + [𝑏𝑖𝑗 + 𝑐𝑖𝑗 ]
= [𝑎𝑖𝑗 + (𝑏𝑖𝑗 + 𝑐𝑖𝑗 )]
-9-
= [(𝑎𝑖𝑗 + 𝑏𝑖𝑗 ) + 𝑐𝑖𝑗 ]
= [𝑎𝑖𝑗 + 𝑏𝑖𝑗 ] + [𝑐𝑖𝑗 ]
= ([𝑎𝑖𝑗 ] + [𝑏𝑖𝑗 ]) + [𝑐𝑖𝑗 ]
= (𝐴 + 𝐵) + 𝐶.
Therefore 𝐴 + (𝐵 + 𝐶 ) = (𝐴 + 𝐵) + 𝐶.
(3) 𝐴 + 𝟎 = [𝑎𝑖𝑗 ] + [0] = [𝑎𝑖𝑗 + 0] = [𝑎𝑖𝑗 ] = 𝐴.
Therefore 𝐴 + 𝟎 = 𝐴.
(4) 𝐴 + (−𝐴) = [𝑎𝑖𝑗 ] + [−𝑎𝑖𝑗 ] = [𝑎𝑖𝑗 − 𝑎𝑖𝑗 ]
= [0] = 𝟎.
Therefore 𝐴 + (−𝐴) = 𝟎.
(5) 𝑟(𝐴 + 𝐵) = 𝑟([𝑎𝑖𝑗 ] + [𝑏𝑖𝑗 ]) = 𝑟[𝑎𝑖𝑗 + 𝑏𝑖𝑗 ]
= [𝑟(𝑎𝑖𝑗 + 𝑏𝑖𝑗 )] = [𝑟𝑎𝑖𝑗 + 𝑟𝑏𝑖𝑗 ]
= [𝑟𝑎𝑖𝑗 ] + [𝑟𝑏𝑖𝑗 ] = 𝑟[𝑎𝑖𝑗 ] + 𝑟[𝑏𝑖𝑗 ]
= 𝑟𝐴 + 𝑟𝐵.
Therefore 𝑟(𝐴 + 𝐵) = 𝑟𝐴 + 𝑟𝐵.
(6) (𝑟 + 𝑠)𝐴 = (𝑟 + 𝑠)[𝑎𝑖𝑗 ] = [(𝑟 + 𝑠)𝑎𝑖𝑗 ]
= [𝑟𝑎𝑖𝑗 + 𝑠𝑎𝑖𝑗 ] = [𝑟𝑎𝑖𝑗 ] + [𝑠𝑎𝑖𝑗 ]
= 𝑟[𝑎𝑖𝑗 ] + 𝑠[𝑎𝑖𝑗 ] = 𝑟𝐴 + 𝑠𝐴.
Therefore (𝑟 + 𝑠)𝐴 = 𝑟𝐴 + 𝑠𝐴.
- 10 -
(7) (𝑟𝑠)𝐴 = (𝑟𝑠)[𝑎𝑖𝑗 ] = [((𝑟𝑠)𝑎𝑖𝑗 )]
= 𝑟(𝑠𝐴).
Therefore (𝑟𝑠)𝐴 = 𝑟(𝑠𝐴).
(8) Obvious. ◄
Example 5.
0 −1 −1 0
Let 𝐴 = [2 5 ] and 𝐵 = [ 5 −1]. Then
1 3 3 2
0 −1 −1 0 1 −3
3𝐴 − 𝐵 = 3 [2 5 ] − [ 5 −1] = [1 16 ].
1 3 3 2 0 7
Example 6.
Solve the following matrix equation.
0 −2 2 −1
𝐴 + 3[ ]=[ ].
1 −1 4 0
Solution:
Note that the size of the matrix 𝐴 should be 2 × 2. So, we
assume that.
𝑎 𝑏 2 −1 0 −6 2 5
𝐴=[ ]=[ ]−[ ]=[ ].◄
𝑐 𝑑 4 0 3 −3 1 3
- 11 -
Definition 6.
If 𝐴 = [𝑎𝑖𝑗 ] is any 𝑚 × 𝑛 matrix, then the transpose of
𝐴, denoted by 𝐴𝑇 , is the 𝑛 × 𝑚 matrix obtained from 𝐴 by
interchanging the rows and columns of 𝐴, 𝑖. 𝑒. 𝐴𝑇 = [𝑎𝑗𝑖 ].
Theorem 2.
Let 𝐴 = [𝑎𝑖𝑗 ] and 𝐵 = [𝑏𝑖𝑗 ] be an 𝑚 × 𝑛 matrices. The
following statements are true.
(1) (𝐴𝑇 )𝑇 = 𝐴.
(2) (𝑘𝐴)𝑇 = 𝑘𝐴𝑇 ,where𝑘 ∈ ℝ.
(3) (𝐴 + 𝐵)𝑇 = 𝐴𝑇 + 𝐵𝑇 .
𝑇
Proof. (𝐴𝑇 )𝑇 = [𝑎𝑗𝑖 ] = [𝑎𝑖𝑗 ] = 𝐴;
𝑇
(1) (𝑘𝐴)𝑇 = [𝑘𝑎𝑖𝑗 ] = [𝑘𝑎𝑗𝑖 ] = 𝑘[𝑎𝑗𝑖 ] = 𝑘𝐴𝑇 ;
𝑇
(2) (𝐴 + 𝐵)𝑇 = [𝑎𝑖𝑗 + 𝑏𝑖𝑗 ] = [𝑎𝑗𝑖 + 𝑏𝑗𝑖 ]
= [𝑎𝑗𝑖 ] + [𝑏𝑗𝑖 ] = 𝐴𝑇 + 𝐵𝑇 .
♣ We noted that sum of matrices is nearly like the
addition of real numbers. But the product of matrices is
- 12 -
not like the product of real numbers. Before we define the
product of matrices, we define the Euclidean product.
Definition 7.
𝑏1
𝑏
Let 𝐴 = [𝑎1 𝑎2 … 𝑎𝑛 ] be a row vector and 𝐵 = [ 2 ] be a
:
𝑏𝑛
column vector. We define the Euclidean product of 𝐴 and
B as follows:
𝐴𝐵 = 𝑎1 𝑏1 + 𝑎2 𝑏2 + ⋯ + 𝑎𝑛 𝑏𝑛 .
Definition 8.
Let 𝐴 = [𝑎𝑖𝑗 ] be an 𝑚 × 𝑛 matrix and 𝐵 = [𝑏𝑖𝑗 ] be an
𝑛 × 𝑝 matrix.
The product of 𝐴 and 𝐵, written as 𝐴𝐵, is the 𝑚 × 𝑝
matrix 𝐶 = [𝑐𝑖𝑗 ] whose 𝑖𝑗 - component is
𝑛
- 13 -
Example 6.
Example 7.
1 2
0 −1 2
Let 𝐴 = [−1 0] and 𝐵 = [ ].
−4 1 0
0 3
Compute 𝐴𝐵 and 𝐵𝐴, if it is possible?
Solution:
Since 𝐴 is of size 3 × 2 and 𝐵 is of size 2 × 3, then 𝐴𝐵
is of size 3 × 3 and 𝐵𝐴 is of size 2 × 2. Now
1 2 −8 1 2
0 −1 2
𝐴𝐵 = [−1 0] [ ] = [ 0 1 −2].
−4 1 0
0 3 −12 3 0
1 2
0 −1 2 1 6
Also 𝐵𝐴 = [ ] [−1 0] = [ ].◄
−4 1 0 −5 −8
0 3
- 14 -
Notes.
(1) 𝐴𝐵 ≠ 𝐵𝐴, 𝑖. 𝑒., The product of matrices is not
commutative.
(2) It is possible that that 𝐴 ≠ 𝟎 and 𝐵 ≠ 𝟎 such that
𝐴𝐵 = 𝟎.
1 0 0 0
For example, 𝐴 = [ ],𝐵 = [ ] and
0 0 0 1
0 0
𝐴𝐵 = [ ].
0 0
Theorem 3.
Let 𝐴 = [𝑎𝑖𝑗 ], 𝐵 = [𝑏 𝑖𝑗 ] and 𝐶 = [𝑐𝑖𝑗 ] be matrices with
suitable size such that the following operations are
defined and 𝑘 ∈ ℝ. Then:
(1) 𝐴(𝐵𝐶 ) = (𝐴𝐵 )𝐶
(2) 𝐴(𝐵 + 𝐶 ) = 𝐴𝐵 + 𝐴𝐶
(3) (𝐵 + 𝐶 )𝐴 = 𝐵𝐴 + 𝐶𝐴
(4) 𝑘 (𝐴𝐵) = (𝑘𝐴)𝐵 = 𝐴(𝑘𝐵 )
(5) (𝐴𝐵 )𝑇 = 𝐵 𝑇 𝐴𝑇
Proof.
(1) Let 𝐴, 𝐵, 𝐶 have the size 𝑚 × 𝑟, 𝑟 × 𝑠, 𝑠 × 𝑛,
respectively.
- 15 -
Therefore
𝑟 𝑠 𝑟
= 𝐴𝐵 + 𝐴𝐶.
- 16 -
(3) Similar to (2)
(4) Let 𝐴 be of size 𝑚 × 𝑛 and 𝐵 of size 𝑛 × 𝑝. Therefore
𝑛 𝑛
𝑡𝑟 (𝐴) = ∑ 𝑎𝑖𝑖
𝑖=1
- 17 -
Example 8.
1 3 −2
Let 𝐴 = [0 5 7 ].
1 2 −3
Then 𝑡𝑟(𝐴) = 1 + 5 + (−3) = 3.◄
Theorem 4.
Let 𝐴 and 𝐵 be an 𝑛 × 𝑛 matrices and 𝑘 ∈ ℝ. Then the
following statements are true.
(1) 𝑡𝑟(𝐴 + 𝐵 ) = 𝑡𝑟(𝐴) + 𝑡𝑟(𝐵);
(2) 𝑡𝑟(𝐴𝑇 ) = 𝑡𝑟(𝐴);
(3) 𝑡𝑟(𝐴𝐵 ) = 𝑡𝑟(𝐵𝐴);
(4) 𝑡𝑟(𝑘𝐴) = 𝑘 𝑡𝑟(𝐴);
2
(5) 𝑡𝑟(𝐴𝐴𝑇 ) = ∑𝑖≠𝑗 𝑎𝑖𝑗 .
Proof. Left as exercise.
- 18 -
Exercise Set (1.1)
3 0
4 −1 1 4 2
Let 𝐴 = [−1 2] , 𝐵 = [ ],𝐶 =[ ]
0 2 3 1 5
1 1
1 5 2 6 1 3
𝐷 = [−1 0 1]and 𝐸 = [−1 1 2]
3 2 4 4 1 3
In Exercise (1) to (30) Find:
(1) 𝐷 + 𝐸 (2) 𝐷 − 𝐸
(3) 5𝐴 (4) −7𝐶
(5) 2𝐵 − 𝐶 (6) 4𝐸 − 2𝐷
(7) −3(𝐷 + 2𝐸) (8) 𝐶 − 𝐶 𝑇
(9) 𝑡𝑟(𝐷) (10) 𝑡𝑟(𝐷 − 3𝐸 )
(11) 4𝑡𝑟(7𝐵) (12) 𝑡𝑟(𝐴)
(13) 2𝐴𝑇 + 𝐶 (14) 𝐷𝑇 − 𝐸 𝑇
(15) (𝐷 − 𝐸 )𝑇 (16) 𝐵𝑇 + 5𝐶 𝑇
1 1
(17) 𝐶𝑇 𝐴 (18) 𝐶 𝑇 − 𝐶 𝑇
4 2
- 19 -
(29) 𝑡𝑟(𝐷𝐸 + 𝐸𝐷) (30) 𝑡𝑟(𝐷𝐸 − 𝐸𝐷)
(31) Find 𝐴 of 𝐵 when 2𝐴 − 3𝐵 = 6(𝐴 + 3𝐵)
(32) Let 𝐵 be an 𝑚 × 𝑛 matrix such that 𝐵 + 𝐴 = 𝐴 for
every 𝑚 × 𝑛 matrix 𝐴. Prove that 𝐵 = 𝟎.
(33) Find all 2 × 2 matrices such that 𝐴2 = 𝐼.
(34) Prove that there is no an 2 × 2 matrices 𝐴 and 𝐵
such that 𝐴𝐵 − 𝐵𝐴 = 𝐼.
- 20 -
2-ELEMENTARY ROW OPERATIONS
The following operations, called elementary row
operations, on the rows of a matrix:
1- Interchange two rows;
2- Multiply a row by a non-zero constant;
3- Replace one row by the sum of itself and a constant
multiple of another row.
- 21 -
●𝑘𝑅𝑖𝑗 will indicate that 𝑘 times the 𝑖𝑡ℎ row is added to
the 𝑗𝑡ℎ row and the sum becomes the new 𝑗𝑡ℎ row.
●If we obtained B from A by elementary row operation,
then we can obtain B from A by the inverse of the
elementary row operation as follows:
1
(a') 𝑅𝑗𝑖 ; (b') 𝑅𝑖 ; (c') −𝑘𝑅𝑖𝑗 .
𝑘
- 22 -
column occurs to the right of the first column that
contains a nonzero (𝑘 − 1)𝑡ℎ entry.
13 5 7
1 4 5
01 1 0
In particular, [0 1 4] and [ ] are in row-
00 1 3
0 0 0
00 0 1
2 3 5
echelon form, while [4 0 1] is not.
0 0 0
♣ A matrix in row-echelon form that satisfy the following
two properties is said to be in reduced row-echelon
form:
● If a row does not consist entirely of zeros then the
leading entry of that row, is 1.
● Any column containing the leading entry of some row
consists entirely of zeros except for that single leading
entry.
For example,
1 0 2
1 0 0 1 4 7 0
0 1 0
[0 1 0], [ ] and [0 0 0 1]
0 0 0
0 0 1 0 0 0 0
0 0 0
are in reduced row-echelon form,
while,
- 23 -
1 0 1
1 1 0 1 2 6 6
0 0 0
[0 1 3], [ ] and [0 0 2 1]are not.
0 1 0
0 0 1 0 0 0 0
0 0 0
♣ A systematic process that transforms any matrix to a
matrix in row-echelon form is as follows:
(1) Interchanging rows (if necessary) to replace a
nonzero number in the first entry of the first row.
(2) Use the second elementary row operation as many
times necessary to replace a zero as the first entry in
every row below the first.
(3) Hold the first row fixed, and interchange other
rows (if necessary) to give the second row a nonzero
entry as for as left as possible.
(4) Hold the first two rows fixed; use the second
elementary row operation to replace a zero in every row
after the second row directly beneath the first nonzero
entry in the second row.
(5) Continue to hold the first two rows fixed and
interchange other rows (if necessary) to give the third
row a nonzero entry as for left as possible.
- 24 -
(6) Hold the first three rows fixed; use the second
elementary row operation to replace a zero in every row
after the third row directly beneath the first nonzero
entry in the third row.
(7) Continue in this fashion row-echelon forms
obtained.
22 2 4
Example 1. Transform the matrix [7 3 4 4] to a matrix
61 2 1
in row- echelon form.
Solution:
2 2 2 4 1 𝑅1 1 1 1 2 −7𝑅12 ,−6𝑅13 1 1 1 2
2
[7 3 4 4 ] → [ 7 3 4 4] → [0 −4 −3 −10]
61 2 1 61 2 1 0 −5 −4 −11
1
− 𝑅2 1 1 1 2 5𝑅23 1 1 1 2
4
→ [0 1 3/4 5/2 ] → [0 1 3/4 5/2]
0−5 −4 −11 0 0 −1/4 3/2
−4𝑅3 1 1 1 2
→ [0 1 3/4 5/2]
0 0 1 −6
- 25 -
1 1 1 2
[0 1 3/4 5/2]
0 0 1 −6
Example 3.
Put the matrix
5 2 2
𝐴 = [7 3 4]
6 1 2
in reduced row echolen-form.
Solution:
1
𝑅 1 2/5 2/5 −7𝑅12,−6𝑅13 1 2/5 2/5
5 1
𝐴→ [7 3 4 ]→ [0 1/5 6/5 ]
6 1 2 0 −7/5 −2/5
7
5𝑅21 2/5 2/5 𝑅23 1 2/5 2/5
5
→ [0 1 6 ]→ [0 1 6 ]
0 −7/5 −2/5 0 0 8
1
𝑅 1 2/5 2/5 −2 𝑅21 1 0 −2
8 3 5
→ [0 1 6 ]→ [0 1 6 ]
0 0 1 0 0 1
2𝑅31, −6𝑅32 1 0 0
→ [0 1 0]
0 0 1
- 26 -
Exercise Set (1.2)
Transform the following matrices to the reduced row-
echelon form: -
1 2 1 1 1 3 1 7
1 −1 4 2 (2) [3 −2 4 1]
(1) [ ]
2 1 3 1 4 1 9 6
−3 1 1 4
1 2 1 0 2 4
1 3 5 (4) [1 −3 5 ]
(3) [ ]
2 5 6 3 −1 −1
3 7 7
1 3 −2 0 2 0 1 1
2 6 −5 −2 4 −3 (6) [2 −1]
(5) [ ]
0 0 5 10 0 15 1 2
2 6 0 8 4 18
1 3 −2 −1 0
6 3
4 −5 (8) [ 2 1 1
(7) [6 ] 0 0 1 1 0
]
2 −3 2
2 6 4 2 −2
1 3 1 1
(9) [−1 −2 1 1]
3 7 −1 1
- 27 -
3. Inverse of Matrix
Definition 1.
Let A be an 𝑛 × 𝑛 matrix. The matrix B is an inverse
matrix of A if B is an 𝑛 × 𝑛 matrix and 𝐴𝐵 = 𝐵𝐴 = 𝐼. In
this case, we say that A is invertible or nonsingular.
Example 1.
7 3
Prove that 𝐵 = [ ] is an inverse of
2 1
1 −3
𝐴=[ ].
−2 7
Solution. Since
1 −3 7 3 1 0
𝐴𝐵 = [ ][ ]=[ ]=𝐼
−2 7 2 1 0 1
And
7 3 1 −3 1 0
𝐵𝐴 = [ ][ ]=[ ] = 𝐼.
2 1 −2 7 0 1
Then B is the inverse of A.◄
0 1
Example 1. Prove that the matrix 𝐴 = [ ] has no
0 1
inverse (singular).
𝑎 𝑏
Solution. Let 𝐵 = [ ] be the inverse of A. Then
𝑐 𝑑
0 1 𝑎 𝑏 1 0
𝐴𝐵 = [ ][ ]=[ ].
0 1 𝑐 𝑑 0 1
- 28 -
Therefore 𝑐 = 0 and 𝑐 = 1, impossible. Then A has no
inverse. ◄
Theorem 1.
The inverse of a square matrix A is unique.
Proof.
Let B and C are inverses of A, then 𝐴𝐶 = 𝐶𝐴 = 𝐼 and
𝐴𝐵 = 𝐵𝐴 = 𝐼. Therefore
𝐶 = 𝐶𝐼 = 𝐶 (𝐴𝐵 ) = (𝐶𝐴)𝐵 = 𝐼𝐵 = 𝐵.◄
Note.
We will write 𝐴−1 for the inverse of 𝐴
Definition 2.
Let A be an 𝑛 × 𝑛 matrix and k be positive integer. We
define 𝐴𝑘 inductively as follows: 𝐴𝑘+1 = 𝐴𝑘 𝐴 and 𝐴0 =
𝐼 , 𝑘 ≥ 0. Also, 𝐴−𝑘 = (𝐴−1 )𝑘 .
Theorem 2.
(1) 𝐼 −1 = 𝐼;
(2) (𝐴−1 )−1 = 𝐴
(3) If both A and B have an inverse, then AB has an
inverse and (𝐴𝐵 )−1 = 𝐵−1 𝐴−1 ;
(4) If 𝐴1 , 𝐴2 , … . , 𝐴𝑘 have an inverse, then 𝐴1 𝐴2 … 𝐴𝑘
−1 −1
has an inverse (𝐴1 𝐴2 … 𝐴𝑘 )−1 = 𝐴−1
𝑛 … 𝐴2 𝐴1 .
- 29 -
(5) If A has an inverse, then 𝐴𝑘 has an inverse for all
𝑘 ≥ 1 and (𝐴𝑘 )−1 = (𝐴−1 )𝑘
(6) If A has an inverse and 0 ≠ 𝑟 ∈ ℝ, then 𝑟𝐴 has an
1
inverse and (𝑟𝐴)−1 = 𝐴−1
𝑟
Proof.
(1) Since 𝐼𝐼 = 𝐼, then 𝐼 −1 = 𝐼.
(2) Since 𝐴−1 exists and 𝐴𝐴−1 = 𝐼 = 𝐴−1 𝐴, by
uniqueness of the inverse of 𝐴, 𝐴 is the inverse of 𝐴−1 .
Therefore (𝐴−1 )−1 = 𝐴.
(3) Since(𝐴𝐵)(𝐵 −1 𝐴−1 ) = 𝐴(𝐵𝐵−1 )𝐴−1
= 𝐴𝐼𝐴−1 = 𝐴𝐴−1 = 𝐼
and
(𝐵−1 𝐴−1 )(𝐴𝐵) = 𝐵−1 (𝐴−1 𝐴)𝐵 = 𝐵−1 𝐼𝐵
= 𝐵𝐵−1 = 𝐼.
Then (𝐴𝐵 )−1 = 𝐵−1 𝐴−1 .
(4) We use induction.
= 𝐴−1 −1 −1 −1
𝑚+1 𝐴𝑚 … 𝐴2 𝐴1 .
- 31 -
Example 3.
1 0
The matrix 𝐸 = [ ] is an elementary matrix, because
0 5
we obtained it from 𝐼2 by only one elementary row
operation (5𝑅2 ). Also,
1 0 5
𝐹 = [0 1 0]
0 0 1
is an elementary matrix, because we obtained it from 𝐼3 by
only one elementary row operation (5𝑅31 ).
1 0 2 3
Now let 𝐴 = [2 −1 3 6]. If 𝐵 is the matrix obtained
1 4 4 0
from 𝐴 by the elementary row operation (5𝑅13 ), 𝑖. 𝑒.,
1 0 2 3
𝐵 = [2 −1 3 6 ], then 𝐸𝐴 = 𝐵, where𝐸 is the
6 4 14 15
elementary matrix obtained from 𝐼3 by(5𝑅13 ).
Thus
100 1 0 23 1 0 2 3
𝐸𝐴 = [0 1 0] [2 −1 3 6] [2 −1 3 6 ] = 𝐵.◄
5 0 1 1 4 4 0 6 4 14 15
- 32 -
Theorem 3.
Let 𝐴 be an 𝑛 × 𝑛 matrix and 𝐸 be elementary matrix
obtained from 𝐼𝑚 . Then 𝐸𝐴 is a matrix obtained from 𝐴
by the same elementary row operation which done in 𝐼𝑚
to obtained 𝐸.
Proof.
We prove the theorem for one elementary row operation.
For example, multiplying𝑐𝑅𝑘 . That is, suppose that 𝐴 =
𝑎11 𝑎12 … 𝑎1𝑛
𝑎21 𝑎22 … 𝑎2𝑛
[ ⋮ ⋮ ⋱ ⋮ ]and 𝐸 is the elementary matrix
𝑎𝑚1 𝑎𝑚2 … 𝑎𝑚𝑛
obtained from 𝐼𝑚 . By 𝑐𝑅𝑘 , 𝑖. 𝑒.,
10 … 0 𝑎11 𝑎12 … 𝑎1𝑛
01 … 0 𝑎21 𝑎22 … 𝑎2𝑛
⋮⋮ : ⋮ ⋮ ⋮ ⋮ ⋮
𝐸= . Then𝐸𝐴 = 𝑐𝑎𝑘𝑛 and
00… 𝑐 …0 𝑐𝑎𝑘1 𝑐𝑎𝑘2 …
⋮⋮ ⋮ ⋮ ⋮ ⋮ ⋮ ⋮
[ 00 … 1] [ 𝑎𝑚1 𝑎𝑚2 … 𝑎𝑚𝑛 ]
𝑎11 𝑎12 … 𝑎1𝑛
𝑎21 𝑎22 … 𝑎2𝑛
𝑐𝑅𝐾 ⋮ ⋮ ⋮ ⋮
𝐴 → 𝑐𝑎 𝑐𝑎 …𝑐𝑎 = 𝐸𝐴 .
𝑘1 𝑘2 𝑘𝑛
⋮ ⋮ ⋮ ⋮
[ 𝑎𝑚1 𝑎𝑚2 … 𝑎𝑚𝑛 ]
Similarly, we can prove the other two statements.
- 33 -
Theorem 4.
Elementary matrices are nonsingular, and the inverse of
an elementary matrix is another elementary matrix.
Proof.
Let 𝐸 be the elementary matrix obtained from 𝐼𝑛 by
multiplying its 𝑖 𝑡ℎ row by a nonzero constant c.
Let 𝐹 be the elementary matrix obtained from 𝐼𝑛 by
1
multiplying its 𝑖 𝑡ℎ row by . Then 𝐸𝐹 is the matrix
𝑐
1
obtained from 𝐹 by multiplying its 𝑖𝑡ℎ row by .
𝑐
- 35 -
𝑅231 1 1 1 0 0
→ [0 −1 −1 |−2 0 1]
0 4 3 1 1 0
−1𝑅2 1 1 1 1 0 0
→ [0 𝟏 1 |2 0 −1]
0 4 3 1 1 0
−4𝑅23 1 1 1 1 0 0
→ [0 𝟏 1 | 2 0 −1]
0 0 −1 −7 1 4
−1𝑅3 1 1 1 1 0 0
→ [0 1 1 |2 0 −1]
0 0 𝟏 7 −1 −4
−1𝑅23 ,−1𝑅31 1 1 0 −6 1 4
→ [0 𝟏 0 |−5 1 3]
0 0 𝟏 7 −1 −4
−1𝑅21 1 0 0 −1 0 1
→ [0 1 0 |−5 1 3 ].
0 0 1 7 −1 −4
Therefore
−1 0 1
−1
𝐴 = [−5 1 3]
7 −1 −4
Example 5.
Find the inverse of the matrix (if it exists)
- 36 -
1 2 3
𝐴 = [4 5 6]
7 8 9
Solution:
1 2 3 1 0 0
[4 5 6 |0 1 0]
7 8 9 0 0 1
−4𝑅12 ,−7𝑅13 1 2 3 1 0 0
→ [0 −3 −6 |−4 1 0]
0 −6 −12 −7 0 1
1
− 𝑅23 1 2 3 1 0 0
3
→ [0 1 2 |4/3 −1/3 0]
0 −6 −12 −7 −2 1
6𝑅23 1 2 3 1 0 0
→ [0 1 2 |4/3 −1/3 0]
0 0 0 1 −2 1
Since the last row in the left matrix is zero, the matrix 𝐴
has no inverse.
- 37 -
Exercise Set (1.3)
𝑎
𝑏
(1) Let 𝐴 = [ ] and 𝑎𝑑 − 𝑏𝑐 ≠ 0. Prove that 𝐴 has
𝑐
𝑑
1 𝑎 −𝑏
an inverse and 𝐴−1 = [ ]
𝑎𝑑−𝑏𝑐 −𝑐 𝑎
(2) Use Exercise (1) to find the inverse of the following
matrix (if it exists).
1 2 −1 3 1/2 0 3 6
(a)[ ] (b) [ ](c)[ ] (d) [ ].
1 3 2 5 0 1/3 2 4
(3) Compute the inverse of the following matrices (if it
exists).
1 7 12 1 2 1
(a)[5 −1 2 ] (b) [3 5 0]
2 3 5 1 7 1
1 1 2 1 4 2
(c) [1 1 1] (d) [2 3 3]
1 2 4 4 1 4
−1 4 5 2 1 2 3 0 0
0 0 0 −1 1 1 1 0 0
(e) [ ]
1 −2−2 0 (f) 0 1 5 1 0
0 −1−1 0 0 0 1 2 2
[0 0 0 0 1]
- 38 -
(4) Write the following matrices as a product of
elementary matrices (if it is possible).
102
2 1 5 0
(a)[ ], (b) [ ], (c)[011],
3 1 3 2
216
1 0 −3
(d) [ 0 1 4 ]
−2 2 15
(5) Prove that 𝐸 𝑡 is an elementary matrix.
(6) Let 𝐴 be an invertible matrix and 𝐴𝐵 = 𝐴𝐶. prove
that 𝐵 = 𝐶.
(7) Let 𝐴 be an invertible matrix and 𝐴2 − 3𝐴 + 𝐼 = 0.
Prove that 𝐴−1 = (3𝐼 − 𝐴)
(8) Let 𝐴 and 𝐵 be matrices of size.prove that:-
(a)𝐴(𝐼 + 𝐵𝐴) = (𝐼 + 𝐴𝐵)𝐴
(b) (𝐼 + 𝐵𝐴)𝐵 = 𝐵(𝐼 + 𝐴𝐵).
- 39 -
4. SPECIAL MATRICES
Definition 1.
An 𝑛 × 𝑛 matrix 𝐴 = [𝑎𝑖𝑗 ] is called upper triangular if
𝑎𝑖𝑗 = 0 whenever 𝑖 > 𝑗 and lower triangular if 𝑎𝑖𝑗 = 0
whenever 𝑖 < 𝑗. A matrix is called triangular if it is
either upper or lower triangular.
Definition 2.
Let 𝐴 be a square matrix:
(1) 𝐴 is symmetric if 𝐴𝑇 = 𝐴.
(2) 𝐴is anti(skew)-symmetric if 𝐴𝑇 = −𝐴.
Note.
We considered only the matrices of real numbers.
Sometimes we need to study the matrices of complex
number, i. e., its elements is complex numbers.
- 40 -
Example 1.
1 2 −1 𝑖 𝑖 1+𝑖
Both 𝐴 = [ 2 3 7 ] and 𝐵 = [ 𝑖 2𝑖 2 − 𝑖]
−1 7 5 1+𝑖 2−𝑖 1
1 −1
are symmetric matrices. But 𝐶 = [ ] is not
3 2
symmetric.
Example 2. Both the matrices
0 2 −1 0 𝑖 1+𝑖
𝐴 = [−2 0 7 ] and 𝐵 = [ −𝑖 0 2 − 𝑖]
1 −7 0 −1 − 𝑖 −2 + 𝑖 0
are anti-symmetric.
Theorem 1. Let 𝐴 be a square matrix and 𝑘 ∈ ℝ. Then
(1) 𝐴𝐴𝑇 is symmetric;
(2) 𝐴 + 𝐴𝑇 is symmetric;
(3) 𝐴 − 𝐴𝑇 is anti-symmetric;
(4) If 𝐴 is symmetric, then 𝑘𝐴 is symmetric.
(5) If 𝐴 is anti-symmetric, then 𝑘𝐴 is anti-symmetric.
(6) There exist a unique matrices 𝐵 and 𝐶, where 𝐵 is
symmetric and 𝐶 is anti-symmetric such that
𝐴 = 𝐵 + 𝐶.
- 41 -
Proof.
(1) Since(𝐴𝐴𝑇 )𝑇 = (𝐴𝑇 )𝑇 𝐴𝑇 = 𝐴𝐴𝑇 , 𝐴𝐴𝑇 is
symmetric.
(2) Since (𝐴+𝐴𝑇 )𝑇 = 𝐴𝑇 + (𝐴𝑇 )𝑇 = 𝐴𝑇 + 𝐴 =
𝐴 + 𝐴𝑇 , 𝐴 + 𝐴𝑇 is symmetric.
(3) Since (𝐴−𝐴𝑇 )𝑇 = 𝐴𝑇 + (−𝐴𝑇 )𝑇 = 𝐴𝑇 − (𝐴𝑇 )𝑇 =
𝐴𝑇 − 𝐴 = −(𝐴−𝐴𝑇 ), 𝐴−𝐴𝑇 is anti-symmetric.
(4) Since A is symmetric, 𝐴𝑇 = 𝐴.
Therefore (𝑘𝐴)𝑇 = 𝑘𝐴𝑇 = 𝑘𝐴. Hence 𝑘𝐴 is
symmetric.
(5) Since 𝐴 is anti-symmetric, then 𝐴𝑇 = −𝐴
Therefore (𝑘𝐴)𝑇 = 𝑘𝐴𝑇 = 𝑘(−𝐴) = −𝑘𝐴. Hence
𝑘𝐴 is anti-symmetric.
1
(6) Note that 𝐵 = (𝐴 + 𝐴𝑇 ) is symmetric and 𝐶 =
2
1
(𝐴 − 𝐴𝑇 ) is anti-symmetric.
2
1 1
Then 𝐵 + 𝐶 = (𝐴 + 𝐴𝑇 ) + (𝐴 − 𝐴𝑇 ) = 𝐴.
2 2
Definition 3.
Let 𝐴 = [𝑎𝑖𝑗 ] be a complex matrix. The 𝐴 = [𝑎𝑖𝑗 ] is
called the conjugate matrix for 𝐴, where 𝑎𝑖𝑗 is conjugate
of the complex number 𝑎𝑖𝑗 .
There following theorem gives the properties of the
conjugate matrix. It easily be proved by using the
properties of the complex numbers. The proof is left to
the reader as exercise.
- 43 -
Theorem 2.
Let 𝐴 = [𝑎𝑖𝑗 ] and 𝐵 = [𝑏𝑖𝑗 ] be a complex matrices.
(1) ̿𝐴 = 𝐴;
(2) (𝑘𝐴) = 𝑘 𝐴, 𝑘 ∈ ℂ;
Definition 4.
Let 𝐴 be a complex matrix. The conjugate transpose of 𝐴
is denoted by 𝐴∗ and is defined as follows:
𝑇
𝐴∗ = (𝐴) = 𝐴𝑇 .
Example 4.
0 1+𝑖
𝑖 0 1−𝑖 −𝑖
If 𝐴 = [ ], then 𝐴 = [ ]
2 3 − 2𝑖
−𝑖 2 3 + 2𝑖 𝑖
𝑇
0 2
∗
And 𝐴 = (𝐴) = [1 − 𝑖 3 + 2𝑖 ].
−𝑖 𝑖
Theorem 3.
(1) (𝐴∗ )∗ = 𝐴;
(2) (𝐴 + 𝐵)∗ = 𝐴∗ +𝐵 ∗ ;
(3) (𝑘𝐴)∗ = 𝑘𝐴∗ , 𝑘 ∈ ℂ ;
- 44 -
(4) (𝐴𝐵)∗ = 𝐵 ∗ 𝐴∗ .
Proof.
𝑇
𝑇 ∗ 𝑇 𝑇 𝑇
(1) (𝐴∗ )∗
= [(𝐴) ] = ((𝐴) ) = [(𝐴̿) ] = 𝐴;
𝑇 𝑇
(2) (𝐴 + 𝐵)∗ = (𝐴 + 𝐵) = (𝐴 + 𝐵) .
𝑡 𝑡
= (𝐴) + (𝐵) = 𝐴∗ + 𝐵 ∗ ;
𝑇 𝑇
(3) (𝑘𝐴)∗ = (𝑘 𝐴) = 𝑘(𝐴) = 𝑘𝐴∗
𝑇 𝑇 𝑇
(4) (𝐴𝐵)∗ = (𝐴𝐵) = (𝐴̅𝐵̅)𝑇 = (𝐵) (𝐴) = 𝐵∗ 𝐴∗ .
Definition 5.
Let 𝐴 be a nonsingular complex matrix. 𝐴 is said to be
unitary if 𝐴−1 = 𝐴∗ .
Example 5.
3 4
𝑖
5 5
Prove that the matrix 𝐴 = [−4 3 ] is unitary.
𝑖
5 5
Solution:
3/5 −4/5
𝐴∗ = 𝐴𝑇 = [ ].
−4/5𝑖 −3/5𝑖
- 45 -
Hence 𝐴−1 = 𝐴∗ .
Remark:
If 𝐴 is real matrix, then 𝐴 = 𝐴. Therefore 𝐴∗ = 𝐴𝑇 .
Hence 𝐴 is unitary if and only if 𝐴−1 = 𝐴𝑇 . The real
unitary matrix is called orthogonal matrix.
Example 6.
3/7 2/7 6/7
The matrix 𝐴 = [−6/7 3/7 2/7 ] is orthogonal
2/7 6/7 −3/7
matrix because 𝐴𝑇 𝐴 = 𝐼, i.e., 𝐴−1 = 𝐴𝑇 .
Definition 6.
A square complex matrix 𝐴 is said to be Hermitian
matrix if 𝐴∗ = 𝐴 and anti-Hermitian matrix if 𝐴∗ = −𝐴.
Example 7.
−2 1−𝑖 −1 + 𝑖
The matrix 𝐴 = [ 1 + 𝑖 0 3 ] is
−1 − 𝑖 3 5
𝑖 2 1 + 2𝑖
Hermitian matrix, while 𝐵 = [ −2 5𝑖 −2 + 𝑖]
−1 + 2𝑖 2 + 𝑖 0
is anti-Hermitian matrix.
- 46 -
Exercise Set (1.4)
(1) Let𝐴 and 𝐵 be symmetric matrices of size 𝑛.
Prove that
(a) 𝐴𝐵 + 𝐵𝐴 is symmetric;
(b) 𝐴𝐵 − 𝐵𝐴 is anti-symmetric;
(c) 𝐴𝐵 is symmetric if and only if 𝐴𝐵 = 𝐵𝐴.
- 47 -
𝑖𝑒
(a) [ 𝑒 𝑖𝑒
1 𝑒 −𝑖𝑒 ]
√2 𝑖𝑒 −𝑖𝑒 −𝑖𝑒
3 4𝑖
(b) [ ]
−4 3𝑖
1 1 1
(c) [−1 + 𝑖 1 + 𝑖 ]
√2
√2 √2
3+𝑖 1 − 𝑖 √3
[√
1
(d) ]
2√2 1 + 𝑖 √3 1 − √3
- 48 -
(11) Let 𝐴 be a square matrix. Prove that 𝐴 + 𝐴∗ is
Hermitian.
(12) Which of the following matrices is Hermitian and
which anti- Hermitian:
0
𝑖 𝑖 𝑖 1 1+𝑖
[ ], [ ], [ ],
2 −𝑖 𝑖
𝑖 1 − 𝑖 −3
−2 1 − 𝑖 −1 + 𝑖
[ 1+𝑖 0 3 ]
−1 − 𝑖 3 5
- 49 -
CHAPTER (II)
SYSTEMS OF LINEAR EQUATIONS
- 50 -
CHAPTER (II)
SYSTEMS OF LINEAR EQUATIONS
1. Introduction of systems of linear equations
The study of systems of linear equations and their
solutions is one of the major topics in Mathematic (II). In
this section we shall introduce same basic terminology
and discuss a method for solving such systems. We
suppose that the student is familiar with a simple of linear
system and its solution. In general, an arbitrary system of
𝑚 linear equations in 𝑛 unknowns can be written as.
𝑎11 𝑥1 + 𝑎12 𝑥2 + ⋯ + 𝑎1𝑛 𝑥𝑛 = 𝑏1
𝑎21 𝑥1 + 𝑎22 𝑥2 + ⋯ + 𝑎2𝑛 𝑥𝑛 = 𝑏2
⋮⋮⋮⋮
𝑎𝑚1 𝑥1 + 𝑎𝑚2 𝑥2 + ⋯ + 𝑎𝑚𝑛 𝑥𝑛 = 𝑏𝑚
Where 𝑥1 , 𝑥2 , … … , 𝑥𝑛 are the unknowns and the
subscripted a's and b's denote constants.
If we mentally keep track of the location of +'s, the x's,
and the = 's, a system of 𝑚 linear equations in 𝑛
unknowns can be abbreviated by writing only the matrix
equation 𝐴𝑥 = 𝑏, where
- 51 -
𝑎11 𝑎12 . . 𝑎1𝑛 𝑥1 𝑏1
𝑎21 𝑎22 . . 𝑎2𝑛 𝑥2 𝑏2
𝐴=[ : : .. : ] , 𝑥 = [ : ] 𝑏 = [ ]
:
𝑎𝑚1 𝑎𝑚2 . . 𝑎𝑚𝑛 𝑥𝑛 𝑏𝑚
𝐴 is the coefficients matrix, 𝑏 is the constants matrix 𝑥 is
the unknown's matrix, and [𝐴|𝑏] is the augmented matrix.
Note that the number of the rows in 𝐴 is equal the number
of the equations in the system and the number of the
columns is equal the number of the unknowns.
For example, the augmented matrix for the system of
equations
𝑥1 + 𝑥2 + 2𝑥3 = 9
2𝑥1 + 4𝑥2 − 3𝑥3 = 1
3𝑥1 + 6𝑥2 − 5𝑥3 = 0
is
1 1 2 9
[2 4 −3| 1]
3 6 −5 0
Remark. When constructing an augmented matrix, the
unknown must be written in the same order in each
equation.
The basic method for solving a system of linear equations
is to replace the given system by a new system that has
- 52 -
the same solution set but is easier to solve. This new
system is generally obtained by the elementary row
operations. Now, we introduce two methods using the
augmented matrix to solve a system of linear equations.
(1) Gaussian elimination.
To solve the given system (1) by Gaussian elimination we
put the augmented matrix for the system (1) in row-
echelon form, then the solution set of the system will be
evident by inspection or after a few simple steps.
(2) Gauss-Jordan-elimination:
- 53 -
Example 1.
Solve the following system by Gauss and Gauss-Jordan
elimination.
2𝑥2 + 4𝑥3 = 3
𝑥1 − 3𝑥2 + 5𝑥3 = 1
3𝑥1 − 𝑥2 − 𝑥3 = 1
Solution.
(a) Gaussian-elimination
- 55 -
3𝑅21 1 0 0 11/16
→ [0 1 0| 5/8 ]
0 0 1 7/16
The corresponding system of equation is
11 5
𝑥1 = , 𝑥2 = , 𝑥3 = 7/16.
16 8
Therefore, the unique solution of the system is (11/16,
5/8, 7/16) which we obtained in (a). ◄
Example 2.
Use Gauss and Gauss-Jordan to solve the following: -
𝑥1 − 3𝑥2 + 2𝑥3 + 𝑥4 = 2
3𝑥1 − 9𝑥2 + 10𝑥3 + 2𝑥4 = 9
2𝑥1 − 6𝑥2 + 4𝑥3 + 2𝑥4 = 4
2𝑥1 − 6𝑥2 + 8𝑥3 + 𝑥4 = 7
Solution.
(a) Gauss-elimination
1 −3 2 1 2
[𝐴|𝑏] = [ 3 −9 10 2 | 9]
2 −6 4 2 4
2 −6 8 1 7
1 −3 2 1 2
−3𝑅12 − 2𝑅13
=[0 0 4 −1 |3]
−2𝑅14 0 0 0 0 0
0 0 4 −1 3
- 56 -
1 −3 2 1 2
1/4𝑅2 1 −1/4 3/4
→ [0 0 | ]
0 0 0 0 0
0 0 4 −1 3
1 −3 2 1 2
−4𝑅24 0 0 1 −1/4 3/4
→ [ | ]
0 0 0 0 0
0 0 0 0 0
The entire matrix is now in row-echelon form. The
corresponding system of equations is
𝑥1 − 3𝑥2 + 2𝑥3 + 𝑥4 = 2
1
𝑥3 − 𝑥4 = 3/4
4
Since 𝑥1 and 𝑥3 correspond to leading 1's in the
augmented matrix, we call them leading variables. The
no leading variables (in this ease 𝑥2 and 𝑥4 ) are called
free variables.
Solving for the leading variables in terms of the free
variable gives
𝑥1 = 2 + 3𝑥2 − 2𝑥3 − 𝑥4
3 1
𝑥3 = + 𝑥
4 4 4
3 1
Substituting 𝑥3 = + 𝑥4 into first equation yields
4 4
- 57 -
1 3
𝑥1 = + 3𝑥2 − 𝑥4 ,
2 2
3 1
𝑥3 = + 𝑥
4 4 4
Assign arbitrary values to the free variables, we obtain
𝑥4 = 𝑡 and 𝑥2 = 𝑠.
Therefore, the general solution is given by the formulas
1 3 3 1
𝑥1 = + 3𝑠 − 𝑡, 𝑥2 = 𝑠, 𝑥3 = + 𝑡, 𝑥4 = 𝑡.
2 2 4 4
Remark.
The arbitrary values that are assigned to the free
variables are often called parameters. Although we shall
generally use the letters 𝑟, 𝑠, 𝑡, … for the parameters, any
letters that do not conflict with the variable names may
be used.
(b) Gauss-Jordan elimination.
We start from the end of (a). We put the augmented
matrix in reduced row-echelon form.
1 −3 2 1 2
1 −1/4 3/4
[0 0 | ]
0 0 0 0 0
0 0 0 0 0
1 −3 0 3/2 1/2
−2𝑅21 0 0 1 −1/4 3/4
→ [ | ]
0 0 0 0 0
0 0 0 0 0
- 58 -
The last augmented matrix is in reduced row-echelon
form and the corresponding system is
3 1
𝑥1 − 3𝑥2 + 𝑥4 = ,
2 2
1 3
𝑥3 − 𝑥4 =
4 4
Putting 𝑥2 = 𝑠 and 𝑥4 = 𝑡 we obtain the set of solutions
3 1 3 1
𝑆 = {(3𝑠 − 𝑡 + , 𝑠, + 𝑡, 𝑡): 𝑡, 𝑠 ∈ ℝ}.◄
2 2 4 4
Example 3.
Solve
𝑥+𝑧=1
𝑦 − 𝑧 = −1
2𝑥 + 𝑦 + 𝑧 = 2
by Gauss and Gauss-Jordan elimination.
Solution.
(a)Gauss-Jordan elimination.
The corresponding augmented matrix is:
1 0 1 1 −2𝑅13 1 0 1 1
[𝐴|𝑏]=[ 0 1 −1| −1] → [ 0 1 −1| −1]
2 1 1 2 0 1 −1 0
−1𝑅23 1 0 1 1
→ [ 0 1 −1| −1]
0 0 0 1
- 59 -
The corresponding system is
𝑥 + 𝑧 = 1, 𝑦 − 𝑧 = −1, 𝟎=𝟏
Which is impossible. Therefore, the given system of
equations has no solution. Hence it is inconsistent.
(b) Gauss-Jordan elimination.
Note that the augmented matrix is in reduced row-
echelon form. Therefore, we obtain the same results as
in (a). ◄
Remark.
Every system of linear equations has either no solutions,
exactly one solution, or infinitely many solutions.
Theorem 1.
If 𝐴𝑥 = 𝑏 is a consistent system of equations, then it has
exact one solution, or infinitely many solutions.
Proof.
Let 𝑥1 and 𝑥2 be two solutions of 𝐴𝑥 = 𝑏 has infinitely
solution, we prove that 𝑥1 + 𝑘(𝑥1 − 𝑥2 ), 𝑘 ∈ ℝ is also a
solution of 𝐴𝑥 = 𝑏. Now,
𝐴(𝑥1 + 𝑘 (𝑥1 − 𝑥2 )) = 𝐴𝑥1 + 𝑘 (𝐴𝑥1 ) − 𝑘 (𝐴𝑥1 )
= 𝑏 + 𝑘𝑏 − 𝑘𝑏 = 𝑏.
- 60 -
Therefore 𝑥1 + 𝑘 (𝑥1 − 𝑥2 ), 𝑘 ∈ ℝ is a solution of 𝐴𝑥 =
𝑏.◄
The above three examples are for system of equations
with number equal the number of unknowns.
Now, we give examples for system of equations with
number does not equal the number of unknowns.
Example 3. Use Gauss-Jordon to solve
3𝑥1 − 2𝑥2 = 4, 5𝑥1 + 𝑥2 = 1, 9𝑥1 + 7𝑥2 = −5
Solution.
3 −2 4 1/3𝑅1 1 −2/3 4/3
[𝐴|𝑏]=[5 1 | 1 ] → [5 1 | 1 ]
9 7 −5 9 7 −5
−5𝑅12 1 −2⁄3 4⁄3
(0 13⁄3 |−17⁄3)
−9𝑅13
0 13 −17
3⁄13𝑅2 1 −2/3
4/3
→ [0 1 | −17⁄13]
0 13 −17
−13𝑅23 1 −2/3
4/3 2⁄3𝑅21 1 0
6/13
→ [0 1 | −17⁄13] → [0 1| −17⁄13]
0 0 0 0 0 0
The corresponding system of equations is 𝑥1 = 6⁄13 and
𝑥2 = −17⁄13.◄
- 61 -
Exercise.
Use Gauss elimination to solve
3𝑥1 + 2𝑥2 = 4
9𝑥1 + 6𝑥2 = 12
−36𝑥1 − 24𝑥2 = −48
Example 3. Use Gauss-Jordon to solve
𝑥1 − 2𝑥2 + 𝑥3 = 3
𝑥1 − 2𝑥2 + 𝑥3 = 4
Solution.
[𝐴|𝑏]=[1 −2 1 3 −1𝑅12 1 −2 1 3
| ]→ [ | ].
1 −2 1 4 0 0 0 1
The corresponding system is
𝑥1 − 2𝑥2 + 𝑥3 = 3
0 = 1.
Therefore, the system has no solution. ◄
Remark.
If the number of unknowns is greater than the number of
equations, then the system either has infinitely many
solutions or it is inconsistent.
- 62 -
Example 4.
For which values of a will the following system have no
solutions? Exactly one solution? Infinitely many
solutions?
𝑥1 + 𝑥2 + 𝑥3 + 𝑥4 = 4
𝑥1 + 𝑎𝑥2 + 𝑥3 + 𝑥4 = 4
𝑥1 + 𝑥2 + 𝑎𝑥3 + (3 − 𝑎)𝑥4 = 6
2𝑥1 + 2𝑥2 + 2𝑥3 + 𝑎𝑥4 = 6.
Solution.
We use Gauss elimination to put the augmented matrix in
row-echelon form.
1 1 1 1 4
[𝐴|𝑏] = [1 𝑎 1 1 | 4] =
1 1 𝑎 3−𝑎 6
2 2 2 𝑎 6
1 1 1 1 4
−1𝑅12 , −1𝑅13 0 𝑎−1 0 0
[ |0]
−2𝑅14 0 0 𝑎−1 2−𝑎 2
0 0 0 𝑎−2 −2
… … . . (∗)
If 𝑎 ≠ 1, 2 , then
- 63 -
(*)
1 1 1 1 4
1 1 1
𝑅 , 𝑅 , 𝑅
𝑎−1 2 𝑎−1 3 𝑎−2 4 0 1 0 0 0
→ 2−𝑎 | 2/(𝑎 − 1)
0 0 1
𝑎−1
[0 0 0 1 −2/(𝑎 − 2)]
2−𝑎 1 1 1 1 4
−( )𝑅
𝑎−1 43 0
→ [0 1 0 0 | 4/(𝑎 − 1) ]
0 0 1 0
0 0 0 1 −2/(𝑎 − 2)
1 1 1 0 4 + 2/(𝑎 − 2)
−1𝑅41 0 1 0 0 0
→ [ | 4/(𝑎 − 1) ]
0 0 1 0
0 0 0 1 −2/(𝑎 − 2)
2 4
1 1 0 0 4+ −
(𝑎 − 2) (𝑎 − 1)
−1𝑅31 0 1 0 0 |
→ 0
0 0 1 0 |
4/(𝑎 − 1)
0 0 0 1
[ −2/(𝑎 − 2) ]
2 4
1 0 0 0 4+ −
(𝑎 − 2) (𝑎 − 1)
−1𝑅21 0 1 0 0 |
→ 0
0 0 1 0 |
4/(𝑎 − 1)
0 0 0 1
[ −2/(𝑎 − 2) ]
- 64 -
Therefore, the unique solution is
2 4 4 −2
(4 + − , 0, , )
(𝑎 − 2) (𝑎 − 1) (𝑎 − 1) 𝑎−2
Now, if 𝑎 = 2 , then the matrix (*) becomes
1 1 1 1 4
0 1 0 0 0
[ | ]
0 0 1 0 2
0 0 0 0 −2
which implies 0 = −2, impossible. Then the system has
no solution. Finally, if 𝑎 = 1, then (*) becomes
1 1 1 1 4
0 0 0 0 0
[ | ]
0 0 0 1 2
0 0 0 −1 −2
which is equivalent to the following reduced row-
echelon form.
1 1 1 1 4
0 0 0 1 2
[ | ]
0 0 0 0 0
0 0 0 0 0
The corresponding system is
𝑥1 + 𝑥2 + 𝑥3 = 2,
𝑥4 = 2.
- 65 -
Let 𝑥3 = 𝑡 and 𝑥2 = 𝑠. Then the solution set is 𝑆 =
many solutions. ◄
Theorem 2.
If A is invertible 𝑛 × 𝑛 matrix, for each 𝑛 × 1 matrix 𝑏,
the system of equation 𝐴𝑥 = 𝑏 has exactly one solution,
namely 𝑥 = 𝐴−1 𝑏.
Proof.
Since 𝐴(𝐴−1 𝑏) = 𝑏, it follows that 𝑥 = 𝐴−1 𝑏 is a
solution of 𝐴𝑥 = 𝑏. To show that this is the only
solution, we will assume that 𝑥0 is an arbitrary solution
and then we show that 𝑥0 must be the solution 𝐴−1 𝑏.
If 𝑥0 is any solution, then 𝐴𝑥0 = 𝑏. Multiplying both
sides by 𝐴−1 , we obtain 𝑥0 = 𝐴−1 𝑏.◄
Example 5.
Consider the system of linear equations
𝑥1 − 2𝑥2 + 2𝑥3 = 3
2𝑥1 + 𝑥2 + 𝑥3 = 0
𝑥1 + 𝑥3 = −2
- 66 -
The matrix form of this system can be written as 𝐴𝑥 =
𝑏, where
1 −2 2 𝑥1 3
𝐴 = [2 1 1] , 𝑥 = [𝑥2 ] 𝑏 = [ 0 ].
1 0 1 𝑥3 −2
By elementary row operation, we obtain that
1 2 −4
−1
𝐴 = [−1 −1 3 ].
−1 −2 5
Therefore, the unique solution is
𝑥1 1 2 −4 3 11
𝑥 = [𝑥2 ] = 𝐴 𝑏 = [−1 −1 3 ] [ 0 ] = [ −9 ].◄
−1
𝑥3 −1 −2 5 −2 −13
Corollary 1.
If is A is 𝑛 × 𝑛 matrix, then the following are equivalent.
(a) A is invertible.
(b) The reduced row-echelon form of A is In.
(c) A is expressible as a product of elementary
matrices.
(d) 𝐴𝑥 = 𝑏 is consistent for every 𝑛 × 1 matrix b.
(e) 𝐴𝑥 = 𝑏 has exactly one solution for every 𝑛 × 1
matrix b.
(f) Det 𝐴 ≠ 0.
- 67 -
Example 6.
What conditions must 𝑏1 , 𝑏2 , and 𝑏3 satisfy for the
system of equations
𝑥1 − 2𝑥2 + 5𝑥3 = 𝑏1
4𝑥1 − 5𝑥2 + 8𝑥3 = 𝑏2
−3𝑥1 + 3𝑥2 − 3𝑥3 = 𝑏3
to be consistent?
Solution.
The augmented matrix is
1 −2 5 𝑏1
[𝐴|𝑏]=[ 4 −5 8 | 𝑏2 ]
−3 3 −3 𝑏3
which can be reduced to row-echelon form as follows.
𝑏1
−4𝑅12 1 −2 5
[𝐴|𝑏] [0 3 −12| 𝑏2 − 4𝑏1 ]
3𝑅13
0 −3 12 𝑏3 + 3𝑏1
1/3𝑅2 1 −2 5 𝑏1
→ [0 1 −4| (𝑏2 − 4𝑏1 )/3]
0 −3 12 𝑏3 + 3𝑏1
3𝑅23 1 −2 5 𝑏1
→ [0 1 −4| (𝑏2 − 4𝑏1 )/3].
0 0 0 𝑏3 + 𝑏2 −𝑏1
- 68 -
It is now evident from the third row in the matrix that
the system has a solution if and only if 𝑏1 , 𝑏2 , and 𝑏3
satisfy the condition
𝑏3 + 𝑏2 − 𝑏1 = 0 or 𝑏3 = 𝑏1 − 𝑏2 .
To express this condition another way, 𝐴𝑥 = 𝑏 is
consistent if and b is a matrix of the form
𝑏1
𝑏 = [ 𝑏2 ],
𝑏1 − 𝑏2
where 𝑏1 and 𝑏2 are arbitrary.
- 69 -
Exercise Set (2.1)
In exercise (1)-(15), Find all solutions of the given
systems using Gauss and Gauss-Jordan.
(1) 𝑥 + 𝑦 = 1 (2) 𝑥 + 2𝑦 = 2
𝑥−𝑦 =0 𝑥 − 4𝑦 = −1
(3) 3𝑥 + 7𝑦 − 3𝑧 = 2 (4) 3𝑥 + 7𝑦 − 3𝑧 = 2
2𝑥 + 5𝑦 + 𝑧 = −4 2𝑥 + 5𝑦 + 𝑧 = −4
2𝑥 + 6𝑦 + 10𝑧 = 3 2𝑥 + 6𝑦 + 10𝑧 = −20
(5) 𝑥 + 𝑦 + 𝑧 = 4 (6) 𝑥 − 2𝑦 − 𝑧 = 1
2𝑥 + 5𝑦 − 2𝑧 = 3 𝑥+𝑦−𝑧 =2
𝑥 + 7𝑦 − 7𝑧 = 5 𝑥 + 2𝑦 − 2𝑧 = 2
(7) 𝑥1 − 𝑥2 + 2𝑥3 − 𝑥4 = −1 (8) 𝑥1 − 2𝑥2 + 𝑥3 − 4𝑥4 = 1
2𝑥1 + 𝑥2 − 2𝑥3 − 2𝑥4 = −2 𝑥1 + 3𝑥2 + 7𝑥3 + 2𝑥4 = 2
−𝑥1 − 2𝑥2 − 4𝑥3 + 𝑥4 = 1 𝑥1 − 12𝑥2 − 11𝑥3 − 16𝑥4 = 5
3𝑥1 − 3𝑥4 = −3 2𝑥1 + 3𝑥2 + 3𝑥3 + 5𝑥4 = 6
(9) 2𝑥1 + 𝑥2 + 𝑥3 + 4𝑥4 = 2 (10) 𝑥 + 𝑦 = 0
𝑥1 + 𝑥2 + 𝑥3 − 2𝑥4 = 0 2𝑥 − 𝑦 = 1
2𝑥2 + 𝑥3 − 4𝑥4 = 1 𝑥 + 2𝑦 = −1
- 70 -
(11)𝑥1 + 𝑥2 + 𝑥3 + 𝑥4 = 2 (12) 2𝑥1 + 𝑥2 − 3𝑥3 = 1
2𝑥1 + 𝑥2 + 3𝑥3 − 𝑥4 = 1 𝑥1 − 𝑥2 + 2𝑥3 = 2
(13) 3𝑥 − 2𝑦 = 4 (14)𝑥2 + 𝑥3 = 1
5𝑥 + 𝑦 = 1 𝑥1 − 𝑥3 = 2
9 𝑥 + 7𝑦 = −5 −2 𝑥1 − 𝑥2 = 3
(16) Find conditions that b's must satisfy for the system to be
consistent
(𝑎) 2𝑥 + 3𝑦 + 5𝑧 = 𝑏1 (𝑏)𝑥1 + 𝑥2 + 2𝑥3 = 𝑏1
4𝑥 − 5𝑦 + 8𝑧 = 𝑏2 𝑥1 + 𝑥3 = 𝑏2
3𝑥 + 3𝑦 + 2𝑧 = 𝑏3 2𝑥1 + 𝑥2 + 3𝑥3 = 𝑏3
- 71 -
(𝑐) 𝑥1 − 𝑥2 + 3𝑥3 + 2𝑥4 = 𝑏1
−2𝑥1 + 𝑥2 + 5𝑥3 + 𝑥4 = 𝑏2
−3𝑥1 + 2𝑥2 + 2𝑥4 − 𝑥4 = 𝑏3
4𝑥1 − 3𝑥2 + 𝑥3 + 3𝑥4 = 𝑏4
(17) Solve the following systems:
- 72 -
2. HOMOGENEOUS SYSTEMS OF LINEAR EQUATIONS
A system of linear equation 𝐴𝑥 = 𝑏, where 𝐴 is a
matrix of size 𝑚 × 𝑛, 𝑥 is a matrix of size 𝑛 × 1, and b a
matrix of size 𝑚 × 1, is said to be homogeneous if the
matrix 𝑏 is a zero matrix, 𝑖. 𝑒. , 𝑏 = 𝟎. If 𝑏 ≠ 𝟎. then
𝐴𝑥 = 𝑏 is a non- homogeneous system.
Every homogeneous system of linear equations is
consistent, since (0, 0, … ,0) is a solution of this system.
This solution is called the trivial solution; if there are
other solutions, they are called nontrivial solutions,
nontrivial solutions.
Because a homogeneous linear system always has the
trivial solution, there are only two possibilities for its
solutions.
• The system has only the trivial solution.
• The system has infinitely many solutions in addition
to the trivial solution.
- 73 -
Theorem 1.
If 𝑥 and 𝑦 are solutions for the homogeneous system
𝐴𝑥 = 𝟎, then both 𝑥 + 𝑦 and 𝑘𝑥 are solutions for the
system.
Proof.
Since 𝑥, 𝑦 are solutions for the system, then 𝐴𝑥 = 𝟎 and
𝐴𝑦 = 𝟎. Therefore 𝐴(𝑥 + 𝑦) = 𝐴𝑥 + 𝐴𝑦 = 𝟎 + 𝟎 = 𝟎
and 𝐴(𝑘𝑥 ) = 𝑘(𝐴𝑥 ) = 𝑘. 𝟎 = 𝟎. Hence both 𝑥 + 𝑦 and
𝑘𝑥 are solution of 𝐴𝑥 = 𝟎.◄
Definition 2.
Let 𝑈1 , 𝑈2 , … … , 𝑈𝑘 be an 𝑛 × 1 (or 1 × 𝑛) matrices.
Then an 1 × 𝑛 matrix is a linear combination of
𝑈1 , 𝑈2 , … … , 𝑈𝑘 if there exist reals 𝛼1 , 𝛼2 , … … , 𝛼𝑘 such
that 𝑈 = 𝛼1 𝑈1 + 𝛼2 𝑈2 + ⋯ + 𝛼𝑘 𝑈𝑘 .
Note that by the above theorem, any linear combination
of solutions of 𝐴𝑥 = 𝟎 is also a solution. We use Gauss
and Gauss-Jordan in solving homogeneous system. Also,
we write this solution as a linear combination of other
solutions.
- 74 -
Example 1.
Solve the following homogeneous system of linear
equations by Gauss-Jordan elimination:
𝑥1 − 2𝑥2 + 𝑥3 + 𝑥4 = 0
−𝑥1 + 2𝑥2 + 𝑥4 = 0
2𝑥1 − 4𝑥2 + 𝑥3 = 0
Write the solution as a linear combination of the two
solutions (1,0, −2, 1) and (2, 1, 0, 0).
Solution.
The augmented matrix for the system is
1 −2 1 1 0
[−1 2 0 1| 0].
2 −4 1 0 0
Reducing this matrix to reduced row-echelon form, we
obtain
1 −2 0 1 0
[ 0 0 1 2 | 0]
0 0 0 0 0
The corresponding system of equations is
𝑥1 − 2𝑥2 − 𝑥4 = 0
𝑥3 + 2𝑥4 = 0
Solving for the leading variables yields
𝑥1 = 2𝑥2 + 𝑥4
- 75 -
𝑥3 = −2𝑥4
Thus, the general solution is
𝑥1 = 2𝑠 + 𝑡, 𝑥2 = 𝑠, 𝑥3 = −2𝑡, 𝑥4 = 𝑡.
Hence the set of solutions 𝑆 = {(2𝑠 + 𝑡, 𝑠, −2𝑡, 𝑡) ∶
𝑆, 𝑡 ∈ ℝ}. Therefore
(2𝑠 + 𝑡, 𝑠, −2𝑡, 𝑡) = (2𝑠, 𝑠, 0,0) + (𝑡, 0, −2𝑡, 𝑡)
= 𝑠(2,1,0,0) + 𝑡(0,0, −2,1).
Which is a linear combination of (2,1,0,0) and
(0,0, −2,1).◄
Theorem 2.
A homogeneous system of linear equations with more
unknowns than equations has infinitely many solutions.
Remark. Note that the above theorem applies only to
homogeneous system. A non-homogeneous system with
more unknowns than equations need not be consistent;
however, if the system is consistent, it will have infinitely
many solutions.
Theorem 3.
If 𝑥∘ is a solution of 𝐴𝑥 = 𝑏, then any solution of 𝐴𝑥 =
𝑏 is in the form 𝑥∘ + 𝑥1 , where 𝑥1 is a solution of 𝐴𝑥 =
𝟎.
- 76 -
Proof.
Since 𝐴(𝑥∘ + 𝑥1 ) = 𝐴𝑥∘ + 𝐴𝑥1 = 𝑏 + 𝟎 = 𝑏, then any
solution of 𝐴𝑥 = 𝑏 is in the form 𝑥∘ + 𝑥1 . Now suppose
that 𝑦 is a solution of 𝐴𝑥 = 𝑏. We prove that 𝑦 = 𝑥∘ +
𝑥1 . Put 𝑥1 = 𝑦 − 𝑥∘ . Then
𝐴𝑥1 = 𝐴(𝑦 − 𝑥∘ ) = 𝐴𝑦 − 𝐴𝑥∘ = 𝑏 − 𝑏 = 𝟎.◄
Example 2.
If 𝑥∘ = (1, −1, 0, 0) is a solution of the system
𝑥1 + 2𝑥4 = 1, 𝑥2 − 𝑥4 = −1, 𝑥3 + 𝑥4 = 0, then find all
solutions of the system.
Solution. By Gauss-Jordan, (−2𝑡, 𝑡, −𝑡, 𝑡) is a solution
of the homogenous system. Then the solutions of the
given system are (−2𝑡, 𝑡, −𝑡, 𝑡) + (1, −1, 0, 0), 𝑡 ∈
ℝ.◄
Corollary 1.
If A is a square matrix, then 𝐴𝑥 = 𝟎 has the trivial
solution if and only if A is invertible.
Example 3.
Prove that the following system has only the trivial
solution.
𝑥1 − 2𝑥2 + 2𝑥3 = 0
- 77 -
2𝑥1 + 𝑥2 + 𝑥3 = 0
𝑥1 + 𝑥3 = 0
Solution.
1 −2 2
Since det 𝐴 = |2 1 1| = 1 ≠ 0, then 𝐴 is
1 0 1
invertible. Therefore, the given system has only the trivial
solution
We end this section by the following theorem.
Theorem 4.
(1) 𝐴 is in veritable.
matrix 𝑏.
- 78 -
Exercise Set (2.2)
(1) Find a such that the following two systems has
infinitely many solutions?
(a) 𝑥 − 2𝑦 + 𝑧 = 0,
𝑥 + 𝑎𝑦 − 3𝑧 = 0,
−𝑥 + 𝑏𝑦 − 5𝑧 = 0
(b) 𝑥 − 𝑦 − 𝑧 = 0, 𝑥𝑦 − 𝑧 = 0, 𝑥 + 𝑦 + 𝑎𝑧 = 0
(3) Write the general solution of the following system
as a linear combination of particular solutions.
𝑥1 + 2𝑥2 + 𝑥3 − 𝑥4 + 3𝑥5 = 0
𝑥1 + 2𝑥2 + 2𝑥3 + 𝑥4 + 2𝑥5 = 0
2𝑥1 + 4𝑥2 + 2𝑥3 − 𝑥4 + 7𝑥5 = 0
(3) Prove that the following system has only the trivial
solution.
𝑥 + 𝑦 = 0, 𝑥 − 𝑦 = 0, 2𝑥 + 3𝑦 = 0
- 79 -
CHAPTER (III)
DETERMINATES
- 80 -
CHAPTER (III)
DETERMINATES
1. DETERMINATES
For any 𝑛 × 𝑛 matrix 𝐴 we can define the determinant of
𝐴 as function. Its domain is the set of 𝑛 × 𝑛 matrix and its
co-domain is the set of real numbers. The determinant
function is very important. It gives us some information
about the matrix, such as its inverse and how we compute
it. Also, we can solve a system of linear equation as we
will see. There are many methods to define the
determinant of the matrix. We choose the inductive
definition, 𝑖. 𝑒., we define the determinat for a matrix of
degree 1 and 2, and we define the determinat of an 𝑛 × 𝑛
matrix.
Let 𝐴 be an 𝑛 × 𝑛 matrix. The (𝑛 − 1) × (𝑛 − 1) sub-
matrix 𝑀𝑖𝑗 is obtained by deleting the 𝑖𝑡ℎ row and
𝑗𝑡ℎ column from 𝐴.
1 1 0
2 1
For example, if 𝐴 = [ 2 1 1], then 𝑀12 = [ ]
−1 3
−1 0 3
1 0
and 𝑀22 = [ ].
−1 3
- 81 -
Definition 1.
Let 𝐴 = [𝑎𝑖𝑗 ] be an 𝑛 × 𝑛 matrix the determinat of 𝐴 is
denoted by det 𝐴 or |𝐴|and is defined by inductive as:
(1) If 𝑛 = 1, then det 𝐴 = 𝑎11
(2) If 𝑛 = 2, then det 𝐴 = 𝑎11 𝑎22 − 𝑎12 𝑎21
(3) If 𝑛 > 2, then
det 𝐴 = 𝑎11 det 𝑀11 − 𝑎12 det 𝑀12 + ⋯ +
𝑛
Example 1.
1 3 −2
If 𝐴 = [−1 2 4 ], then
3 1 0
2 4 −1 4 −1 2
det 𝐴 = 1 | | − 3| | + (−2) | |
1 0 3 0 3 1
= (0 − 4) − 3(0 − 12) − 2(−1 − 6)
= 46.◄
Example 2.
2 1 −3 1
−3−2 0 2
Let 𝐴 be the matrix [ ].
2 1 0 −1
1 0 1 2
- 82 -
Then
det 𝐴 = 2 det 𝑀11 − det 𝑀12 − det 𝑀13 − det 𝑀14 ,where
−2 0 2 −3 0 2
𝑀11 = [ 1 0 −1] , 𝑀12 = [ 2 0 −1],
1 0 2 1 1 2
−3 −2 −2 −3 −2 0
𝑀13 = [ 2 1 −1] , 𝑀14 = [ 2 1 0].
1 0 2 1 0 1
Therefore,
det 𝑀11 = 0, det 𝑀12 = 1, det 𝑀13 = 2, det 𝑀14 = 1.
Hence det 𝐴 = 2(0) − 1 − 3(2) − 1 = −8.◄
●Note that we defined the determinant by inductive using
the first row of the matrix. But we use any row or any
column.
Definition 2.
Let 𝐴 = [𝑎𝑖𝑗 ] be can 𝑛 × 𝑛 matrix. The minor of 𝑎𝑖𝑗 is
det 𝑀𝑖𝑗 and the cofactor 𝐶𝑖𝑗 of 𝑎𝑖𝑗 is (−1)𝑖+𝑗 det 𝑀𝑖𝑗 .
Example 3.
2 5 −21
1−2 0 1
Let 𝐴 = [ ].Then
33 40
4 1 −16
- 83 -
2 −2 1
4+2
𝐶42 = (−1) det 𝑀42 = [1 0 1] = −10.◄
3 4 0
● Note that det 𝐴 is the sum of the product of the
have
- 84 -
Example 4.
2 5 −2 1
10 0 1
If 𝐴 = [ ], then we choose the second column
30 4 0
4 0 −1 2
to compute det 𝐴 as it contains many zeros. Therefore
det 𝐴 = 𝑎12 𝑐12 + 𝑎22 𝑐22 + 𝑎32 𝑐32 + 𝑎42 𝑐42
= 𝑎12 𝑐12 + 0 + 0 + 0
1 0 1
= −5 |3 4 0|
4 −1 2
= −5[1(8 + 0) − 0(6 − 0) + 1(−3 − 16)] = 55.◄
- 85 -
−11 0 1 −1 4 5 −2
311 0 3 −12 7 −3
(7) [ ] (8) [ ]
0 1 2 −3 2 −8 8 4
1 2−3 2 1 −4 −1 2
𝑘−1 1
(9) Find 𝑘 such that | | = 0.
2 𝑘
1 −1 1
(10) Find 𝑘 such that |−1 𝑘 1| = 0.
−1 −1 1
𝑘−4 0 0
(11) Find 𝑘 such that | 0 𝑘 2 | = 0.
0 3 𝑘−1
1 0 −3
𝜆 −1
(12) Find 𝑘 such that | | = |2 𝜆 −6 |.
3 1−𝜆
1 3 𝜆−5
sin 𝜃 cos 𝜃 0
(13) Find | − cos 𝜃 sin 𝜃 0|.
sin 𝜃 − cos 𝜃 sin 𝜃 + cos 𝜃 1
- 86 -
3.2 Cramer' Rule
Theorem 1. (Cramer' Rule).
If 𝐴𝑥 = 𝑏 is a system of 𝑛 linear equations in 𝑛
unknowns such that det (𝐴) ≠ 0. Then the system has
a unique solution. This solution is
det (𝐴1 ) det (𝐴2 ) det (𝐴𝑛 )
𝑥1 = , 𝑥2 = , … , 𝑥𝑛 =
det (𝐴) det (𝐴) det (𝐴)
Where 𝐴𝑗 is the matrix obtained by replacing the
entries in the 𝑗𝑡ℎ column of 𝐴 by the entries in the
𝑏1
𝑏
matrix 𝑏 = [ 2 ]
⋮
𝑏𝑛
Proof.
Since det (𝐴) ≠ 0, then 𝐴 is invertible and𝑥 = 𝐴−1 𝑏
is the unique solution of 𝐴𝑥 = 𝑏.
Therefore
1
𝑥 = 𝐴−1 𝑏 = 𝑎𝑑j (𝐴). 𝑏
det(𝐴)
𝑐11 𝑐21 …𝑐𝑛1 𝑏1
1 𝑐12 𝑐22 …𝑐𝑛2 𝑏
= [ ⋮ ⋮ … ⋮ ] [ 2]
det (𝐴) ⋮
𝑐1𝑛 𝑐2𝑛 …𝑐𝑛𝑛 𝑏𝑛
- 87 -
1
Therefore𝑥𝑖 = (𝑏1 𝑐1𝑖 + 𝑏2 𝑐2𝑖 + ⋯ + 𝑏𝑛 𝑐𝑛𝑖 ),
det(𝐴)
𝑖 = 1,2, … , 𝑛.
Also, computing det𝐴𝑖 using column 𝑖 we obtain
det 𝐴𝑖 = 𝑏1 𝑐1𝑖 + 𝑏2 𝑐2𝑖 + ⋯ + 𝑏𝑛 𝑐𝑛𝑖
det 𝐴𝑖
Therefore 𝑥𝑖 = , 𝑖 = 1,2, … , 𝑛.◄
det 𝐴
Example 1.
Use Cramer's Rule to solve
𝑥1 + 2𝑥3 = 6
−3𝑥1 + 4𝑥2 + 6𝑥3 = 30
−𝑥1 − 2𝑥2 + 3𝑥3 = 8
Solution.
1 0 2 6 0 2
𝐴 = [−3 4 6] , 𝐴1 = [30 4 6]
−1 −2 3 8 −2 3
1 6 2 1 0 6
𝐴2 = [−3 30 6] , 𝐴3 = [−3 4 30]
−1 8 3 −1 2 8
Therefore
det(𝐴1 ) −10 det (𝐴2 ) 72 18
𝑥1 = = , 𝑥2 = = =
det(𝐴) 11 det (𝐴) 44 11
𝑑𝑒𝑡 (𝐴3 ) 152 38
𝑥3 = − = .◄
det (𝐴) 44 11
- 88 -
Exercise Set (3.2)
(1) Use Cramer's Rule to solve the following systems
(a) 2𝑥 − 3𝑦 = 7 (b) 8𝑥 − 6𝑦 = −4
8𝑥 + 𝑦 = −2 3𝑥 + 2𝑦 = 6
(c) 2𝑥 − 6𝑦 + 𝑍 = 2 (d) 𝑥 − 3𝑦 − 𝑍 = −7
𝑦+𝑍 =1 𝑥 − 𝑦 − 𝑍 = −2
𝑥−𝑦−𝑍 =0 𝑥 − 6𝑦 − 2𝑍 = −3
(e) 𝑥1 + 𝑥2 + 𝑥3 = 𝑎 (f) 6𝑥 − 𝑦 + 3𝑍 = −3
𝑥1 + (1 + 𝑎)𝑥2 + 𝑥3 = 2𝑎 9𝑥 + 5𝑦 + 2𝑍 = 7
𝑥1 + 𝑥2 + (1 + 𝑎)𝑥3 = 0 5𝑥 + 𝑦 − 8𝑍 = −2
(5) Is it possible to use Cramer's Ruler in solving the
following System?
2𝑥 + 3𝑦 + 𝑍 = 9
𝑥 + 2𝑦 + 3𝑍 = 6
𝑥 + 𝑦 + 2𝑍 = 3
Why? Is there a solution for the system?
- 89 -
CHAPTER IV
LINEAR SPACE
- 90 -
CHAPTER IV
LINEAR SPACE
1. Euclidean n-space
Definition 1.
Let 𝑛 ≥ 1 . The set of all ordered n- tuples of real
numbers is called Euclidean space of dimension n and
will be denoted by ℝ𝑛 . That is:
ℝ𝑛 = {(𝑎1 , 𝑎2 , … , 𝑎𝑛 ) ∶ 𝑎𝑖 ∈ ℝ, 1 ≤ 𝑖 ≤ 𝑛}
Note.
We will use the row or the column to express the
elements of ℝ𝑛 and will be called n- vector. The addition
and scalar multiplication on ℝ𝑛 are defined as follows.
Definition 2.
Let 𝒖 = (𝑢1 , 𝑢2 , … , 𝑢𝑛 ), 𝒗 = (𝑣1 , 𝑣2 , … , 𝑣𝑛 ) ∈ ℝ𝑛 and
𝛼 ∈ ℝ , then 𝒖 + 𝒗 = (𝑢1 + 𝑣1 , 𝑢2 + 𝑣2 , … , 𝑢𝑛 + 𝑣𝑛 ) and
𝛼𝒖 = (𝛼𝑢1, 𝛼𝑢2 , … , 𝛼𝑢𝑛 ).
The zero element of ℝ𝑛 and the inverse of 𝒖 (𝑖. 𝑒, −𝒖)
are 𝟎 = (0,0, … ,0) and −𝒖 = (−𝑢1 , −𝑢2 , … , −𝑢𝑛 ),
respectively, 𝒖 − 𝒗 = 𝒖 + (−𝒗)
= (𝑢1 − 𝑣1 , 𝑢2 − 𝑣2 , … , 𝑢𝑛 − 𝑣𝑛 )
- 91 -
Example 1.
Let 𝒖 = (2, 6, −4) and 𝒗 = (−1, 3, 5) be elements of
ℝ3 . Find 𝒖 + 𝒗, 𝒖 − 𝒗, 3𝒖 and −2𝒗 .
Solution.
𝒖 + 𝒗 = (2, 6, −4) + (−1, 3, 5) = (1, 9, 1);
𝒖 − 𝒗 = 𝒖 + (−𝒗) = (2,6, −4) + (1, −3, −5)
= (3,3, −9);
3𝒖 = 3(2, 6, −4) = (6, 18, −12);
−2𝒗 = −2(−1, 3, 5) = (2, −6, −10).◄
For every positive integer n the set of ℝ𝑛 satisfies some
properties listed in the following theorem. Since these
properties are easy to establish, the proof of this theorem
is left as an exercise.
Theorem 1.
If 𝒖, 𝒗 and w are any vectors in ℝ𝑛 and if 𝛼, 𝛽 are any
real numbers, then
1. 𝒖 + 𝒗 is an n-vector;
2. 𝒖 + 𝒗 = 𝒗 + 𝒖 ;
3. (𝒖 + 𝒗) + 𝒘 = 𝒖 + (𝒗 + 𝒘);
4. There is a zero element 0 in ℝ𝑛 such that
𝒖 + 𝟎 = 𝟎 + 𝒖 = 𝒖 for every u in ℝ𝑛 ;
- 92 -
5. There is an n-vector –u such that 𝒖 + (−𝒖) = 𝟎 ;
6. 𝛼𝒖 is an n- vector;
7. (𝛼𝛽 )𝒖 = 𝛼 (𝛽𝒖);
8. (𝛼 + 𝛽 )𝒖 = 𝛼𝒖 + 𝛽𝒖 ;
9. 𝛼 (𝒖 + 𝒗) = 𝛼𝒖 + 𝛼𝒗 ;
10. 1. 𝒗 = 𝒗.
2. Linear spaces
In Section 1 above, we found that ℝ𝑛 possesses several
arithmetic properties that do not depend on the positive
integer. The operation of addition is both commutative
and associative, while the operation of scalar
multiplication satisfies several distributive properties. In
addition, ℝ𝑛 contains a zero element and every vector in
ℝ𝑛 has an additive inverse. We now define an abstract
mathematical system that embodies the arithmetic
properties of ℝ𝑛 .
Definition1.
A linear space V over the set of real numbers is a
nonempty set with two laws of combination, called
addition and scalar multiplication, satisfying the
- 93 -
following conditions for all 𝒖, 𝒗 and w in V and all real
numbers 𝛼 and 𝛽.
1. To every pair 𝒖, 𝒗 in V there is only one element in V,
called the sum of u and 𝒗, denoted 𝒖 + 𝒗 (closure
under addition).
2. 𝒖 + 𝒗 = 𝒗 + 𝒖 (Commutative property of addition).
3. (𝒖 + 𝒗) + 𝒘 = 𝒖 + (𝒗 + 𝒘) (Associate property of
addition).
4. There is a zero element 0 in V such that
𝒖 + 𝟎 = 𝟎 + 𝒖 = 𝒖 for all u in V (Existence of
additive identity).
5. To every element u in V there corresponds an additive
inverse element –u in V such that 𝒖 + (−𝒖) = 𝟎
(Existence of additive inverse).
6. To every real number 𝛼 and every element u in V
there is associated a unique element in V, called the
scalar product of u and 𝛼, which denoted by 𝛼𝒖
(closure under scalar multiplication).
7. (𝛼𝛽 )𝒖 = 𝛼(𝛽𝒖) (Associate property of scalar
multiplication).
- 94 -
8. (𝛼 + 𝛽 )𝒖 = 𝛼𝒖 + 𝛽𝒖 (Distributive property of scalar
multiplication).
9. 𝛼 (𝒖 + 𝒗) = 𝛼𝒖 + 𝛼𝒗 (Distributive property of scalar
multiplication).
10. 1. 𝒖 = 𝒖 (Scalar identity property).
- 95 -
Example1.
By Theorem 1 of Section 1 in this chapter, ℝ𝑛 of all
ordered n-tuples of real numbers is a linear space over ℝ
for 𝑛 ≥ 1.◄
Remark.
We can express the elements of ℝ2 as vectors in the
plane. For example,𝒖 = (a1 , a1 ) ∈ ℝ2 is the vector line
with initial point(0, 0)and end point (a1 , a1 ) as shown in
the figure.
- 96 -
If 𝒗 = (𝑎, 𝑏) ∈ ℝ2 ,then 𝛼𝒗 = α(𝑎, 𝑏) = (𝛼𝑎, 𝛼𝑏) is the
vector with initial (0, 0) and end (𝛼𝑎, 𝛼𝑏) . If 𝛼 > 0, then
the direction of 𝛼𝒗 is the same direction of 𝒗. If 𝛼 < 0,
the direction of 𝛼𝒗 is opposite to the direction of 𝒗 as
shown in the following figure.
Example 2.
From Theorem 1 in Section 1 in Chapter I the set of all
𝑚 × 𝑛 matrices, with real entries, with addition and
scalar multiplication of matrices (𝑀𝑚×𝑛 , + , ∙) is a
linear space. ◄
- 97 -
Example 3.
Let V be the set of all 2- vectors whose components total
zero. That is 𝑉 = {(𝑥, 𝑦): 𝑥 + 𝑦 = 0}. The set V is
nonempty. Since (0, 0) is an element of V. The set V also
satisfied 2,3,7, 8,9 and 10 of Definition 1 in this section,
because all 2- vectors satisfies them and V is the set of 2-
vectors. We now verify that the remaining properties are
satisfied. Let 𝒖 = (𝑥1 , 𝑦1 ) and 𝒗 = (𝑥2 , 𝑦2 ) be any two
elements of V so that 𝑥1 + 𝑦1 = 0, 𝑥2 + 𝑦2 = 0. Then
𝒖 + 𝒗 = (𝑥1 + 𝑥2 , 𝑦1 + 𝑦2 ) and (𝑥1 + 𝑥2 ) +
(𝑦1 + 𝑦2 ) = (𝑥1 + 𝑦1 ) + (𝑥2 + 𝑦2 ) = 0 + 0 = 0. Thus
𝒖 + 𝒗 is an element of V, and property 1 is satisfied.
Since 𝟎 = (0, 0) is an element of V and 𝒖 + 𝟎 = 𝒖 for
every vector u in V, property 4 is satisfied. It is clear that
– 𝒖 = (−𝑥, −𝑦) is an additive inverse for 𝒖 = (𝑥, 𝑦)
since 𝒖 + (−𝒖) = (𝑥, 𝑦) + (−𝑥, −𝑦) = (0, 0) = 𝟎 .
- 98 -
is an element of V, and property 6 is satisfied. We have
shown that V satisfied all the properties of Definition 1.
Therefore, V is a linear space. ◄
Example 4.
Let V be the set of all 2-vectors whose components total
one. That is 𝑉 = {(𝑥, 𝑦): 𝑥 + 𝑦 = 1} . Prove that 𝑉 is not
a linear space.
Solution:
Properties 1,4,5 and 6 of Definition 1 in this section are
not satisfied. In particular (0, 1) and (1, 0) are elements
of 𝑉, but (0, 1) + (1, 0) = (1, 1) is not. So, property 1 is
not satisfied. ◄
The important example of linear spaces is the set
of all polynomials. So, we remember the basic properties
of polynomials functions.
We define the polynomial function of degree n in the
variable 𝑥 as follows:
𝑃(𝑥 ) = 𝑎0 + 𝑎1 𝑥 + 𝑎2 𝑥 2 + ⋯ + 𝑎𝑛 𝑥 𝑛 ,
where 𝑎0 , 𝑎1 , 𝑎2 , … , 𝑎𝑛 ∈ ℝ are the real coefficients and
𝑎𝑛 ≠ 0. The zero polynomial is a polynomial with all
- 99 -
coefficients are zeros and its degree is undefined. Let P
be the set of all polynomials and
𝑝(𝑥 ) = 𝑎0 + 𝑎1 𝑥 + 𝑎2 𝑥 2 + ⋯ and
𝑞 (𝑥 ) = 𝑏0 + 𝑏1 𝑥 + 𝑏2 𝑥 2 + ⋯
be two elements of P (may be with different degrees). We
say that 𝑝(𝑥 ) = 𝑞(𝑥) if and only if 𝑎𝑖 = 𝑏𝑖 for all i. We
define addition and scalar multiplication as follows.
𝑝(𝑥) + 𝑞(𝑥) = (𝑎0 + 𝑏0 ) + (𝑎1 + 𝑏1 )𝑥 + (𝑎2 + 𝑏2 )𝑥 2 + ⋯
𝛼𝑝(𝑥 ) = 𝛼𝑎0 + (𝛼𝑎1 )𝑥 + (𝛼𝑎2 )𝑥 2 + ⋯
It is obvious that if 𝑝(𝑥 ), 𝑞(𝑥) ∈ 𝑃 and 𝛼 ∈ ℝ , then
𝑝(𝑥 ) + 𝑞(𝑥) ∈ 𝑃 and 𝛼𝑝(𝑥) ∈ 𝑃. Also, It is easy to
verify that P satisfies all properties of Definition 1 above.
However, if 𝑉 denotes the set of all polynomial functions
with real coefficients of exactly degree 2. The operations
of addition and scalar multiplication are defined above.
Then 𝑉 is not a linear space, because the sum of two
second – degree polynomials need not be a second –
degree polynomial. For example, the sum of 𝑝(𝑥 ) = 𝑥 2
and 𝑞 (𝑥 ) = −𝑥 2 + 𝑥 is a first-degree polynomial.
Hence property 1 of Definition 1 above is not satisfied.
The polynomial that is zero for all x is not a second –
- 100 -
degree polynomial, so property 4 is not satisfied. There
are important subsets of P which is a linear space. For
example, if 𝑃𝑛 is the set of all polynomial functions of
degree less than or equal to n with the zero polynomial.
Then 𝑃𝑛 is a linear space for all 𝑛 ≥ 0.◄
Example 5.
Let 𝐹 [𝑎, 𝑏] be the set of all real valued functions defined
on closed interval [𝑎, 𝑏], i.e.,
𝐹 [𝑎, 𝑏] = {𝑓: 𝑓: [𝑎, 𝑏] → ℝ}
It is easy to verify that 𝐹 [𝑎, 𝑏] is a linear space, where
the addition and scalar multiplication is defined as
follows:
(𝑓 + 𝑔)(𝑥 ) = 𝑓 (𝑥 ) + 𝑔(𝑥 ) and (𝛼𝑓)(𝑥 ) = 𝛼 [𝑓(𝑥 )]for
all 𝑓, 𝑔 ∈ 𝐹 [𝑎, 𝑏] , 𝛼 ∈ ℝ 𝑎𝑛𝑑 𝑥 ∈ [𝑎, 𝑏].◄
Example 6.
Let 𝑉 be a set consists of only one element is 0. Then 𝑉 is
a linear space which is called a zero space and is denoted
by 0, where the addition and scalar multiplication is
defined as 𝟎 + 𝟎 = 𝟎, 𝛼𝟎 = 𝟎 for all 𝛼 ∈ ℝ.
All examples above on linear space are expected and
normal. Now, we introduce a stranger example.
- 101 -
Example 7.
Prove that the set 𝑉 = {𝑥 ∈ ℝ: 𝑥 > 0} with the addition
𝑥⨁𝑦 = 𝑥𝑦 and the scalar multiplication defined by
𝛼⨀𝑥 = 𝑥 𝛼 for all 𝑥, 𝑦 ∈ 𝑉 and α ∈ ℝ is a linear space.
Solution.
1- Since the product of two positive real is positive real
then 𝑥⨁𝑦 ∈ 𝑉.
2- 𝑥⨁𝑦 = 𝑥𝑦 = 𝑦𝑥 = 𝑦⨁𝑥.
3- 𝑥⨁(𝑦⨁𝑧) = 𝑥⨁(𝑦𝑧) = 𝑥(𝑦𝑧).
= (𝑥𝑦)𝑧 = (𝑥⨁𝑦)𝑧 = (𝑥⨁𝑦)⨁𝑧.
4- Since 1⨁𝑥 = 1𝑥 = 𝑥, then 1 is the additive identity.
1 1 1
5- Since 𝑥⨁ 𝑥 = 𝑥 𝑥 = 1 , then 𝑥 is the additive inverse
of 𝑥.
6- Since for all 𝑥 > 0 and α ∈ ℝ , we have 𝑥 𝛼 > 0 then
𝛼 ⊙ 𝑥 ∈ 𝑉.
7- 𝛼⨀(𝑥⨁𝑦) = 𝛼⨀(𝑥𝑦) = (𝑥𝑦)𝛼 = 𝑥 𝛼 𝑦 𝛼 =
𝑥 𝛼 ⨁𝑦 𝛼 = (𝛼⨀𝑥 )⨁(𝛼⨀𝑦).
8- (𝛼 + 𝛽)⨀𝑥 = 𝑥 𝛼+𝛽 = 𝑥 𝛼 𝑥 𝛽 = 𝑥 𝛼 ⨁𝑥 𝛽 =
(𝛼⨀𝑥 )⨁(𝛽⨀𝑥 ).
𝛼
9- (𝛼𝛽)⨀𝑥 = 𝑥 𝛼𝛽 = (𝑥 𝛽 ) = 𝛼⨀(𝛽⨀𝑥).
- 102 -
10- 1⨀𝑥 = 𝑥1 = 𝑥.
Therefore, all properties of Definition 1 above are
satisfied and 𝑉 is a linear space. ◄
Example 8.
Prove that 𝑉 = ℝ2 with addition defined as
(𝑎, 𝑏) + (𝑐, 𝑑) = (𝑎 + 𝑐, 𝑏 + 𝑑) and scalar
multiplication defined as 𝛼 (𝑎, 𝑏) = (𝛼𝑎, 𝑏) is not a linear
space.
Solution.
Property 8 (for example) of Definition 1 above is not
satisfied because
𝛼 (𝑎, 𝑏) + 𝛽 (𝑎, 𝑏) = (𝛼𝑎, 𝑏) + (𝛽𝑎, 𝑏) = ((𝛼 + 𝛽)𝑎, 2𝑏)
while (𝛼 + 𝛽 )(𝑎, 𝑏) = ((𝛼 + 𝛽)𝑎, 𝑏) . Then 𝛼 (𝑎, 𝑏) +
𝛽 (𝑎, 𝑏) ≠ (𝛼 + 𝛽 )(𝑎, 𝑏).
Therefore 𝑉 is not a linear space. ◄
We conclude this section with a theorem that lists several
useful properties of linear spaces.
- 103 -
Theorem 1.
Let 𝑉 be a linear space, 𝒖, 𝒗, 𝒘 be elements of 𝑉 and 𝛼 is
a scalar. Then
1-There is precisely one zero element of 𝑉.
2- If 𝒖 + 𝒗 = 𝒖, then 𝒗 = 𝟎.
3-If 𝒖 + 𝒗 = 𝒖 + 𝒘, then 𝒗 = 𝒘.
4-There is exactly one element w of such that 𝒖 + 𝒘 = 𝟎.
5-0𝒗 = 𝟎.
6-𝛼𝟎 = 𝟎
7-If 𝛼𝒗 = 𝟎, then 𝛼 = 0 or 𝒗 = 𝟎.
8-(−𝛼 )𝒗 = −(𝛼𝒗) = 𝛼 (−𝒗)
9-(−1)𝒖 = −𝒖
Proof.
1-Assume that there are two elements 𝟎 and 𝟎′ such that
𝒖 + 𝟎 = 𝒖 and 𝒖 + 𝟎′ = 𝒖 for every u in V. Setting 𝒖 =
𝟎′ in the first identity and 𝒖 = 𝟎 in the second, we obtain
𝟎′ = 𝟎′ + 𝟎 = 𝟎 + 𝟎′ = 𝟎.
2-Suppose that 𝒖 + 𝒗 = 𝒖. By Property 5 of Definition 1
in this section, there is an inverse – 𝒖 for 𝒖. Adding – 𝒖
to each side of 𝒖 + 𝒗 = 𝒖 we have(−𝒖) + (𝒖 + 𝒗) =
- 104 -
(−𝒖) + 𝒖, or [(−𝒖) + 𝒖] + 𝒗 = (−𝒖) + 𝒖, or 𝟎 + 𝒗 =
𝟎, or 𝒗 = 𝟎.
3-𝒖 + 𝒗 = 𝒖 + 𝒘 ⇒
(−𝒖) + (𝒖 + 𝒗) = (−𝒖) + (𝒖 + 𝒘) ⇒
(−𝒖 + 𝒖) + 𝒗 = (−𝒖 + 𝒖) + 𝒘 ⇒
𝟎 + 𝒗 = 𝟎 + 𝒘 ⇒ 𝒗 = 𝒘.
4- Let 𝒘 and 𝒘′ be such that 𝒖 + 𝒘 = 𝟎 and 𝒖 + 𝒘′ = 𝟎
𝒘 = 𝒘 + 𝟎 = 𝒘 + (𝒖 + 𝒘′) = (𝒘 + 𝒖) + 𝒘′
= (𝒖 + 𝒘) + 𝒘′ = 𝟎 + 𝒘′ = 𝒘′.
5- Since 0𝒗 = (0 + 0)𝒗 = 0𝒗 + 0𝒗, then by adding
(−0𝒗) to both sides we have
0𝒗 + (−0𝒗) = (0𝒗 + 0𝒗) + (−0𝒗)
or
0𝒗 + (−0𝒗) = 0𝒗 + (0𝒗 + (−0𝒗))
or
𝟎 = 0𝒗 + 𝟎 = 0𝒗
6- Since 𝛼𝟎 = 𝛼 (𝟎 + 𝟎) = 𝛼𝟎 + 𝛼𝟎, then by adding
(−𝛼𝟎) to both sides we have
𝛼𝟎 + (−𝛼𝟎) = (𝛼𝟎 + 𝛼𝟎) + (−𝛼𝟎), or
𝟎 = 𝛼𝟎 + (𝛼𝟎 + (−𝛼𝟎)) = 𝛼𝟎 + 𝟎. Hence 𝛼𝟎 = 𝟎.
- 105 -
1 1
7- Let 𝛼𝒗 = 𝟎 and 𝛼 ≠ 0 . Then 𝛼𝒗 = 𝟎.
𝛼 𝛼
Therefore,1. 𝒗 = 𝟎. So, 𝒗 = 𝟎.
8- Note that (−𝛼 )𝒗 + (𝛼𝒗) = (−𝛼 + 𝛼 )𝒗 = 0𝒗 = 𝟎.
By adding (−𝛼𝒗) or both sides we here (−𝛼 )𝒗 =
−(𝛼𝒗) . Similarly,𝛼 (−𝒗) = −𝛼𝒗.
9- Let 𝛼 = 1 in (8). ◄
- 106 -
Exercise Set (4.2)
In Exercise (1) – (20) determine whether the given set
with the two operations are linear space.
1. 𝑉 = ℝ2 , (𝑥1 , 𝑥2 ) + (𝑦1 , 𝑦2 ) = (𝑥2 + 𝑦2 , 𝑥1 +
𝑦1 ), 𝛼 (𝑥1 , 𝑥2 ) = (𝛼𝑥1 , 𝛼𝑥2 ).
2. 𝑉 = {(𝑥1 , 0, 𝑥3 ) ∶ 𝑥1 , 𝑥3 ∈ ℝ},
(𝑥1 , 0, 𝑥3 ) + (𝑦1 , 0, 𝑦3 )
= (𝑥1 + 𝑦1 , 0, 𝑥3 + 𝑦3 ), 𝛼 (𝑥1 , 0, 𝑥3 )
= (𝛼𝑥1 , 0, 𝑥3 )
0 𝑏
3.𝑉 = {[ ] ∶ 𝑎, 𝑏, 𝑐 ∈ ℝ} with summation and scalar
𝑎 𝑐
multiplication of matrices.
𝑎 𝑏
4. 𝑉 = {[ ] ∶ 𝑎 ≠ 0} with summation and scalar
𝑐 𝑑
multiplication of matrices.
5.𝑉 = {𝐴: 𝐴 ∈ 𝑀2×3 , ∑ 𝑎𝑖𝑗 = 0} with summation and
scalar multiplication of matrices.
6.𝑉 = {𝑃 ∈ 𝑃𝑛 ∶ 𝑃(1) + 𝑃(6) − 𝑃(2) = 0} with
summation andscalar multiplication of polynomials.
7. 𝑉 = {𝑃 ∈ 𝑃𝑛 ∶ 𝑃(1) + 𝑃(6) − 𝑃(2) = 1} with
summation andscalarmultiplication of polynomials.
- 107 -
8. 𝑉 = ℝ𝟑 , (𝑥1 , 𝑦1 , 𝑧1 ) + (𝑥2 , 𝑦2 , 𝑧2 ) = (𝑥1 + 𝑥2 , 𝑦1 +
𝑦2 , 𝑧1 + 𝑧2 ), 𝛼 (𝑥, 𝑦, 𝑧) = (0,0,0)).
9.𝑉 = ℝ2 , (𝑥1 , 𝑦1 ) + (𝑥2 , 𝑦2 ) = (𝑥1 + 𝑥2 , 𝑦1 + 𝑦2 ),
(𝑥, 𝑦) = (2𝑥, 2𝑦).
10.𝑉 = {(𝑥, 0): 𝑥 ∈ ℝ} with the usual addition and scalar
multiplication onℝ2 .
11. 𝑉 = {(𝑥, 0): 𝑥 ≥ 0} with the usual addition and scalar
multiplication onℝ2 .
12. 𝑉 = ℝ2 , (𝑥, 𝑦) + (𝑥1 , 𝑦1 ) = (𝑥 + 𝑥1 + 1, 𝑦 + 𝑦1 +
1), (𝑥, 𝑦) = (𝑥, 𝑦).
𝑎 1
13.𝑉 = {[ ] : 𝑎, 𝑏 ∈ ℝ} with summation and scalar
1 𝑏
multiplication of matrices.
14.𝑉 = {𝑓 ∈ F[0,2]: 𝑓(1) = 0}with addition and scalar
multiplication in defined in Example 5 Section 2 above.
15.𝑉 = {(𝑎, 𝑎, −𝑎): 𝑎 ∈ ℝ} with usual addition and
scalar multiplication on ℝ𝟑 .
16. 𝑉 = {(𝑎, 1, −𝑎): 𝑎 ∈ ℝ} with usual addition and
scalar multiplication on ℝ𝟑 .
17.𝑉 = {𝑎0 + 𝑎1 (𝑥 + 1): 𝑎0 , 𝑎1 ∈ ℝ} with addition and
scalar multiplication of polynomials.
- 108 -
18. 𝑉 = {𝑎0 + 𝑥 + 1: 𝑎0 ∈ ℝ}with addition and scalar
multiplication of polynomials.
19. 𝑉 = {𝐴 ∈ 𝑀3×3 : 𝐴 = 𝐴𝑡 }with addition and scalar
multiplication of matrices.
20. 𝑉 = {(𝑎, 𝑏, 𝑐 ): ∈ ℝ3 }with addition and scalar
multiplication on ℝ𝟑 .
𝑎1 𝑎2 𝑎3
21. Find c such that 𝑉 = {[𝑏 𝑏2 𝑏3 ] : 𝑎1 + 𝑎2 +
1
- 109 -
3. Subspaces
It is possible for one linear space to be contained within a
larger space. For example, planes through the origin are
linear spaces that are contained within the larger linear
space ℝ𝟑 . In this section we shall study this important
concept in more detail.
Definition 1.
A subset W of a linear space V is called a subspace of V if
W is itself a linear space under the addition and scalar
multiplication defined on V.
In general, one must verify the ten linear space axioms to
show that a set W with addition and scalar multiplication
forms a linear space. However, if W is part of a larger set
V that is already known to be a linear space, then certain
axioms needn’t be verified for W because they are
inherited from V. For example, there is no need to check
that 𝒖 + 𝒗 = 𝒗 + 𝒖 (Axiom 2 ) for W because this holds
for all vectors in V and consequently for all vectors in W.
other axioms inherited by W from V are 3, 7, 8, 9 and 10.
Thus, to show that a set W is a subspace of a linear space
V, we need only verify Axioms 1,4,5 and 6. The
- 110 -
following theorem shows that even Axioms 4 and 5 can
be dispensed with.
Theorem 1.
If W is a set of one or more vectors from a linear space V,
then W is a subspace of V if and only if the following
conditions hold:
(a) If u and v are vectors in W, then 𝒖 + 𝒗 is in W.
(b) If k is any scalar and u is any vector in W, then ku is
in W.
Proof.
If W is a subspace of V, then all the linear space axioms
are satisfied; in particular, Axioms 1 and 6 hold but these
are Precisely conditions (a) and (b).
Conversely, assume conditions (a) and (b) hold since
these conditions are linear space axioms1 and 6, we need
only show that W satisfies the remaining eight axioms.
Axioms 2,3,7,8,9, and 10 are automatically satisfied by
the vectors in W since they are satisfied by all vectors in
V. Therefore, to complete the proof, we need only verify
that Axioms 4 and 5 are satisfied by vectors in W. Let u
be any vector in W. By condition (b), ku is in W for every
- 111 -
scalar k. Setting 𝑘 = 0, it follows from Theorem 1 in
Section 2 of of this chapter that 0𝒖 = 𝟎 is in W, and
setting 𝑘 = −1, it follows that (−1)𝒖 = −𝒖 is in W.◄
Remark.
A set W of one or more vectors from a linear space V is
said to be closed under a addition if condition (a) in
Theorem 1 above holds and closed under scalar
multiplication if condition (b) holds. Thus, Theorem 1
above states that W is a subspace of V if and only if W is
closed under addition and under scalar multiplication.
Note.
We can use one condition equivalent as both conditions
(a) and (b) in Theorem 1 above as follows:
W is a subspace of V if and only if 𝛼𝒖 + 𝛽𝒗 ∈ 𝑾 for all
𝒖, 𝒗 ∈ 𝑊and all 𝛼, 𝛽 ∈ ℝ (why?).
Example 1.
Every nonzero linear space V has at least two subspaces:
V itself is a subspace, and the set {𝟎} consisting of just
the zero in V is a subspace called the zero subspace. ◄
- 112 -
Example 2.
Let 𝑋 = {(𝑥, 0): 𝑥 ∈ ℝ} ⊆ ℝ2 . Prove that X is a subspace
of ℝ2 .
Solution.
(1) (0,0) ∈ 𝑋, 𝑖. 𝑒, 𝑋 ≠ 𝜙.
(2) Let (𝑥1 , 0), (𝑥2 , 0) ∈ 𝑋. Then
(𝑥1 , 0) + (𝑥2 , 0) = (𝑥1 + 𝑥2 , 0) ∈ 𝑋
(3) 𝛼 (𝑥, 0) = (𝛼𝑥, 0) ∈ 𝑋 for all (𝑥, 0) ∈ 𝑋 and all
α ∈ ℝ. From (1) , (2) and (3), X is a subspace of ℝ2 .
Similarly
𝑌 = {(0, 𝑦): 𝑦 ∈ ℝ} is a subspace of ℝ2 .◄
Example 3.
Let 𝑊 = {(𝑥, 𝑦) ∈ ℝ2 : 𝑎𝑥 + 𝑏𝑦 = 0} ⊆ ℝ2 . Prove that
W is a subspace of ℝ2 .
Solution.
(1) Since 𝑎. 0 + 𝑏. 0 = 0, then (0,0) ∈ 𝑊and𝑊 ≠ ϕ.
(2) Let (𝑥, 𝑦), (𝑥1 , 𝑦1 ) ∈ 𝑊. Then 𝑎𝑥 + 𝑏𝑦 = 0 and
𝑎𝑥1 + 𝑏𝑦1 = 0. Hence 𝑎(𝑥 + 𝑥1 ) + 𝑏(𝑦 + 𝑦1 ) =
(𝑎𝑥 + 𝑏𝑦) + (𝑎𝑥1 + 𝑏𝑦1 ) = 0 + 0 = 0.
Therefore (𝑥, 𝑦) + (𝑥1 , 𝑦1 ) = (𝑥 + 𝑥1 , 𝑦 + 𝑦1 ) ∈ 𝑊.
- 113 -
(3) Let (𝑥, 𝑦) ∈ 𝑊 and 𝛼 ∈ ℝ . Then 𝑎𝑥 + 𝑏𝑦 = 0.
Therefore 𝛼 (𝑎𝑥 + 𝑏𝑦) = 0. Thus 𝛼 (𝑥, 𝑦) ∈ 𝑊. From (1),
(2) and (3), we have W is a subspace of ℝ2 . Note that the
subspace in Example 3 above of ℝ2 is the lines through
the origin. So, the subspaces of ℝ2 are {𝟎}, lines through
the origin and ℝ2 itself. Later, we will show that these are
the only subspaces of ℝ2 .◄
Example 4.
Let 𝑊 = {(𝑥, 𝑦, 𝑧) ∈ ℝ3 : 𝑎𝑥 + 𝑏𝑦 + 𝑐𝑧 = 0}. Prove W is
a subspace of ℝ3 .
Solution.
(1) Since 𝑎0 + 𝑏0 + 𝑐0 = 0, (0,0,0) ∈ 𝑊 and 𝑊 ≠ ∅.
(2) Let (𝑥, 𝑦, 𝑧), (𝑥1 , 𝑦1 , 𝑧1 ) ∈ 𝑊. Then 𝑎𝑥 + 𝑏𝑦 + 𝑐𝑧 =
0and 𝑎𝑥1 + 𝑏𝑦1 + 𝑐𝑧1 = 0. Therefore,
𝑎(𝑥 + 𝑥1 ) + 𝑏(𝑦 + 𝑦1 ) + 𝑐 (𝑧 + 𝑧1 ) = (𝑎𝑥 + 𝑏𝑦 + 𝑐𝑧) +
(𝑎𝑥1 + 𝑏𝑦1 + 𝑐𝑧1 ) = 0.
Thus (𝑥, 𝑦, 𝑧) + (𝑥1 , 𝑦1 , 𝑧1 ) ∈ 𝑊.
(3) Let (𝑥, 𝑦, 𝑧) ∈ 𝑊 and α ∈ ℝ. Then 𝑎𝑥 + 𝑏𝑦 + 𝑐𝑧 =
0. Therefore 𝛼 (𝑎𝑥 + 𝑏𝑦 + 𝑐𝑧) = 0. Thus 𝛼 (𝑥, 𝑦, 𝑧) ∈ 𝑊.
Hence W is a subspace of ℝ3 . Note that the subspace W is
the planes through the origin. Later, we will show that the
- 114 -
only subspaces of ℝ3 are {𝟎}, lines through the origin,
planes through the origin, and ℝ3 itself. ◄
Example 5.
𝑎 𝑏
Let 𝑊 = {[ ] ∶ 𝑎, 𝑏 ∈ ℝ} ⊆ 𝑀2×2 . Prove that W is
0 −𝑎
a subspace of 𝑀2×2 .
Solution.
0
0
[ ] ∈ 𝑊 and W ≠ ϕ.
0
0
𝑎 𝑏 𝑐
𝑑
(2)Let [ ], [ ] ∈ 𝑊. Then
0 −𝑎 0
−𝑐
𝑎 𝑏 𝑐 𝑑 𝑎+𝑐 𝑏+𝑑
[ ]+[ ]=[ ] ∈ 𝑊.
0 −𝑎 0 −𝑐 0 −(𝑎 + 𝑐)
𝑎 𝑏
(3) Let [ ] ∈ 𝑊 and α ∈ ℝ.
0 −𝑎
𝑎 𝑏 𝛼𝑎 𝛼𝑏
Then 𝛼 [ ]=[ ] ∈ 𝑊.
0 −𝑎 0 −𝛼𝑎
Therefore, W is a subspace of 𝑀2×2 .
Example 6.
Let P be the set of all polynomials and
𝑊 = {𝑝(𝑥 ) ∈ 𝑃: 𝑝(3) = 0} ⊆ 𝑃. Prove that W is a
subspace of P.
Solution.
(1) Since 𝟎(3) = 0, then 𝟎 ∈ 𝑊.
- 115 -
(2) Let 𝑝(𝑥 ), 𝑞 (𝑥 ) ∈ 𝑊. then 𝑝(3) = 𝑞 (3) = 0.
Therefore (𝑝 + 𝑞 )(3) = 𝑝(3) + 𝑞 (3) = 0 + 0 = 0. Thus
𝑝(𝑥 ) + 𝑞(𝑥) ∈ 𝑊.
(3) Let 𝑝(𝑥 ) ∈ 𝑊 and α ∈ ℝ. Then 𝑝(3) = 0 and
α𝑝(3) = 0. Hence 𝛼𝑝(𝑥 ) ∈ 𝑊.◄
Example 7.
The subset 𝑊 = {(𝑥, 𝑥 + 1): 𝑥 ∈ ℝ} is not a subspace of
ℝ2 since (0,0) ∉ 𝑊.
Example 8.
The subset 𝑊 = {𝑎0 + 𝑎1 𝑥 + 𝑎2 𝑥 2 : 𝑎0 is integer} is not
1
a subspace of 𝑃2 , since, for example, 2 + 3𝑥 + 𝑥 2 ∈ 𝑊
3
1 1 1 2 1
and ∈ ℝ but (2 + 3𝑥 + 𝑥 2 ) = + 𝑥 + 𝑥 2 ∉ 𝑊.◄
3 3 3 3 9
Example 9.
The subset 𝑊 = {(𝑎, 𝑏) ∈ ℝ2 : 𝑎 ≠ 0, 𝑏 ≠ 0}is not a
subspace of ℝ2 , because, for example, (2,3), (−2,5) ∈ 𝑊
but (2,3) + (−2,5) = (0,8) ∉ 𝑊.
Definition 2.
If 𝐴𝒙 = 𝑏 is a system of linear equations, then each
vector 𝒙 that satisfies this equation is a solution vector of
the system. The following theorem shows that the
- 116 -
solution vectors of a homogeneous linear system form a
linear space, which we shall call the solution space of
the system.
Theorem 2.
If 𝐴𝒙 = 𝟎 is a homogeneous linear system of m
equations in n unknowns, then the set of solution vectors
is a subspace of ℝ𝑛 .
Proof.
Let W be the solution vectors set. There is at least one
vector in W, namely 0. To show that W is closed under
addition and scalar multiplication, we must show that if
𝒙 and 𝒙′ are any solution vectors and k is any scalar, then
𝒙 + 𝒙′ and 𝑘𝒙 are also solution vectors. But if 𝒙 and 𝒙′
are solution vectors, then 𝐴𝒙 = 𝟎 and 𝐴𝒙′ = 𝟎. It
follows that 𝐴(𝒙 + 𝒙′) = 𝐴𝒙 + 𝐴𝒙′ = 𝟎 + 𝟎 = 𝟎 and
𝐴(𝑘𝒙) = 𝑘(𝐴𝒙) = 𝑘𝟎 = 𝟎. This proves that 𝒙 +
𝒙′ and 𝑘𝒙 are solution vectors. Therefore, W is a
subspaceℝ𝑛 .◄
- 117 -
Exercise set (4.3)
In Exercise (1) – (19) determine which of the following
subsets are subspaces of the giver linear spaces.
1.𝑊 = {𝑓: [a, b] ⟶ ℝ, f is diffrentiable on [a, b]} ⊆
𝐹 [𝑎, 𝑏]
2.W= {𝑓: [𝑎, 𝑏] → ℝ, 𝑓 is diffrentiable twice on [a, b]}
⊆ 𝐹 [𝑎, 𝑏] such that 𝑓 ′′ + 𝑓 = 0}
3.𝑊 = {𝑎, 𝑏, 𝑎 − 2𝑏: 𝑎, 𝑏 ∈ ℝ} ⊆ ℝ3 .
𝑎 0
4. 𝑊 = {[ ] , 𝑎, 𝑏, 𝑐 ∈ ℝ} ⊆ 𝑀2×2 .
𝑏 𝑐
5.𝑊 = {𝑎0 + 𝑎1 𝑥 + 𝑎2 𝑥 2 ∈ 𝑃2 : 𝑎0 + 𝑎1 + 𝑎2 = 0} ⊆ 𝑃2 .
6.𝑊 = {𝑓 ∈ F[0,2]: 𝑓(1) = 0} ⊆ 𝐹 [0,2].
7.𝑊 = {(𝑥, 𝑦) ∈ ℝ2 : 2𝑥 + 3𝑦 = 0} ⊆ ℝ2 .
8.𝑊 = {𝐴 ∈ 𝑀2×2 : det 𝐴 = 0} ⊆ 𝑀2×2 .
(9)𝑊 = {𝐴 ∈ 𝑀2×2 : 𝐴2 = 𝐴} ⊆ 𝑀2×2 .
10.𝑊 = {(𝑥, 𝑦) ∈ ℝ2 : 𝑥 + 𝑦 = 1} ⊆ ℝ2 .
11.𝑊 = {𝑎0 + 𝑎1 𝑥 + 𝑎2 𝑥 2 ∈ 𝑃2 : 𝑎1 = 1 𝑎0 = 𝑎2 } ⊆ 𝑃2 .
12.𝑊 = {(𝑎, 𝑏, 𝑐 ) ∈ ℝ3 : 𝑎 + 𝑏 + 𝑐 = 0} ⊆ ℝ3 .
13.𝑊 = {(𝑎, 𝑏, 𝑐 ) ∈ ℝ3 : 𝑎 − 𝑏 − 𝑐 = 0} ⊆ ℝ3 .
(14)𝑊 = {(𝑎, 𝑏, 𝑐) ∈ ℝ3 : 𝑎 + 𝑏 = 1, 𝑐 = 0} ⊆ ℝ3 .
15.𝑊 = {(𝑎, 𝑏, 𝑐 ) ⊂ ℝ3 : 𝑎𝑏 = 0} ⊆ ℝ3 .
- 118 -
3 𝑎 3
16.𝑊 = {[ ] : 𝑎, 𝑏 ∈ ℝ} ⊆ 𝑀2×3 .
𝑎 𝑏 𝑎
17.𝑊 = {𝐴 ∈ 𝑀2×2 : 𝐴 = −𝐴𝑡 } ⊆ 𝑀2×2 .
1
18.𝑊 = {𝑓 ∈ F[0,1] : ∫0 𝑓(x)dx = 0} ⊆ 𝐹 [0,1].
- 122 -
𝑎11 𝑎12 …𝑎1𝑛 𝑥1
𝑎21 𝑎22 …𝑎2𝑛 𝑥2
𝐴𝑥 = [ ⋮ ⋮ ⋮ ⋮ ][ ⋮ ]
𝑎𝑚1 𝑎𝑚2 …𝑎𝑚𝑛 𝑥𝑛
𝑎11 𝑥1 + 𝑎12 𝑥2 + ⋯ + 𝑎1𝑛 𝑥𝑛
𝑎 𝑥 + 𝑎22 𝑥2 + ⋯ + 𝑎2𝑛 𝑥𝑛
= [ 21 1 ⋮ ]
𝑎𝑚1 𝑥1 + 𝑎𝑚2 𝑥2 + ⋯ + 𝑎𝑚𝑛 𝑥𝑛
= 𝑥1 𝐴1 + 𝑥2 𝐴2 + ⋯ + 𝑥𝑛 𝐴𝑛 . ◄
Corollary1.
Let A be an 𝑚 × 𝑛 matrix the system 𝐴𝒙 = 𝑏 is
consistent if and only if b is a linear combination of
columns of A.
Example 5.
Find all vectors 𝒃 = [𝑏1 𝑏2 𝑏3 𝑏4 ]𝑡 such that b is a linear
combination of 𝒗1 = [1 2 3 4]𝑡
and𝒗2 = [4 3 2 1]t .
Solution.
Let = 𝛼1 𝒗1 + 𝛼2 𝒗2 , 𝛼1 , 𝛼2 ∈ ℝ . Then𝐴𝒙 = 𝒃, where
1 4
𝐴 = [2 3] , 𝑥 = [𝛼1 ]. From Corollary 1above, it is
3 2 𝛼2
4 1
- 123 -
sufficient to find 𝑏1 , 𝑏2 , 𝑏3 , 𝑏4 such that𝐴𝒙 = 𝒃 is
consistent. Therefore
1 4 𝑏1 1 4 𝑏1
[𝐴 | 𝑏 ] = [ 2 3] 𝑏2 ] ~ [0 −5] 𝑏2 − 2𝑏1
]
3 2 𝑏3 0 0 𝑏3 + 𝑏1 − 2𝑏2
4 1 𝑏4 0 0 𝑏4 + 2𝑏1 − 3𝑏2
This system is consistent if 𝑏3 + 𝑏1 − 2𝑏2 = 0 and
𝑏4 + 2𝑏1 − 3𝑏2 = 0 . Hence
𝒃 = [𝑏1 𝑏2 2𝑏2 − 𝑏1 3𝑏2 − 2𝑏1 ]𝑡 . ◄
If𝒗1 , 𝒗2 , … , 𝒗𝑟 are in vector space V, then generally some
vectors in V may be a linear combinations of
𝒗1 , 𝒗2 , … , 𝒗𝑟 and others may not. The following theorem
shows that if we construct a set W consisting of all those
vectors that are expressible as linear combinations
of𝒗1 , 𝒗2 , … , 𝒗𝑟 , then W forms a subspace of V.
Theorem 2.
If 𝒗1 , 𝒗2 , … , 𝒗𝑟 are vectors in a vector space V, then
(a)The set W of all linear combinations of 𝒗1 , 𝒗2 , … , 𝒗𝑟 is
a subspace of V.
(b)W is the smallest subspace of V that
contains𝒗1 , 𝒗2 , … , 𝒗𝑟 . In the since that every other
subspace of 𝒗1 , 𝒗2 , … , 𝒗𝑟 must contain W.
- 124 -
Proof.
(a)To show that W is a subspace of V, we prove it is
closed under addition and scalar multiplication.
There is at least one vector in W, namely, 0, since
𝟎 = 0𝒗1 + 0𝒗2 + ⋯ + 0𝒗𝑟 . If u and v are vectors in W,
then𝒖 = 𝑐1 𝒗1 + 𝑐2 𝒗2 + ⋯ + 𝑐𝑟 𝒗𝑟 and 𝒗 = 𝑘1 𝒗1 +
𝑘2 𝒗2 + ⋯ + 𝑘𝑟 𝒗𝑟 ,where 𝑐1 , 𝑐2 , … , 𝑐𝑟 , 𝑘1 , 𝑘2 , … , 𝑘𝑟 are
scalars. Therefore
𝒖 + 𝒗 = (𝑐1 + 𝑘1 )𝒗1 + (𝑐2 + 𝑘2 )𝒗2 + ⋯ + (𝑐𝑟 + 𝑘𝑟 )𝒗𝑟
and, for any scalar k,𝑘𝑢 = (𝑘𝑐1 )𝒗1 + (𝑘𝑐2 )𝒗2 + ⋯ +
(𝑘𝑐𝑟 )𝒗𝑟 . Thus, 𝒖 + 𝒗 and k u are linear combinations of
𝒗1 , 𝒗2 , … , 𝒗𝑟 and consequently elements in W. Therefore,
W is closed under addition and scalar multiplication.
(b) Each vector 𝒗𝑖 is a linear combination of
𝒗1 , 𝒗2 , … , 𝒗𝑟 since we can write
𝒗𝑖 = 0𝒗1 + 0𝒗2 + ⋯ + 1𝒗𝑖 + ⋯ + 0𝒗𝑟
Therefore, the subspace W contains each of the vectors
𝒗1 , 𝒗2 , … , 𝒗𝑟 . LetW' be any other subspace that contains
𝒗1 , 𝒗2 , … , 𝒗𝑟 . SinceW' is closed under addition and scalar
multiplication, it must contain all linear combinations of
𝒗1 , 𝒗2 , … , 𝒗𝑟 . Thus, W' contains each vector of W. ◄
- 125 -
We have the following definition.
Definition 2.
If 𝑆 = {𝒗1 , 𝒗2 , … , 𝒗𝑟 } is a set of vectors in a vector space
V, then the subspace W of V consisting of all linear
combinations of vectors of S is called the space spanned
by the vectors 𝒗1 , 𝒗2 , … , 𝒗𝑟 and we say that the vectors
𝒗1 , 𝒗2 , … , 𝒗𝑟 span W.
To indicate that W is the space spanned by the vectors in
the set 𝑆 = {𝒗1 , 𝒗2 , … , 𝒗𝑟 }, we write 𝑊 = span (𝑆) or
𝑊 = span{𝒗1 , 𝒗2 , … , 𝒗𝑟 } .
Theorem 3.
Let 𝑆 = {𝒗1 , 𝒗2 , … , 𝒗𝑟 } ⊆ ℝ𝑛 and A be an 𝑛 × 𝑘 matrix
whose columns are 𝑣1 , , … , 𝒗𝑟 . Then 𝑆 spansℝ𝑛 if and
only if the system 𝐴𝑥 = 𝑏 is consistent for every𝑏 ∈ ℝ𝑛 .
Example 6.
Determine whether 𝒗1 = (1,1,2), 𝒗2 = (1,0,1), and𝒗3 =
(2,1,3) span the vector space ℝ3 , or not.?
Solution.
We must determine whether an arbitrary vector 𝒃 =
(𝑏1 , 𝑏2 , 𝑏3 )in ℝ3 can be expressed as a linear
combination 𝒃 = 𝑘1 𝒗1 + 𝑘2 𝒗2 + 𝑘3 𝒗3 of the vectors
- 126 -
𝒗1 , 𝒗2 and 𝒗3 . Expressing this equation in terms of
components gives
(𝑏1 , 𝑏2 , 𝑏3 ) = 𝑘1 (1,1,2) + 𝑘2 (1,0,1) + 𝑘3 (2,1,3)
or
𝑘1 + 𝑘2 + 2𝑘3 = 𝑏1
𝑘1 + 𝑘3 = 𝑏2
2𝑘1 + 𝑘2 + 3𝑘3 = 𝑏3
This system is consistent for all 𝑏1 , 𝑏2 and 𝑏3 if and only if
1 1 2
the coefficient matrix 𝐴 = [1 0 1] is invertible.
2 1 3
Butd𝑒𝑡 (𝐴) = 0 (verify), so that A is not invertible.
Consequently, 𝑣1 , 𝑣2 and𝑣3 do not span ℝ3 .
We leave the proof of the following useful theorem as an
exercise.
Theorem 4.
If𝑆 = {𝒗1 , … , 𝒗𝑟 } and 𝑆′ = {𝒘1 , … , 𝒘𝑟 }are two sets of
vectors in a vector space V, then span{𝒗1 , … , 𝒗𝑟 } =
span{𝒘1 , … , 𝒘𝑟 }if ad only if each vector in 𝑆 is a linear
combination of those in 𝑆′. And conversely each vector in
𝑆′ is a linear combination of those in 𝑆.
Example 7.
- 127 -
Prove that 𝑝(𝑥 ) = 𝑥 2 + 1, 𝑞(𝑥 ) = 𝑥 − 2, and r(𝑥 ) = 𝑥 +
3span 𝑃2
Solution.
Let 𝑚(𝑥 ) = 𝑎𝑥 2 + 𝑏𝑥 + 𝑐 ∈ 𝑃2 such that 𝑚(𝑥 ) =
𝛼1 𝑝(𝑥 ) + 𝛼2 𝑞(𝑥 ) + 𝛼3 𝑟(𝑥) for 𝛼1 , 𝛼2 , 𝛼3 ∈ ℝ.
Therefore,
𝛼1 = 𝑎
𝛼2 + 𝛼3 = 𝑏
𝛼1 − 2𝛼2 + 3𝛼3 = 𝑐
Hence the coefficient matrix is
1 0 0
𝐴 = [0 1 1]
1 −2 3
Is invertible (verify?).
Thus, the system above is consistent. So, 𝑚(𝑥) is a linear
combination of 𝑝, 𝑞, 𝑟 and then span 𝑃𝑛 .◄
●The subspace W in Theorem 2 above is called the
subspace spanned by 𝑆 and is denoted by 〈𝑆〉.
Remark:
(1) If 𝑆 spans V, then 𝑉 = 〈𝑆〉.
(2) If = ϕ or𝑆 = {𝟎} , then 〈𝑆〉 = {𝟎}.
(3) If 𝑆 is a subspace of 𝑉, then 〈𝑆〉 = 𝑆
- 128 -
Example 8.
Find the vectors that spans the subspace W of ℝ3 , where
𝑊 = {(𝑥, 𝑦, 𝑧) ∈ ℝ3 : 𝑥 + 𝑦 + 𝑧 = 0}.
Solution: Since+𝑦 + 𝑧 = 0, then𝑧 = −𝑥 − 𝑦.Therefore
if (𝑥, 𝑦, 𝑧) ∈ 𝑊,then
(𝑥, 𝑦, 𝑧) = (𝑥, 𝑦, −𝑥 − 𝑦)
= (𝑥, 0, −𝑥 ) + (0, 𝑦, −𝑦)
= 𝑥 (1,0, −1) + 𝑦(0,1, −1)
Hence 𝑆 = {(1,0, −1), (0,1, −1)} spans W. ◄
Exercise Set (4.4)
In Exercise (1) – (9), express the first vector as a linear
combination of the other vectors.
(1)(1, −7,5) ; (1, −1,1) , (1,0,1) , (1,1,0);
(2) (−9, −7, −15) ; (2,1,4) , (1, −1,3) , (3,2,5);
(3) (0,0,0) ; (2, 1, 4) , (1, −1, 3) , (3, 2, 5);
(4) (2,3, −7,3) ; (2,1,0,3) , (3, −1,5,2) , (−1,0,2,1)
(5) 𝑥 2 + 1 ; −𝑥 + 1 , 3𝑥 2 + 𝑥 + 2;
(6) 2𝑥 2 − 3𝑥 + 1 ; 𝑥 + 1 , 𝑥 2 + 𝑥 , 𝑥 2 + 2;
(7) 9𝑥 2 + 8𝑥 + 7 ; 4𝑥 2 + 𝑥 + 2 , 3𝑥 2 − 𝑥 + 1 , 5𝑥 2 +
2𝑥 + 3;
- 129 -
7 19 2 4 3 1
(8)𝐴 = [ ] ; 𝐴1 = [ ] , 𝐴2 = [ ] , 𝐴3 =
−3 8 0 2 3 2
0 10
[ ];
−6 2
6 −8 4 2
(9) 𝐴 = [ ] ; 𝐴1 = [ ] , 𝐴2 =
−1 −8 −2 −2
1 −1 0 2
[ ] , 𝐴3 = [ ];
2 3 1 4
(10) Is 𝑤 = (−1,4,15) ∈ 〈𝑆〉, where
𝑆 = {𝒗1 = (1,2,8) , 𝒗2 = (3,0,1)} ⊆ ℝ3 .
(11) Is 𝑝(𝑥 ) = 6𝑥 2 + 4𝑥 ∈ 〈𝑆〉, where
𝑆 = {𝑃1 (𝑥 ) = 𝑥 2 + 𝑥 + 1, 𝑃2 (𝑥 ) = 3𝑥 2 + 2𝑥 − 2,
𝑃3 (𝑥 ) = 6𝑥 3 + 5𝑥 2 + 5𝑥 + 1} ⊆ 𝑃2
0 5
(12) Is 𝐴 = [ ] ∈ 〈𝑆〉, where
−4−1
1 2 3 1
𝑆 = {𝐴1 = [ ], 𝐴2 = [ ] ⊆ 𝑀2×2
−1 0 1 1
(13) Which of the following subsets of ℝ3 spansℝ3 .
(a) 𝑆1 = {(2,0,2), (3,1,1), (−3,5,5).
(b) 𝑆2 = {(1,5,1), (2,6,1), (3,3,0), (4,6,2)}.
(14) Which of the following subsets of 𝑃2 spans 𝑃2
(a) 𝑆1 = {−𝑥 2 + 1, 𝑥 2 + 1, 6𝑥 2 + 5𝑥 + 2}.
(b) 𝑆2 = {3𝑥, 𝑥 + 1, 2𝑥 2 + 1}.
- 130 -
(c) 𝑆2 = {−8𝑥 2 + 15𝑥 + 3, −5𝑥 2 + 3𝑥, 3𝑥 2 + 5𝑥 +
2, −𝑥 2 + 4𝑥 + 1}.
(15) Which of the following subsets of 𝑀2×2 spans 𝑀2×2
1 0 1 0 0 1 1 1
(a) 𝑆1 = {[ ], [ ], [ ], [ ]}
0 0 0 1 1 0 0 1
2 1 3 1 2 2 2 3
(b) 𝑆2 = {[ ], [ ], [ ], [ ]}
−1 0 1 1 1 1 −4 −1
5. linear Independence
In the preceding section we learned that asset of vectors
𝑆 = {𝒗1 , 𝒗2 , … , 𝒗𝑟 } spans a linear space V if every vector
in V is expressible as a linear combination of the vectors
in S. In general, there may be more than one way to
express a vector in V as a linear combination of vectors in
a spanning set. In this section we will study conditions
under which
each vector in V is expressible as a linear combination of
the spanning vectors in exactly one way. Spanning sets
with this property play a fundamental role in the study of
vector spaces.
Definition 1.
If𝑆 = {𝒗1 , 𝒗2 , … , 𝒗𝑟 }is nonempty set of vectors, then the
vector equation
- 131 -
𝑘1 𝒗1 + 𝑘2 𝒗2 + ⋯ + 𝑘𝑟 𝒗𝑟 = 𝟎
has at least one solution, namely
𝑘1 = 0, 𝑘2 = 0, … , 𝑘𝑟 = 0
If this is the only solution, then S is called a linearly
independent set. If there are other solutions, then S is
called linearly dependent set.
Example 1.
If𝒗1 = (2, −1, 0, 3), 𝒗2 = (1, 2, 5, −1) and 𝒗3 =
(7, −1, 5, 8), then the set vectors𝑆 = {𝑣1 , 𝒗2 , 𝒗3 } is
linealy dependent, since 3𝒗1 + 𝒗2 − 𝒗3 = 𝟎 .◄
Example 2.
The polynomials 𝑃1 = 1 − 𝑥, 𝑃2 = 5 + 3𝑥 − 2𝑥 2 , and
𝑃3 = 1 + 3𝑥 − 𝑥 2 form a linearly dependent set in 𝑃2
since 3𝑃1 − 𝑃2 + 2𝑃3 = 𝟎.◄
Example 3.
Consider the vectors 𝒊 = (1,0,0), 𝒋 = (0,1,0), and 𝒌 =
(0,0,1)in ℝ3 . In terms of components the vector
equation𝑘1 𝒊 + 𝑘2 𝒋 + 𝑘3 𝒌 = 𝟎 becomes
𝑘1 (1,0,0) + 𝑘2 (0,1,0) + 𝑘3 (0,0,1) = (0,0,0)
Or equivalently, (𝑘1 , 𝑘2 , 𝑘3 ) = (0,0,0)
- 132 -
This implies that 𝑘1 = 0, 𝑘2 = 0 and 𝑘3 = 0.So the set
𝑆 = {𝒊, 𝒋, 𝒌} is linearly independent.
●A similar argument can be used to show that the vectors
𝒆1 = (1,0,0, … ,0), 𝒆2 = (0,1,0, … ,0), …,
𝒆𝑛 = (0,0, … ,1)form a linearly independent set in ℝ𝑛 .◄
Example 4.
Determine whether the vectors
𝑣1 = (1, −2, 3), 𝑣2 = (5, 6, 1), 𝑣3 = (3, 2, 1)
from a linearly dependent set or linearly independent set.
Solution.
In terms of components the vector equation
𝑘1 𝒗1 + 𝑘2 𝒗2 + 𝑘3 𝒗3 = 0
becomes
𝑘1 (1, −2,3) + 𝑘2 (5,6, −1) + 𝑘3 (3,2,1) = (0,0,0)
Or equivalently
(𝑘1 + 5𝑘2 + 3𝑘3 , −2𝑘1 + 6𝑘2 + 2𝑘3 , 3𝑘1 − 𝑘2 + 𝑘3 )
= (0,0,0)
Equating corresponding components gives.
𝑘1 + 5𝑘2 + 3𝑘3 = 0
−2𝑘1 + 6𝑘2 + 2𝑘3 = 0
3𝑘1 − 𝑘2 + 𝑘3 = 0
- 133 -
Thus, 𝒗1 , 𝒗2 and 𝒗3 form a linearly dependent set if this
system has a nontrivial solution or a linearly independent
set if it has only the trivial solution. Solving this system
gives
1 1
𝑘1 = − 𝑡, 𝑘2 = − 𝑡, 𝑘3 = 𝑡, 𝑡 ∈ 𝑘
2 2
Thus, the system has nontrivial solutions and𝒗1 , 𝒗2 and
𝒗3 form a linearly dependent set.◄
- 134 -
Proof:
(a) Let 𝑆 = {𝒗1 , 𝒗2 , … , 𝒗𝑟 }be aset with two or more
vectors. If we assume that 𝑆is linearly dependent, then
there are scalars 𝑘1 , 𝑘2 , … , 𝑘𝑟 , not all zero, such that
𝑘1 𝒗1 + 𝑘2 𝒗2 + ⋯ + 𝑘𝑟 𝒗𝑟 = 𝟎 (∗)
To be specific, suppose that 𝑘𝑗 ≠ 0, 1 ≤ 𝑗 ≤ 𝑟then (*)
can be rewritten as
𝑘1 𝑘2 𝑘𝑗−1
𝒗𝑗 = (− ) 𝒗1 + (− ) 𝒗2 + ⋯ + (− ) 𝒗𝑗−1
𝑘𝑗 𝑘𝑗 𝑘𝑗
𝑘𝑗+1 𝑘
+ (− ) 𝒗𝑗+1 + ⋯ + (− 𝑟 ) 𝒗𝑟 .
𝑘𝑗 𝑘𝑗
- 135 -
by𝑘1 = 𝑐1 , 𝑘2 = 𝑐2 , … , 𝑘𝑗 = −1, 𝑘𝑟 = 𝑐𝑟 which are all
not zero.
(b) Similar to (a).◄
Example 5.
In Example 1 above we saw that the vectors 𝒗1 =
(2, −1, 0, 3), 𝒗2 = (1, 2, 5, −1), and 𝒗3 = (7, −1, 5, 8)
form a linearly dependent set. It follows from Theorem 1
above that at least one of these vectors is expressible as a
linear combination of the other two. In this example each
vector is expressible as a linear combination of the other
two since it follows from the equation 3𝒗1 + 𝒗2 − 𝒗3 =
1 1
𝟎 that 𝒗1 = − 𝒗2 + 𝒗3 , 𝒗2 = −3𝒗1 + 𝒗3 , and 𝒗3 =
3 3
3𝒗1 + 𝒗2 .◄
Example 6.
In Example 3 in this section we saw that the vectors 𝒊 =
(1,0,0), 𝒋 = (0,1,0), and 𝒌 = (0,0,1) form a linearly
independent set. Thus, it follows from Theorem 1 above
that none of these vectors is expressible as a linear
combination of the other two. To see directly that,
- 136 -
suppose that k is expressible as𝒌 = 𝛼1 𝒊 + 𝛼2 𝒋.Then, in
terms of components, (0,0,1) = 𝛼1 (1,0,0) + 𝛼2 (0,1,0).
Or(0,0,1) = (𝛼1 , 𝛼2 , 0). But this equation is not satisfied
by any values of 𝛼1 and𝛼1 .So k cannot be expressed as a
linear combination of i and j. Similarly, i is not
expressible as a linear combination of j and k, and j is not
expressible as a linear combination of i and k.◄
Theorem 2.
(a) A finite set of vectors that contains the zero vector is
linearly dependent.
(b) A set with exactly two vectors is linearly independent
if and only if neither of the vectors is a scalar multiple of
the other.
Theorem 3.
Let 𝑆 = {𝒗1 , 𝒗2 , … , 𝒗𝑟 } be a set of vectors in ℝ𝑛 . If > 𝑛 ,
then𝑆 is linearly dependent.
Remark.
The preceding theorem tells us that a set in ℝ2 with more
than two vectors is linearly dependent, and a set in ℝ3
with more than three vectors is linearly dependent.
- 137 -
Example 7.
Determine whether the following set of vectors in 𝑃2 is
linearly dependent
1 + 𝑥, 3𝑥 + 𝑥 2 , 2 + 𝑥 − 𝑥 2 .
Solution.
Let 𝛼1 (1 + 𝑥 ) + 𝛼2 (3𝑥 + 𝑥 2 ) + 𝛼3 (2 + 𝑥 − 𝑥 2 ) = 0.
Then
𝛼1 + 2𝛼3 = 0
𝛼1 + 3𝛼2 + 𝛼3 = 0
𝛼2 − 𝛼3 = 0
The determinant of the coefficient's matrix A for this
system of equation is
1 0 2
det 𝐴 = |1 3 1 | = −2 ≠ 0.
0 1 −1
Therefore A is Invertible, and the only solution is the
linearly dependent. ◄
- 138 -
Exercise Set (4.5)
In Exercise (1) – (5), determine whether the given set of
vectors is linearly dependent or independent in the given
vectors spaces:
(1) 𝑆 = {(1, −1), (2, 0)}, 𝑉 = ℝ2 ;
(2) 𝑆 = {(1, 1, 2), (1, 4, 5), (1, 2, 7), (−1, 8, 3)},
𝑉 = ℝ4 ;
(3) 𝑆 = {4𝑥 2 − 𝑥 + 2, 2𝑥 2 + 6𝑥 + 3, −4𝑥 2 + 10𝑥 +
2}, 𝑉 = 𝑃2 ;
1 1 1 0 1 0
(4) 𝑆 = {[ ], [ ], [ ]} , 𝑉 = 𝑀2×2 ;
0 1 1 1 0 1
(5) Find x such that 𝑆 is linearly independent in ℝ3 ,
where𝑆 = {(2, 𝑥, 1), (1,0,1), (0,1,3)}
(7)Find 𝑎, 𝑏 such that the vectors
(1, 𝑎, 2, 𝑏), (1, −1,2,0), (1,2,1,3) are linearly dependent
inℝ4 .
(8)Prove that any nonempty subset of linearly
independent set is linearly independent.
(9)Let 𝑆1 , 𝑆2 be subsets of a vector space V such that 𝑆1 ⊆
𝑆2 and 𝑆1 is linearly dependent.Prove that 𝑆2 is linearly
dependent.
- 139 -
6. Basis and Dimension
Definition 1.
If V is any vector space and 𝑆 = {𝒗1 , 𝒗2 , … , 𝒗𝑛 } is a set
of vectors in V, then 𝑆 is called a basis for V if the
following two conditions hold:
(a) 𝑆 is linearly independent
(b) S spans V.
- 140 -
Theorem 1.
If 𝑆 = {𝒗1 , 𝒗2 , … , 𝒗𝑛 } is a basis for a vector spaceV, then
every vector v in V can be expressed in the form 𝒗 =
𝑐1 𝒗1 + 𝑐2 𝒗2 + ⋯ + 𝑐𝑛 𝒗𝑛 in exactly one way.
Proof.
Since 𝑆 spans V, it follows from the definition of the
spanning set that every vector in V is expressible as a
linear combination of the vectors in 𝑆. To see that there is
only one way to express a vector as a linear combination
of the vectors in 𝑆,supposethat some vector v can be
written as
𝒗 = 𝑐1 𝒗1 + 𝑐2 𝒗2 + ⋯ + 𝑐𝑛 𝒗𝑛
and also, as
𝒗 = 𝑘1 𝒗1 + 𝑘2 𝒗2 + ⋯ + 𝑘𝑛 𝒗𝑛
Subtracting the second equation from the first gives
0 = (𝑐1 − 𝑘1 )𝒗1 + (𝑐2 − 𝑘2 )𝒗2 + ⋯ + (𝑐𝑛 − 𝑘𝑛 )𝒗𝑛
The linear independence of 𝑆 implies that
𝑐1 − 𝑘1 = 0 , 𝑐2 − 𝑘2 = 0 … , 𝑐𝑛 − 𝑘𝑛 = 0.
That is, 𝑐1 = 𝑘1 , 𝑐2 = 𝑘2 , … , 𝑐𝑛 = 𝑘𝑛 .
Thus, the two expressions for V are the same.
- 141 -
If 𝑆 = {𝒗1 , 𝒗2 , … , 𝒗𝑛 } is a basis for a vector space V, and
𝒗 = 𝑐1 𝒗1 + 𝑐2 𝒗2 + ⋯ + 𝑐𝑛 𝒗𝑛
is the expression for a vector v in terms of the basis 𝑆,
then the scalars 𝑐1 , 𝑐2 , … , 𝑐𝑛 are called the coordinates of
v relative to the basis 𝑆. The vector (𝑐1 , 𝑐2 , … , 𝑐𝑛 ) in ℝ𝑛
constructed from these coordinates is called the
coordinates vector of V relative to ; it is denoted by
(𝒗)𝑆 = (𝑐1 , 𝑐2 , … , 𝑐𝑛 ). ◄
Remark: It should be noted that coordinate vectors
depend not only on the basis 𝑆 but also on the order in
which the basis vectors are written; a change in the order
of the basis vectors results in a corresponding change of
order for the entries in the coordinate vectors.
Example 1.
If 𝒊 = (1,0,0), 𝒋 = (0,1,0), and = (0,0,1) ,then it is easy
to show that 𝑆 = {𝒊, 𝒋, 𝒌} is a linearly independent set in
ℝ3 . This set also spansℝ3 since any vector 𝒗 = (𝑎, 𝑏, 𝑐)
in ℝ3 can be written as
𝒗 = (𝑎, 𝑏, 𝑐 ) = 𝑎(1,0,0) + 𝑏(0,1,0) + 𝑐(0,0,1)
= 𝑎𝒊 + 𝑏𝒋 + 𝑐𝒌 (1)
Thus, 𝑆 is a basis for ℝ3 ; it is called the standard basis
forℝ3 . Looking at the coefficients of i, j and k in (1), it
- 142 -
follows that the standard basis are 𝑎, 𝑏, and c. So (𝒗)𝑠 =
(𝑎, 𝑏, 𝑐). Comparing this result we see that 𝒗 = (𝒗)𝑠 . ◄
●The result in the preceding example is a special case of
those in the next example.
Example 2.
As shown in the preceding section = {𝒆1 , 𝒆2 , … , 𝒆𝑛 } ,
where 𝒆1 = (1,0, … ,0), 𝒆2 = (0,1,0, … ,0) , … , 𝒆𝑛 =
(0,0, … ,1), is a linearly independent set in ℝ𝑛 . This set
also spans ℝ𝑛 . Since any vector 𝒗 = (𝒗1 , 𝒗2 , … , 𝒗𝑛 ) in
ℝ𝑛 can be written as
𝒗 = 𝒗1 𝒆1 + 𝒗2 𝒆2 + ⋯ + 𝒗𝑛 𝒆𝑛 (2)
Thus 𝑆 is a basis for ℝ𝑛 ; it is called the standard basis
for ℝ𝑛 . It follows from (2) that the coordinates of v
relative to the standard basis are
(𝒗)𝑆 = (𝒗1 , 𝒗2 , … , 𝒗𝑛 ) = 𝒗.◄
Remark.
We will see in a subsequent example that a vector and its
coordinate vector will not be the same; the equality that
we observed in the two preceding examples is a special
situation that occurs only with the standard basis for ℝ𝑛 .
- 143 -
Example 3.
Let 𝒗1 = (1, 2, 1), 𝒗2 = (2, 4, 0),and𝒗3 = (3, 3, 4). Show
that the set 𝑆 = {𝒗1 , 𝒗2 , 𝒗3 } is a basis for ℝ3 .
Solution.
To show that the set 𝑆spansℝ3 , we must show that any
arbitrary vector 𝒃 = (𝑏1 , 𝑏2 , 𝑏3 ) can be expressed as a
linear combination 𝑏 = 𝑐1 𝒗1 + 𝑐2 𝒗2 + 𝑐3 𝒗3 of vectors
in 𝑆. Expressing this equation in terms of components
gives
(𝑏1 , 𝑏2 , 𝑏3 ) = 𝑐1 (1, 2, 1) + 𝑐2 (2, 4, 0) + 𝑐3 (3, 3, 4)
or on equating corresponding components
𝑐1 + 2𝑐2 + 3𝑐3 = 𝑏1
2𝑐1 + 9𝑐2 + 3𝑐3 = 𝑏2 (3)
𝑐1 + 4𝑐3 = 𝑏3
Thus, to show that 𝑆 spasmℝ3 , we must demonstrate the
system (3) has a solution for all choice of = (𝑏1 , 𝑏2 , 𝑏3 ) .
To prove that 𝑆 is linearly independent, we must show
that the only solution of
𝑐1 𝒗1 + 𝑐2 𝒗2 + 𝑐3 𝒗3 = 𝟎 (4)
Is 𝑐1 = 𝑐2 = 𝑐3 = 0.
- 144 -
As above, if (4) is expressed in terms of components, the
verification of independence reduces to showing that the
homogeneous system
𝑐1 + 2𝑐2 + 3𝑐3 = 0
2𝑐1 + 9𝑐2 + 3𝑐3 = 0 (5)
𝑐1 + 4𝑐3 = 0
has only the trivial solution. Observe that the systems (3)
and (5) have the same coefficient matrix. Thus from
Theorem 4, in Section 2 of Chapter (III) we can
simultaneously prove that 𝑆is linearly independent and
spans ℝ3 by demonstrating that in systems (3) and (5) the
matrix of coefficients
1 2 3
𝐴 = [2 9 3]
1 0 4
has anon zero determinant. But det (𝐴) = −1. So, 𝑆 is a
basis for ℝ3 . ◄
Example 4.
Let 𝑆 = {𝒗1 , 𝒗2 , 𝒗3 }be the basis for ℝ3 in the preceding
example.
(a)Find the coordinate vector of𝒗 = (5, −1, 9) with
respect to S.
- 145 -
(b)Find the vector v inℝ3 whose coordinate vector with
respect to the basis 𝑆 is (𝒗)𝑆 = (−1, 3, 2) .
Solution.
(a)We must find scalars 𝑐1 , 𝑐2 , 𝑐3 such that
𝒗 = 𝑐1 𝒗1 + 𝑐2 𝒗2 + 𝑐3 𝒗3 or, in terms of components,
(5, −1, 9) = 𝑐1 (1, 2, 1) + 𝑐2 (2, 9, 0) + 𝑐3 (3, 3, 4)
Equating corresponding components gives
𝑐1 + 2𝑐2 + 3𝑐3 = 5
2𝑐1 + 9𝑐2 + 3𝑐3 = −1
𝑐1 + 4𝑐3 = 9
Solving this system, we obtain 𝑐1 = 1, 𝑐2 = −1, 𝑐3 = 2
(verify). Therefore, (𝒗)𝑆 = (1, −1, 2).
(b)Using the definition of the coordinate vector (𝒗)𝑆 ,we
obtain
𝒗 = (−1)𝒗1 + 3𝒗2 + 2𝒗3
= (−1)(1, 2, 1) + 3 (2, 9, 0) + 2(3, 3, 4)
= (11, 31, 7). ◄
Example 5.
(a) Show that 𝑆 = {1, 𝑥, 𝑥 2 , … , 𝑥 𝑛 } is a basis for the
vector space 𝑃𝑛 of polynomials of the form 𝑎0 + 𝑎1 𝑥 +
⋯ + 𝑎𝑛 𝑥 𝑛 .
- 146 -
(b) Find the coordinate vector of the polynomial
𝑝 = 𝑎0 + 𝑎1 𝑥 + 𝑎2 𝑥 2 relative to basis 𝑆 = {1, 𝑥, 𝑥 2 } for
𝑃2 .
Solution.
(a)The student can easily show that 𝑆 spans 𝑃𝑛 and 𝑆is
linearly independent set. Thus, 𝑆 is a basis for 𝑃𝑛 ; it is
called the standard basis for 𝑃𝑛 .
(b)The coordinates of 𝑝 = 𝑎0 + 𝑎1 𝑥 + 𝑎2 𝑥 2 are the
scalar coefficients of the basis vectors 1, 𝑥, and 𝑥 2 so
(𝑝)𝑆 = (𝑎0 , 𝑎1 , 𝑎2 ).◄
Example 6.
1 0 0 1 0 0
Let 𝑴1 = [ ] , 𝑴2 = [ ] , 𝑴3 = [ ],
0 1 0 0 1 0
0 0
𝑴4 = [ ]. The set 𝑆 = {𝑴1 , 𝑴2 , 𝑴3 , 𝑴4 } is a basis
0 1
for the vector space𝑀2×2 of 2 × 2 Matrices.
Solution.
To see that 𝑆 spans 𝑀2×2 , note that an arbitrary vector
𝑎 𝑏
(matrix) [ ] can be written as
𝑐 𝑑
𝑎 𝑏 1 0 0 1 0 0 0 0
[ ] = 𝑎[ ]+𝑏[ ]+𝑐[ ]+𝑑[ ]
𝑐 𝑑 0 1 0 0 1 0 0 1
= 𝑎𝑴1 + 𝑏𝑴2 + 𝑐𝑴3 + 𝑑𝑴4
- 147 -
To see that 𝑆 is linearly independent, assume that
𝑎𝑴1 + 𝑏𝑴2 + 𝑐𝑴3 + 𝑑𝑴4 = 𝟎
That is,
1 0 0 1 0 0 0 0 0 0
𝑎[ ]+𝑏[ ]+𝑐[ ]+𝑑[ ]=[ ].
0 0 0 0 1 0 0 1 0 0
It follows that
𝑎 𝑏 0 0
[ ]=[ ]
𝑐 𝑑 0 0
Thus, 𝑎 = 𝑏 = 𝑐 = 𝑑 = 0.So𝑆 is linearly independent.
The basis 𝑆 in this example is called the standard basis
for𝑀2×2 . ◄
●More generally, the standard basis for 𝑀𝑚×𝑛 consists
of the 𝑚 × 𝑛 different Matrices with a single l and zeros
for theremaining entries.
Definition 2.
A nonzero vector space V is called finite – dimensional
if it contains a finite set of vectors {𝒗1 , 𝒗2 , … , 𝒗𝑛 } that
forms abasis. If no such set exists, V is called infinite –
dimensional. In addition, we shall regard the zero vector
space to be finite – dimensional.
Example 7.
The vector spacesℝ𝑛 ,𝑃𝑛 , and 𝑀𝑚×𝑛 are finite –
dimensional. The vector spaces𝐹 (−∞, ∞),
𝐶 (−∞, ∞), 𝐶 𝑚 (−∞, ∞) and 𝐶 ∞ (−∞, ∞) are infinite –
dimensional. The next theorem will provide the key to the
concept of dimensions.
- 148 -
Theorem 2.
If V is a finite – dimensional vector space and
{𝒗1 , 𝒗2 , … , 𝒗𝑛 }is any basis. Then
(a) Every set with more than n vectors is linearly
dependent.
(b) No set with fewer than n vectors spans V.
Proof.
(a) Let 𝑆′ = {𝒘1 , 𝒘2 , … , 𝒘𝑚 } be any set of m vectors in
V, where 𝑚 > 𝑛.
We want to Show that 𝑆′ is linearly dependent. Since
𝑆 = {𝒗1 , 𝒗2 , … , 𝒗𝑛 } is a basis, each𝒘𝑖 can be
expressedas a linear combination of the vectors in𝑆,say
𝒘1 = 𝑎11 𝒗1 + 𝑎21 𝒗2 + ⋯ + 𝑎𝑛1 𝒗𝑛
𝒘2 = 𝑎12 𝒗1 + 𝑎22 𝒗2 + ⋯ + 𝑎𝑛2 𝒗2
⋮⋮⋮⋮
𝒘𝑚 = 𝑎1𝑚 𝒗1 + 𝑎2𝑚 𝒗2 + ⋯ + 𝑎𝑛𝑚 𝒗𝑛
(6)
To show that 𝑆′ is linearly dependent, we must find
scalars𝑘1 , 𝑘2 , … , 𝑘𝑚 , not all zero, such that
𝑘1 𝒘1 + 𝑘2 𝒘2 + ⋯ + 𝑘𝑚 𝒘𝑚 = 0 (7)
Using the equations in (6), we can rewrite (7) as
- 149 -
(𝑘1 𝑎11 + 𝑘2 𝑎12 + ⋯ + 𝑘𝑚 𝑎1𝑚 )𝒗1 +
(𝑘1 𝑎21 + 𝑘2 𝑎22 + ⋯ + 𝑘𝑚 𝑎2𝑚 )𝒗2 + ⋯
+(𝑘1 𝑎𝑛1 + 𝑘2 𝑎𝑛2 + ⋯ + 𝑘𝑚 𝑎𝑛𝑚 )𝒗𝑛 = 0
Thus, from the linear independence of S, the problem of
proving that𝑆 ′ is linearly dependent set reduces to
showing there are scalars 𝑘1 , 𝑘2 , … , 𝑘𝑚 , not all zero, that
satisfy
𝑎11 𝑘1 + 𝑎12 𝑘2 + ⋯ + 𝑎1𝑚 𝑘𝑚 = 0
𝑎21 𝑘1 + 𝑎22 𝑘2 + ⋯ + 𝑎2𝑚 𝑘𝑚 = 0
⋮⋮⋮⋮
𝑎𝑛1 𝑘1 + 𝑎𝑛2 𝑘2 + ⋯ + 𝑎𝑛𝑚 𝑘𝑚 = 0
(8)
But (8) has more unknowns than equations, then the
system has nontrivial solution. So the proof is complete.
(b) Let 𝑆 ′ = {𝒘1 , 𝒘2 , … , 𝒘𝑚 } be any set of m vectors in
V, where𝑚 < 𝑛.
We want to show that 𝑆′does not span V. The proof will
be by contradiction. We will show that assuming 𝑆′spans
V leads to a contradiction of the linear independence of
{𝒗1 , 𝒗2 , … , 𝒗𝑛 } .
- 150 -
If 𝑆′ spans V, then every vector in V is a linear
combination of the vectors in 𝑆 ′ .In particular each basis
vector 𝒗𝑖 is a linear combination of the vectors in 𝑆′ , say
𝒗1 = 𝑎11 𝒘1 + 𝑎21 𝒘2 + ⋯ + 𝑎𝑚1 𝒘𝑚
𝒗2 = 𝑎12 𝒘1 + 𝑎22 𝒘2 + ⋯ + 𝑎𝑚2 𝒘𝑚
⋮⋮⋮⋮
𝒗𝑛 = 𝑎1𝑛 𝒘1 + 𝑎2𝑛 𝒘2 + ⋯ + 𝑎𝑚𝑛 𝒘𝑚
(9)
To obtain our contradiction we will show that there are
scalars 𝑘1 , 𝑘2 , … , 𝑘𝑛 not all zero, such that
𝑘1 𝒗1 + 𝑘2 𝒗2 + ⋯ + 𝑘𝑛 𝒗𝑛 = 0 (10)
But observe that (9) and (10) have the same form as (6)
and (7) except that m and n are interchanged and the w’s
and v’s are interchanged. Thus, the computation that led
to(8) now yields
𝑎11 𝑘1 + 𝑎12 𝑘2 + ⋯ + 𝑎1𝑛 𝑘𝑛 = 0
𝑎21 𝑘1 + 𝑎22 𝑘2 + ⋯ + 𝑎2𝑛 𝑘𝑛 = 0
⋮⋮⋮⋮
𝑎𝑚1 𝑘1 + 𝑎𝑚2 𝑘2 + ⋯ + 𝑎𝑚𝑛 𝑘𝑛 = 0
This linear system has more unknowns than equations,
and therefore has nontrivial solution.◄
- 151 -
Theorem 3.
All basis for a finite – dimensional vector space has the
same number of vectors.
Definition 3.
The dimension of a finite – dimensional vector space V,
denoted by dim (V), is defined to be the number of
vectors in a basis for V. In addition, we define the zero-
vector space to have dimension zero.
Example 8.
dim (ℝ𝑛 ) = 𝑛 [The standard basis has n vectors].
dim (𝑃𝑛 ) = 𝑛 + 1[The standard basis has 𝑛 + 1 vectors].
dim (𝑀𝑚×𝑛 ) = 𝑚𝑛[The standard basis has 𝑚𝑛
vectors].◄
Example 9.
Determine a basis for and the dimension of the solution
space of the homogenous system
2𝑥1 + 2𝑥2 − 𝑥3 + 𝑥5 = 0
−𝑥1 − 𝑥2 + 2𝑥3 − 3𝑥4 + 𝑥5 = 0
𝑥1 + 𝑥2 − 2𝑥3 − 𝑥5 = 0
𝑥3 + 𝑥4 + 𝑥5 = 0
- 152 -
Solution.
The general solution of the given system is
𝑥1 = −𝑠 − 𝑡 , 𝑥2 = 𝑠 , 𝑥3 = −𝑡, 𝑥4 = 0 , 𝑥5 = 𝑡
(Verify). Therefore, the solution vectors can be written
as
𝑥1 −𝑠 − 𝑡 −𝑠 −𝑡 −1 −1
𝑥2 𝑠 𝑠 0 1 0
𝑥3 = −𝑡 = 0 + −𝑡 = 𝑠 0 + 𝑡 −1
𝑥4 0 0 0 0 0
[𝑥5 ] [ 𝑡 ] [ 0 ] [𝑡 ] [0] [1]
Which shows that the vectors
𝒗1 = (−1, 1, 0, 0, 0) and 𝒗2 = (−1, 0, −1, 0, 1)
span the solution space. Since they are also linearly
independent (verify),{𝒗1 , 𝒗2 } is a basis and the solution
space is two- dimensional. ◄
Theorem 4.
- 153 -
Exercise Set (4.6)
(1) Which of the following sets of vectors are basis for
ℝ2 ?
(a) {(1, 3) , (3, 5)}(𝑏){(7, 1) , (10, 1)}
3 4 −4 3
(c) {(7, 1) , (21, 3)}(𝑑) {( , ) , ( , )}
5 5 5 5
- 154 -
(c) {𝑥 2 + 3𝑥 + 5 , 5𝑥 2 + 3𝑥 + 1 , 𝑥 2 + 𝑥 + 1}
(d) {𝑥 + 5 , −𝑥 2 − 𝑥 + 1 , 2𝑥 2 + 𝑥 + 3}
(5) Which of the following sets of vectors are basis for
𝑀2×2 ?
1 0 0 −1 2 2 0 3
(a) = {[ ] ,[ ],[ ] ,[ ]}
0 1 −1 0 2 0 3 3
0 1 0 5 0 0 1 7
(b) = {[ ] ,[ ] ,[ ] ,[ ]}
4 4 5 0 2 −3 0 0
1 1 1 1 1 0 0 1
(c) = {[ ] ,[ ] ,[ ] ,[ ]}
1 0 0 1 1 1 1 1
(6) Determine basis for and dimensions of the given
subspaces (≤ denotes subspace).
(a)𝑊 = {(𝑎, 𝑏, 𝑐 ): 𝑎 + 𝑏 + 𝑐 = 0} ≤ ℝ3 .
(b)𝑊 = {(𝑎, 𝑏, 𝑐 ): 𝑎 + 𝑏 = 𝑏 + 𝑐 = 0} ≤ ℝ3 .
(c)𝑊 = {(𝑎 − 𝑏 , 𝑏 + 𝑐 , 𝑏, 𝑏 − 𝑐): 𝑎, 𝑏, 𝑐 ∈ ℝ} ≤ ℝ4 .
(d) 𝑊 = {𝐴 ∈ 𝑀2×2 ∶ 𝐴𝑡 + 𝐴 = 0} ≤ 𝑀2×2 .
(e) 𝑊 = {𝐴 ∈ 𝑀2×2 ∶ 2𝑎11 + 𝑎22 = 𝑎12 + 𝑎21 = 0} ≤ 𝑀2×2
(f) 𝑊 = {𝑎(𝑥 + 1) + 𝑏(𝑥 2 + 𝑥 ): 𝑎, 𝑏 ∈ ℝ} ≤ 𝑃2
(7) Determine the dimension of and a basis for the
solution space of the systems:
(a)𝑥1 + 𝑥2 − 𝑥3 = 0
−2𝑥1 − 𝑥2 + 2𝑥3 = 0
- 155 -
−𝑥1 + 𝑥3 = 0.
(b)3𝑥1 + 𝑥2 + 𝑥3 + 𝑥4 = 0
5𝑥1 − 𝑥2 + 𝑥3 − 𝑥4 = 0.
(8) Find the coordinate vector of w relative to the basis
𝑆 = {𝒖1 , 𝒖2 } for ℝ2 .
(a)𝒖1 = (1, 0), 𝒖2 = (0, 1); 𝒘 = (3, −7);
(b) 𝒖1 = (1, 1), 𝒖2 = (0, 2) ; 𝒘 = (𝑎, 𝑏).
(10) Find the coordinate vector of p relative to the basis
𝑆 = {𝒑1 , 𝒑2 , 𝒑3 }, where
𝒑 = 4 − 3𝑥 + 𝑥 2 ; 𝒑1 = 1, 𝒑2 = 𝑥 , 𝒑3 = 𝑥 2
(11) Find the coordinate vector of A relative to the basis
= {𝐴1 , 𝐴2 , 𝐴3 , 𝐴4 } , where
2 0 −1 1 1 1
𝑨=[ ] , 𝑨1 = [ ] , 𝑨2 = [ ],
−1 3 0 0 0 0
0 0 0 0
𝑨3 = [ ] , 𝑨4 = [ ]
1 0 0 1
- 156 -
CHAPTER (V)
INNER PRODUCT SPACES
- 157 -
CHAPTER (V)
INNER PRODUCT SPACES
1. Inner product
Definition 1.
An inner product on a linear space V having the real
numbers as scalars is a function 〈 , 〉: 𝑉 × 𝑉 → ℝthat
associates with each pair of elements u and v real
number〈𝒖, 𝒗〉 in such a way that the following axioms are
satisfied for all elements u , v and w of V and𝛼 ∈ ℝ:
(1) 〈𝒖, 𝒗〉 = 〈𝒗, 𝒖〉; [Symmetry axiom]
(2) 〈𝒖 + 𝒗, 𝒘〉 = 〈𝒖, 𝒘〉 + 〈𝒗, 𝒘〉; [Additively axiom]
(3) 〈𝛼𝒖, 𝒗〉 = 𝛼 〈𝒖, 𝒗〉; [Homogeneity axiom]
(4) 〈𝒗, 𝒗〉 ≥ 0 [Positivity axiom]
(5) 〈𝒗, 𝒗〉 = 0 if and only if 𝒗 = 𝟎.
Areal linear space with an inner product is called an inner
product space.
Example 1.
Let 𝑉 = ℝ𝑛 and 𝒖 = (𝑎1 , … , 𝑎𝑛 ) and 𝒗 = (𝑏1 , … , 𝑏𝑛 )be
elements in V. Then the function defined by :
〈𝒖, 𝒗〉 = 𝒖 ∙ 𝒗 = 𝑎1 𝑏1 + ⋯ + 𝑎𝑛 𝑏𝑛
- 158 -
is an inner product (called Euclidean inner product). In
this case, V is called Euclidean Product Space.
Solution.
Let 𝒖 = (𝑎1 , … , 𝑎𝑛 ) , 𝒗 = (𝑏1 , … , 𝑏𝑛 ),and 𝒘 =
(𝑐1 , … , 𝑐𝑛 )be elements in ℝ𝑛 and 𝛼𝜖ℝ. Then
(1) 〈𝒖, 𝒗〉 = 𝑎1 𝑏1 + ⋯ + 𝑎𝑛 𝑏𝑛
= 𝑏1 𝑎1 + ⋯ + 𝑏𝑛 𝑎𝑛
= 〈𝒗, 𝒖〉.
(2) 〈𝒖 + 𝒗, 𝒘〉 = (𝑎1 + 𝑏1 )𝑐1 + ⋯ + (𝑎𝑛 + 𝑏𝑛 )𝑐𝑛
= 𝑎1 𝑐1 + 𝑏1 𝑐1 + ⋯ + 𝑎𝑛 𝑐𝑛 + 𝑏𝑛 𝑐𝑛
= (𝑎1 𝑐1 + ⋯ + 𝑎𝑛 𝑐𝑛 ) + (𝑏1 𝑐1 + ⋯ + 𝑏𝑛 𝑐𝑛 )
𝑎1 = ⋯ = 𝑎𝑛 = 0 ⟺ 𝒖 = 𝟎.
- 159 -
Remark.
〈𝒖, 𝒗〉 = 𝛼1 𝑢1 𝑣1 + ⋯ + 𝛼𝑛 𝑢𝑛 𝑣𝑛 (∗)
Example 2.
- 160 -
Solution.
and ∈ ℝ . Then
= 𝛼 (7𝑎1 𝑏1 + 5𝑎2 𝑏2 )
= 𝛼 〈𝒖, 𝒗〉.
⟺ 𝒖 = 𝟎.◄
Example 3.
Let𝑝(𝑥 ) = 𝑎0 + 𝑎1 𝑥 + 𝑎2 𝑥 2 , 𝑞 (𝑥 ) = 𝑏0 + 𝑏1 𝑥 + 𝑏2 𝑥 2 ∈
𝑃2 ,where 〈𝑝(𝑥 ), 𝑞 (𝑥 )〉 = 𝑎0 𝑏0 + 𝑎1 𝑏1 + 𝑎2 𝑏2 . Prove
that 𝑃2 is an inner product space.
- 161 -
Solution.
Let 𝛼 ∈ ℝ and𝑝(𝑥 ) = 𝑎0 + 𝑎1 𝑥 + 𝑎2 𝑥 2 , 𝑞 (𝑥 ) = 𝑏0 +
𝑏1 𝑥 + 𝑏2 𝑥 2 and 𝑟(𝑥 ) = 𝑐0 + 𝑐1 𝑥 + 𝑐2 𝑥 2 ∈ 𝑃2
(1)〈𝑝, 𝑞 〉 = 𝑎0 𝑏0 + 𝑎1 𝑏1 + 𝑎2 𝑏2
= 𝑏0 𝑎0 + 𝑏1 𝑎1 + 𝑏2 𝑎2
= 〈𝑞, 𝑝〉.
(𝑎2 + 𝑏2 )𝑐2
= (𝑎0 𝑐0 + 𝑎1 𝑐1 + 𝑎2 𝑐2 ) + (𝑏0 𝑐0 + 𝑏2 𝑐1 + 𝑏2 𝑐2 )
= 〈𝑝, 𝑟〉 + 〈𝑞, 𝑟〉 .
= 𝛼 (𝑎0 𝑏0 + 𝑎1 𝑏1 + 𝑎2 𝑏2 ) = 𝛼 〈𝑝, 𝑞 〉 .
⟺ 𝑎0 = 𝑎1 = 𝑎2 = 0
⟺ 𝑝 = 𝟎.
- 162 -
Example 4.
𝑏
(2)〈𝑓 + 𝑔, ℎ〉 = ∫𝑎 [𝑓(𝑥 ) + 𝑔(𝑥 )] ℎ(𝑥 )𝑑𝑥
𝑏 𝑏
= 𝛼 〈𝑓, 𝑔〉.
- 163 -
(4) Since f is continuous on [𝑎, 𝑏], then [𝑓(𝑥)]2 ≥ 0 for
all 𝑥 ∈ [𝑎, 𝑏] Therefore 〈𝑓, 𝑓〉 ≥ 0.
𝑏
(5)〈𝑓, 𝑓〉 = 0 ⇔ ∫𝑎 [𝑓(𝑥)]2 𝑑𝑥 = 0 ⟺ 𝑓 (𝑥 ) = 𝟎.
Theorem 4.
𝛼 ∈ ℝ. Then
Proof.
- 164 -
(3) 〈𝒖, 𝛼𝒗〉 = 〈𝛼𝒗, 𝒖〉 = 𝛼 〈𝒗, 𝒖〉 = 𝛼〈𝒖, 𝒗〉.
(4) 〈𝒖 − 𝒗, 𝒘〉 = 〈𝒖 + (−𝒗), 𝒘〉
= 〈𝒖, 𝒘〉 + 〈(−𝒗), 𝒘〉
(5) 〈𝒖, 𝒗 − 𝒘〉 = 〈𝒗 − 𝒘, 𝒖〉
= 〈𝒗, 𝒖〉 − 〈𝒘, 𝒖〉
= 〈𝒖, 𝒗〉 − 〈𝒖, 𝒘〉 .◄
- 165 -
Exercise Set (5.1)
- 166 -
〈𝒖 + 𝒗, 𝒗 + 𝒘〉, 〈2𝒗 − 𝒘, 3𝒖 + 2𝒘〉, 〈𝒖 − 𝒗 + 4𝒘, 3𝒖 − 𝒗〉.
〈𝐴, 𝐵〉 = 𝑎1 𝑏1 + 𝑎2 𝑏2 + 𝑎3 𝑏3 + 𝑎4 𝑏4 .
- 167 -
2.Orthogonally
Definition 1.
𝑑(𝒖, 𝒘) = ‖𝒖 − 𝒘‖ = √〈𝒖 − 𝒘, 𝒖 − 𝒘〉
Example 1.
- 168 -
In particular, if 𝑛 = 3, 𝒖 = (1, 2, 3) and 𝒘 =
(−1, −2 ,4), then‖𝒖‖ = √(1)2 + (2)2 + (3)2 =
Example 2.
- 169 -
Example 4.
1
𝜋 2
Also,𝑑(𝑓(𝑥 ), 𝑔(𝑥 )) = (∫0 (cos 𝑥 − sin 𝑥) 𝑑𝑥 ) = √𝜋.◄
2
Proof.
〈𝒗, 𝒖〉
𝒘=𝒗− 𝒖.
‖𝒖‖2
Therefore,
〈𝒗, 𝒖〉 〈𝒗, 𝒖〉
𝟎 ≤ ‖𝒘‖2 = 〈𝒘, 𝒘〉 = 〈𝒗 − 𝒖, 𝒗 − 𝒖〉
‖𝒖‖2 ‖𝒖‖2
- 170 -
〈𝒗, 𝒖〉 〈𝒗, 𝒖〉 〈𝒗, 𝒖〉2
= 〈𝒗, 𝒗〉 − 〈𝒗, 𝒖〉 − 〈 𝒖, 𝒗〉 + 〈𝒖, 𝒖〉
‖𝒖‖2 ‖𝒖‖2 ‖𝒖‖4
Remarks.
(1) Applying the Caushy-Schwarz inequality to ℝ𝑛 we
obtain the inequality:
(𝑎1 𝑏1 + ⋯ + 𝑎𝑛 𝑏𝑛 )2 ≤ (𝑎12 + ⋯ + 𝑎𝑛2 )(𝑏12 + ⋯ + 𝑏𝑛2 )
(2) Applying the Caushy-Schwarz inequality to 𝐶 [𝑎, 𝑏]
we obtain the inequality:
𝑏 2
(∫ 𝑓 (𝑥 )𝑔(𝑥 )𝑑𝑥 )
𝑎
𝑏 𝑏
2 2
≤ (∫ (𝑓(𝑥 )) 𝑑𝑥 ) (∫ (𝑔(𝑥 )) 𝑑𝑥 )
𝑎 𝑎
- 171 -
Lemma 1.
Proof.
〈𝒗,𝒖〉
Let |〈𝒖, 𝒗〉| = ‖𝒖‖‖𝒗‖ and 𝒘 = 𝒗 − ‖𝒖‖2 𝒖. Then
〈𝒗,𝒖〉
〈𝒘, 𝒘〉 = 𝟎. Therefore 𝒘 = 𝟎. Hence𝒗 = 𝒖.
‖𝒖‖2
- 172 -
(2)‖𝒖‖ = 0 if and only if 𝒖 = 𝟎;
(3)‖𝛼𝒖‖ = |𝛼 |‖𝒖‖;
(4)‖𝒖 + 𝒗‖ ≤ ‖𝒖‖ + ‖𝒗‖ (Triangular inequality).
Proof.
(1) Since 〈𝒖, 𝒖〉 ≥ 0 ,‖𝒖‖ = √〈𝒖, 𝒖〉 ≥ 0.
(2) Since 〈𝒖, 𝒖〉 = 0 if and only if 𝒖 = 𝟎 . Then‖𝒖‖ =
𝟎if and only if 𝒖 = 𝟎.
(3) ‖𝛼𝒖‖2 = 〈𝛼𝒖, 𝛼𝒖〉
= 𝛼 2 〈𝒖, 𝒖〉 = 𝛼 2 ‖𝒖‖2 .
Therefore ‖𝛼𝒖‖ = 𝛼 ‖𝒖‖.
(4) ‖𝒖 + 𝒗‖2 = 〈𝒖 + 𝒗, 𝒖 + 𝒗〉
= 〈𝒖, 𝒖〉 + 2〈𝒖, 𝒗〉 + 〈𝒗, 𝒗〉
≤ 〈𝒖, 𝒖〉 + 2|〈𝒖, 𝒗〉| + 〈𝒗, 𝒗〉
≤ ‖𝒖‖2 + 2‖𝒖‖‖𝒗‖ + ‖𝒗‖2
= (‖𝒖‖ + ‖𝒗‖)2 .
Therefore‖𝒖 + 𝒗‖ ≤ ‖𝒖‖ + ‖𝒗‖. ◄
Lemma 2.
Let V be an inner product space,𝒖, 𝒗, 𝒘 ∈ 𝑉 and
𝛼 ∈ ℝ.Then
(1) 𝑑(𝒖, 𝒗) ≥ 0 ;
- 173 -
(2) 𝑑 (𝒖, 𝒗) = 0 if and only if 𝒖 = 𝒗 ;
(3) 𝑑 (𝒖, 𝒗) = 𝑑(𝒗, 𝒖)
(4) 𝑑 (𝒖, 𝒗) ≤ 𝑑(𝒖, 𝒘) + 𝑑(𝒘, 𝒗)(Triangular
inequality).
Proof.
(1) 𝑑 (𝒖, 𝒗) = ‖𝒖 − 𝒗‖ ≥ 0
(2) 𝑑 (𝒖, 𝒗) = 0 ⇔ ‖𝒖 − 𝒗‖ = 0 ⟺ 𝒖 − 𝒗 = 𝟎 ⟺
𝒖 = 𝒗.
(3) 𝑑 (𝒖, 𝒗) = ‖𝒖 − 𝒗‖ = ‖𝒗 − 𝒖‖ = 𝑑(𝒗, 𝒖).
(4) 𝑑 (𝒖, 𝒗) = ‖𝒖 − 𝒗‖ = ‖𝒖 − 𝒘 + 𝒘 − 𝒗‖
≤ ‖𝒖 − 𝒘‖ + ‖𝒘 − 𝒗‖
= 𝑑 (𝒖, 𝒘) + 𝑑(𝒘, 𝒗).◄
♠Now we show how to use Cauchy – Schwarz inequality
in defining the angle between two vectors in an inner
product space. Let V be an inner product space and 𝒖, 𝒗 ∈
𝑉. From Cauchy Schwarz inequality we have
|〈𝒖, 𝒗〉| ≤ ‖𝒖‖‖𝒗‖ ⟺
|〈𝒖, 𝒗〉|
≤1⟺
‖𝒖‖‖𝒗‖
- 174 -
〈𝒖, 𝒗〉
−1 ≤ ≤1
‖𝒖‖‖𝒗‖
Therefore, there exists unique angle 𝜃 such that
〈𝒖, 𝒗〉
cos 𝜃 =
‖𝒖‖‖𝒗‖
where 0 ≤ 𝜃 ≤ 𝜋.
Definition 2.
Let V be an inner product space and 𝒖and𝒗 be anonzero
vectors in V. Then
(1) The cosine of 𝜃 between u and v is
〈𝒖, 𝒗〉
cos 𝜃 =
‖𝒖‖‖𝒗‖
(2) The vectors u and v are orthogonal if 〈𝒖, 𝒗〉 = 0.
Example 5.
Let ℝ2 be the Euclidean inner product space. Then the
angle between 𝒖 = (1, 2)and 𝒗 = (−1, 3)satisfies the
(1)(−1)+(2)(3) 1
equationcos 𝜃 = = .
√1+4√1+9 √2
- 175 -
But 𝒖 = (1, 2) and 𝒗 = (−1, 1) are not orthogonal if the
inner product on ℝ2 is the inner product defined in
Example 2,in Section 2 of Chapter (IV), where
〈𝒖, 𝒗〉 = 7(1)(−2) + 5(2)(1) ≠ 0.◄
Example 6.
Let 𝐴 = [20 −4
0 0 1
] , 𝐵 = [−7 0
] ∈ 𝑀2×2 and the inner product
is as defined in Exercise (5.1)-(18). Then A and B are
orthogonal as
〈𝐴, 𝐵〉 = (2)(0) + (0)(1) + 0(−7) + (−4)(0) = 0.◄
Example 7.
The vectors sin 𝑥 , cos 𝑥 ∈ 𝐶 [0, 𝜋]is orthogonal, where
the inner product on 𝐶 [0, 𝜋] is defined in Example 4 of
Section 1 in this Chapter as
𝜋 1 𝜋
〈sin 𝑥 , cos 𝑥 〉 = 2
∫0 sin 𝑥 cos 𝑥 𝑑𝑥 = [ sin 𝑥] .◄
2 0
Example 8.
Let 𝑝(𝑥 ) = 2 − 𝑥 + 4𝑥 2 , 𝑞 (𝑥 ) = 3 + 5𝑥 − 𝑥 2 ∈ 𝑃2 ,
where theinner product is defined in Example 3 of
Section 1 in this Chapter. The angle 𝜃 between𝑝(𝑥) and
𝑞(𝑥) satisfies the following equation:
6−5−4 −3
cos 𝜃 = = .◄
√4+1+16√9+25+1 √735
- 176 -
Theorem 2.
Let u and v be any vectors of an inner product space V.
Then u and v are orthogonal if and only if
(‖𝒖 + 𝒗‖2 = ‖𝒖‖2 + ‖𝒗‖2 ) (*)
Proof.
‖𝒖 + 𝒗‖2 = 〈𝒖 + 𝒗, 𝒖 + 𝒗〉
= 〈𝒖, 𝒖〉 + 2〈𝒖, 𝒗〉 + 〈𝒗, 𝒗〉
= ‖𝒖‖2 + 2〈𝒖, 𝒗〉 + ‖𝒗‖2
Thus the identity (*) holds if and only if 〈𝒖, 𝒗〉 = 0, i.e.,
ifand only if u and v are orthogonal.◄
Exercise Set (5.2)
In Exercise (1) – (5) determine the angle between the
given vectors and whose vectors orthogonal with respect
to the Euclidean inner produce on ℝ3 .
(1)𝒖 = (−1, 3, 2), 𝒗 = (4, 2, −1);
(2) 𝒖 = (−2, −2, −2), 𝒗 = (−1, 1, −1);
(3) 𝒖 = (−1, 1, 0), 𝒗 = (4, 0, 9);
(4) 𝒖 = (−1, 5, 2), 𝒗 = (2, 4, −9);
(5)𝒖 = (8, −4, −2), 𝒗 = (−4, 2, 1);
(6) Do the Exercise (1) – (5) if the inner product on ℝ3
is:〈𝒖, 𝒗〉 = 7𝑎1 𝑏1 + 3𝑎2 𝑏2 + 4𝑎3 𝑏3 .
- 177 -
In Exercise (7) – (11) determine the angle between the
given elements and whose are orthogonal with respect to
the Euclidean inner product ℝ4 .
(7)𝒖 = (−4, 0, −10, 1), 𝒗 = (2, 1, −2, 9);
(8)𝒖 = (−1, 1, 0, 2), 𝒗 = (1, −1, 3, 0);
(9)𝒖 = (0, −2, 2, 1), 𝒗 = (−1, −1, 1, 1);
(10)𝒖 = (1, 2, 0, 1), 𝒗 = (−2, 3, 1, 2);
(11)𝒖 = (−2, 3, 1, 4), 𝒗 = (−3, 1, 4, 6).
(12) Do the Exercise (7) – (11) if the inner product on
ℝ4 isdefined as
〈𝒖, 𝒗〉 = 𝑎1 𝑏1 + 𝑎2 𝑏2 + 2𝑎3 𝑏3 + 3𝑎4 𝑏4 .
In Exercise (13) – (17) Determine the angle 𝜃between
the given vectors and whose are orthogonal where
theinner product on 𝑃2 is defined in Example 3of Section
1 in this Chapter
(13) 𝑝(𝑥 ) = 𝑥 2 + 2𝑥 + 3 , 𝑞 (𝑥 ) = 𝑥 2 + 1 ;
(14) 𝑝(𝑥 ) = 2𝑥 2 + 𝑥 − 3 , 𝑞 (𝑥 ) = 4𝑥 2 − 𝑥 ;
(15) 𝑝(𝑥 ) = 2𝑥 2 + 5𝑥 − 1 , 𝑞 (𝑥 ) = 9𝑥 2 + 2𝑥 + 1 ;
(16) 𝑝(𝑥 ) = 2𝑥 2 − 𝑥 + 1 , 𝑞 (𝑥 ) = 𝑥 2 + 2𝑥 ;
(17) 𝑝(𝑥 ) = 2𝑥 2 − 𝑥 + 1 , 𝑞 (𝑥 ) = 𝑥 2 + 2𝑥 ;
- 178 -
In Exercise (18) – (22) Find the angle 𝜃 between the given
vectors and determine whose are orthogonal, where the inner
product on 𝑀2×2 is defined in Exercise (5.1)-(18).
(18)= [21 −3
6
] , 𝐵 = [01 23] ;
(19) 𝐴 = [−1
6 1
2
] , 𝐵 = [33 01] ;
(21) = [52 −4
6
] , 𝐵 = [−2 −1
1 −5
];
- 179 -
3. Orthonormal Basis
In many problems involving vector spaces, the problem
solver is free to choose any basis for the vector space that
seems appropriate. In inner product spaces the solution
of a problem is often greatly simplified by choosing a
basis in which the vectors are orthogonal to one another.
In this section we shall show how such bases can be
obtained.
Definition 1.
A set of vectors in an inner product space is called an
orthogonal set if all pairs of distinct vectors in the set are
orthogonal. An orthogonal set in which each vector has
norm 1 is called orthonormal.
Example 1.
1 1
Let 𝒖1 = (1, 1, 0, 1) , 𝒖2 = (1, −2, 0, 1)and
√3 √6
1
𝒖3 = (−1,0,2,1)be three vectors in the Euclidean
√6
- 180 -
Example 2.
1 3 3 2 1
Let𝑆 = { , √ 𝑥, √ (𝑥 2 − )} ⊆ 𝐶 [−1,1], where the
√ 2 2 2 5 3
Remark.
If V is a nonzero vector in an inner product space, then
1 1 1 1
‖𝒗‖
𝒗 has norm 1, as ‖‖𝒗‖ 𝒗‖ = |‖𝒗‖| ‖𝒗‖ = ‖𝒗‖ ‖𝒗‖ = 1.
- 181 -
called normalizing v. An orthogonal set of nonzero
vectors can always be converted to an orthonormal set by
normalizing each of its vectors.
Example 3.
Let 𝒖1 = (0,1,0), 𝒖2 = (1,0,1), 𝒖3 = (1,0, −1)and
assume that ℝ3 has the Euclidean inner product. It
follows that the set of vectors 𝑆 = {𝒖1 , 𝒖2 , 𝒖3 } is
orthogonal since 〈𝒖1 , 𝒖2 〉 = 〈𝒖1 , 𝒖3 〉 = 〈𝒖2 , 𝒖3 〉 =
0.Also the Euclidean norms of the vectors are
‖𝒖1 ‖ = 1, ‖𝒖2 ‖ = √2, ‖𝒖3 ‖ = √2
Consequently, normalizing 𝒖1 , 𝒖2 , and 𝒖3 yields
𝒖 𝒖 1 1
𝒗1 = ‖𝒖1 ‖ = (0,1,0) , 𝒗2 = ‖𝒖2 ‖ = ( , 0, ),
1 2 √2 √2
𝒖 1 −1
𝒗3 = ‖𝒖3 ‖ = ( , 0, ). We leave it for you to verify that
3 √2 √2
- 183 -
Theorem 2.
If 𝑆 = {𝒗1 , … , 𝒗𝑛 }is an orthonormal basis for an inner
product space V, and u is any vector in V, then
𝒖 = 〈𝒖, 𝒗1 〉𝒗1 + 〈𝒖, 𝒗2 〉𝒗2 + ⋯ + 〈𝒖, 𝒗𝑛 〉𝒗𝑛 .
Proof.
Since 𝑆 = {𝒗1 , … , 𝒗𝑛 } is a basis, a vector u can be
expressed in the form𝒖 = k1 𝒗1 + ⋯ + k 𝑛 𝒗𝑛 .
We shall complete the proof by showing that k 𝑖 =
〈𝒖, 𝒗𝑖 〉for = 1, … , 𝑛 . For each vector 𝒗𝑖 in S we have
〈𝒖, 𝒗𝑖 〉 = 〈k1 𝒗1 + ⋯ + k 𝑛 𝒗𝑛 , 𝒗𝑖 〉
= k1 〈𝒗1 , 𝒗𝑖 〉 + ⋯ + k 𝑛 〈𝒗𝑛 , 𝒗𝑖 〉.
Since 𝑆 = {𝒗1 , … , 𝒗𝑛 } is an orthonormal set, we have
〈𝒗𝑖 , 𝒗𝑖 〉 = ‖𝒗𝑖 ‖2 = 1 and 〈𝒗𝑗 , 𝒗𝑖 〉 = 0 if 𝑗 ≠ 𝑖.
Therefore, the above expression for 〈𝒖, 𝒗𝑖 〉 simplifies to
〈𝒖, 𝒗𝑖 〉 = k 𝑖 . The scalars 〈𝒖, 𝒗1 〉, 〈𝒖, 𝒗2 〉, … , 〈𝒖, 𝒗𝑛 〉 in the
above theorem are the coordinates of u relative to the
orthonormal basis 𝑆 = {𝒗1 , … , 𝒗𝑛 } and
(𝒖)𝑆 = (〈𝒖, 𝒗1 〉, 〈𝒖, 𝒗2 〉, … , 〈𝒖, 𝒗𝑛 〉)
is the coordinate vector of u relative to this basis.◄
- 184 -
Example 4.
4 3 3 4
Let 𝒗1 = (0,1,0), 𝒗2 = (− , 0, ) , 𝒗3 = ( , 0, ). It is
5 5 5 5
- 185 -
Theorem 3 (Gram – Schmidt algorithm).
Every nonzero finite – dimensional inner product space
has an orthonormal basis.
Proof.
Let V be any nonzero finite – dimensional inner product
space, and let {𝒖1 , … , 𝒖𝑛 } be any basis for V. It suffice to
show that V has an orthogonal basis, since the vectors in
the orthogonal basis can by normalized to produce an
orthonormal basis for V. The following sequence of steps
will produce an orthogonal basis {𝒗1 , … , 𝒗𝑛 } for V. Let
Step 1.𝒗1 = 𝒖1
〈𝒖2 ,𝒗1 〉
Step 2.𝒗2 = 𝑢2 − ‖𝒗1 ‖2
𝒗1
〈𝒖3 ,𝒗1 〉 〈𝒖3 ,𝒗2 〉
Step 3.𝒗3 = 𝒖3 − ‖𝒗1 ‖2
𝒗1 − ‖𝒗2 ‖2
𝒗2
⋮
〈𝒖𝑛 , 𝒗𝑛−1 〉 〈𝒖𝑛 , 𝒗1 〉
𝒗 𝑛 = 𝒖𝑛 − 𝒗 − ⋯ − 𝒗
‖𝒗𝑛−1 ‖2 𝑛−1 ‖𝒗1 ‖2 1
Since {𝒖1 , … , 𝒖𝑛 } is linearly independence, then
{𝒗1 , … , 𝒗𝑛 } is nonzero.
Now, we proof by induction that{𝒗1 , … , 𝒗𝑛 } is
orthogonal.
- 186 -
Let = 2 , then
〈𝒖2 , 𝒗1 〉
〈𝒗2 , 𝒗1 〉 = 〈𝒖2 − 𝒗 ,𝒗 〉
‖𝒗1 ‖2 1 1
〈𝒖2 , 𝒗1 〉
= 〈𝒖2 , 𝒗1 〉 − 〈𝒗 , 𝒗 〉
‖𝒗1 ‖2 1 1
= 〈𝒖2 , 𝒗1 〉 − 〈𝒖2 , 𝒗1 〉 = 0.
Therefore 𝒗1 , 𝒗2 are orthogonal.
Now assume{𝒗1 , 𝒗2 , … , 𝒗𝑛−1 }is orthogonal, where 𝑛 > 2
. We prove that {𝒗1 , 𝒗2 , … , 𝒗𝑛 } is orthogonal.
For 𝑖 = 1,2, … , 𝑛 − 1,we have
〈𝒖𝑛 , 𝒗𝑛−1 〉
〈 𝒗 𝑛 , 𝒗 𝑖 〉 = 〈 𝒖𝑛 , 𝒗 𝑖 〉 − 〈𝒗𝑛−1 , 𝒗𝑖 〉 −
‖𝒗𝑛−1 ‖2
〈𝒖𝑛 , 𝒗1 〉
…− 〈𝒗1 , 𝒗𝑖 〉
‖𝒗1 ‖2
〈 𝒖𝑛 , 𝒗 𝑖 〉
= 〈 𝒖𝑛 , 𝒗 𝑖 〉 − 〈𝒗𝑖 , 𝒗𝑖 〉 = 0
‖𝒗𝑖 ‖2
Therefore {𝒗1 , … , 𝒗𝑛 } is orthogonal. From Theorem 1 in
Section 3 of Chapter (IV) {𝒗1 , … , 𝒗𝑛 } is linearly
independent. Hence {𝒗1 , … , 𝒗𝑛 } is an orthogonal basis
𝒗
for the inner product space V. Finally if 𝒘𝑖 = ‖𝒗𝑖‖ for 𝑖 =
𝑖
- 187 -
1, … , 𝑛, we obtain an orthonormal basis {𝒘1 , … , 𝒘𝑛 } for
the inner product space V.◄
Example 5.
Consider the vector space ℝ3 with the Euclidean inner
product. Apply the Gram – Schmidt process to transform
the basis vectors 𝒖1 = (1,1,1) , 𝒖2 = (0,1,1) , 𝒖3 =
(0,0,1)into an orthogonal basis {𝒗1 , 𝒗2 , 𝒗3 } ; then
normalize the orthogonalbasis vectors to obtain an
orthonormal basis {𝒒1 , 𝒒2 , 𝒒3 } .
Solution:
Step 1. 𝒗1 = 𝒖1 = (1,1,1)
〈𝒖2 ,𝒗1 〉
Step 2.𝒗2 = 𝒖2 − ‖𝒗1 ‖2
𝒗1
2 2 1 1
= (0,1,1) − (1,1,1) = (− , , ).
3 3 3 3
〈𝒖3 ,𝒗1 〉 〈𝒖3 ,𝒗2 〉
Step 3. 𝒗3 = 𝒖3 − ‖𝑣1 ‖2
𝒗1 − ‖𝒗2 ‖2
𝒗2
1 1 ⁄3 2 1 1
= (0,0,1) − (1,1,1) − (− , , )
3 2 ⁄3 3 3 3
1 1
= (0, − , ).
2 2
2 1 1 1 1
Thus 𝒗1 = (1,1,1) , 𝒗2 = (− , , ) , 𝒗3 = (0, − , )
3 3 3 2 2
- 188 -
√6 1
vectors are ‖𝒗1 ‖ = √3, ‖𝒗2 ‖ = , ‖𝒗3 ‖ = .So an
3 √2
Example 6.
Consider the subspace W of ℝ5 spanned by {𝒗1 , 𝒗2 , 𝒗3 } ,
where𝒗1 = (−1, −1,1,0,0), 𝒗2 = (0, −1,0,0,1) , 𝒗3 =
(1, −1,0,1,0). Find an orthonormal basis for W, where the
inner product is the Euclidean product.
Solution.
Step 1:𝒖1 = 𝒗1 = (−1, −1,1,0,0).
〈𝒗2 ,𝒖1 〉
Step 2:𝒖2 = 𝒗2 − ‖𝒖1 ‖2
𝒖1 = (0, −1,0,0,1) −
1 1 −2 1
(−1, −1,1,0,0) = ( , , − , 0,1)
3 3 3 3
〈𝒗3 ,𝒗𝒖1 〉 〈𝒗3 ,𝒖2 〉 4 −3 1 3
Step 3.𝒖3 = 𝒗3 − ‖𝑢1 ‖2
𝒖1 − ‖𝒖2 ‖2
𝒖2 = ( , , , 1, )
5 5 5 5
- 189 -
1 1 1 1 −2 1 √15
(− ,− , , 0,0) , ( , ,− , 0, )
√3 √3 √3 √15 √15 √15 5
2 √15 √15 √15 √15
( ,− , , , − ). ◄
√15 10 30 6 10
Example 7.
Let 𝑝, 𝑞 ∈ 𝑃4 and the inner product defined on 𝑃4 is as
follows
〈𝑝, 𝑞 〉 = 𝑝(−2)𝑞 (−2) + 𝑝(−1)𝑞 (−1) + 𝑝(0)𝑞 (0) +
𝑝(1)𝑞 (1) + 𝑝(2)𝑞(2) and W be a subspace of 𝑃4
spanned by {𝑝1 (𝑥 ) = 1, 𝑝2 (𝑥 ) = 𝑥, 𝑝3 (𝑥) = 𝑥 2 }.
Find an orthogonal basis for W.
Solution.
Step 1. 𝑞1 = 𝑝1 = 1
Step 2.
〈𝑝2 , 𝑞1 〉 = 𝑝2 (−2)𝑞1 (−2) + 𝑝2 (−1)𝑞1 (−1)
+ 𝑝2 (0)𝑞1 (0) + 𝑝2 (1)𝑞1 (1) + 𝑝2 (2)𝑞1 (2)
=0
〈𝑝2 ,𝑞1 〉
Hence 𝑞2 = 𝑝2 − ‖𝑞1 ‖2
𝑞1 = 𝑥 − 0 = 𝑥
〈𝑝3 ,𝑞2 〉 〈𝑝3 ,𝑞1 〉
Step 3:𝑞3 = 𝑝3 − ‖𝑞2 ‖2
𝑞2 − ‖𝑞1 ‖2
𝑞1 ,
- 190 -
where 〈𝑝3 , 𝑞1 〉 = 10, ‖𝑞1 ‖2 = 〈𝑞1 , 𝑞1 〉 = 5, and
10
〈𝑝3 , 𝑞2 〉 = 0. Therefore, 𝑞3 = 𝑥 3 − 0 − (1) = 𝑥 2 − 2
5
- 191 -
Definition.
Let A be an 𝑛 × 𝑚 matrix. The subspace of ℝ𝑛 spanned
by the rows (columns) of A is called the row (column)
space.
(4) Apply the Gram – Schmidt process to find an
orthonormal basis for the column space of the matrix A,
where the inner product is the Euclidean
3 0 5 1 0 0
(𝑎) 𝐴 = [0 1 −2] , (𝑏) 𝐴 = [1 1 0]
4 1 1 1 1 1
(5) Find an orthonormal basis for the row space for the
matrix A in Exercise (4).
- 192 -
The equation can be rearranged as follows:
Rearrange: 𝐴𝒙 − 𝜆𝒙 = 0
Factorize: (𝐴 − 𝜆)𝒙 = 0
Make 𝜆 into a matrix:(𝐴 − 𝜆𝐼)𝒙 = 0
This form is useful for finding eigenvalues and
eigenvectors.
We can see from the last version of the equation that 𝒙 =
0 is a solution (trivial).
To find the eigenvalues (𝜆), matrix theory tells us that we
must set the determinant of 𝐴 − 𝜆𝐼 equal to 0.
i.e.|𝐴 − 𝜆𝐼| = 0
To find the eigenvectors, put the eigenvalues back into
the original equation and solve.
The characteristic polynomial for the matrix A is:
𝑝(𝜆) = |𝐴 − 𝜆𝐼|.
Some texts used the form (𝐴 − 𝜆𝐼) which implies ±𝜆𝑛 .
Example 1.
Find the eigenvalues and eigenvectors for the matrix𝐴 =
6 3
[ ].
−2 1
- 193 -
Solution.
Finding the characteristic polynomial
𝜆 0
We know that 𝜆𝐼 = [ ]. Thus
0 𝜆
6 3 𝜆 0 6−𝜆 3
𝐴 − 𝜆𝐼 = [ ]−[ ]=[ ]
−2 1 0 𝜆 −2 1−𝜆
Therefore
𝑝(𝜆) = |𝐴 − 𝜆𝐼|
= (6 − 𝜆)(1 − 𝜆) − 3(−2) =6−
7𝜆 + 𝜆2 + 6
= 𝜆2 − 7𝜆 + 12.
- 195 -
Example 2.
Find the characteristic polynomial, eigenvalues and
1 2
eigenvectors for the matrix𝐴 = [ ].
4 3
Solution.
𝜆 0 1 2 𝜆−1 −2
𝜆𝐼 − 𝐴 = [ ]−[ ]=[ ].
𝜆 0 4 3 −4 𝜆−3
The characteristic polynomial is
𝜆−1 −2
𝑝(𝜆) = | | = (𝜆 − 1)(𝜆 − 3) − 8
−4 𝜆−3
= 𝜆2 − 4𝜆 − 5.
And the eigenvalues are 𝜆 = 5 𝑎𝑛𝑑 𝜆 = −1.
1 2 𝑥 𝑥
The eigenvector for 𝜆 = 5: [ ] [𝑦] = 5 [𝑦]yields the
4 3
equations 𝑥 + 2𝑦 = 5𝑥 and 4𝑥 + 3𝑦 = 𝑦.
Solving, we find that 𝑦 = 2𝑥; therefore the eigenvector is
1
[ ]. Using the same method, we find the eigenvector for
2
1
𝜆 = −1 is [ ].◄
−1
- 196 -
♣The eigenvalues for a matrix that is upper or lower
diagonal come from the diagonal. For the upper diagonal
1 1 1 1
0 1 1 1
matrix 𝐶 = [ ],
0 0 1 1
0 0 0 1
𝜆−1 1 1 1
0 𝜆−1 1 1
where (𝜆𝐼 − 𝐴) = [ ],
0 0 𝜆−1 1
0 0 0 𝜆−1
the determinant is (𝜆 − 1)4 , which yields the eigenvalue
𝜆 = 1.
Example 3.
0 2 1
Find the eigenvalues for the matrix 𝐴 = [1 0 0].
0 1 0
Solution.
we found that the characteristic polynomial 𝑝(𝜆) = 𝜆3 −
2𝜆 − 1 could not be factored. We discussed using either
synthetic division or long division to solve. Two of the
1+√5
eigenvalues looked VERY familiar, 𝜆 = = 𝜌 and
2
1
𝜆 = 1 − 𝜌 = − , and the third 𝜆 = −1.◄
𝜌
- 197 -
Theorem 1.
If 𝜆1 , … 𝜆𝑚 are mutually distinct eigenvalues of A with
eigenvectors 𝒙1 , … 𝒙𝑚 , then {𝒙1 , … 𝒙𝑚 } is a linearly
independent set.
Corollary 1.
If a 𝑛 × 𝑛 matrix A has n distinct eigenvalues, then A
has n linearly independent eigenvectors.
Theorem 2.
Let Abe an𝑛 × 𝑛matrix. Then A is invertible if and only
if the number 0 is not an eigenvalue of A.
Example 4:
6 8 4
𝐴 = [−2 −2 −2]
−1 −2 1
What are the eigenvalues and eigenvectors?
Solution. Outlines
1) Find the characteristic polynomial.
When it is zero the solution to it are the only possible
eigenvalues:
𝜆−6 −8 −4
|𝜆𝐼 − 𝐴| = | 2 𝜆−2 2 | = (𝜆 − 1)2 (𝜆 − 2)1
1 2 𝜆−1
- 198 -
has eigenvalues λ = 1, 1, 2.
Solve: 𝐴𝒙 = 1𝒙 and A𝒙=2𝒙
𝐴𝒙 = 1𝒙 has eigenspace of dimension 1.
A𝒙=2𝒙 has eigenspace of dimension 2.
Example 5.
Find the characteristic polynomial, eigenvalues and
eigenvectors for the matrix A, where
−𝟑 𝟏 −𝟏
𝑨 = (−𝟕 𝟓 −𝟏).
−𝟔 𝟔 −𝟐
Solution.
We have
−𝟑 − 𝝀 𝟏 −𝟏
𝑨 − 𝝀𝑰𝟑 = ( −𝟕 𝟓−𝝀 −𝟏 ).
−𝟔 𝟔 −𝟐 − 𝝀
and
|𝑨 − 𝝀𝑰𝟑 | = −𝝀𝟑 + 𝟏𝟐𝝀 + 𝟏𝟔.
In general, it is not easy to find the roots of a polynomial
of degree 3. We try some simple values and hope we will
be lucky. For 𝝀 = −𝟐, you get
|𝑨 − 𝝀𝑰𝟑 | = −(−𝟐)𝟑 + 𝟏𝟐(−𝟐) + 𝟏𝟔 = 𝟎,
so that 𝝀 = −𝟐 is a root of |𝑨 − 𝝀𝑰𝟑 |.
Hence (𝝀 + 𝟐) is a factor of |𝑨 − 𝝀𝑰𝟑 | and |𝑨 − 𝝀𝑰𝟑 | can
be written in the form
- 199 -
|𝑨 − 𝝀𝑰𝟑 | = (𝝀 + 𝟐)(𝒂𝝀𝟐 + 𝒃𝝀 + 𝒄)
= 𝒂𝝀𝟑 + 𝝀𝟐 (𝒃 + 𝟐𝒂) + 𝝀(𝒄 + 𝟐𝒃) + 𝟐𝒄.
We now have a, band c must satisfy
𝒂 = −𝟏
𝒂 = −𝟏
𝒃 + 𝟐𝒂 = 𝟎
{ , which yields { 𝒃 = 𝟐 .
𝒄 + 𝟐𝒃 = 𝟏𝟐
𝒄=𝟖
𝟐𝒄 = 𝟏𝟔
We are now in position to find all the factors of |𝑨 −
𝝀𝑰𝟑 |:
|𝑨 − 𝝀𝑰𝟑 | = (𝝀 + 𝟐)(−𝝀𝟐 + 𝟐𝝀 + 𝟖)
= −(𝝀 + 𝟐)𝟐 (𝝀 − 𝟒).
Therefore, A has two eigenvalues: 𝝀𝟎 = −𝟐 and 𝝀𝟏 = 𝟒.
To get the eigenvectors associated with 𝝀𝟎 = −𝟐, we
solve (𝑨 + 𝟐𝑰𝟑 )𝒖 = 𝟎.
−𝟏 𝟏 −𝟏 𝒙𝟏 𝟎
(𝑨 + 𝟐𝑰𝟑 )𝒖 = (−𝟕 𝟕 −𝟏) (𝒙𝟐 ) = (𝟎).
−𝟔 𝟔 𝟎 𝒙𝟑 𝟎
−𝒙𝟏 +𝒙𝟐 −𝒙𝟑 = 𝟎
𝒙𝟏 = 𝒙 𝟐
⇔ {−𝟕𝒙𝟏 +𝟕𝒙𝟐 −𝒙𝟑 = 𝟎 ⇔ { 𝒙 = 𝟎 .
𝟑
−𝟔𝒙𝟏 +𝟔𝒙𝟐 =𝟎
Hence the eigenvectors associated with the eigenvalue
𝟏
𝝀𝟎 = −𝟐 are of the form 𝜶 (𝟏).
𝟎
- 200 -
To get the eigenvectors associated with 𝝀𝟏 = 𝟒, we solve
(𝑨 − 𝟒𝑰𝟑 )𝒖 = 𝟎:
−𝟕 𝟏 −𝟏 𝒙𝟏 𝟎
(𝑨 − 𝟒𝑰𝟑 )𝒖 = (−𝟕 𝟏 −𝟏) (𝒙𝟐 ) = (𝟎)
−𝟔 𝟔 −𝟔 𝒙𝟑 𝟎
−𝟕𝒙𝟏 +𝒙𝟐 −𝒙𝟑 = 𝟎
𝒙 =𝟎
⇔ {−𝟕𝒙𝟏 +𝒙𝟐 −𝒙𝟑 = 𝟎 ⇔{ 𝟏 .
𝒙𝟐 = 𝒙𝟑
−𝟔𝒙𝟏 +𝟔𝒙𝟐 −𝟔𝒙𝟑 = 𝟎
Hence the eigenvectors associated with the eigenvalue
𝟎
𝝀𝟏 = 𝟒 are of the form 𝜶 (𝟏).◄
𝟏
Example 5.
Find the characteristic polynomial, eigenvalues and
eigenvectors of the matrix A, where
5 4 2
𝐴 = [4 5 2].
2 2 2
Solution.
The characteristic polynomial is
𝜆 − 5 −4 −2
𝑃(𝜆) = 𝑑𝑒𝑡( 𝜆𝐼 − 𝐴) = | −4 𝜆 − 5 −2 |
−2 −2 𝜆 − 2
( )2( )
= 𝜆 − 1 𝜆 − 10 = 0.⇒ 𝜆 = 1, 1, and 10.
1. As𝜆 = 1,
−4 −4 −2 𝑥1
(1 ⋅ 𝐼 − 𝐴)𝑥 = [−4 −4 −2] [𝑥2 ] = 0.
−2 −2 −1 𝑥3
⇔ 𝑥1 = −𝑠 − 𝑡, 𝑥2 = 𝑠, 𝑥3 = 2𝑡
- 201 -
𝑥1 −𝑠 − 𝑡 −1 −1
⇔ 𝑥 = [𝑥2 ] = [ 𝑠 ] = 𝑠 [ 1 ] + 𝑡 [ 0 ] , 𝑠, 𝑡 ∈ 𝑅.
𝑥3 2𝑡 0 2
Thus,
−1 −1
= 𝑠 [ 1 ] + 𝑡 [ 0 ] , 𝑠, 𝑡 ∈ 𝑅, 𝑠 ≠ 0 or 𝑡 ≠ 0,
0 2
are the eigenvectors associated with eigenvalue𝜆 = 1.
2. As𝜆 = 10,
5 −4 −2 𝑥1
(10 ⋅ 𝐼 − 𝐴)𝑥 = [−4 5 −2] [𝑥2 ] = 0.
−2 −2 8 𝑥3
⇔ 𝑥1 = 2𝑟, 𝑥2 = 2𝑟, 𝑥3 = 𝑟
𝑥1 2𝑟 2
⇔ 𝑥 = [𝑥2 ] = [2𝑟] = 𝑟 [2] , 𝑟 ∈ 𝑅. Thus,
𝑥3 𝑟 1
2
𝑟 [2] , 𝑟 ∈ 𝑅, 𝑟 ≠ 0,
1
are the eigenvectors associated with eigenvalue𝜆 = 10.◄
Example 5.
Find the eigenvectors of the matrix A, where
0 1 2
𝐴 = [2 3 0].
0 4 5
Solution.
𝜆 −1 −2
𝑓(𝜆) = 𝑑𝑒𝑡( 𝜆𝐼 − 𝐴) = |−2 𝜆 − 3 0 |
0 −4 𝜆−5
2
= (𝜆 − 1) (𝜆 − 6) = 0
⇒ 𝜆 = 1, 1, and 6.
- 202 -
−1 1 2 𝑥1
3. As 𝜆 = 1,(𝐴 − 1 ⋅ 𝐼)𝑥 = [ 2 2 0] [𝑥2 ] = 0.
0 4 4 𝑥3
𝑥1 1 1
𝑥 = [𝑥2 ] = 𝑡 [−1] , 𝑡 ∈ 𝑅. Thus 𝑡 [−1] , 𝑡 ∈ 𝑅, 𝑡 ≠
𝑥3 1 1
4. As𝜆 = 6,
−6 1 2 𝑥1
(𝐴 − 6 ⋅ 𝐼)𝑥 = [ 2 −3 0 ] [𝑥2 ] = 0.
0 4 −1 𝑥3
𝑥1 3 3
𝑥 = [𝑥2 ] = 𝑟 [2] , 𝑟 ∈ 𝑅 or 𝑟 [2] , 𝑟 ∈ 𝑅, 𝑟 ≠ 0.
𝑥3 8 8
- 203 -
Exercise Set (5.4)
Find the eigenvalues of the matrices in problems 1 - 9.
2 7 3 −2 5 3
1. 𝐴 = ( ) 2. 𝐴 = ( ) 3. 𝐴 = ( )
7 2 1 −1 −4 4
0 0 0 4 0 0
4. 𝐴 = (0 2 5 ) 5. 𝐴 = (0 0 0 )
0 0 −1 1 0 −3
4 −7 0 2
4 0 0
0 3 −4 6
6. 𝐴 = (0 0 0 ) 7. 𝐴 = ( )
0 0 3 −8
1 0 −3
0 0 0 1
5 0 0 0
8 −4 0 0
8. 𝐴 = ( )
0 7 1 0
1 −5 2 1
3 0 0 0 0
−5 1 0 0 0
9. 𝐴 = 3 8 0 0 0 .
0 −7 2 1 0
(−4 1 9 −2 3)
- 204 -
CHAPTER VI
PARTIAL FRACTIONS &
SEQUENCES
- 205 -
CHAPTER VI
- 206 -
𝑓(𝑥) 𝐴 𝐵
(𝑥−𝑎)(𝑥−𝑏)
= + , (1)
𝑥−𝑎 𝑥−𝑏
evaluated at𝑥 = 𝑎.
𝑓(𝑥)
This is the same as covering-up (𝑥 − 𝑎) in (𝑥−𝑎)(𝑥−𝑏) and
- 207 -
Example 1.
𝑥+1
Find the partial fraction decomposition of .
𝑥(𝑥−1)(𝑥+2)
Solution.
𝑥+1 𝐴 𝐵 𝐶
= + +
𝑥 (𝑥 − 1)(𝑥 + 2) 𝑥 𝑥−1 𝑥+2
𝑥+1
By the cover up method, 𝐴 = (𝑥−1)(𝑥+2) evaluated at𝑥 =
𝑓(𝑥) 𝐴 𝐵 𝐶
(𝑥−𝑎)2 (𝑥−𝑏)
= 2 + + ,
(𝑥−𝑎) 𝑥−𝑎 𝑥−𝑏
- 209 -
Solution.
𝑥+1 𝐴 𝐵 𝐶 𝐷
= + (𝑥−1)3 + (𝑥−1)2 + (𝑥−1) (2)
𝑥(𝑥−1)3 𝑥
so 𝐴 = −1
𝑥+1
By the cover up method, 𝐵 = evaluated at𝑥 = 1,
𝑥
so𝐵 = 2.
To find C and D, we have to solve a system of equations.
Since we know A and B, if we multiply (2) times𝑥 (𝑥 −
1)3 , we obtain
𝑥 + 1 = −(𝑥 − 1)3 + 2𝑥 + 𝐶𝑥(𝑥 − 1) + 𝐷𝑥(𝑥 − 1)2 (3)
- 210 -
Note: If you solve this problem using the book’s method,
you will need to solve a system of 4 equations.
Case III Quadratic Roots:
𝑓(𝑥)
Consider the proper rational function (𝑥−𝑎)(𝑥 2 +𝑏𝑥+𝑐)
,
must be determined.
In this case we can use the cover-up method in the linear
term, and a system of equations find𝐵𝑥 + 𝐶.
Example 3.
𝑥+1
Find the partial fraction decomposition of .
𝑥(𝑥 2 −2𝑥−2)
Solution.
𝑥+1 𝐴 𝐵𝑥+𝐶
= + (4)
𝑥(𝑥 2 −2𝑥−2) 𝑥 𝑥 2 −2𝑥−2
𝑥+1
By the cover up method, 𝐴 = (𝑥 2 evaluated at 𝑥 =
−2𝑥−2)
1
0, so 𝐴 = − .
2
- 211 -
1
Since we know 𝐴 = − by cover-up, if we multiply (4)
2
- 212 -
𝑥+1
By the cover up method, 𝐵𝑥 + 𝐶 = evaluated at𝑥 =
𝑥
- 213 -
Exercise Set (6.3)
1) Find the partial fraction decomposition of
2𝑥
4𝑥 2 + 12𝑥 + 9
.
2) Find the partial fraction decomposition of
−10𝑥 2 + 27𝑥 − 14
(𝑥 − 1)3 (𝑥 + 2)
.
3) Find the partial fraction decomposition of
2𝑥 4 +4𝑥 3 −2𝑥 2 +𝑥+7
.
𝑥 3 +2𝑥 2 −𝑥−2
x3 + 1
g( x ) :=
(x −1) (x −2) g(x) :=
1 11 1 4
h( x ) := h( x ) := − + +
2 x3 + 3 x2 − 2 x 2 x 10 ( x + 2 ) 5 ( 2 x − 1 )
1 1 x
i( x ) := i( x ) := − 2
(x +1) x
2 x x +1
- 214 -
2. SEQUENCES
The "sequence" in mathematics is used to describe
an unending succession of numbers. Some possibilities
are
1 ,2 ,3 ,4 ,…
2 ,4 ,6 ,8 ,…
1 1 1
1, , , , …
2 3 4
1 , −1, 1, −1, …
In each case, the dots are used to suggest that the
sequence continues indefinitely, following the obvious
pattern. The numbers in a sequence are called the terms
of the sequence. The terms may be described according to
the positions they occupy. Thus, a sequence has a first
term, a second term, a third term, and so forth. Because
a sequence continues indefinitely, there is no last term.
The most common way to specify a sequence is to
give a formula for the terms. To illustrate the idea, we
have listed in the table below the terms in the sequence
2, 4, 6, 8, … together with their term numbers:
Term number:1 2 3 4 …
Term 2 4 6 8 …
- 215 -
There is a clear relationship here: each term is twice its
term number. Thus, for each positive integer 𝑛 the nth
term in the sequence 2, 4, 6, 8, … is given by the formula
2 𝑛.This is denoted by writing
2 , 4 , 6 , 8 , … ,2 𝑛 , …
or more compactly in bracket notation as
{2𝑛}+∞
𝑛=1
Solution.
Substituting 𝑛 = 1, 2, 3, 4, 5 into the formula2𝑛 yields
21 , 22 , 23 , 24 , 25
or, equivalently 2, 4, 8, 16, 32.◄
Example 2.
Express the following sequences in bracket notation
1 2 3 4
(a) , , , ,…
2 3 4 5
1 1 1 1
(b) , , , ,…
2 4 8 16
- 216 -
(c) 1, −1, 1, −1, …
1 2 3 4
(d) ,− , ,− ,…
2 3 4 5
(e) 1, 3, 5, 7, …
Solution.
(a) Begin by comparing terms and term numbers:
Term number 1 2 3 4 …
1 2 3 4
Term …
2 3 4 5
be written
𝑛 +∞
{ }
𝑛 + 1 𝑛=1
(b)Observe that the sequence can be rewritten as
1 1 1 1
, , , ,…
2 22 23 24
and construct a table comparing terms and term numbers:
Term number 1 2 3 4…
1 1 1 1
Term …
2 22 23 24
- 217 -
1
From the table we see that the nth term is so the
2𝑛
- 218 -
{2 𝑛 − 1}+∞
𝑛=1
- 219 -
Definition 1.
A sequence (or infinite sequence) is a function whose
domain is the set of positive integers.
Because sequences are functions, we may inquire about
the graph of sequence. For example, the graph of the
𝑛+1 +∞
sequence{ }
2𝑛2 𝑛=1
(Figure 1 a)
- 220 -
y
𝑥+1
𝑦= ,𝑥≥1
2𝑥 2
x
(Figure 1 b).
In Figure 2 we have sketched the graphs of 3sequences:
𝑛 − 1 +∞ 𝑛 − 1 +∞
{ } , {(−1)𝑛+1
( )} , {3}+∞
𝑛=1
𝑛 𝑛=1 𝑛 𝑛=1
Figure 2 (a)
Figure 2 (b)
- 221 -
Figure 2 (c)
Figure 3
- 222 -
The following definition expresses this idea precisely.
Definition 2.
A sequence{𝑎𝑛 }+∞
𝑛=1 is said to have the limit L if given
𝑛−1 +∞
diverge, while the sequences { } and{3}+∞
𝑛=1
𝑛 𝑛=1
Theorem 3.
Suppose that the sequences {𝑎𝑛 }and {𝑏𝑛 }converge to
limits 𝐿1 and 𝐿2 , respectively, and c is a constant. Then
(a) lim 𝑐 = 𝑐
𝑛→+∞
- 223 -
(b) lim 𝑐𝑎𝑛 = 𝑐 lim 𝑎𝑛 = 𝑐𝐿1
𝑛→+∞ 𝑛→+∞
𝑎 lim 𝑎𝑛 𝐿
(f) lim ( 𝑛 ) = 𝑛→+∞ = 1 (𝑖𝑓 𝐿2 ≠ 0).
𝑏
𝑛→+∞ lim 𝑏
𝑛 𝐿
𝑛→+∞ 𝑛 2
Example 4.
In each part, determine if the given sequence converges
or diverges. If it converges, find the limit.
𝑛 +∞ 𝑛 +∞
(a) { } (b){(−1)𝑛+1 }
2𝑛+1 𝑛=1 2 𝑛+1 𝑛=1
1 +∞
(c) {(−1)𝑛+1 } (d) {8 − 2 𝑛}+∞
𝑛=1
𝑛 𝑛=1
𝑛 +∞
(e) { 𝑛} .
𝑒 𝑛=1
Solution.
(a) Dividing numerator and denominator by n yields
𝑛 1 1 1
lim = lim 1 = = .
𝑛→+∞ 2 𝑛+1 𝑛→+∞ (2+ ) 2+0 2
𝑛
𝑛 +∞ 1
Thus{ } converges to .
2 𝑛+1 𝑛=1 2
𝑛 1
(b) From part (a), lim =
𝑛→+∞ 2 𝑛+1 2
- 224 -
Thus, since(−1)𝑛+1 between + 1 and -1, the product
𝑛
(−1)𝑛+1 oscillates between positive and negative
2 𝑛+1
1
values, with the odd-numbered terms approaching and
2
1
the even-numbered terms approaching.− . Therefore,
2
𝑛 +∞
the sequence {(−1)𝑛+1 } approaches no limit. It
2 𝑛+1 𝑛=1
diverges.
1 1
(c) Since lim = 0, the product(−1)𝑛+1 ( )
𝑛→+∞ 𝑛 𝑛
- 225 -
However, we can apply L'Hopital's rule to the related
𝑥
problem lim to obtain
𝑥→+∞ 𝑒 𝑥
𝑥 1
lim = lim =0
𝑥→+∞ 𝑒 𝑥 𝑥→+∞ 𝑒 𝑥
𝑛
We conclude from this that lim = 0since the values
𝑛→+∞ 𝑒 𝑛
𝑛 𝑥
of and are the same when x is a positive integer.◄
𝑒𝑛 𝑒𝑥
Example 5.
𝑛
Show that lim √𝑛 = 1.
𝑛→+∞
Solution.
1 𝑛
By L'Hopital's rule, lim ln 𝑛 = 0.Thus lim √𝑛 =
𝑛→+∞ 𝑛 𝑛→+∞
1
lim 𝑛 𝑛
𝑛→+∞
- 226 -
Exercise Set (6.1)
Show the first five terms of the sequence, determine if the
sequence converges, and if so find the limit. (When
writing out the terms of the sequence, you need not find
numerical values: leave the forms in the first form you
obtain).
+∞ +∞
𝑛 𝑛2
(1) { } (2) { }
𝑛+2 𝑛=1 2 𝑛+1 𝑛=1
1 +∞
(3) {2}+∞
𝑛=1 (4) {ln ( )}
𝑛 𝑛=1
ln 𝑛 +∞ 𝜋 +∞
(5) { } (6) {𝑛 sin }
𝑛 𝑛=1 𝑛 𝑛=1
+∞
(−1)𝑛+1
(7) {1 + (−1)𝑛 }+∞
𝑛=1 (8) { }
𝑛2 𝑛=1
+∞
2 𝑛3 𝑛 +∞
(9) {(−1)𝑛 } (10) { 𝑛 }
𝑛3 +1 𝑛=1 2 𝑛=1
(𝑛+1)(𝑛+2) +∞ 𝜋𝑛 +∞
(11) { } (12){ 𝑛 }
2 𝑛2 𝑛=1 4 𝑛=1
+∞ +∞
1
2 𝑛
(13) {𝑛 } 𝑛 (14) {(1 − ) }
𝑛=1 𝑛 𝑛=1
- 227 -
1 3 5 7 1 2 3
(15) , , , , … (16) 0 , , , ,…
2 4 6 8 22 32 42
1 1 1 1
(17) , , , ,… (18) −1 , 2 , −3 ,4 , −5 , …
3 9 27 81
1 1 1 1 1 1 1
(19)(1 − ) , ( − ) , ( − ) , ( − ),…
2 2 3 3 4 4 5
3 3 3
(20) 3 , , , ,…
2 22 23
- 228 -
separately, find a formula for the nth term of the given
sequence
1 1 1
(a) 1 , ,3 , ,5 , ,…
23 24 26
1 1 1 1 1 1 1 1
(b)1 , , , , , , , , , …
3 3 5 5 7 7 9 9
- 229 -
(a) 𝜀 = 0.5 (b) 𝜀 = 0.1 (c) 𝜀 = 0.001
(28) I
𝑛 +∞
f we accept the fact that the sequence{ } converge
𝑛+1 𝑛=1
𝑛 +∞
(b) The sequence{ } converge to 1.
𝑛+1 𝑛=1
- 230 -
3. MONOTONE SEQUENCES
Definition 1.
A sequence{𝑎𝑛 } is called
increasing if 𝑎1 < 𝑎2 < 𝑎3 < ⋯ < 𝑎𝑛 < ⋯
nondecreasing if 𝑎1 ≤ 𝑎2 ≤ 𝑎3 ≤ ⋯ ≤ 𝑎𝑛 ≤ ⋯
decreasing if 𝑎1 > 𝑎2 > 𝑎3 > ⋯ > 𝑎𝑛 >
⋯nonincreasing if 𝑎1 ≥ 𝑎2 ≥ 𝑎3 ≥ ⋯ ≥ 𝑎𝑛 ≥ ⋯
A sequence that is either nondecreasing or nonincreasing
is called monotone, and a sequence that is increasing or
decreasing is called strictly monotone. Observe that a
strictly monotone sequence is monotone, but not
conversely.
Example 1.
1 2 3 𝑛
, , ,…, ,… is increasing
2 3 4 𝑛+1
1 1 1
1 , , ,…, ,… is decreasing
2 3 𝑛
1, 1, 2, 2, 3, 3, … is nondecreasing
1 1 1 1
1 ,1 , , , , , … is nonincreasing
2 2 3 3
- 231 -
1 1 1 1
1,− , , − , … , (−1)𝑛+1 , …
2 3 4 𝑛
is not. The first and second sequences are strictly
monotone.
Frequently, one can guess whether a sequence is
increasing, decreasing, nondecreasing, or nonincreasing
after writing out some of the initial terms. However, to be
certain that the guess is correct, the kind of analysis
illustrated in the next few examples is needed.
Example 2.
Show that
1 2 3 𝑛
, , ,…, ,…
2 3 4 𝑛+1
is an increasing sequence.
Solution.
Although it is intuitively clear that the sequence is
increasing, intuition cannot replace a definitive proof. To
prove that the sequence is increasing, let
𝑥
𝑓(𝑥 ) =
𝑥+1
- 232 -
So, the nth term in the given sequence is 𝑎𝑛 = 𝑓(𝑛). The
function f is increasing for 𝑥 ≥ 1 since
(𝑥+1)(1)−𝑥(1) 1
𝑓′(𝑥 ) = (𝑥+1)2
= (𝑥+1)2. So 𝑓′(𝑥) > 0 for 𝑥 ≥ 1.
Thus, for 𝑛 ≥ 1
𝑛 𝑛 + 1 𝑛2 + 2𝑛 − 𝑛2 − 2 𝑛
𝑎𝑛 − 𝑎𝑛+1 = − =
𝑛+1 𝑛+2 (𝑛 + 1)(𝑛 + 2)
1
= − (𝑛+1)(𝑛+2) < 0.
- 233 -
𝑎𝑛+1
>1 (1a)
𝑎𝑛
for 𝑛 ≥ 1.
Example 3.
Use (1a) to prove that the sequence in Example 2 above is
increasing.
Solution.
As shown in the alternate solution of Example 2,
𝑛 𝑛+1
𝑎𝑛 = 𝑎𝑛𝑑 𝑎𝑛+1 =
𝑛+1 𝑛+2
Thus
(𝑛+1)
𝑎𝑛+1 (𝑛+2) 𝑛2 + 2 𝑛 + 1
= 𝑛 = (2)
𝑎𝑛 𝑛2 + 2 𝑛
(𝑛+1)
𝑎𝑛+1
That is > 1 for 𝑛 ≥ 1.This proves that the sequence
𝑎𝑛
is increasing.◄
●In our subsequent work n! (n factorial)is the product of
0 ! = 1.
- 234 -
Example 4.
Show that the sequence
𝑒 𝑒2 𝑒3 𝑒𝑛
, , ,…, ,…
2! 3! 4! (𝑛 + 1)!
is decreasing
Solution.
𝑒𝑛
We will use (1b). Since 𝑎𝑛 = (𝑛+1)!, it follows on
replacing n by 𝑛 + 1 that
𝑒 𝑛+1 𝑒 𝑛+1
𝑎𝑛+1 = =
[(𝑛 + 1) + 1]! (𝑛 + 2)!
Thus
𝑒 𝑛+1
𝑎𝑛+1 (𝑛+2)! 𝑒 𝑛+1 (𝑛 + 1)! 𝑒
= 𝑒𝑛
= . =
𝑎𝑛 𝑎𝑛 (𝑛 + 2)! 𝑛 + 2
(𝑛+1)!
𝑎𝑛+1
For 𝑛 ≥ 1, 𝑛 + 2 ≥ 3 > 𝑒(≈ 2 .718 … ). So, =
𝑎𝑛
𝑒
< 1 for 𝑛 ≥ 1. This proves that the sequence is
𝑛+2
decreasing. ◄
- 235 -
Theorem 2.
For a nondecreasing sequence 𝑎1 ≤ 𝑎2 ≤ ⋯ ≤ 𝑎𝑛 ≤ ⋯
there are two possibilities:
(a) There is a constant M such that𝑎𝑛 ≤ 𝑀 for all n, in
which case the sequence converges to a limit L
satisfying 𝐿 ≤ 𝑀.
(b) No such constant exists, in which case
lim 𝑎𝑛 = +∞
𝑛→+∞
Theorem 3.
For a nonincreasing sequence 𝑎1 ≥ 𝑎2 ≥ ⋯ ≥ 𝑎𝑛 ≥ ⋯
there are two possibilities:
(a) There is a constant M such that 𝑎𝑛 ≥ 𝑀 for all n, in
which case the sequence converges to a limit L
satisfying 𝐿 ≤ 𝑀.
(b) No such constant exists, in which case
lim 𝑎𝑛 = −∞.
𝑛→+∞
Axiom 1.
- 236 -
number in S, then there is a smallest real number that is
greater than or equal to every number in S.
Axioms 2.
- 237 -
Example 5.
𝑛 +∞
As shown in Examples 2 and 3, the sequence{ } is
𝑛+1 𝑛=1
𝑛
nondecreasing. Since𝑎𝑛 = < 1 , 𝑛 = 1 , 2 , …, the
𝑛+1
converges.
- 238 -
Solution.
It is tedious to determine convergence directly from the
limit
5𝑛
lim
𝑛→+∞ 𝑛!
- 239 -
Thus, if we discard the first four terms of the given
sequence (which won't affect convergence) the resulting
sequence will be decreasing. Moreover, each term in the
sequence is positive. So that by Theorem 3, the sequence
converges to some limit that is ≥ 0.
Exercise Set (6.2)
In Exercise 1-6, use differentiation to show that the
sequence is strictly monotone and classify it as increasing
or decreasing
𝑛 +∞ 1 +∞
(1){ } (2) {3 − }
2 𝑛+1 𝑛=1 𝑛 𝑛=1
1 +∞
(3) { } (4) {𝑛 𝑒 −2 𝑛 }+∞
𝑛=1
𝑛+ln 𝑛 𝑛=1
ln(𝑛+2) +∞
(5) { } (6) {tan−1 𝑛}+∞
𝑛=1
𝑛+2 𝑛=1
𝑛 +∞ 2𝑛 +∞
(9) { 𝑛} (10) { }
2 𝑛=1 1+2𝑛 𝑛=1
- 240 -
10 +∞ 𝑛𝑛 +∞
(11) { 𝑛2
} (12) { }
2 𝑛=1 𝑛! 𝑛=1
1 +∞ 4 𝑛−1 +∞
(15) {𝑛 − } (16) { }
𝑛 𝑛=1 5 𝑛+2 𝑛=1
𝑛 +∞ 𝜋 +∞
(17) { 𝑛 } (18) {cos }
5 𝑛=1 2 𝑛 𝑛=1
- 241 -
CHAPTER (VII)
INFINITE SERIES
- 242 -
CHAPTER (VII)
INFINITE SERIES
1. Infinite Series
Definition 1.
An infinite series is an expression of the form
𝑢1 + 𝑢2 + 𝑢3 + ⋯ + 𝑢𝑘 + ⋯
or in sigma notation
∞
∑ 𝑢𝑘
𝑘=1
- 243 -
𝑛
𝑠𝑛 = 𝑢1 + 𝑢2 + 𝑢3 + ⋯ + 𝑢𝑛 = ∑ 𝑢𝑘
𝑘=1
partial sums.
Example 1.
For the infinite series
3 3 3 3
+ 2+ 3+ 4+⋯
10 10 10 10
the partial sums are
3
𝑠1 =
10
3 3 33
𝑠2 = + 2=
10 10 100
3 3 3 333
𝑠3 = + 2+ 3=
10 10 10 1000
3 3 3 3 3333
𝑠4 = + 2+ 3+ 4=
10 10 10 10 10000
⋮
- 244 -
Definition 2.
Let {𝑠𝑛 } be the sequence of partial sums of the series
∑∞
𝑘=1 𝑢𝑘 . If the sequence {𝑠𝑛 }converges to a limit S, then
Or
9 3 1
𝑠𝑛 = (1 − 10𝑛 ).
10 10
Or
1 1
𝑠𝑛 = (1 − 𝑛 ) (3)
3 10
1
Since → 0 𝑎𝑠 𝑛 → +∞, it follows from (3) that
10𝑛
1
𝑆 = lim 𝑠𝑛 = .
𝑛→+∞ 3
Thus,
1 3 3 3 3
= + + + ⋯+ + ⋯◄
3 10 102 103 10𝑛
Example 3.
Determine the series
1 − 1 + 1 − 1 + …
converges or diverges. If it converges, find its sum.
- 246 -
Solution.
The partial sums are 𝑠1 = 1 , 𝑠2 = 1 − 1 = 0 , 𝑠3 = 1 −
1 + 1 = 1, 𝑠4 = 1 − 1 + 1 − 1 = 0, and so forth. Thus
the sequence of partial sums is
1, 0, 1, 0, 1, 0, …
Since this is a divergent sequence, the given series
diverges and consequently has no sum.◄
●The series in Example 2 and 3 are examples of
geometric series. A geometric series is one of the forms
𝑎 + 𝑎 𝑟 + 𝑎𝑟 2 + ⋯ + 𝑎𝑟 𝑘−1 + ⋯ (𝑎 ≠ 0)
where each term is obtained by multiplying the previous
one by a constant r. the multiplier r is called the ratio for
the series. Some examples of geometric series are:
1 + 2 + 4 + 8 + ⋯ + 2𝑘−1 + ⋯ (𝑎 = 1 , 𝑟 = 2);
3 3 3 1
3+ + +⋯+ + ⋯ (𝑎 = 3 , 𝑟 = );
10 102 10𝑘−1 10
1 1 1 1 1 1
2
− + − ⋯ + (−1)𝑘+1
4 8 2𝑘
+ ⋯ (𝑎 =
2
,𝑟 = − )
2
;
1+ 1+⋯+1 +⋯ (𝑎 = 1 , 𝑟 = 1);
1 − 1 + 1 − ⋯ + (−1)𝑘+1 + ⋯ (𝑎 = 1 , 𝑟 = −1).
The following theorem is the fundamental result on
convergence of geometric series
- 247 -
Theorem 3.
A geometric series
𝑎 + 𝑎𝑟 + 𝑎𝑟 2 + ⋯ + 𝑎𝑟 𝑘−1 + ⋯ (𝑎 ≠ 0)
converges if |𝑟| < 1 and diverges if |𝑟| ≥ 1. When the
series converges the sum is
𝑎
= 𝑎 + 𝑎𝑟 + 𝑎𝑟 2 + ⋯ + 𝑎𝑟 𝑘−1 + ⋯
1−𝑟
Proof.
Let us treat the case|𝑟| = 1 first. If 𝑟 = 1, then the
series is
𝑎 + 𝑎 + ⋯ + 𝑎 + ⋯.
So that the nth partial sum is 𝑠𝑛 = 𝑛𝑎 and lim 𝑠𝑛 =
𝑛→+∞
- 248 -
Multiplying both sides of (4) by r yields
𝑟 𝑠𝑛 = 𝑎 𝑟 + 𝑎𝑟 2 + ⋯ + 𝑎𝑟 𝑛−1 + 𝑎𝑟 𝑛 (5)
and subtracting (5) form (4) gives
𝑠𝑛 − 𝑟𝑠𝑛 = 𝑎 − 𝑎𝑟 𝑛
or (1 − 𝑟)𝑠𝑛 = 𝑎 − 𝑎𝑟 𝑛 (6)
Since 𝑟 ≠ 1in the case we are considering, this can be
rewritten as
𝑎−𝑎𝑟 𝑛 𝑎 𝑎𝑟 𝑛
𝑠𝑛 = = − (7)
1−𝑟 1−𝑟 1−𝑟
- 249 -
1
Since |𝑟| = < 1, the series converges and the sum is
4
𝑎 5 20
= 1 = ◄
1−𝑟 1− 3
4
Example 5.
Find the rational number represented by repeating
decimal 0.784784784 …
Solution.
We can write
0.784784784 … = 0.784 + 0.000784 +
0.000000784 + …
so the given decimal is the sum of a geometric series with
𝑎 = 0.784and 𝑟 = 0.001. Thus
𝑎 0.784 0.784 784
0.784784784…= = = = .◄
1−𝑟 1−0.001 0.999 999
Example 6.
Determine if the series
∞
1 1 1 1
∑ = + + +⋯
𝑘(𝑘 + 1) 1.2 2.3 3.4
𝑘=1
- 250 -
𝑛
1 1 1 1 1
𝑠𝑛 = ∑ = + + + ⋯+
𝑘(𝑘 + 1) 1.2 2.3 3.4 𝑛(𝑛 + 1)
𝑘=1
- 251 -
Example 7.
One of the most famous and important of all diverging
series is the harmonic series
∞
1 1 1 1
∑ =1+ + + +⋯
𝑘 2 3 4
𝑘=1
- 253 -
Exercise Set (7.1)
(1) In each part, find the first four partial sums, find a
closed form for the nth partial sum; determine if this
series converges, and if so give the sum.
2 1
(a) ∑∞
𝑘=1 (b) ∑∞
𝑘=1
5𝑘−1 (𝑘+1)(𝑘+2)
2𝑘−1
(c) ∑∞
𝑘=1 4
2 𝑘+2 7
(4) ∑∞
𝑘=1 (3) ; (5) ∑∞
𝑘=1(−1)
𝑘−1
;
6𝑘−1
3 𝑘+1
(6) ∑∞
𝑘=1 4
𝑘−1
; (7) ∑∞
𝑘=1 (− 2) ;
1 1 1
(8) ∑∞
𝑘=1 ( − ); (9) ∑∞
𝑘=1 ;
𝑘+3 𝑘+4 (𝑘+2)(𝑘+3)
1 1 1
(10) ∑∞
𝑘=1 ( − ); (11) ∑∞
𝑘=1 ;
2𝑘 2𝑘+1 9𝑘 2 +3𝑘−2
1 4𝑘+2
(12) ∑∞
𝑘=1 ; (13) ∑∞
𝑘=1 ;
𝑘 2 −1 7𝑘−1
𝑒 𝑘−1 1 𝑘
(14) ∑∞
𝑘=1 (𝜋) ; (15) ∑∞
𝑘=1 (− 2) ;
- 254 -
5
(16) ∑∞
𝑘=1 ;
𝑘−2
√𝑘+1−√𝑘
(26) Show ∑∞
𝑘=1 =1
√𝑘 2 +𝑘
- 255 -
1
(b) ∑∞ 𝑘
𝑘=1(𝑥 − 3) = if 2 < 𝑥 < 4
4−𝑥
1
(c) ∑∞ 𝑘 2𝑘
𝑘=1(−1) 𝑥 = if −1 < 𝑥 < 1.
1+𝑥 2
- 256 -
2. CONVERGENCE; THE INTEGRAL TEST
Theorem 1.
If the series∑ 𝑢𝑘 converges, then lim 𝑢𝑘 = 0.
𝑘→+∞
Proof.
The term 𝑢𝑘 can be written
𝑢𝑘 = 𝑠𝑘 − 𝑠𝑘−1 (1)
where 𝑠𝑘 is the sum of the first k terms and 𝑠𝑘−1 is the sum
of the first k-1 terms. If S denotes the sum of the series,
then lim 𝑠𝑘 = 𝑆 and since(𝑘 − 1) → +∞ 𝑎𝑠 𝑘 → +∞ we
𝑘→+∞
Proof.
It is just an alternate phrasing of the above theorem and
needs no additional proof.
Example 1.
The series
∞
𝑘 1 2 3 𝑘
∑ = + + +⋯+ +⋯
𝑘+1 2 3 4 𝑘+1
𝑘=1
- 257 -
𝑘 1
diverges since lim = lim 1 = 1 ≠ 0.◄
𝑘→+∞ 𝑘+1 𝑘→+∞ 1+𝑘
WARNING:
The converse of Theorem 1 is false. To prove that a series
converges it does not suffice to show that lim 𝑢𝑘 = 0.
𝑘→+∞
to zero as 𝑘 → +∞.
Theorem 3 (The proof left as exercise).
(a) If∑ 𝑢𝑘 𝑎𝑛𝑑 ∑ 𝑣𝑘 are convergent series, then∑(𝑢𝑘 +
𝑣𝑘 ) are convergent series and the sums of these series
are related by ∑(𝑢𝑘 ± 𝑣𝑘 ) = ∑∞ ∞
𝑘=1 𝑢𝑘 ± ∑𝑘=1 𝑣𝑘 .
- 258 -
(c) Convergence or divergence is unaffected by deleting a
finite number of terms from the beginning of a series;
that is, for any positive integer K, the series
∑∞
𝑘=1 𝑢𝑘 = 𝑢1 + 𝑢2 + 𝑢3 + ⋯
and
∑∞
𝑘=𝐾 𝑢𝑘 = 𝑢𝑘 + 𝑢𝑘+1 + 𝑢𝑘+2 + ⋯
Solution.
3 3 3 3
The series ∑∞
k=1 = + + + ⋯is a convergent
4k 4 42 43
3 1
geometric series(𝑎 = ,𝑟 = ) and the series
4 4
2 2 2 2
∑∞
𝑘=1 =2+ + + + ⋯is also a convergent
5𝑘−1 5 52 53
1
geometric series(𝑎 = 2 , 𝑟 = ).Thus by part (a) of
5
- 259 -
Example 3.
5 5 5 5
The series∑∞
𝑘=1 = 5 + + + ⋯ + + ⋯
𝑘 2 3 𝑘
harmonic series.
Theorem 4.
If∑ 𝑎𝑘 is a series with positive terms, and if there is a
constant 𝑀 such that 𝑠𝑛 = 𝑎1 + 𝑎2 + ⋯ + 𝑎𝑛 ≤ 𝑀for
every n, then the series converges and the sum
𝑆 satisfies 𝑆 ≤ 𝑀. If no such 𝑀 exists then the series
diverges.
1
●If we have a series with positive terms, say ∑∞
𝑘=1 𝑘2
- 260 -
+∞ 1
and if we form the improper integral∫1 𝑑𝑥 whose
𝑥2
- 261 -
Example 5.
1
Determine if ∑∞
𝑘=1 converges or diverges.
𝑘2
Solution.
1
If we replace k by x in the formula 𝑎𝑘 = we obtain the
𝑘2
1
function𝑓 (𝑥 ) = which satisfies the hypotheses of the
𝑥2
- 262 -
+∞ ℓ
1 1
∫ 𝑑𝑥 = lim ∫ 𝑑𝑥 = lim [In ℓ − In 1] = +∞
𝑥 ℓ→+∞ 𝑥 ℓ→+∞
1 1
- 263 -
Remark.
where 𝑝 > 0.
Examples of p-series are
∞
1 1 1 1
∑ = 1 + + + ⋯+ + ⋯ (𝑝 = 1)
𝑘 2 3 𝑘
𝑘=1
∞
1 1 1 1
∑ = 1 + + + ⋯ + +⋯ (𝑝 = 2)
𝑘2 22 32 𝑘2
𝑘=1
- 264 -
∞
1 1 1 1 1
∑ =1+ + +⋯+ + ⋯ (𝑝 = )
√𝑘 √2 √3 √𝑘 2
𝑘=1
1 𝑘 2 +1
(17) ∑∞
𝑘=1 (𝑘+1)[𝑙𝑛 (18) ∑∞
𝑘=1
(𝑘+1)]2 𝑘 2 +3
- 266 -
1 𝑘 1
(19) ∑∞
𝑘=1 (1 + 𝑘 ) (20) ∑∞
𝑘=1 √𝑘 2 +1
tan−1 𝑘
(21) ∑∞
𝑘=1 (22) ∑∞ 2
𝑘=1 sech 𝑘
1+𝑘 2
1 3
(23) ∑∞ 2 2
𝑘=1 𝑘 sin (𝑘 ) (24) ∑∞ 2 −𝑘
𝑘=1 𝑘 𝑒
1
(25) Prove∑∞
𝑘=2 converges if 𝑝 > 1 and
𝑘 (𝑙𝑛 𝑘)𝑝
diverges if 𝑝 ≥ 1.
1
(28) Prove ∑∞
𝑘=3 converges if 𝑝 >
𝑘(ln 𝑘)[ln (ln 𝑘)]𝑝
1 and diverges if 𝑝 ≤ 1.
(29) Prove : If
∑ 𝑢𝑘 converges and∑ 𝑣𝑘 diverges, then ∑(𝑢𝑘 + 𝑣𝑘 )
diverges and∑(𝑢𝑘 − 𝑣𝑘 ) diverges.
(30) Find examples to show that∑(𝑢𝑘 +
𝑣𝑘 )and∑(𝑢𝑘 − 𝑣𝑘 ) may converge or may diverge
if∑ 𝑢𝑘 and∑ 𝑣𝑘 both diverge.
(31) With the help of Exercise (29) determine if the
given series converges or diverges.
∞ 2 𝑘−1 1 ∞ 𝑘2 1
(a)∑𝑘=1 [( ) + ](b) ∑𝑘=1 [ 2 + ]
3 𝑘 1+𝑘 𝑘(𝑘+1)
1 1 1 1
(c) ∑∞
𝑘=1 [ + 3 ](d) ∑∞
𝑘=2 [ 2 − ].
3 𝑘+2 𝑘(𝑙𝑛 𝑘) 𝑘2
𝑘2
- 267 -
3. ADDITIONAL CONVERGENCE TESTS
In this section we will develop some additional
convergence tests for series with positive terms.
Theorem 1 (The Comparison Test).
Let∑ 𝑎𝑘 and∑ 𝑏𝑘 be series with positive terms and
suppose𝑎1 ≤ 𝑏1 , 𝑎2 ≤ 𝑏2 , … , 𝑎𝑘 ≤ 𝑏𝑘 , …
(a) If the "bigger series" ∑ 𝑏𝑘 converges, then the
"smallest series" ∑ 𝑎𝑘 also converges.
(b) If the "smaller series” ∑ 𝑎𝑘 diverges, then the
"bigger series"∑ 𝑏𝑘 also diverges.
Theorem 2 (The Ratio Test).
Let∑ 𝑢𝑘 be a series with positive terms and suppose
𝑢𝑘+1
lim =𝜌
𝑘→+∞ 𝑢𝑘
- 268 -
∞ ∞
1 1
∑ 𝑎𝑛𝑑 ∑
𝑘 𝑘2
𝑘=1 𝑘=1
𝑢𝑘+1 𝑘 + 1 2𝑘 1 𝑘+1 1
𝜌 = lim = lim 𝑘+1 . = lim =
𝑘→+∞ 𝑢𝑘 𝑘→+∞ 2 𝑘 2 𝑘→+∞ 𝑘 2
so that 𝜌 < 1.◄
- 269 -
Example 3.
𝑘𝑘
The series ∑∞
𝑘=1 diverges by the ratio test since
𝑘!
𝑢𝑘+1 (𝑘 + 1)𝑘+1 𝑘 !
𝜌 = lim = lim . 𝑘
𝑘→+∞ 𝑢𝑘 𝑘→+∞ (𝑘 + 1)! 𝑘
(𝑘 + 1)𝑘 1 𝑘
= lim = lim (1 + ) = 𝑒
𝑘→+∞ 𝑘𝑘 𝑘→+∞ 𝑘
Since 𝜌 = 𝑒 > 1, the series diverges.◄
Example 4.
Determine whether the series
1 1 1 1
1+ + + +⋯+ +⋯
3 5 7 2𝑘−1
converges or diverges.
Solution.
The ratio test is of no help since
𝑢𝑘+1 1 2𝑘−1
𝜌 = lim = lim .
𝑘→+∞ 𝑢𝑘 𝑘→+∞ 2 (𝑘 + 1) − 1 1
2𝑘−1
= lim =1
𝑘→+∞ 2 𝑘 + 1
- 270 -
1 ℓ
= lim 𝑙𝑛 (2𝑥 − 1)] = +∞.◄
ℓ→+∞ 2 1
Example 5.
The series
2! 4! 6! (2 𝑘)!
+ 2 + 3 + ⋯+ 𝑘 + ⋯
4 4 4 4
diverges since
𝑢𝑘+1 [2 (𝑘 + 1)]! 4𝑘
𝜌 = lim = lim .
𝑘→+∞ 𝑢𝑘 𝑘→+∞ 4𝑘+1 (2 𝑘)!
(2 𝑘 + 2)! 1 1
= lim ( . ) = lim (2 𝑘 + 1)(2 𝑘 + 2)
𝑘→+∞ (2 𝑘)! 4 4 𝑘→+∞
= +∞.◄
Theorem 3 (The Root Test).
Let∑ 𝑢𝑘 be a series with positive terms and suppose
𝜌 = lim 𝑘√𝑢𝑘 = lim (𝑢𝑘 )1/𝑘
𝑘→+∞ 𝑘→+∞
- 271 -
Example 6.
4𝑘−5 𝑘
The series ∑∞
𝑘=1 ( ) diverges by the root test since
2 𝑘+1
4 𝑘−5
𝜌 = lim 𝑢𝑘 1/𝑘 = lim = 2 > 1.◄
𝑘→+∞ 𝑘→+∞ 2 𝑘+1
Example 7.
1
The series ∑∞
𝑘=1 (𝑙𝑛 converges by the root test, since
(𝑘+1)𝑘
1
lim (𝑢𝑢 )1/𝑘 = lim = 0 < 1.◄
𝑘→+∞ 𝑘→+∞ 𝐼𝑛 (𝑘+1)
∑ 𝑢𝑘 (5)
𝑘=1
can be written as
∞
2𝑘−1 22 23
∑ =1+2+ + +⋯ (7)
(𝑘 − 1)! 2! 3!
𝑘=1
- 272 -
However, for purpose of applying the convergence tests,
it is not necessary that the series have form (5). For
example, we can apply the ratio test to (6) without
converting to the more complicated form (7). Doing so
yields
𝑢𝑘+1 2𝑘+1 𝑘! 2
𝜌 = lim = lim . 𝑘 = lim =0
𝑘→+∞ 𝑢𝑘 𝑘→+∞ (𝑘 + 1)! 2 𝑘→+∞ 𝑘 + 1
1 1 𝑘
(3) ∑∞
𝑘=2 (4) ∑∞
𝑘=1 𝑘 (2)
5𝑘
𝑘! 𝑘
(5) ∑∞
𝑘=1 (6) ∑∞
𝑘=1
𝑘3 𝑘 2 +1
- 273 -
𝑘 1
(9) ∑∞
𝑘=1 (10) ∑∞
𝑘=1
5𝑘 𝑘2
2 𝑘
(19) ∑∞
𝑘=1 𝑘 (3) (20) ∑∞
𝑘=1 𝑘
𝑘
1 2 𝑘
(21) ∑∞
𝑘=2 𝑘 𝐼𝑛 𝑘 (22) ∑∞
𝑘=1 𝑘 3 +1
4 𝑘 (𝑘 !)2 2𝑘
(23) ∑∞
𝑘=1 ( ) (24) ∑∞
𝑘=1 (2
7 𝑘−1 𝑘+2)!
(𝑘 !)2 1
(25) ∑∞
𝑘=0 (2 (26) ∑∞
𝑘=1
𝑘)! 𝑘 2 +25
1 𝑘𝑘
(27) ∑∞
𝑘=1 (28) ∑∞
𝑘=1
1+√𝑘 𝑘!
𝐼𝑛 𝑘 𝑘!
(29) ∑∞
𝑘=1 (30) ∑∞
𝑘=1 2
𝑒𝑘 𝑒𝑘
2
(𝑘+4)! 𝑘 𝑘
(31) ∑∞
𝑘=0 4 !𝑘 ! 4 𝑘 (32) ∞
∑𝑘=1 ( )
𝑘+1
- 274 -
1 .2 1 .2 .3 1 .2 .3 .4
1+ + + +⋯
1 .3 1 .3 .5 1 .3 .5 .7
𝑎𝑘
(34) For which positive values of a does ∑∞
𝑘=1 𝑘𝑎
converges?
(35) (a) Show lim (𝑙𝑛 𝑘)1/𝑘 = 1 = 1
𝑘→+∞
(b) Use the result in (a) and the root test to show
𝑙𝑛 𝑘
that∑∞
𝑘=1 converges.
3𝑘
(c) Show that the series converges using the ratio test.
𝑘!
(36) Prove: lim =0
𝑘→+∞ 𝑘 𝑘
- 275 -
4. Applying the comparison test
Informal Principle 1.
Constants added or subtracted in the formula for𝑢𝑘 may
usually be deleted without affecting the convergence or
divergence behavior of the series.
Example 1.
Use the above principle to help guess whether the
following series converge or diverge.
∞ ∞ ∞
1 1 1
(𝑎) ∑ 𝑘 (𝑏) ∑ (𝑐) ∑
2 +1 √𝑘 − 2 1 3
𝑘=1 𝑘=5 𝑘=1 (𝑘 + )
2
Solution.
(a) Deleting the constant 1 suggest that
∞ ∞
1 1
∑ 𝑘 behaves like ∑ 𝑘
2 +1 2
𝑘=1 𝑘=1
- 276 -
1
The modified series is a divergent p-series(𝑝 = ) , so the
2
Informal Principle 2.
Solution.
(a) Deleting the term 2 k suggest that
- 277 -
∞ ∞ ∞
1 1 1
∑ behaves like ∑ =∑
√𝑘 3 + 2 𝑘 √𝑘 3 𝑘 3/2
𝑘=1 𝑘=1 𝑘=1
- 278 -
∞
1
∑
2 𝑘2 + 𝑘
𝑘=1
converges or diverges
Solution.
Using Principle 2, the series behaves like the series
∞ ∞
1 1 1
∑ = ∑
2 𝑘2 2 𝑘2
𝑘=1 𝑘=1
- 279 -
Example 4.
Use the comparison test to determine whether
∞
1
∑
2 𝑘2 − 𝑘
𝑘=1
converges or diverges.
Solution.
Using Principle 2, the series behaves like the convergent
series
∞ ∞
1 1 1
∑ = ∑
2 𝑘2 2 𝑘2
𝑘=1 𝑘=1
- 280 -
1 1 1
replacing k by𝑘 2 to obtain ≤ = .
2𝑘 2 −𝑘 2 𝑘 2 −𝑘 2 𝑘2
1
Since∑∞
𝑘=1 is a convergent p-series, the given series
𝑘2
Example 5.
Use the comparison test to determine whether
∞
1
∑ 1
𝑘=1 𝑘 − 4
converges or diverges.
Solution.
Using Principle 1, the series behaves like the divergent
1
harmonic series ∑∞
𝑘=1 𝑘
- 281 -
1
and since∑∞
𝑘=1 diverges, the given series diverges by the
𝑘
comparison test.◄
Example 6.
Use the comparison test to determine whether
1
∑∞
𝑘=1 converges or diverges
√𝑘+5
Solution.
Using Principle 1, the series behaves like the divergent
1
p-series∑∞
𝑘=1 √𝑘
𝐾.
1 𝑎𝑘 3
Or on substituting (1) 𝜌 < < 𝜌 when 𝑘 ≥ 𝐾.
2 𝑏𝑘 2
1 3
Or 𝜌 𝑏𝑘 < 𝑎𝑘 < 𝜌 𝑏𝑘 when 𝑘 ≥ 𝐾 (2)
2 2
If∑∞
𝑘=1 𝑎𝑘 converges, then∑∞
𝑘=𝐾 𝑎𝑘 converges by
Theorem3 (c) in Section 2. It follows from the
comparison test and left-hand inequality in
1
(2),that∑∞ ∞
𝑘=𝐾 𝜌 𝑏𝑘 converges. Thus∑𝑘=𝐾 𝑏𝑘 converges
2
- 283 -
3
Conversely, if∑∞ ∞
𝑘=1 𝑏𝑘 converges, ∑𝑘=𝐾 𝜌 𝑏𝑘 converges
2
Example 7.
Use the limit comparison test to determine if
3𝑘 3 −2𝑘 2 +4
∑∞
𝑘=1 converges or diverges.
𝑘 5 −𝑘 3 +2
Solution.
From Principle2, the series behave like
3𝑘 3 3
∑∞
𝑘=1 = ∑∞
𝑘=1 (3)
𝑘5 𝑘2
- 284 -
Exercise Set (7.4)
In Exercise (1)-(6) prove that the series converges by the
comparison test.
1 2 1
(1) ∑∞
𝑘=1 (2) ∑∞
𝑘=1 (3) ∑∞
𝑘=1 .
3𝑘 +5 𝑘 4 +𝑘 5𝑘 2 −𝑘
𝑘 2𝑘 −1 5 sin2 𝑘
(4)∑∞
𝑘=1 (5)∑∞
𝑘=1 (6)∑∞
𝑘=1 .
8 𝑘 2 +𝑘 2 −1 3𝑘 +2 𝑘 𝑘!
- 285 -
√𝑘 4
(23) ∑∞
𝑘=1 (24) ∑∞
𝑘=1
𝑘 2 +1 2+𝑘3𝑘
1 2+(−1)𝑘
(25) ∑∞
𝑘=1 (26) ∑∞
𝑘=1
√𝑘(𝑘+1) 5𝑘
2+√𝑘 4+|cos 𝑘|
(27) ∑∞
𝑘=1 (𝑘+1)3 (28) ∑∞
𝑘=1
−1 𝑘3
1 √𝑘In 𝑘
(29) ∑∞
𝑘=1 (30) ∑∞
𝑘=1
4+2𝑘 𝑘 3 +1
∞ tan−1 𝑘 ∞ 5𝑘 +𝑘
(31) ∑𝑘=1 2 (32) ∑𝑘=1
𝑘 𝑘 !+3
In 𝑘
(33) ∑∞
𝑘=1 𝑘 √𝑘
𝜋
[Hint : Compare with the series∑∞
𝑘=1 ]] 𝑘
- 286 -
(38) Show that𝑘 𝑘 ≥ 𝑘! and use this result to prove that
the series ∑∞
𝑘=1 𝑘
−𝑘
converges by the comparison
test.Prove convergence using the root test.
(39) Use the limit comparison test to investigation
(𝑘+1)2
convergence of∑∞
𝑘=1 (𝑘+2)! .
∑ 𝑎𝑘 converges.
𝑎
(b) If lim ( 𝑘 ) + ∞ and ∑ bk diverges, then
k→+∞ 𝑏 𝑘
∑ 𝑎𝑘 diverges.
- 287 -
5. ALTERNATING SERIES; CONDITIONAL
CONVERGENCE
In this section we discuss series containing negative
terms.
Of special importance are series whose terms are
alternately positive and negative. These are called
alternating series. Such series have one of two possible
forms :
𝑎1 − 𝑎2 + 𝑎3 − 𝑎4 + ⋯ + (−1)𝑘+1 𝑎𝑘 + ⋯
or
−𝑎1 + 𝑎2 − 𝑎3 + 𝑎4 − ⋯ + (−1)𝑘 𝑎𝑘 + ⋯
where the 𝑎𝑘 ′𝑠 are all positive.
The following theorem is the key result on convergence
of alternating series.
Theorem 1 (Alternating Series Test).
An alternating series
𝑎1 − 𝑎2 + 𝑎3 − 𝑎4 + ⋯ + (−1)𝑘+1 𝑎𝑘 + ⋯
or
−𝑎1 + 𝑎2 − 𝑎3 + 𝑎4 − ⋯ + (−1)𝑘 𝑎𝑘 + ⋯
converges if the following two conditions are satisfied :
(𝑎)𝑎1 ≥ 𝑎2 ≥ 𝑎2 ≥ ⋯ ≥ 𝑎𝑘 ≥ ⋯
- 288 -
(𝑏) lim 𝑎𝑘 = 0
𝑘→+∞
- 290 -
Form (1) and (3), the sequence 𝑠1 , 𝑠2 , 𝑠3 , … , 𝑠𝑛 , …
converges to S, so the series converges.◄
Example 1.
The series
1 1 1 1
1− + − + ⋯ + (−1)𝑘+1 + ⋯
2 3 4 𝑘
is called the alternating harmonic series. Since
1 1
𝑎𝑘 = > = 𝑎𝑘+1
𝑘 𝑘+1
and
1
lim 𝑎𝑘 = lim =0
𝑘→+∞ 𝑘→+∞ 𝑘
converges or diverges.
Solution.
Requirement (b) of the alternating series test is satisfied
since
- 291 -
1 3
𝑘+3 + 2
𝑘 𝑘
lim 𝑎𝑘 = lim = lim =0
𝑘→+∞ 𝑘→+∞ 𝑘(𝑘 + 1) 𝑘→+∞ 1 + 1
𝑘
Remark 1.
1 1 1 1 1 1
The series 1 − − 2 + 3 + 4 − 5 − + ⋯does not fit
2 2 2 2 2 26
converges.
- 293 -
Example 4.
The series
1 1 1 1 1 1
1− − 2+ 3+ 4− 5− 6+⋯
2 2 2 2 2 2
converges absolutely since the series of absolute values
1 1 1 1 1 1
1+ + 2+ 3+ 4− 5− 6+⋯
2 2 2 2 2 2
is a convergent geometric series. On the other hand, the
alternating harmonic series
1 1 1 1
1− + − + −⋯
2 3 4 5
does not converge absolutely since the series of absolute
values
1 1 1 1
1+ + + + +⋯
2 3 4 5
diverges.◄
Theorem 4.
If the series
∞
- 294 -
∞
∑ 𝑢𝑘 = 𝑢1 + 𝑢2 + ⋯ + 𝑢𝑘 + ⋯
𝑘=1
- 295 -
Example 5.
In Example (4) we showed that
1 1 1 1 1 1
1− − 2+ 3+ 4− 5− 6+⋯
2 2 2 2 2 2
converges absolutely. It follows from Theorem 4 above
that the series converges.◄
Example 6.
cos 𝑘
Show that the series ∑∞
𝑘=1 converges.
𝑘2
Solution.
cos 𝑘 1 cos 𝑘
Since |cos 𝑘| ≤ 1for all k, | |≤ 2 . Thus
∑∞
𝑘=1 | |
𝑘2 𝑘 𝑘2
- 296 -
series, diverges. Yet in each case the series of absolute
values is
1 1 1
1+ + + ⋯+ + ⋯
2 3 𝑘
which diverges. A series such as (4), which is convergent,
but not absolutely convergent, is called conditionally
convergent.
The following version of the ratio test is useful for
investigating absolute convergence.
Theorem 5 (The Ratio Test for Absolute
Convergence).
Let∑ 𝑢𝑘 be a series with nonzero terms and suppose
|𝑢𝑘+1 |
lim =𝜌
𝑘→+∞ |𝑢𝑘 |
- 297 -
|𝑢𝑘+1 |
𝜌 = lim
𝑘→+∞ |𝑢𝑘 |
2𝑘+1 𝑘! 2
= lim . = lim = 0 < 1.◄
𝑘→+∞ (𝑘+1)! 2𝑘 𝑘→+∞ 𝑘+1
Example 8.
We proved earlier (Theorem 3 in Section 1 in this
Chapter) that a geometric series
𝑎 + 𝑎𝑟 + 𝑎𝑟 2 + ⋯ + 𝑎𝑟 𝑘−1 + ⋯
converges if|𝑟| < 1 and diverges if |𝑟| ≥ 1. However, a
stronger statement can be made - the series converges
absolutely if|𝑟| < 1. This follows from Theorem 5 in
Section 4 above since
|𝑢𝑘+1 | |𝑎𝑟 𝑘 |
𝜌 = lim = lim = lim |𝑟| = |𝑟|
𝑘→+∞ |𝑢𝑘 | 𝑘→+∞ |𝑎𝑟 𝑘−1 | 𝑘→+∞
- 298 -
Review of Convergence Tests
NAME STATEMENT COMMENTS
Divergence test If lim 𝑢𝑘 ≠ 0 then∑ 𝑢𝑘 diverges If lim 𝑢𝑘 =
𝑘→+∞ 𝑘→+∞
0 , ∑ 𝑢𝑘 may or may
not converge.
Integral test Let∑ 𝑢𝑘 be a series with positive Use this test when
terms and let 𝑓(𝑥) be the function 𝑓(𝑥) is easy to
that results when k is replaced by x integrate.
in the formula for 𝑢𝑘 . If f is
decreasing and continuous for 𝑥 ≥
+∞
1, then ∑∞
𝑘−1 𝑢𝑘 𝑎𝑛𝑑 ∫1 𝑓(𝑥)𝑑𝑥
both converges or both diverges.
Comparison test Let∑ 𝑎𝑘 and∑ 𝑏𝑘 be series with Use this test as a last
positive terms such that reason easier to apply.
𝑎1 ≤ 𝑏1 , 𝑎2 ≤ 𝑏2 , … , 𝑎𝑘 ≤ 𝑏𝑘 , …
If
∑ 𝑏𝑘 converges, then∑ 𝑎𝑘
converges, and if∑ 𝑎𝑘 diverges,
then∑ 𝑏𝑘 diverges
Ratio test let∑ 𝑢𝑘 be a series with positive Try this test when 𝑢𝑘
𝑢𝑘+1
terms and let lim =𝜌 involves factorials or
𝑘→+∞ 𝑢𝑘
kth powers.
(a) Series converges if 𝜌 < 1.
(b) Series diverges if 𝜌 > 1 or
𝜌 = +∞
(c) No conclusion if 𝜌 = 1
- 299 -
NAME STATEMENT COMMENTS
Root test Let be∑ 𝑢𝑘 a series with positive Try this test when 𝑢𝑘
terms such that 𝜌 = lim 𝑘√𝑢𝑘 involves kth power.
𝑘→+∞
Ratio test for Let∑ 𝑢𝑘 be a series with nonzero The series need not
absolute |𝑢𝑘+1 | have positive terms
terms such that𝜌 = lim
𝑘→+∞ |𝑢𝑘 |
convergence and need not be
(a) Series converges absolutely
alternating to use this
if 𝜌 < 1
test
(b) Series diverges if 𝜌 > 1
or𝜌 = +∞
(c ) No conclusion if 𝜌 = 1.
- 300 -
Exercise Set (7.5)
In Exercise (1)-(6), use the alternating series test to
determine if the series converges or diverges
(−1)𝑘+1 𝑘
(1) ∑∞
𝑘=1 2 𝑘+1 (2) ∑∞
𝑘=1(−1)
𝑘+1
3𝑘
𝑘+1 𝑘+4
(3) ∑∞
𝑘=1(−1)
𝑘+1
(4) ∑∞
𝑘=1(−1)
𝑘+1
3 𝑘+1 𝑘 2 +𝑘
𝐼𝑛 𝑘
(5) ∑∞
𝑘=1(−1)
𝑘+1 −𝑘
𝑒 (6) ∑∞
𝑘=3(−1)
𝑘
𝑘
3𝑘 𝑘
(9) ∑∞
𝑘=1(−1)
𝑘+1
(10) ∑∞ 𝑘
𝑘=1(−1) (5𝑘 )
𝑘2
∞ 𝑘3 𝑘+1 𝑘
𝑘
(11) 𝑘
∑𝑘=1(−1) ( 𝑘 ) (12) ∑∞
𝑘=1( −1 )
𝑒 𝑘!
- 301 -
𝑘+2 (−1)𝑘+1
(19) ∑∞
𝑘=1(−1)
𝑘+1
( ) (20) ∑∞
𝑘=1
3 𝑘−1 𝑘 2 +1
𝑘+2 (−1)𝑘+1 𝑘 2
(21) ∑∞
𝑘=1(−1)
𝑘+1
(22) ∑∞
𝑘=1
𝑘 (𝑘+3) 𝑘 3 +1
𝑘𝜋 sin 𝑘
(23) ∑∞
𝑘=1 sin (24) ∑∞
𝑘=1
𝑘3 𝑘3
(−1)𝑘 (−1)𝑘
(25) ∑∞
𝑘=2 (26) ∑∞
𝑘=1
𝑘 𝐼𝑛 𝑘 √𝑘 (𝑘+1)
1 (−1)𝑘+1
(27) ∑∞
𝑘=2 (− ) (28) ∑∞
𝑘=1
𝐼𝑛 𝑘 √𝑘+1+√𝑘
(−1)𝑘 (𝑘 2 +1) 𝑘 cos 𝑘𝜋
(29) ∑∞
𝑘=2 (30) ∑∞
𝑘=1
𝑘 3 +2 𝑘 2 +1
- 302 -
value of n for which the nth partial sum approximates the
sum of the series to the stated accuracy.
(−1)𝑘+1
(35) ∑∞
𝑘=1 ; |𝑒𝑟𝑟𝑜𝑟| < 0.0001 .
𝑘
(−1)𝑘+1
(36) ∑∞
𝑘=1 ; |𝑒𝑟𝑟𝑜𝑟| < 0.00001
𝑘!
(−1)𝑘+1
(37) ∑∞
𝑘=1 ; |𝑒𝑟𝑟𝑜𝑟| < 0.005 .
√𝑘
(−1)𝑘+1
(38) ∑∞
𝑘=1 (𝑘+1) ; |𝑒𝑟𝑟𝑜𝑟| < 0.1.
𝐼𝑛 (𝑘+1)
- 303 -
6. POWER SERIES
If 𝑐0 , 𝑐1 , 𝑐2 , … are constants and x is a variable, then a
series of the form
∞
∑ 𝑐𝑘 𝑥 𝑘 = 𝑐0 + 𝑐1 𝑥 + 𝑐2 𝑥 2 + ⋯ + 𝑐𝑘 𝑥 𝑘 + ⋯
𝑘=0
∑ 𝑥𝑘 = 1 + 𝑥 + 𝑥2 + 𝑥3 + ⋯
𝑘=0
∞
𝑥𝑘 𝑥2 𝑥3
∑ =1+𝑥+ + +⋯
𝑘! 2! 3!
𝑘=0
∞
𝑥 𝑘+1 𝑥2 𝑥3 𝑥4
∑ (−1 )𝑘 =𝑥− + −
𝑘+1 2 3 4
𝑘=0
∞
𝑥2 𝑘 𝑥2 𝑥4 𝑥6
∑ (−1 )𝑘 =1− + − +⋯
(2 𝑘) ! 2! 4! 6!
𝑘=0
∞
𝑥 2 𝑘+1 𝑥3 𝑥5 𝑥7
∑ (−1 )𝑘 =𝑥− + − +⋯
(2 𝑘 + 1) ! 3! 5! 7!
𝑘=0
∑ 𝑥𝑘 = 1 + 𝑥 + 𝑥2 + ⋯ + 𝑥𝑘 + ⋯
𝑘=0
Solution.
For every x, the given series is a geometric series with
ratio 𝑟 = 𝑥.By Example 8 of Section 5 in this chapter,
the series converges absolutely if −1 < 𝑥 < 1 and
diverges if|𝑥 | ≥ 1. So, the interval of convergence is
(−1, 1) and the radius of convergence is 𝑅 = 1.
Example 2.
Find the interval of convergence and radius of
𝑥𝑘
convergence of ∑∞
𝑘=0 .
𝑘!
Solution.
We will apply the ratio test for absolute convergence. for
every real number x,
𝑢𝑘+1 𝑥 𝑘+1 𝑘!
𝜌 = lim | | = lim | . 𝑘|
𝑘→+∞ 𝑢𝑘 𝑘→+∞ (𝑘 + 1) ! 𝑥
𝑥
= lim | |=0
𝑘→+∞ 𝑘+1
Since 𝜌 < 1 for all x, the series converges absolutely for
all x. Thus the interval of convergence is(−∞ , +∞) and
the radius of convergence is 𝑅 = +∞ .◄
- 306 -
REMARK.
There is a useful byproduct of Example 2. Since
∞
𝑥𝑘
∑
𝑘!
𝑘=0
∑ 𝑘 ! 𝑥𝑘
𝑘=1
Solution.
If 𝑥 = 0, the series has only one nonzero term and
therefore converges. If 𝑥 ≠ 0, the ratio test yields
𝑢𝑘+1 (𝑘 + 1) ! 𝑥 𝑘+1
𝜌 = lim | | = lim | |
𝑘→+∞ 𝑢𝑘 𝑘→+∞ 𝑘 ! 𝑥𝑘
= lim |(𝑘 + 1) 𝑥 | = +∞
𝑘→+∞
Solution.
Since|(−1)𝑘 | = |(−1)𝑘+1 | = 1 we obtain
𝑢𝑘+1 𝑥 +1 3𝑘 (𝑘 + 1)
𝜌 = lim | | = lim | 𝑘+1 . |
𝑘→+∞ 𝑢𝑘 𝑘→+∞ 3 (𝑘 + 2) 𝑥𝑘
|𝑥 | 𝑘 + 1
= lim [ .( )]
𝑘→+∞ 3 𝑘+2
|𝑥| 1+1/𝑘 |𝑥|
= lim ( )= .
3 𝑘→+∞ 1+2/𝑘 3
- 308 -
∞ ∞ ∞
(−1)𝑘 (−3)𝑘 (−1)𝑘 (−1)𝑘 3𝑘 1
∑ 𝑘 =∑ = ∑
3 (𝑘 + 1) 3𝑘 (𝑘 + 1) 𝑘+1
𝑘=0 𝑘=0 𝑘=0
Solution.
Since|(−1)𝑘 | = |(−1)𝑘+1 | = 1, we have
𝑢𝑘+1 𝑥 2(𝑘+1) (2 𝑘) !
𝜌 = lim | | = lim | . 2𝑘 |
𝑘→+∞ 𝑢𝑘 𝑘→+∞ [2 (𝑘 + 1)] ! 𝑥
- 309 -
𝑥 2 𝑘+2 (2 𝑘) !
= lim | |
𝑘→+∞ (2 𝑘 + 2) ! 𝑥 2 𝑘
𝑥2
= lim | |
𝑘→+∞ (2 𝑘 + 2)(2 𝑘 + 1)
1
= 𝑥 2 lim
𝑘→+∞ (2 𝑘 + 2)(2 𝑘 + 1)
= 𝑥 2 .0 = 0
Thus 𝜌 < 1 for all x, which means that the interval
of convergence is(−∞ , +∞) and the radius of
convergence is 𝑅 = +∞.◄
Theorem 2.
For a power series∑ 𝑐𝑘 (𝑥 − 𝑎)𝑘 , exactly one of the
following is true:
(a) The series converges only for 𝑥 = 𝑎.
(b) The series converges absolutely for all x.
(c) The series converges absolutely for all x in some
finite open interval (𝑎 − 𝑅, 𝑎 + 𝑅) and diverges if 𝑥 <
𝑎 − 𝑅 𝑜𝑟 𝑥 > 𝑎 + R. At the points 𝑥 = 𝑎 − 𝑅 and
𝑥 = 𝑎 + 𝑅, the series may converge absolutely,
converge conditionally, or diverge, depending on the
particular series.
- 310 -
In case (a), (b) and (c) of Theorem 8.2 the series is said to
have radius of convergence, 0, +∞, and R respectively.
The set of all values of x for which the series converges is
called the interval of convergence.
Example 6.
Find the interval of convergence and radius of
convergence of the series
∞
(𝑥 − 5)𝑘
∑
𝑘2
𝑘=1
Solution.
We apply the ratio test for absolute convergence.
𝑢𝑘+1
𝜌 = lim | |
𝑘→+∞ 𝑢𝑘
(𝑥 − 5)𝑘+1 𝑘2
= lim | . |
𝑘→+∞ (𝑘 + 1)2 (𝑥 − 5)𝑘
𝑘 2
= lim [|𝑥 − 5| ( ) ]
𝑘→+∞ 𝑘+1
- 311 -
2
1
= |𝑥 − 5| lim ( 1)
𝑘→+∞ 1+
𝑘
= |𝑥 − 5|.
Thus, the series converges absolutely if|𝑥 − 5| < 1, or
−1 < 𝑥 − 5 < 1, or 4 < 𝑥 < 6. The series diverges
if
𝑥 < 4 or 𝑥 > 6.
To determine the convergence behavior at end
points 𝑥 = 4 and 𝑥 = 6, we substitute these values in
the given series, if 𝑥 = 6 the series becomes
1 𝑘 1 1 1 1
∑∞
𝑘=1 𝑘 2 = ∑∞
𝑘=1 =1+ + + +⋯
𝑘2 22 32 42
- 312 -
Exercise Set (7.6)
In Exercise (1)-(24), find the radius of convergence and
the interval of convergence.
∞ 𝑥𝑘
(1) ∑𝑘=0 ; (2) ∑∞ 𝑘 𝑘
𝑘=0 3 𝑥 ;
𝑘+1
(−1) 𝑥 𝑘 𝑘 𝑘!
(3) ∑∞
𝑘=0 ; (4) ∑∞
𝑘=0 𝑥𝑘;
𝑘! 2𝑘
5𝑘 𝑥𝑘
(5) ∑∞
𝑘=1
𝑘
2𝑥 ; (6) ∑∞
𝑘=1 ;
𝑘 𝑙𝑛 𝑘
𝑥𝑘 (−2)𝑘 𝑥 𝑘+1
(7) ∑∞
𝑘=1 ; (8) ∑∞
𝑘=0 ;
𝑘 (𝑘+1) 𝑘+1
𝑥𝑘 (−1)𝑘 𝑥 2𝑘
(9) ∑∞
𝑘=1(−1)
𝑘−1
; (10) ∑∞
𝑘=0 ;
√𝑘 (2𝑘) !
𝑥 2𝑘+1 𝑥 3𝑘
(11) ∑∞ 𝑘
𝑘=0(−1) (2𝑘+1) ; (12) ∑∞
𝑘=0(−1)
𝑘
;
! 𝑘 3/2
3 𝑘 𝑥𝑘
(13) ∑∞ 𝑘
𝑘=0 𝑘 ! 𝑥 ; (14) ∞
∑𝑘=2(−1) 𝑘+1
;
𝑘 (𝑙𝑛 𝑘)2
𝑥𝑘 (𝑥−3)𝑘
(15) ∑∞
𝑘=0 2; (16) ∑∞
𝑘=0 ;
1+𝑘 2𝑘
(𝑥+1)𝑘 (𝑥−4)𝑘
(17) ∑∞
𝑘=1(−1)
𝑘+1
; (18)∑∞ 𝑘
𝑘=0(−1) (𝑘+1)2 ;
𝑘
3 𝑘 (2𝑘+1) !
(19) ∑∞ 𝑘 ∞
𝑘=0 (4) (𝑥 + 5) ; (20) ∑𝑘=1 (𝑥 − 2)𝑘 ;
𝑘3
(𝑥+1)2𝑘+1 (𝐼𝑛 𝑘)(𝑥−3)𝑘
(21) ∑∞
𝑘=1(−1)
𝑘
; (22) ∑∞
𝑘=1 ;
𝑘 2 +4 𝑘
𝜋𝑘 (𝑥−1)2 𝑘 (2 𝑥−3)𝑘
(23) ∑∞
𝑘=0 ; (24) ∑∞
𝑘=1 .
(2 𝑘+1) ! 4 2𝑘
- 313 -
(25) Use the root test to find the interval of
𝑥𝑘
convergence of ∑∞
𝑘=2 .
(𝑙𝑛 𝑘)𝑘
where 𝑏 > 0.
(27) Find the radius of convergence of the power
(𝑝𝑘)!
series ∑∞
𝑘=0 (𝑘 𝑥 𝑘 , where p is a positive integer.
!)𝑝
integers.
(29) Prove: If lim |𝑐𝑘 |1/𝑘 = 𝐿 , where 𝐿 ≠ 0, then
𝑘→+∞
convergence 1/𝐿.
- 314 -
7. TAYLOR AND MACLAURIN SERIES
In this section we will study the approximation of
function by polynomials and introduce an important class
of power series.
Suppose we are interested in approximating a
function f by a polynomial
𝑝(𝑥 ) = 𝑐0 + 𝑐1 𝑥 + ⋯ + 𝑐𝑛 𝑥 𝑛 (1)
over an interval centered at 𝑥 = 0. Because 𝑝(𝑥) has
𝑛 + 1 coefficients, it seems reasonable that we will be
able to impose 𝑛 + 1 conditions on this polynomial. We
will assume that the first n derivatives of f exist at 𝑥 =
0, and we will choose these 𝑛 + 1 conditions to be:
𝑓(0) = 𝑝(0), 𝑓 ′ (0) = 𝑝′ (0), 𝑓 ′′ (0) = 𝑝′′ (0), …,
𝑓 (𝑛) (0) = 𝑝(𝑛) (0) (2)
These conditions require that the value of 𝑝(𝑥) and its
- 315 -
𝑝(𝑥) will remain close over some interval (possibly
𝑝(𝑥 ) = 𝑐0 + 𝑐1 𝑥 + 𝑐2 𝑥 2 + 𝑐3 𝑥 3 + ⋯ + 𝑐𝑛 𝑥 𝑛
𝑝′ (𝑥 ) = 𝑐1 + 2𝑐2 𝑥 + 3𝑐3 𝑥 2 + ⋯ + 𝑛𝑐𝑛 𝑥 𝑛−1
𝑝′′ (𝑥 ) = 2𝑐2 + 3.2𝑐3 𝑥 + ⋯ + 𝑛(𝑛 − 1)𝑐𝑛 𝑥 𝑛−2
𝑝′′′ (𝑥 ) = 3.2𝑐3 + ⋯ + 𝑛(𝑛 − 1)(𝑛 − 2)𝑐𝑛 𝑥 𝑛−3
⋮
𝑝(𝑛) (𝑥 ) = 𝑛(𝑛 − 1)(𝑛 − 2) … (1)𝑐𝑛
we obtain on substituting 𝑥 = 0
𝜌(0) = 𝑐0
𝑝′ (0) = 𝑐1
𝑝′′ (0) = 2𝑐2 = 2! 𝑐2
𝑝′′′ (0) = 3 .2𝑐2 = 3! 𝑐3
⋮
𝑝(𝑛) (0) = 𝑛(𝑛 − 1)(𝑛 − 2) … 𝑐𝑛 = 𝑛! 𝑐𝑛
Thus, from (2),
𝑓(0) = 𝑐0
⋮
𝑓 ′ (0) = 𝑐1
𝑓 ′′ (0) = 2! 𝑐2
- 316 -
𝑓 ′′′ (0) = 3! 𝑐3
⋮
𝑓 (𝑛) (0) = 𝑛 ! 𝑐𝑛
So
′(
𝑓 ′′ (0) 𝑓 ′′′ (0)
𝑐0 = 𝑓(0), 𝑐1 = 𝑓 0), 𝑐2 = , 𝑐3 = , … , 𝑐𝑛
2! 3!
𝑓 (𝑛) (0)
=
𝑛!
Substituting these values in (1) yields a polynomial,
called the nth Maclaurin polynomial for f.
Definition 1.
If f can be differentiated n times at 0, then we define the
nth Maclaurin polynomial for f to be
𝑓 ′ (0) 𝑓 ′′ (0) 2 𝑓 ′′′ (0) 3
𝑝𝑛 (𝑥 ) = 𝑓(0) + 𝑥+ 𝑥 + 𝑥 +⋯
1! 2! 3!
𝑓 (𝑛) (0) 𝑛
+ 𝑥
𝑛!
(3)
This polynomial has the property that its value and the
values of its first n derivatives match the value of 𝑓(𝑥)
and its first n derivatives when 𝑥 = 0.
- 317 -
Example 1.
Find the Maclaurin polynomials 𝑝0 , 𝑝1 , 𝑝2 , 𝑝3 and𝑝𝑛 for
𝑒𝑥.
Solution.
Let 𝑓(𝑥) = 𝑒 𝑥 .
Thus 𝑓 ′ (𝑥 ) = 𝑓 ′′ (𝑥 ) = ⋯ = 𝑓 (𝑛) (𝑥 ) = 𝑒 𝑥 .
and 𝑓(0) = 𝑓 ′ (0) = 𝑓 ′′ (0) = ⋯ = 𝑓 (𝑛) = 𝑒 0 = 1.
Therefore
𝑝0 (𝑥 ) = 𝑓 (0) = 1;
𝑝1 (𝑥 ) = 𝑓 (0) + 𝑓 ′ (0) = 1 + 𝑥;
𝑓 ′′ (0) 𝑥2
𝑝2 (𝑥 ) = 𝑓 (0) + 𝑓 ′ (0)𝑥 + 𝑥2 = 1 + 𝑥 + =1+
2! 2!
1
𝑥 + 𝑥 2;
2
′(
𝑓 ′′ (0) 2 𝑓 ′′′ (0) 3
𝑝3 (𝑥 ) = 𝑓 (0) + 𝑓 0) + 𝑥 + 𝑥
2! 3!
𝑥2 𝑥3 1 1
=1+𝑥+ + = 1 + 𝑥 + 𝑥2 + 𝑥3;
2! 3! 2 6
′(
𝑓 ′′ (0) 2 𝑓 (𝑛) (0) 𝑛
𝑝𝑛 (𝑥 ) = 𝑓(0) + 𝑓 0) + 𝑥 +⋯+ 𝑥
2! 𝑛!
𝑥2 𝑥𝑛
=1+𝑥+ +⋯+ ;
2! 𝑛!
- 318 -
In the figure we have sketched the graphs of 𝑒 𝑥 and
its Maclaurin polynomials of degree 1, 2, and 3. Note that
the graphs of 𝑒 𝑥 and𝑝3 (𝑥) are virtually indistinguishable
over the interval from −0.5 𝑡𝑜 + 0.5.
Example 2.
Find the nth Maclaurin polynomial for 𝑙𝑛 (𝑥 + 1).
Solution.
Let 𝑓(𝑥) = 𝑙𝑛 (𝑥 + 1) and arrange the computations as
follows:
𝑓(𝑥 ) = ln (𝑥 + 1) ⟹ 𝑓(0) = ln 1 = 0;
1
𝑓 ′ (𝑥 ) = ⟹ 𝑓 ′ (0) = 1;
𝑥+1
1
𝑓 ′′ (𝑥 ) = − ⟹ 𝑓 ′′ (0) = −1;
(𝑥+1)2
- 319 -
2
𝑓 ′′′ (𝑥 ) = (𝑥+1)3 ⟹ 𝑓 ′′′ (0) = 2;
3 .2
𝑓 4 (𝑥) = (𝑥+1)4 ⟹ 𝑓 (4) (0) = −3 !;
4 .3 .2
𝑓 (5) (𝑥) = (𝑥+1)5 ⟹ 𝑓 (5) (0) = 4 !;
⋮ ⋮
(𝑛 − 1) !
𝑓 (𝑛) = (−1)𝑛+1 ⟹
(𝑥 + 1)𝑛
𝑓 (𝑛) (0) = (−1)𝑛+1 (𝑛 − 1) !.
Substituting these values in (3) yields
𝑥2 𝑥3 𝑥𝑛
𝑝𝑛 (𝑥 ) = 𝑥 − + − ⋯ + (−1)𝑛+1 .◄
2 3 𝑛
Example 3.
In the Maclaurin polynomials for sin 𝑥, only the odd
powers of x appear explicitly. To see this, let 𝑓(𝑥) = sin
x, thus
𝑓 (𝑥 ) = sin 𝑥 ⟹ 𝑓(0) = 0
𝑓′(𝑥) = cos 𝑥 ⟹ 𝑓′(0) = 1
𝑓"(𝑥) = − sin 𝑥 ⟹ 𝑓"(0) = 0
𝑓 ′′′ (𝑥 ) = − cos 𝑥 ⟹ 𝑓 ′′′ (0) = −1
Since𝑓 (4) (𝑥) = sin 𝑥 = 𝑓(𝑥), the pattern 0, 1, 0, −1
will repeat over and over as we evaluate successive
- 320 -
derivatives at 0. Therefore, the successive Maclaurin
polynomials for sin x are
𝑝1 (𝑥) = 0 + 𝑥 = 𝑥 ;
𝑝2 (𝑥) = 0 + 𝑥 + 0 = 𝑥 ;
𝑥3 𝑥3
𝑝3 (𝑥 ) = 0 + 𝑥 + 0 − =𝑥− ;
3! 3!
𝑥3 𝑥3
𝑝4 (𝑥 ) = 0 + 𝑥 + 0 − +0=𝑥− ;
3! 3!
𝑥3 𝑥5 𝑥3 𝑥5
𝑝5 (𝑥 ) = 0 + 𝑥 + 0 − +0+ =𝑥− + ;
3! 5! 3! 5!
𝑥3 𝑥5 𝑥3 𝑥5
𝑝6 (𝑥 ) = 0 + 𝑥 + 0 − +0+ +0=𝑥− + ;
3! 5! 3! 5!
𝑥3 𝑥5 𝑥7
𝑝7 (𝑥 ) = 0 + 𝑥 + 0 − + 0 + + 0 −
3! 5! 7!
𝑥3 𝑥5 𝑥7
=𝑥− + −
3! 5! 7!
In general, the Maclaurin polynomials for sin x are
𝑥3 𝑥5 𝑥7
𝑝2𝑛+1 (𝑥 ) = 𝑝2𝑛+2 (𝑥 ) = 𝑥 − + − + ⋯+
3! 5! 7!
𝑥 2𝑛+1
(−1)𝑛 , (𝑛 = 0, 1, 2, … ).◄
(2𝑛+1)!
Example 4.
The successive Maclaurin polynomials for cos 𝑥 are
𝑝0 (𝑥 ) = 𝑝1 (𝑥 ) = 1;
- 321 -
𝑥2
𝑝2 (𝑥 ) = 𝑝3 (𝑥 ) = 1 − ;
2!
𝑥2 𝑥4
𝑝4 (𝑥 ) = 𝑝5 (𝑥 ) = 1 − =𝑥− ;
2! 4!
𝑥2 𝑥4 𝑥6
𝑝6 (𝑥 ) = 𝑝7 (𝑥 ) = 1 − + − ;
2! 4! 6!
𝑥2 𝑥4 𝑥6 𝑥8
𝑝8 (𝑥 ) = 𝑝9 (𝑥 ) = 1 − + − + .
2! 4! 6! 8!
(𝑛 = 0, 1, 2, … ).◄
Definition 2.
If f can be differentiated n times at a, then we define the
nth Taylor polynomials for f about 𝑥 = 𝑎 to be
′(
𝑓 ′′ (𝑎)
𝑝𝑛 (𝑥 ) = 𝑓(𝑎) + 𝑓 𝑎)(𝑥 − 𝑎) + (𝑥 − 𝑎)2
2!
𝑓 ′′′ (𝑎) 𝑓 𝑛 (𝑎)
+ (𝑥 − 𝑎)3 + ⋯ + (𝑥 − 𝑎)𝑛 (5)
3! 𝑛!
- 322 -
Solution.
Let 𝑓(𝑥 ) = sin 𝑥 . Thus
𝜋 √3
𝑓(𝑥 ) = sin 𝑥 ⇒ 𝑓(𝜋/3) = sin = ;
3 2
𝜋 1
𝑓 ′ (𝑥 ) = cos 𝑥 ⇒ 𝑓(𝜋/3) = cos = ;
3 2
𝜋 √3
𝑓 ′′ (𝑥 ) = − sin 𝑥 ⇒ 𝑓 ′′ (𝜋/3) = − sin = − ;
3 2
𝜋 1
𝑓 ′′′ (𝑥 ) = − cos 𝑥 ⇒ 𝑓 ′′′ (𝜋/3) = − cos = − .
3 2
- 323 -
Frequently, it is convenient to express the defining
formula for the Taylor polynomials in sigma notation. To
do this, we use the notation 𝑓 (𝑘) (𝑎) to denote the kth
derivative of f at 𝑥 = 𝑎, and we make the added
convention that 𝑓 (0) (𝑎) denotes 𝑓(𝑎). This enables us to
write
𝑓 (𝑘) (𝑎)
∑𝑛𝑥=0 (𝑥 − 𝑎)𝑘 = 𝑓(𝑎) + 𝑓 ′ (𝑎)(𝑥 − 𝑎)
𝑘!
Definition 3.
If f derivatives of all orders at a, then we define the
Taylor Series for f about 𝑥 = 𝑎 to be
∞
𝑓 (𝑘) (𝑎)
∑ (𝑥 − 𝑎)𝑘 = 𝑓(𝑎) + 𝑓 ′ (𝑎)(𝑥 − 𝑎)
𝑘!
𝑘=0
𝑓 ′′ (𝑎) 𝑓 (𝑘) (𝑎)
+ (𝑥 − 𝑎)2 + ⋯ + (𝑥 − 𝑎)𝑘 + ⋯ (6)
2! 𝑘!
- 324 -
Definition 4.
If f has derivatives of all orders at 0, then we define the
Maclaurin series for f to be
∞
𝑓 (𝑘) (0) 𝑘
∑ 𝑥
𝑘!
𝑘=0
𝑓 ′′ (0) 2
′(
= 𝑓(0) + 𝑓 0)𝑥 + 𝑥 +⋯
2!
𝑓 (𝑘) (0) 𝑘
+ 𝑥 + ⋯ (7)
𝑘!
Observe that the Maclaurin series for f is just the Taylor
series for f about 𝑎 = 0.
Example 6.
In Example 1, we found the nth Maclaurin polynomial
for 𝑒 𝑥 to be
𝑥 𝑘 𝑥2 𝑥𝑛
∑∞
𝑘=0 𝑘 ! =1+𝑥+ + ⋯+
2! 𝑛!
- 325 -
and from Example 4 the Maclaurin series for cos x is
∞
𝑥2 𝑘 𝑥2 𝑥4 𝑥6
∑ (−1 )𝑘 =1− + − +⋯
(2𝑘) ! 2! 4! 6!
𝑘=0
Example 7.
Find the Taylor series about x = 1 for 1/x.
Solution.
Let 𝑓(𝑥) = 1 / 𝑥 so that
1
𝑓(𝑥 ) = ⇒ 𝑓(1) = 1;
𝑥
1
𝑓 ′ (𝑥 ) = − ⇒ 𝑓 ′ (1) = −1;
𝑥2
2
𝑓 ′′ (𝑥 ) = ⇒ 𝑓 ′′ (1) = 2 !;
𝑥3
3 .2
𝑓 ′′′ (𝑥 ) = − ⇒ 𝑓 ′′′ (1) = −3 !;
𝑥4
4 .3 .2
𝑓 (4) (𝑥 ) = ⇒ 𝑓 (4) (1) = 4 !;
𝑥5
⋮ ⋮
𝑘!
𝑓 (𝑘) (𝑥 ) = (−1)𝑘 ⇒ 𝑓 (𝑘) (1) = (−1)𝑘 𝑘 !;
𝑥 𝑘+1
⋮ ⋮
Thus, substituting in (6) with 𝑎 = 1 yields
(−1)𝑘 𝑘 !
∑∞
𝑘=0 (𝑥 − 1)𝑘 = ∑∞ 𝑘
𝑘=0(−1) (𝑥 − 1)
𝑘
𝑘!
= 1 − (𝑥 − 1) + (𝑥 − 1)2 − (𝑥 − 1)3 + ⋯◄
- 326 -
Exercise Set (7.7)
In Exercise (1)-(12), find the fourth Maclaurin
polynomial (𝑛 = 4) for the given function.
1
(1) 𝑒 −2𝑥 ; (2) ;
1+𝑥
- 327 -
𝑥
(27) ln(1 + 𝑥);(28)sin 𝜋 𝑥; (29)cos ( );
2
(34) 𝑒 𝑘 , 𝑎 = 2; (35) ln 𝑥 , 𝑎 = 1;
𝜋 1
(36) cos 𝑥 , 𝑎 = ; (37) sin 𝜋 𝑥 , 𝑎 = ;
2 2
1
(38) , 𝑎 = 3; (39) sinh 𝑥 𝑥 , 𝑎 = ln 4.
𝑥+2
𝑓 (𝑛) (𝑎)
(40) Prove that the value of𝑝𝑛 (𝑥 ) = ∑𝑛𝑘=0 (𝑥 −
𝑘!
- 328 -
CHAPTER (VIII)
DIFFERENTIAL EQUATION
- 329 -
CHAPTER (VIII)
DIFFERENTIAL EQUATION
1. Introduction
The words differential and equations clearly indicate
solving some kind of equation involving derivatives.
Differential equations are interesting and important
because they express relationships involving rates of
change. Such relationships form the basis for developing
ideas and studying phenomena in the sciences,
economics, engineering, finance, medicine and in short
without any exaggeration every aspect of human
knowledge.
2. Definitions and Terminology
Definition 1.
equation (DE).
- 330 -
Definition 2.
𝜕2 𝑢 𝜕2 𝑢 𝜕𝑢 𝜕𝑢 𝜕𝑣
= −2 and =−
𝜕𝑥 2 𝜕𝑡 2 𝜕𝑡 𝜕𝑦 𝜕𝑥
- 331 -
Definition 4.
The order of a differential equation (ODE or PDE) is the
order of the highest derivative in the equation.
Example 3.
(i) Order of the differential equation
𝑑2 𝑦 𝑑𝑦 4
+ 5 ( ) − 4𝑦 = 𝑒 𝑥 is 2.
𝑑𝑥 2 𝑑𝑥
Definition 5.
The degree of a differential equation is the degree of the
highest order derivative in the equation.
Example 4.
(i) The degree of ODE
𝑑𝑦 𝑑𝑦 2
𝑦 − √𝑥 ( ) − 5 = 0 is 2.
𝑑𝑥 𝑑𝑥
Remarks 1.
(i) Very often notation 𝑦 ′ , 𝑦 ′′ , . . . , 𝑦 (𝑛) are respectively
𝑑𝑦 𝑑2 𝑦 𝑑3 𝑦 𝑑𝑛 𝑦
used for , 2, 3,…,
𝑑𝑥 𝑑𝑥 𝑑𝑥 𝑑𝑥 𝑛
- 332 -
(ii) In symbols we can express an nth order ordinary
differential equation in one dependent variable by the
general form
𝐹(𝑥, 𝑦, 𝑦 ′ , 𝑦 ′′ , . . . , 𝑦 (𝑛) ) = 0 (1)
where F is a real-valued function of 𝑛 + 2 variables
𝑥, 𝑦, 𝑦 ′ , 𝑦 ′′ , . . . , 𝑦 (𝑛) .
Definition 6.
An n th-order ordinary differential equation is said to be
linear in y if it can be written in the form
𝑎𝑛 (𝑥)𝑦 (𝑛) + 𝑎𝑛−1 (𝑥)𝑦 (𝑛−1) + . . . + 𝑎1 (𝑥)𝑦’ + 𝑎0 (𝑥)𝑦
= 𝑓(𝑥)
where𝑎0 , 𝑎1 , 𝑎2 , … , 𝑎𝑛 and f are functions of x on some
interval, and 𝑎𝑛 (𝑥) 0 on that interval. The functions
𝑎𝑘 (𝑥), 𝑘 = 0, 1, 2, . . . . , 𝑛 are called the coefficient
functions. A differential equation that is not linear is
called non-linear.
Example 5.
(i)𝑦 ′′ = 4𝑦 ′ + 3𝑦 − 𝑥 4 and 𝑥𝑦′′ + 𝑦𝑒 𝑥 + 6 = 0
are linear differential equations.
𝑑3 𝑦 𝑑𝑦
(ii) (𝑦 − 𝑥)𝑑𝑥 + 4𝑥𝑑𝑦 = 0, 𝑥 3 − 4𝑥 + 12𝑦 = 𝑒 𝑥
𝑑𝑥 3 𝑑𝑥
- 333 -
𝑦’‘ − 4𝑦’ + 𝑦 = 0
are linear differential equations.
𝑥 𝑑2 𝑦
(iii) (1 + 𝑦)𝑦’ + 2𝑦 = 𝑒 , + cos 𝑦 = 0,
𝑑𝑥 2
𝑑3 𝑦
+ 𝑦 2 = 0are non-linear.
𝑑𝑥 3
Remark 2.
An ordinary differential equation is linear if the following
conditions are satisfied.
(i) The unknown function and its derivatives occur in the
first degree only.
(ii) There are no products involving either the unknown
function and its derivatives or two or more derivates.
(iii) There are no transcendental functions involving the
unknown function or any of its derivatives.
Definition 7.
(i) A solution or a general solution of an nth-order
differential equation of the form (1) on an interval 𝐼 =
[𝑎, 𝑏] = {𝑥ℝ: 𝑎 ≤ 𝑥 ≤ 𝑏} is any function possessing
all the necessary derivatives, which when substituted
for𝑥, 𝑦, 𝑦 ′ , 𝑦 ′′ , . . . , 𝑦 (𝑛) , reduces the differential equation
to an identity. In other words, an unknown function is a
- 334 -
solution of a differential equation if it satisfies the
equation.
(ii) A solution of a differential equation of order n will
have n independent arbitrary constants. Any solution
obtained by assigning particular numerical values to some
or all of the arbitrary constants is called a particular
solution.
(iii) A solution of a differential equation that is not
obtainable from a general solution by assigning particular
numerical values is called a singular solution.
(iv) A real function 𝑦 = (𝑥) is called an explicit
solution of the differential equation
𝐹(𝑥, 𝑦, 𝑦 ′ , 𝑦 ′′ , . . . , 𝑦 (𝑛) ) = 0on [𝑎, 𝑏]
if𝐹(𝑥, (𝑥), ’(𝑥), . . . , (𝑛) (𝑥) ) = 0 on [𝑎, 𝑏].
(v) A relation 𝑔(𝑥, 𝑦) = 0 is called an implicit solution
of the differential equation 𝐹(𝑥, 𝑦, 𝑦 ′ , 𝑦 ′′ , . . . , 𝑦 (𝑛) ) =
0on [𝑎, 𝑏] if 𝑔(𝑥, 𝑦) = 0 defines at least one real
function f on [𝑎, 𝑏] such that 𝑦 = 𝑓(𝑥) is an explicit
solution on this interval.
We now illustrate these concepts through the following
examples:
- 335 -
Example 6.
(i) 𝑦 = 𝑐1 𝑒 𝑥+𝑐2 is a solution of the equation
𝑦’’ − 𝑦 = 0.
This ODE is of order 2 and so its solution involves 2
arbitrary constants c1 and c2. It is clear that 𝑦 ′ = 𝑐1 𝑒 𝑥+𝑐2 ,
𝑦′′ = 𝑐1 𝑒 𝑥+𝑐2 and so 𝑐1 𝑒 𝑥+𝑐2 − 𝑐1 𝑒 𝑥+𝑐2 = 0.
Hence𝑦 = 𝑐1 𝑒 𝑥+𝑐2 is a general solution or simply a
solution.
(ii)𝑦 = 𝑐𝑒 2𝑥 is a solution of ODE𝑦’ − 2𝑦 = 0,
because 𝑦′ = 2𝑐𝑒 2𝑥 and 𝑦 = 𝑐𝑒 2𝑥 satisfy the ODE. Since
given ODE is of order 1, solution contains only one
constant.
1
(iii) 𝑦 = 𝑐𝑥 + 𝑐 2 is a solution of the equation
2
1
(𝑦’)2 + 𝑥𝑦’ – 𝑦 = 0.
2
- 336 -
To check the validity, we compute 𝑦’ and 𝑦’’ and put
values in these equation.
𝑦 ′ = 2𝑐1 𝑒 2𝑥 − 𝑐2 𝑒 −𝑥 , 𝑦′′ = 4𝑐1 𝑒 2𝑥 + 𝑐2 𝑒 −𝑥
L.H.S. of the given ODE is =
(4𝑐1 𝑒 2𝑥 + 𝑐2 𝑒 −𝑥 ) − (2𝑐1 𝑒 2𝑥 − 𝑐2 𝑒 −𝑥 )
− 2(𝑐1 𝑒 2𝑥 + 𝑐2 𝑒 −𝑥 ) = 0
Example 7.
(i) Choosing𝑐 = 1 we get a particular solution of
differential equation considered in Example 6(iii).
(ii) For 𝑐1 = 1 we get a particular solution of differential
equation in Example 6(i) that is,𝑦 = 𝑒 𝑥+𝑐2 is a particular
solution of𝑦′′ − 𝑦 = 0.
Example 8.
1
(i) 𝑦 = − 𝑥 2 is a singular solution of differential equation
2
in Example 6(iii).
(ii)𝑦 = 0 is a singular solution of 𝑦’ = 𝑥𝑦1/2 .
Verification.
1
(i) 𝑦 = − 𝑥 2 is not obtainable from the general solution𝑦 =
2
1
𝑐𝑥 + 𝑐 2 . However, it is a solution of the given differential
2
- 339 -
if it defines a real function on (−, 4). Solving the
equation 𝑦 2 + 𝑥 − 4 = 0for y, we get𝑦 =
±√4 − 𝑥.Since both𝑦1 = √4 − 𝑥 and𝑦2 = −√4 − 𝑥
and their derivatives are functions defined for all x in the
interval (−, 4), we conclude that 𝑦 2 + 𝑥 − 4 = 0 is
an implicit solution on this interval. ◄
Remark 3.
It is very pertinent to note that a relation 𝑔(𝑥, 𝑦) = 0
can reduce a differential to an identity without
constituting an implicit solution of the differential
equation. For example 𝑥 2 + 𝑦 2 + 1 = 0satisfies𝑦𝑦’ +
𝑥 = 0, but it is not an implicit solution as it does not
define a real-valued function. This is clear from the
solution of the equation 𝑥 2 + 𝑦 2 + 1 = 0 or𝑦 =
±√4 − 𝑥 2 , imaginary number.The relation 𝑥 2 + 𝑦 2 +
1 = 0is called a formal solution of 𝑦𝑦’ + 𝑥 = 0. That is
it appears to be a solution. Very often we look for a
formal solution rather than an implicit solution.◄
- 340 -
♠ Formation of a differential Equation
Let us consider an equation containing n arbitrary
constants. Then by differentiating it successively n times
we get n more equations containing n arbitrary constants
and derivatives. Now by eliminating n arbitrary constants
from the above (𝑛 + 1) equations and obtaining an
equation which involves derivatives up to the nth order,
we get a differential equation of order n. The concept of
obtaining differential equations from a family of curves is
illustrated in following examples.
Example 11.
Find the differential equation of the family curves𝑦 =
𝑐𝑒 2𝑥 .
Solution.
Given 𝑦 = 𝑐𝑒 2𝑥 . Differentiating we get
𝑦 ′ = 2𝑐𝑒 2𝑥 = 2𝑦
or
𝑦’ − 2𝑦 = 0 (3)
Thus, arbitrary constant c is eliminated and equation (3)
is the required equation of the family of curves given by
equation (2).◄
- 341 -
Example 12.
Find the differential equation of the family of curves
𝑦 = 𝑐1 cos 𝑥 + 𝑐2 sin 𝑥 (4)
Solution.
Differentiating (4) twice we get
𝑦 ′ = −𝑐1 sin 𝑥 + 𝑐2 cos 𝑥 (5)
𝑦 ′′ = − 𝑐1 cos 𝑥 − 𝑐2 sin 𝑥 (6)
c1 and c2 can be eliminated from (4) and (6) and we
obtain the different equation
𝑦’’ + 𝑦 = 0 (7)
Therefore (7) is the differential equation of the family of
curves given by (4).◄
Example 13.
Find the differential equation which has
𝑦 = 𝑐1 𝑒 𝑥 + 𝑐2 𝑒 −𝑥 + 3𝑥
as its general solution.
Solution.
Differentiate the given expression twice
𝑦 = 𝑐1 𝑒 𝑥 + 𝑐2 𝑒 −𝑥 + 3𝑥(8)
𝑦 / = 𝑐1 𝑒 𝑥 − 𝑐2 𝑒 −𝑥 + 3(9)
𝑦 // = 𝑐1 𝑒 𝑥 + 𝑐2 𝑒 −𝑥 (10)
- 342 -
Eliminate 𝑐1 and 𝑐2 by subtracting (10) from (8) to obtain
𝑦 − 𝑦 // = 3𝑥 gives which is the desired differential
equation.Note that the desired differential equation is free
from the arbitrary constants.◄
3. Initial-Value and Boundary-Value Problems
A general solution of an nth order ODE contains n
arbitrary constants. To obtain a particular solution, we are
required to specify n conditions on solution function and
its derivatives and thereby expect to find values of n
arbitrary constants. There are two well known methods
for specifying auxiliary conditions. One is called initial
conditions and other is said to be boundary conditions.
It may be observed that an ODEdoes not have solution or
unique solution. However, by imposing initial and
boundary conditions uniqueness can be ensured for
certain classes of differential equations.
Definition 1. (initial-Value Problem)
If the auxiliary conditions for a given differential
equation relate to a single x value, the conditions are
called initial conditions. The differential equation with its
initial conditions is called an initial-value problem.
- 343 -
Definition 2. (boundary-value problem)
If the auxiliary conditions for a given differential
equation relate to two or more x values, the conditions are
called boundary conditions or boundary values. The
differential equation with its boundary conditions is
called boundary-value problem.
Example 1.
- 344 -
(i) 𝑦′ + 𝑦 = 3, 𝑦(0) = 2 is a first-order initial value
problem. Order of initial value problem is nothing but
order of the given equation. 𝑦(0) = 2 is an initial
condition.
(ii) 𝑦’’ + 2𝑦 = 0, 𝑦(1) = 2, 𝑦’(1) = −3 is a second-
order initial value problem. Initial conditions are 𝑦(1) =
2 and 𝑦’(1) = −3. Values of function 𝑦(𝑥) and its
derivative are specified for value 𝑥 = 1.
(iii) 𝑦’’ − 𝑦’ + 𝑦 = 𝑥 3 , 𝑦(0) = 4, 𝑦’(1) = −2 is a
second-order boundary-value problem.
Boundary conditions are specified at two points namely
𝑥 = 0 and 𝑥 = 1.
One may specify boundary conditions for different values
of x say 𝑥 = 2 and 𝑥 = 5. In this case the boundary-
value problem is
𝑦’’ − 𝑦’ + 𝑦 = 𝑥 3 , 𝑦(2) = 4, 𝑦’(5) = −2.◄
● The following questions are quite pertinent as boundary
value and initial value problems represent important
phenomena in nature:
- 345 -
Problem 1.
When does a solution exist? That is, does an initial-value
problem or a boundary value problem necessarily have a
solution?
Problem 2.
Is a known solution unique? That is, is there only one
solution of an initial-value problem or a boundary-value
problem?
The following theorem states that under the specified
conditions, a first-order initial-value problem has a
unique solution.
Theorem 1.
𝜕𝑓
Let f and 𝑓𝑦 ( ) be continuous functions of x and y in
𝜕𝑦
- 346 -
Example 2.
(i) 𝑦 = 3 𝑒 𝑥 is a solution of the initial-value
problem𝑦’ = 𝑦, 𝑦(0) = 3
This means that the solution of the differential equation
𝑦’ = 𝑦passes through the point (0, 3).
(ii) Find a solution of the initial-value problem 𝑦’ = 𝑦,
𝑦(1) = −2. That is, find a solution of differential
equation 𝑦’ = 𝑦 which passes through the point (1, −2).
Verification.
(i) Let 𝑦 = 𝑐 𝑒 𝑥 , where c is an arbitrary constant.
Then 𝑦′ = 𝑐 𝑒 𝑥 = 𝑦.Thus, 𝑦 = 𝑐 𝑒 𝑥 is a general
solution of the given equation 𝑦’ = 𝑦.
By applying initial condition, we get 3 = 𝑦(0) =
𝑐 𝑒 0 = 𝑐 or c = 3. Therefore 𝑦 = 3 𝑒 𝑥 is a solution of
the given initial value problem.
(ii) As seen in part (i) 𝑦 = 𝑐 𝑒 𝑥 is a solution of the
given equation. By imposing given initial condition we
get −2 = 𝑦(1) = 𝑐 𝑒 1 or 𝑐 = −2/𝑒.
2
Therefore𝑦 = − 𝑒 𝑥 = −2𝑒 𝑥−1 is a solution of the
𝑒
initial-value problem.◄
- 347 -
Example 3.
𝑑𝑦
= 𝑥𝑦1/2 , 𝑦(0) = 0
𝑑𝑥
has at least two solutions, namely 𝑦 = 0 and
𝑦 = 𝑥 4 /16.
Example 4.
(i) Does a solution of the boundary value problem 𝑦’’ +
𝑦 = 0, 𝑦(0) = 0, 𝑦() = 2 exist?
(ii) Show that the boundary value problem
𝑦’’ + 𝑦 = 0, 𝑦(0) = 0, 𝑦() = 0
Has infinitely many solutions.
Solution.
(i)𝑦 = 𝑐1 cos 𝑥 + 𝑐2 sin 𝑥 is a solution of the
differential equation 𝑦’’ + 𝑦 = 0.Using given boundary
conditions in𝑦 = 𝑐1 cos 𝑥 + 𝑐2 sin 𝑥, we get
0 = 𝑐1 cos 0 + 𝑐2 sin 0
and
2 = 𝑐1 cos 𝜋 + 𝑐2 sin 𝜋
The first equation yields 𝑐1 = 0 and the second yields
𝑐1 = −2 which is absurd, hence no solution exists.
- 348 -
(ii) The boundary values yield
0 = 𝑐1 cos 0 + 𝑐2 sin 0
and
0 = 𝑐1 cos 𝜋 + 𝑐2 sin 𝜋
Both of these equations lead to the fact that 𝑐1 = 0. The
constant c2 is not assigned a value and therefore takes
arbitrary values. Thus, there are infinitely many solutions
represented by 𝑦 = 𝑐2 sin 𝑥.◄
Example 5.
Examine existence and uniqueness of a solution of the
following initial-value problems:
𝑦
(i) 𝑦 ′ = , 𝑦(2) = 1;
𝑥
𝑦
(ii) 𝑦 ′ = , 𝑦(0) = 3;
𝑥
- 349 -
𝜕𝑓
Hence f and satisfy the conditions of Theorem 1 in any
𝜕𝑦
- 350 -
4. Solutions of First Order Differential Equations
The systemic development of techniques for solving
differential equations logically begins with the equations
of the first order and first degree.
Equations of this type can in general, be written as
𝑑𝑦
= 𝑓(𝑥, 𝑦), (1)
𝑑𝑥
- 351 -
𝑒 3𝑥 2𝑥 3
𝑦= + + 𝑠𝑖𝑛 𝑥 + 𝐴𝑥 + 𝐵
3 3
Note that integrating twice has introduced the 2 arbitrary
constants A and B.◄
𝒅𝒏 𝒚
♣ If you have a DE of the form = 𝒇(𝒙)then you can,
𝒅𝒙𝒏
- 352 -
Example 1.
𝑑 𝑦 𝑦+1
Solve the equation =
𝑑𝑥 𝑥−1
Solution.
Rewriting the given equation, we get:
𝑑𝑦 𝑑𝑥
∫ =∫
𝑦+1 𝑥−1
or 𝑙𝑜𝑔𝑒 ( 𝑦 + 1) = 𝑙𝑜𝑔𝑒 ( 𝑥 − 1) + 𝑙𝑜𝑔𝑒 𝐶
where 𝐶is an arbitrary constant.
𝑦+1
Hence = 𝐶 is the general solution.◄
𝑥−1
Exercise.
Given the boundary condition that 𝑦 = 1 at 𝑥 = 0.
Use the expression for the general solution in the above
example to work out the particular solution.
[Solution𝑦 = 2(1 − 𝑥) − 1)].
Example 2.
Solve the equation
𝑑𝑦 2𝑥 + 1
=
𝑑𝑥 𝑦
Solution.
Rewriting the given equation we obtain
𝑦𝑑𝑦 = (2𝑥 + 1)𝑑𝑥
- 353 -
Or,∫ 𝑦𝑑𝑦 = ∫(2𝑥 + 1)𝑑𝑥.
1
Hence 𝑦 2 = 𝑥 2 + 𝑥 + 𝑐,where c is an arbitrary
2
𝑑𝑦 𝑥𝑑𝑥
Integrating ∫ = ∫ (𝑥 2 .
𝑦 +1)
1
ln|𝑦| = ln|𝑥 2 + 1| + 𝐴
2
- 354 -
Example 4.
4𝑥 2 +1 𝑑𝑦
Solve the equation = 𝑥𝑦 given 𝑦(1) = 0.
𝑦+1 𝑑𝑥
Solution.
4𝑥 2 +1
This can be written as 𝑑𝑥 = 𝑦(𝑦 + 1)𝑑𝑦.
𝑥
- 355 -
Solution.
Replace x and y by 𝑡𝑥 and𝑡𝑦 respectively, to
obtain(𝑡𝑥 )2 + (𝑡𝑥 )(𝑡𝑦) − (𝑡𝑦)2 = 𝑡 2 (𝑥 2 + 𝑥𝑦 − 𝑦 2 ).
The degree is 2.◄
♠ Consider the differential equation
𝑀(𝑥, 𝑦)𝑑𝑥 + 𝑁(𝑥, 𝑦)𝑑𝑦 = 0.
The equation is said to be homogeneous in x and y if M
and N are homogeneous functions of the same degree in x
and y. The technique for solving this equation is to make
the substitution 𝑦 = 𝑣𝑥 or 𝑥 = 𝑣𝑦 and is based upon the
following theorem.
Theorem 1.
Any homogeneous differential of the first order and first
degree can be reduced to the type of variables separable
by a substitution of 𝑦 = 𝑣𝑥 or 𝑥 = 𝑣𝑦.
Example2.
Find the general solution of the differential equation
2𝑥𝑦𝑑𝑥 − (𝑥 2 − 𝑦 2 )𝑑𝑦 = 0.
Solution.
A simple check reveals that the equation is homogeneous.
- 356 -
Let𝑦 = 𝑣𝑥and 𝑑𝑦 = 𝑣𝑑𝑥 + 𝑥𝑑𝑣, and substitute into the
differential equation to obtain
2𝑥(𝑣𝑥)𝑑𝑥 − (𝑥 2 − 𝑣 2 𝑥 2 )(𝑣𝑑𝑥 + 𝑥𝑑𝑣) = 0
𝑥 2 (𝑣 + 𝑣 3 )𝑑𝑥 + 𝑥 3 (𝑣 2 − 1)𝑑𝑣 = 0.
Dividing by 𝑥 3 (𝑣 + 𝑣 3 ) separates the variables:
𝑑𝑥 (𝑣 2 −1)𝑑𝑣
+ = 0. Integration yields
𝑥 𝑣(1+𝑣 2 )
𝑙𝑛 𝑥 − 𝑙𝑛 𝑣 + 𝑙𝑛( 𝑣 2 + 1) = 𝑙𝑛 𝑐
or 𝑥(𝑣 2 + 1) = 𝑐𝑣
𝑦
Rewriting the original variables by substituting 𝑣 = , we
𝑥
Example3.
𝑑𝑦 𝑦(𝑥+𝑦)
Solve the equation = , given 𝑦(1) = 1.
𝑑𝑥 𝑥(𝑦−𝑥)
- 357 -
Solution.
Substitute𝑦 = 𝑣𝑥 we obtain
𝑑𝑣 𝑣𝑥(𝑥+𝑣𝑥) 𝑣(1+𝑣)𝑥 2
𝑥 +𝑣 = = .
𝑑𝑥 𝑥(𝑣𝑥−𝑥) (𝑣−1)𝑥 2
- 358 -
Solution.
𝑑𝑦 √𝑥 2 −𝑦 2 +𝑦
This can be written as = .
𝑑𝑥 𝑥
Solution.
𝑑𝑦 𝑥+𝑦
Rearrange the given equation: = .
𝑑𝑥 𝑥
𝑑𝑣 𝑥+𝑣𝑥
Putting 𝑦 = 𝑣𝑥 gives:𝑥 +𝑣 = = 1 + 𝑣.
𝑑𝑥 𝑥
𝑑𝑣
Simplify: 𝑥 = 1.
𝑑𝑥
𝑑𝑥
Separate the variables: 𝑑𝑣 =
𝑥
- 359 -
1
Integrate: 𝑣 = ∫ 𝑑𝑥 = 𝑙𝑛|𝑥 | + 𝐴.
𝑥
- 360 -
𝑑𝑦
Comparing left hand side with 𝑒 𝑓(𝑥) ( + 𝑓′(𝑥 )𝑦)
𝑑𝑥
𝑦𝑒 −3𝑥 = ∫ 𝑒 −𝑥 𝑑𝑥 = −𝑒 −𝑥 + 𝐴
𝑦 = 𝑒 2𝑥 + 𝐴𝑒 3𝑥 .◄
- 361 -
Example 2.
Given
𝑑𝑦
2 cos 𝑥 + 4𝑦 sin 𝑥 = sin 2 𝑥
𝑑𝑥
𝜋
Find 𝑦(𝑥) (y as a function of x), given that 𝑦( ) = 0.
3
Solution.
Divide through by 2 cos 𝑥 to write it in the form
𝑑𝑦
+ 𝑃(𝑥 )𝑦 = 𝑄 (𝑥 )
𝑑𝑥
dy 2 sin x sin 2 x 2 sin x cos x
+ y= =
dx cos x 2 cos x 2 cos x
dy
+ (2 tan x ) y = sin x
dx
The equation is linear, integrating factor
2𝑥
𝑒 ∫ 2 𝑡𝑎𝑛 𝑥 𝑑𝑥 = 𝑒 ln 𝑠𝑒𝑐 = 𝑠𝑒𝑐 2 𝑥
Multiply by I. F. We obtain
𝑑𝑦
𝑠𝑒𝑐 2 𝑥 + (2 𝑠𝑒𝑐 2 𝑥 𝑡𝑎𝑛 𝑥 )𝑦 = 𝑠𝑒𝑐 2 𝑥 𝑠𝑖𝑛 𝑥
𝑑𝑥
Then
𝑑(𝑦 𝑠𝑒𝑐 2 𝑥 ) 𝑠𝑖𝑛 𝑥
= 𝑠𝑒𝑐 2 𝑥 𝑠𝑖𝑛 𝑥 =
𝑑𝑥 𝑐𝑜𝑠 2 𝑥
𝑠𝑖𝑛 𝑥 1
Integrate 𝑦 𝑠𝑒𝑐 2 𝑥 = ∫ 𝑑𝑥 = + 𝐴.
𝑐𝑜𝑠2 𝑥 𝑐𝑜𝑠 𝑥
- 362 -
Hence 𝑦 = 𝑐𝑜𝑠 𝑥 + 𝐴 𝑐𝑜𝑠 2 𝑥.
𝜋
But𝑦 = 0 when𝑥 = .
3
𝜋 𝜋
Hence 0 = 𝑐𝑜𝑠 + 𝐴 𝑐𝑜𝑠 2
3 3
1 1
0= + 𝐴 ⇒ 𝐴 = −2
2 4
Particular solution𝑦(𝑥) = 𝑐𝑜𝑠 𝑥 − 2 𝑐𝑜𝑠 2 𝑥.◄
Example 3.
𝑑𝑦
Solve the equation −𝑦 =𝑥
𝑑𝑥
Solution.
𝑑𝑦
The I. F. is 𝑒 ∫ −1𝑑𝑥 = 𝑒 −𝑥 giving 𝑒 −𝑥 − 𝑦𝑒 −𝑥 =
𝑑𝑥
𝑥𝑒 −𝑥 .
𝑑(𝑦𝑒 −𝑥 )
Hence = 𝑥𝑒 −𝑥 .
𝑑𝑥
So 𝑦𝑒 −𝑥 = ∫ 𝑥𝑒 −𝑥 𝑑𝑥.
Integrating the right side by parts we obtain
𝑦𝑒 −𝑥 = −𝑥𝑒 −𝑥 − 𝑒 −𝑥 + 𝐴.
Thus
𝑦 = 𝐴𝑒 𝑥 − 𝑥 − 1.◄
- 363 -
Exercise Set (8.1)
1. Classify the given differential equation by order and
tell whether it is linear or non-linear.
(a) 𝑦’ + 2𝑥𝑦 = 𝑥 2 ; (b) 𝑦’ (𝑦 + 𝑥) = 5;
(c) 𝑦 sin 𝑦 = 𝑦’’; (d) 𝑦 cos 𝑦 = 𝑦’’’;
(e)cos 𝑦 𝑑𝑦 = sin 𝑥 𝑑𝑥; (f)𝑦’’ = 𝑒 𝑦 .
2. State whether the given differential equation is linear
or non-linear. Write the order of each equation.
(a) (1 − 𝑥 )𝑦’’ − 6𝑥𝑦’ + 9𝑦 = sin 𝑥;
𝑥𝑑3 𝑦 𝑑𝑦
(b) − 2( )2 + 𝑦 = 0;
𝑑𝑥 3 𝑑𝑥
(c) 𝑦𝑦’ + 2𝑦 = 2 + 𝑥 2 ;
𝑑2 𝑦
(d) + 9𝑦 = 𝑠𝑖𝑛 𝑦;
𝑑𝑥 2
𝑑𝑦 𝑑2 𝑦 2 1
(e) = (1 + ( 2 ) )2 ;
𝑑𝑥 𝑑𝑥
𝑑2 𝑟 𝑘
(f) = − 2.
𝑑𝑡 2 𝑟
- 364 -
6. 𝑥 2 𝑑𝑦 + 2𝑥𝑦 𝑑𝑥 = 0; 𝑦 = −1/𝑥 2
7. 𝑦’’’ − 3𝑦’’ + 3𝑦’ − 𝑦 = 0; 𝑦 = 𝑥 2 𝑒 𝑥
8. 𝑦’ = 𝑦 + 1; 𝑦 = 𝑒 𝑥 − 1
9. 𝑦’’ + 9𝑦 = 8 sin 𝑥 ;
𝑦 = sin 𝑥 + 𝑎 cos 3𝑥 + 𝑏 sin 3𝑥
10. Solve the following differential equations
𝑑𝑦 𝑑𝑦
(i) 𝑥 = 𝑦2 , (ii) 𝑥 (𝑥 2 + 1) = ;
𝑑𝑥 𝑑𝑥
2𝑥𝑦 𝑑𝑦 𝑑𝑦 𝑥 2 +𝑥
(iii) = ;(iv) = ;
𝑥 2 +1 𝑑𝑥 𝑑𝑥 𝑦 2 +𝑦
𝑑𝑦 𝑑𝑦
(v) 𝑐𝑜𝑠 2 𝑥 = 𝑐𝑜𝑠 2 𝑦; (vi) 𝑒 𝑥+𝑦 = 1;
𝑑𝑥 𝑑𝑥
𝑑𝑦 𝑦−1
(vii) 𝑥 = − 𝑦;
𝑑𝑥 𝑦+1
𝑑𝑦
(viii) (𝑥 + 1)𝑦 = 𝑥 , where 𝑦(1) = 1;
𝑑𝑥
𝑑𝑦 𝜋
(ix) 𝑡𝑎𝑛 𝑥 = 𝑐𝑜𝑡 𝑦 where 𝑦(0) = ;
𝑑𝑥 2
𝑑𝑦
(x) (𝑠𝑖𝑛 𝑥 + 𝑐𝑜𝑠 𝑥 ) = 𝑐𝑜𝑠 𝑥 − 𝑠𝑖𝑛 𝑥.
𝑑𝑥
- 365 -
𝑑𝑦 𝑑𝑦
4. 𝑥𝑦 2 = 𝑥 3 − 𝑦 3 5.𝑥 = 𝑦 + √𝑥 2 − 𝑦 2
𝑑𝑥 𝑑𝑥
𝑑𝑦
6. 𝑦 = −𝑥 + 2𝑦
𝑑𝑥
𝑑𝑦
7. 𝑥 (𝑥 2 + 𝑦 2 ) = 𝑦 (𝑦 2 − 𝑥 2 )
𝑑𝑥
𝑑𝑦
8. (𝑥 3 + 𝑥𝑦 2 ) = (𝑥 2 𝑦 − 𝑦 3 )
𝑑𝑥
- 366 -
MISCELLANEOUS PROBLEMS
1- Suppose𝐴 is an 𝑚 × 𝑛 matrix such that 𝐴𝐴𝑡 = 𝟎.
Prove that 𝐴 = 𝟎.
1 2 1 2 1 0 7
1 2 4 −1 4 3 1
2- Put the matrix 𝐴 = 2 4 5 1 5 −2 3 in row-
2 4 8 −2 8 6 2
[1 2 1 2 6 0 7]
echelon form.
3- Let 𝐴 and 𝐵 be an 𝑛 × 𝑛 non-singular matrices prove
that 𝐴𝐵 = 𝐵𝐴 if and only if (𝐴𝐵 )2 = 𝐴2 𝐵2
4- Determine whether the following matrix is unitary and
find the inverse of it if it is unitary.
𝑖 1 1
1+ −
2 2 2
𝑖 1 𝑖
−
√3 √3 √3
3+𝑖 4 + 3𝑖 5𝑖
[2√15 2√15 2√15]
−1 3 5
5- Prove that [ 1 −3 −5] is idempotent.
−1 3 5
- 367 -
6- Prove that the following system is consistent if and
only if 𝛿𝛼 − 𝛽 − 11𝛾 + 5𝛿 = 0.
Solve the system when 𝛼 = 𝛽 = −1, 𝛾 =3 and 𝛿 = 8.
𝑥1 − 𝑥2 − 𝑥4 − 5𝑥5 = 𝛼
2𝑥1 + 𝑥2 − 𝑥3 − 4𝑥4 + 𝑥5 = 𝛽
𝑥1 + 𝑥2 + 𝑥3 − 4𝑥4 − 6𝑥5 = 𝛾
𝑥1 + 4𝑥2 + 2𝑥3 − 8𝑥4 − 5𝑥4 = 𝛿
7- Write the general solution of the following systems as
a linear combination of particular solutions.
𝑥1 − 𝑥2 + 𝑥3 = 0
−2𝑥1 − 3𝑥2 = 0
𝑥1 + 2𝑥2 − 𝑥3 = 0
8. Find the minors and cofactors of the matrix
2 −3 1
𝐴 = [4 −2 5].
1 −1 3
1 𝑥 𝑥2
9. Prove that |1 𝑦 𝑦 2 | = (𝑥 − 𝑦)(𝑦 − 𝑧)(𝑧 − 𝑥 ).
1 𝑧 𝑧2
10. Use Cramer's Rule to find 𝑥1 and 𝑥2 for the system
𝑥 = 𝑥1 cos 𝜃 − 𝑦1 sin 𝜃
𝑥 = 𝑥1 sin 𝜃 + 𝑦1 cos 𝜃
- 368 -
11. Prove that 𝑃2 is a linear space.
12. Prove that 𝑉 = ℝ2 with addition and scalar
multiplication defined by: (𝑎, 𝑏) + (𝑐, 𝑏) =
(𝑎 + 𝑐, 𝑏 + 𝑑) and 𝛼 (𝑎, 𝑏) = (|𝛼 |𝑎, |𝛼 |𝑏) is not a linear
space.
13. Let 𝑉 be a linear space and 𝒖, 𝒗 ∈ 𝑉, 𝛼, 𝛽 ∈ ℝ.
(a) If 𝛼𝒗 = 𝛽𝒗, 𝒗 ≠ 𝟎, prove that 𝛼 = 𝛽.
(b) If 𝛼𝒗 = 𝛼𝒖 and ≠ 0 , prove that 𝒗 = 𝒖.
14. Let 𝑉 = {𝐴𝑋: 𝐴 ∈ 𝑀𝑚×𝑛 and X ∈ Mn×1 }. Prove that
𝑉 is a linear space with addition and scalar
multiplication of matrices
15. Let 𝑈, 𝑊 be two subspaces of V. Prove that
(a) 𝑈 ∩ 𝑊 is a subspace of 𝑉.
(b) 𝑈 ∪ 𝑊 may not be a subspace of 𝑉.
(c) 𝑈 ∪ 𝑊 is a subspace of 𝑉 if and only if
𝑈 ⊆ 𝑉 or W ⊆ V.
(d) 𝑈 + 𝑊 = {𝒖 + 𝒘: 𝒖 ∈ 𝑈, 𝒘 ∈ 𝑊 } is a subspace
of 𝑉.
16. Let 𝒖1 = (1,0,1), 𝒖2 = (−1, −1,0),
𝒗1 = (1,2,0), 𝒗2 = (0,0,1)
(a) Prove that {𝒖1 , 𝒖2 } ≠ {𝒗1 , 𝒗2 }.
- 369 -
(b) Find 𝛼1 , 𝛼2 , 𝛽1 and 𝛽2 such that
𝛼1 𝑢1 + 𝛼2 𝑢2 = 𝛽1 𝑣1 + 𝛽2 𝑣2
17. Let {𝒖, 𝒗} be linearly independent in a vector space V.
Prove that
(a) {𝒖, 𝒖 + 𝒗} is linearly independent.
(b){𝒖, 𝒗, 𝒖 + 𝒗} is linearly dependent.
18. Determine the dimension of and a basis for the
solution space of the systems:
𝑥 + 𝑦 + 𝑧 = 0, 2𝑥 + 2𝑦 − 2𝑧 = 0
4𝑥 + 3𝑦 − 𝑧 = 0, 6𝑥 + 5𝑦 + 𝑧 = 0.
19. Find the coordinate vector of v relative to the basis
𝑆 = {𝒗1 , 𝒗2 , 𝒗3 } , where 𝑣 = (2, −1, 3); 𝒗1 =
(1, 0, 0) , 𝒗2 = (2, 2, 0) , 𝒗3 = (3, 3, 3)
〈𝐴, 𝐵〉 = 𝑎1 𝑏1 + 𝑎2 𝑏2 + 𝑎3 𝑏3 + 𝑎4 𝑏4 .
- 370 -
22. Let 𝒖, 𝒗 be a linearly dependent vectors in an inner
product space V. Compute the determinate
〈𝒗, 𝒗〉 〈𝒖, 𝒗〉
| |
〈𝒗, 𝒖〉 〈𝒖, 𝒖〉
- 371 -
1 0
03 −4
1 0 5 −2 −1
𝐴 = [0 04 2 ] , 𝐴 = [
1 ]
0 1 −2 4 2
0 0
15 0
1 1
27. Let 𝐴 = [ ]. Find the eigenvectors of A.
−2 4
5 −2 6 −1
0 3 −8 0
28. Let 𝐴 = ( ).
0 0 5 4
0 0 0 1
Find the eigenvalues and eigenvectors of A.
100𝑛 +∞
29. Is{ } a monotone sequence? Is it a convergent
𝑛! 𝑛=1
1 .3 .5…(2 𝑛−1) +∞
31. Show that{ } is an increasing
𝑛! 𝑛=1
sequence.
32. Show that if {𝑎𝑛 }+∞
𝑛=1 a nonincreasing sequence,
then{−𝑎𝑛 }+∞
𝑛=1 is a nondecreasing sequence.
- 372 -
(2 𝑛−1)𝜋 𝑛+1
37. 𝑎𝑛 = sin ( ); 38. 𝑎𝑛 = 𝑛(𝑛+2).
2
∑∞ 𝑘 ∑∞ 𝑘
(c) 𝑘=2 1/(ln𝑞 ); (d) 𝑘=2 1/(ln𝑞 ) ;
(b) Why can't you use the integral test for the series
∑∞ 2
𝑘=1(2 + cos 𝑘 )/𝑘 ?Test for convergence using a
- 373 -
44.In parts (a)-(d), use the comparison test to
determine whether the series converges.
2 𝑘−1 2 𝑘+1
(a) ∑∞
𝑘=1 ; (b) ∑∞
𝑘=1 ;
3 𝑘 2 −𝑘 3 𝑘 2 +𝑘
2 𝑘−1 2 𝑘+1
(c) ∑∞
𝑘=1 2; (d) ∑∞
𝑘=1 .
3 𝑘 3 −𝑘 3 𝑘 3 +𝑘 2
√𝑘 𝑘 𝑘 3𝑘 𝑘!
49. ∑∞
𝑘=1 𝑘 2 +7; 50. ∞ ∞
∑𝑘=1 ( ) ;51.∑𝑘=1 ;
𝑘+1 (2 𝑘)!
∞ 𝑘 6 3𝑘 ∞ 5 𝑘 3𝑘
52 ∑𝑘=1 ; 53.∑𝑘=1 ( ) .
(𝑘+1) 2 𝑘+1
- 374 -
𝑘 𝑘 𝑘 𝑘+1
59. ∑∞ ∞
𝑘=0(−1) /(2 𝑘 + 1); 60.∑𝑘=0(−1) 3 /2 .
In Exercise (27)-(32), determine the radius of
convergence and the interval of convergence of the given
power series.
(𝑘−1)𝑘 (2 𝑥)𝑘 (1−𝑥)2𝑘
61. ∑∞
𝑘=1 ; 62. ∑∞
𝑘=1 ; 63.∑∞
𝑘=1 ;
𝑘 √𝑘 3𝑘 4𝑘𝑘
- 375 -
72 Use the identity 𝑥 = 𝑎 + (𝑥 − 𝑎) and the known
Maclaurin series for𝑒 𝑥 , sin 𝑥 , cos 𝑥 , and 1 / (1 − 𝑥) to
find the Taylor series about 𝑥 = 𝑎 for
(a)𝑒 𝑥 (b) sin x and (c) 1 / x.
73. Find the partial fraction decomposition of
𝑥 2 +2𝑥+5 𝑥 2 +1
; ;
𝑥 3 +3𝑥 2 −𝑥−3 𝑥(𝑥−1)3
2𝑥 3 +7𝑥+5 𝑥 4 +𝑥 3 +𝑥 2 −𝑥+1
(𝑥 2 +𝑥+2)(𝑥 2 +1)
; .
𝑥(𝑥 2 +1)2
2 when𝑥 = 0.
Give y in terms of x.
- 376 -
REFERENCES
55608-010-4
ISBN 978-0-13-229654-0
- 377 -