ATOM 02 Algebra Intermediate Methods
ATOM 02 Algebra Intermediate Methods
Volume / Tome II
ALGEBRA —
INTERMEDIATE METHODS
Revised Edition
Bruce Shawyer
Memorial University of Newfoundland
Publisher: Canadian Mathematical Society
Managing Editor: Graham Wright
Editor: Richard Nowakowski
Cover Design: Bruce Shawyer and Graham Wright
Typesetting: CMS CRUX with MAYHEM Office
Printing and Binding: The University of Toronto Press Inc.
Shawyer, Bruce
c 1999 Canadian Mathematical Society
All rights reserved. No part of this publication may be reproduced or transmitted in any
form or by any means, electronic or mechanical, including photocopying, recording, or
any information or retrieval system now known or to be invented, without permission
in writing from the publisher: The Canadian Mathematical Society, 577 King Edward
Avenue, P.O. Box 450, Station A, Ottawa, Ontario K1N 6N5, Canada, except so far as
may be allowed by law.
ISBN 0–919558–11–9
This is a PDF version of the original printed volume. There are changes
in fonts and format that could result in different line and page breaks.
Ceci est une version PDF de l’ouvrage imprim. Il est possible que des
changements de polices ou de format modifient les coupures de ligne
ou de page.
A TASTE OF MATHEMATICS
Volume / Tome II
ALGEBRA —
INTERMEDIATE METHODS
Revised Edition
Bruce Shawyer
Memorial University of Newfoundland
The ATOM series
The booklets in the series, A Taste of Mathematics, are published by the
Canadian Mathematical Society (CMS). They are designed as enrichment mate-
rials for high school students with an interest in and aptitude for mathematics.
Some booklets in the series will also cover the materials useful for mathematical
competitions at national and international levels.
La collection ATOM
Publiés par la Société mathématique du Canada (SMC), les livrets
de la collection Aime-t-on les mathématiques (ATOM) sont destinés au
perfectionnement des étudiants du cycle secondaire qui manifestent un intérêt
et des aptitudes pour les mathématiques. Certains livrets de la collection ATOM
servent également de matériel de préparation aux concours de mathématiques sur
l’échiquier national et international.
Bruce Shawyer
Memorial University of Newfoundland / Université Memorial de Terre-Neuve
1 Introduction 1
2 Mathematical Induction 2
2.1 Induction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
2.2 Horses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.3 Strong Induction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3 Series 5
3.1 Telescoping Series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2 Sums of Powers of Natural Numbers . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.3 Arithmetic Series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
3.4 Geometric Series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.5 Infinite Series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
4 Binomial Coefficients 12
4.1 Factorials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.2 Binomial Coefficients . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.3 Pascal’s Triangle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
4.4 Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
Foreward
This volume contains a selection of some of the basic algebra that is useful in solving
problems at the senior high school level. Many of the problems in the booklet admit
several approaches. Some worked examples are shown, but most are left to the ingenuity
of the reader.
While I have tried to make the text as correct as possible, some mathematical and
typographical errors might remain, for which I accept full responsibility. I would be
grateful to any reader drawing my attention to errors as well as to alternative solutions.
Also, I should like to express my sincere appreciation for the help given by Ed Barbeau
in the preparation of this material.
It is the hope of the Canadian Mathematical Society that this collection may find its way
to high school students who may have the talent, ambition and mathematical expertise
to represent Canada internationally. Those who wish more problems can find further
examples in:
1. The International Mathematical Talent Search (problems can be obtained from the
author, or from the magazine Mathematics & Informatics Quarterly, subscriptions
for which can be obtained (in the USA) by writing to Professor
Susan Schwartz Wildstrom, 10300 Parkwood Drive, Kensington, MD USA 20895
ssw@ umd5.umd.edu, or (in Canada) to Professor Ed Barbeau, Department of
Mathematics, University of Toronto, Toronto, ON Canada M5S 3G3
barbeau@ math.utoronto.ca);
2. The Skoliad Corner in the journal Crux Mathematicorum with Mathematical May-
hem (subscriptions can be obtained from the Canadian Mathematical Society, 577
King Edward, PO Box 450, Station A, Ottawa, ON, Canada K1N 6N5);
3. The book The Canadian Mathematical Olympiad 1969–1993 L’Olympiade
mathématique du Canada, which contains the problems and solutions of the first
twenty five Olympiads held in Canada (published by the Canadian Mathematical
Society, 577 King Edward, PO Box 450, Station A, Ottawa, ON, Canada K1N
6N5);
4. The book Five Hundred Mathematical Challenges, by E.J. Barbeau, M.S.
Klamkin & W.O.J. Moser (published by the Mathematical Association of America,
1529 Eighteenth Street NW, Washington, DC 20036, USA).
1 Introduction
The purpose of this booklet is to gather together some of the miscellaneous math-
ematics that is useful in problem solving, but is often missing from high school
mathematics textbooks.
The topics are grouped under five main headings
1. Mathematical Induction.
2. Series.
3. Binomial Coefficients.
1. Mathematical Induction.
This method is very useful for proving results about whole numbers. It is,
in fact, basic to the number system that we use.
2. Series.
Although infinite series is really a university level topic, a gentle introduction
leads to some very useful results.
3. Binomial Coefficients.
Binomial coefficients are everywhere. Many of the properties can be obtained
using the results of the previous two section. However, many of the properties
can be obtained using counting arguments. See, for example, the forthcoming
ATOM booklet on Combinatorics.
2 Mathematical Induction
A very important method of proof is the Principle of Mathematical Induction.
This is used to prove results about the natural numbers, such as the fact that the
n(n + 1)
sum of the first n natural numbers is .
2
We use functional notation: P (n) means a result (proposition) for each
natural number n. For example, the statement that the sum of the first n natural
n(n + 1)
numbers is equal to is written:
2
n
n(n + 1)
P (n) : k = .
2
k=0
2.1 Induction
Proof by induction consists of three parts:
TEST: Find an appropriate starting number for the result (if it is not already
given in the statement). Usually 1 is appropriate (sometimes 0 is better). For the
purposes of this explanation, we shall take 1 as the starting number.
Test (check) the result for this number: is P (1) correct?
If “YES”, we proceed: if “NO”, the suggested result is false.
STEP: We assume that the result is indeed true for some particular (general)
natural number, say k. This is the inductive hypothesis. With this assumed result
P (k), we deduce the result for the next natural number k + 1. In other words, we
try to prove the implication
P (k) =⇒ P (k + 1) .
n(n + 1)
Problem 1 1 + 2 + · · · + n = .
2
3
n(n + 1)(2n + 1)
Problem 2 12 + 22 + · · · + n2 = .
6
n2 (n + 1)2
Problem 3 13 + 23 + · · · + n3 = .
4
n
n k n−k
Problem 4 α β = (α + β)n .
k
k=0
n
1 √
Problem 5 √ < 2 n.
n
k=1
n5 n4 n3 n
Problem 6 + + − is an integer for n = 0, 1, 2, . . ..
5 2 3 30
Problem 7 If x ∈ [0, π], prove that | sin(nx)| ≤ n sin(x) for n = 0, 1, 2, . . ..
for n = 1, 2, . . ..
2.2 Horses
Just to check that you really understand the Principle of Mathematical Induction,
find the fallacy in the following “proof”.
Proposition. Let P (n) mean “in any set of n horses, all the horses are the same
colour”.
TEST. Consider a set consisting of exactly one horse. All the horses in it are the
same colour (since there is only one).
STEP. Assume that P (k) is true for some natural number k, that is, in any set
of k horses, all the horses are the same colour. This is the inductive hypothesis.
Now, consider a set consisting of k + 1 horses. We place each horse in a stable, in
a row, and number the stables, 1, 2, . . ., k, k + 1.
1 2 3 k−1 k k+1
1 2 3 k−1 k k+1
4
This is a set of k horses, and so, by the inductive hypothesis, must consist of
horses, all of the same colour.
Now, consider the horses in stables numbered 2, 3, . . ., k, k + 1.
..............................................................................................................................................................................................................................................................................................................................................................................................................................................................
.. ..
1 2 3 k−1 k k+1
This is a set of k horses, and so, by the inductive hypothesis, must consist of
horses, all of the same colour.
By observing the overlap between the two sets, we see that all the horses in the
set of k + 1 of horses must be of the same colour. And so, we are done!
Clearly, this is nonsense! We know that all horses are not the same colour. The
reader is asked to examine this “proof” and find out where it goes wrong. The
explanation may be found on page 42.
Problem 9 Pick’s theorem states that the area of a polygon, whose vertices have
integer coordinates (that is, are lattice points) and whose sides do not cross, is
given by
B
I + − 1,
2
where I and B are then numbers of interior and boundary lattice points respectively.
Prove Pick’s Theorem for a triangle directly.
Use strong induction to prove Pick’s theorem in general.
Problem 10 Prove that every natural number may be written as the product of
primes.
Problem 12 Show that every positive integer can be written as a sum of distinct
Fibonacci numbers. 1
2
Problem 13 For the Fibonacci numbers, show that Fn+1 + Fn2 = F2n+1 .
3 Series
We start with a given sequence {aj }, and we use “sigma” notation for addition:
n
aj := ak+1 + ak+2 + · · · + an .
j=k+1
We have “series” when we add up sequences. They may start with term 0 or with
term 1 as is convenient. We define the partial sum of a series by
k
Ak := aj ,
j=0
so that n
ak+1 + ak+2 + · · · + an = aj = An − Ak .
j=k+1
aj = fj − fj−1 ,
and so we get
n
n
aj = (fj − fj−1 ) = fn − f0 .
j=1 j=1
1 1 1 −1 −1 1
For example = − = − , so that fj = − , and
j(j + 1) j j+1 j+1 j j+1
n
n
1 −1 −1 −1 −1 1
= − = − = 1− .
j=1
j(j + 1) j=1
j + 1 j n + 1 1 n + 1
1 Fibonacci numbers are given by
F0 = 0 ,F1 = 1 ,for n ≥ 2 , Fn = Fn−1 + Fn−2 .
6
n
1
Problem 14 Evaluate .
j=1
j(j + 1)(j + 2)
n
1
Problem 15 Evaluate .
j=1
j(j + 1)(j + 2)(j + 3)
Generalize this!
n
j
Problem 16 Evaluate .
j=1
j4 + j2 + 1
n2 − (n − 1)2 = 2n − 1
(n − 1)2 − (n − 2)2 = 2(n − 1) − 1
..
.
22 − 12 = 2(2) − 1
12 − 02 = 2(1) − 1 .
2 2
⎛ sum⎞of the left sides is n − 0 = n and the sum of the right sides is
The
n
⎝2 j ⎠ − n (did you get the last term?) This gives
j=1
n
n(n + 1)
j = .
j=1
2
n
2
3 n(n + 1)
Problem 18 Show that j = .
j=1
2
n
Problem 19 Calculate a formula for j4.
j=1
n
Problem 20 Calculate a formula for j5.
j=1
aj = aj−1 + δ = aj−2 + δ + δ = · · · = a0 + j δ .
Thus
n
n
n
aj = (a0 + j δ) = (n − k)a0 + δ j
j=k+1 j=k+1 j=k+1
⎛ ⎞
n
k
= (n − k)a0 + δ ⎝ j − j⎠
j=0 j=0
n(n + 1) k(k + 1)
= (n − k)a0 + δ − . (1)
2 2
Another way to look at arithmetic series, is to write the sum down in both direc-
tions!
Sum = a0 + a1 + · · · + an−1 + an ,
Sum = an + an−1 + · · · + a1 + a0 .
Adding gives
Note that
a0 + an = a1 + an−1
= ··· = ak + an−k
= ··· = an + a0 .
Therefore
so that
(n + 1) (a0 + an ) (n + 1) (First + Last)
Sum = = .
2 2
Put into words, we see that the sum of an arithmetic series is the average of the
first and last terms, multiplied by the number of terms.
aj = aj−1 · r = aj−2 · r · r = · · · = a0 · rj .
n
n
aj = a rj = a rk+1 + rk+2 + rk+3 + · · · + rn−1 + rn .
j=k+1 j=k+1
Also
n
n
r a rj = a rj+1
j=k+1 j=k+1
In the ATOM volume on trigonometry 2 , you will see how to use geometric
series to find
n n
cos(jθ) and sin(jθ) .
j=k+1 j=k+1
1
8. If lim ak = 0, then ak does not necessarily converge; for example k.
k→∞
∞
(See below) rk converges for |r| < 1 ;
k=1
∞
(See below) rk diverges for |r| ≥ 1 ;
k=1
∞
1
(See below) diverges ; (2)
k
k=1
∞
1
(See below) converges ; (3)
k2
k=1
∞
1
converges for p > 1;
kp
k=1
∞
1
diverges for p ≤ 1;
kp
k=1
∞
1
diverges ;
k log(k)
k=1
∞
1
diverges .
k log(k) log(log(k))
k=1
If r = 1, the partial sum to the nth term is an, which does not tend to a
∞
finite limit. Thus a diverges.
k=1
If |r| < 1, then lim rn = 0, and this gives the sum to infinity
n→∞
∞
1
rn = .
1−r
k=0
For (3), we note that, for each integer n, there is an integer m such that 2m ≤
n < 2m+1 . Thus
11
n
1 1 1 1
= 1+ + 2 + ··· + 2
k2 2 2 3 n
k=1
1 1 1 1 1 1
< 1+ + + + + +
4 4 16 16 16 16
1 1 1
+···+ + 2m + · · · + 2m
22m 2 2
m
1 1 1 1
= 1 + + + ···+ = 2 1 − m+1 < 2 .
2 4 2 2
Thus, the sequence of partial sums is bounded above. Since series of positive terms
either converge or diverge to plus infinity, it follows that (3) converges. We do not
know the value of the sum — only that it is less than or equal to 2. For those
interested in numerical values, the actual sum is approximately 1.644934.
∞
1
Problem 21 Investigate the exact value of .
k2
k=1
2
m
1 1 1 1 1
= 1+ + + + ···+
k 2 3 4 2m
k=1
1 1 1 1 1 1
> 1+ + + + ···+ + + · · · +
2 4 4 2m 2m 2m
1 1 1 m
= 1+ + + ···+ = 1+ .
2 2 2 2
Thus, the sequence of partial sums increases without bound. Since series of positive
terms either converge or diverge to plus infinity, it follows that (2) diverges.
∞
1
We also note that the partial sums of grow slowly. If you try to sum this
k
k=1
series on a computer, you may reach the conclusion that it converges, because the
rate of growth will be too slow for the accuracy of the computer to detect. The
partial sums grow like log(n). In fact
n
1
= γ + log(n) + n ,
n
k=1
where γ is Euler’s constant, and n is an error term that tends to zero and n tends
to infinity. For those interested in numerical values, γ ≈ 0.577215665.
is a convergent series if p > 0. What we need here is the alternating series test,
which states that
If an ≥ 0 and lim an = 0, then
n→∞
∞
(−1)k ak
k=1
is convergent.
Care is required about what is meant by the sum of such a series. For
∞
1
example (−1)k = log(2). But this is only so if the terms are added up in
k
k=1
the order given and not re-arranged.
4 Binomial Coefficients
4.1 Factorials
For every natural number k, we define “k factorial” 4 , written as k! as the running
product of all the natural numbers from 1 up to k:
k! := 1 · 2 · 3 · · · · · (k − 1) · k .
This can also be defined inductively from
1! := 1
k! := k × (k − 1)!
We also define 0! = 1 for reasons of consistency that will become clear.
n n n
We see that = = 1 and that > 0.
n 0 k
We also note these coefficients in the Binomial Theorem:
If n is a positive integer, then
n
n n k
(1 + x) = x .
k
k=0
1
1 1
1 2 1
1 3 3 1
1 4 6 4 1
1 5 10 10 5 1
1 6 15 20 15 6 1
1 7 21 35 35 21 7 1
. . . . . . . . . . . . . . .
n
The binomial coefficient occurs at position (n, k) of this matrix.
k
4.4 Properties
The following formulae should be proved:
Problem 23
n n−1
k = n .
k k−1
Problem 24
n n
= .
k n−k
Problem 25
n+1 n n
= + .
k+1 k k+1
Problem 26
n n n−k
= · .
k+1 k k+1
Problem 27
n+1 n n
= + .
k k k−1
Problem 28
n+1 k k+1 n
= + + ···+ .
k+1 k k k
Problem 29
k+n+1 k k+1 k+2 k+n
= + + + ··· + .
n 0 1 2 n
Problem 30
n n
= = n.
1 n−1
15
Problem 31
n n n n
2n = + + + ···+ .
0 1 2 n
Problem 32
n n n n
0 = − + − · · · + (−1)n .
0 1 2 n
Problem 33
n n n
2n−1 = + + + ··· .
0 2 4
Problem 34
n n n
2n−1 = + + + ··· .
1 3 5
Problem 35
2 2 2 2
2n n n n n
= + + + ···+ .
n 0 1 2 n
Problem 36
n n n n
n 2n−1 = +2 +3 + ···+ n .
1 2 3 n
Problem 37
n n n n
0 = −2 +3 − · · · + (−1)n−1 n .
1 2 3 n
Problem 38
k
m+n n m
= .
k j=0
j k−j
There are several methods for proving these inequalities. For example, one
could use mathematical induction (as developed earlier in this booklet), or the
binomial theorem. They can also be obtained combinatorically. See, for example
the ATOM booklet on Combinatorics 6 .
6 A planned ATOM booklet.
16
b c
0 = x2 + x+ . (5)
a a
The standard technique for solution is to complete the square. Since
(x + u)2 = x2 + 2xu + u2 , we get
2 2
2 b b c b
0 = x + x+ + −
a 2a a 2a
2 2
b b − 4ac
= x+ − . (6)
2a 4a2
The expression b2 − 4ac, called the Discriminant and usually denoted by ∆7 , must
be non-negative if the quadratic equation is to have a real number solution. If
∆ = 0, then the quadratic equation has only one solution: we have a perfect
square. If ∆ > 0, then we have two distinct solutions.
If we allow x to be a complex number, in particular, when ∆ < 0, then the
solutions occur in conjugate pairs, that is, of the form p ± i q, where p, q are real
numbers.
If we also allow the coefficients, a, b, c to be complex numbers, and allow com-
plex solutions, then we always have two (distinct) solutions unless
∆ = 0, which we can interpret as two identical solutions.
The word root of an equation is another word for the solution of an equation.
This is frequently used in the circumstances like those in the next paragraph.
7 This is the upper case Greek letter, delta.
17
Suppose that α and β are the roots of (5). Then we have that
b c
(x − α)(x − β) = x2 + x+ .
a a
b c
α+β = − and αβ = .
a a
In words, this tells us that in the quadratic equation (5), the constant term is
the product of the roots, and that the coefficient of x is minus the sum of the
roots. For more on this, see the section entitled symmetric functions, later in
this booklet.
1 1
0 = x− x−
α β
1 1 1
= x2 − + x+
α β αβ
α + β 1
= x2 − x+
αβ αβ
−5 1
= x2 − x+ ,
7 7
or 7x2 + 5x + 1 = 0.
is
1 1
0 = x− 2 x− 2
α β
1 1 1
= x2 − + 2 x+
α2 β (αβ)2
2
2 α + β2 1
= x − x+
(αβ)2 (αβ)2
2
2 α + 2αβ + β 2 − 2αβ 1
= x − 2
x+
(αβ) (αβ)2
2
(α + β) − 2αβ 1
= x2 − 2
x+
(αβ) (αβ)2
2
(−5) − 2 × 7 1
= x2 − 2
x+ 2
7 7
11 1
= x2 − x + ,
49 49
or 49x2 − 11x + 1 = 0.
to give us
0 = x3 + 2x2 − 7x − 16 .
We have one equation in two unknowns, so there is not a unique solution. We thus
have the freedom to impose the condition: 3uv + p = 0. Now, we have a system
of two equations in two unknowns:
u3 + v 3 = −q ,
p3
u3 v 3 = − .
27
Thinking now about the roots of quadratic equations, we see that u3 and v 3 are
the roots of the quadratic equation
p3
0 = x2 + q x − .
27
20
where
q 2 p 3
∆= + , (10)
2 3
and the cube roots chosen so that uv = − p3 . Let
y1 = u+v, (11)
u+v √ u−v
y2 = − +i 3 , (12)
2 2
u+v √ u−v
y3 = − −i 3 . (13)
2 2
Now, we can check that y1 , y2 and y3 are the three solutions of the reduced
equation.
There are three cases.
If ∆ > 0, then we get one real solution and two conjugate complex solutions.
If ∆ = 0, then we get three real solutions (including a double root).
If ∆ < 0, then we get three real solutions, which can be found from trigonometry
(see the ATOM volume on trigonometry 8 if necessary):
φ |p|
y1 = 2 cos ; (14)
3 3
φ 2π |p|
y2 = 2 cos − ; (15)
3 3 3
φ 2π |p|
y3 = 2 cos + ; (16)
3 3 3
⎛ ⎞
⎜ −q ⎟
⎜ ⎟
where φ = cos −1 ⎜ 2 ⎟
⎜ 3 ⎟ .
⎝ |p| ⎠
3
8 A forthcoming volume in this series.
21
has a root between the values x1 and x2 , that is, if p(x1 ) and p(x2 ) have opposite
signs, then x3 is an approximate value to a root of p(x) = 0, where
(x2 − x1 )p(x1 )
x3 = x1 − .
p(x2 ) − p(x1 )
xq2 x3 q x0 q xq1
S1 = x1 + x2 + · · · + xn
= − an−1 ; (20)
S2 = x1 x2 + x1 x3 + · · · + x2 x3 + x2 x4 + · · ·
= +x3 x4 + x3 x5 + · · · + xn−1 xn
= + an−2 ; (21)
S3 = x1 x2 x3 + x1 x2 x4 + · · · + x2 x3 x4 + x2 x3 x5 + · · ·
= +x3 x4 x5 + x3 x4 x6 + · · · + xn−2 xn−1 xn
= − an−3 ; (22)
..
.
Sn = x1 x2 x3 · · · xn
= (−1)n a0 . (23)
In the problems that follow, you will be making use of the known result that
any symmetric polynomial in the roots can be expressed as a polynomial of the
elementary symmetric functions.
3
(ii) x1 (x22 − x23 ) + x2 (x23 − x21 ) + x3 (x21 − x22 ).
k=1
1 1 1 1 1 1
(iii) + , + , + ,
x1 x2 x2 x3 x3 x1
x1 x2 x3
(iv) , , .
x2 x3 x3 x1 x1 x2
This is all very appealing, but, unfortunately, in general, we cannot conclude that
the sequence converges. It can lead to “strange attractors” or “chaos”. It is
suggested that the reader try the following example.
x + y := (x1 + y1 , x2 + y2 , . . . , xn + yn ) = x + y .
Note that it is necessary for both vectors to have the same number of coordinates.
λ x := (λ x1 , λ x2 , . . . , λ xn ) .
Note that it is necessary for both vectors to have the same number of coordinates.
This leads to the length or norm of a vector being defined by
√
|x| := x· x.
x × y := (x2 y3 − x3 y2 , x3 y1 − x1 y3 , x1 y2 − x2 y1 ) .
x×y
6
........ ........................
.................. .............
........... ........
........ .......
. ........ .....
.... ...
..
....
P
θ PPP
... ..
..... ...
. ......
....... ..... .
.......................... ......
.........
.............
-
............................................................................
........ ..
.
q y
P
.
6.8 Matrices
A Matrix is a rectangular array of elements. We shall refer to the rows and columns
of this matrix.
Suppose that the matrix has n rows and k columns. We then write
where ai,j is the element in the ith row and the j th column. Written in full, this
is
⎛ ⎞
a1,1 a1,2 a1,3 a1,4 ··· a1,k
⎜ a2,1 a2,2 a2,3 a2,4 ··· a2,k ⎟
⎜ ⎟
⎜ a3,1 a3,2 a3,3 a3,4 ··· a3,k ⎟
A=⎜ ⎟.
⎜ .. .. .. .. .. .. ⎟
⎝ . . . . . . ⎠
an,1 an,2 an,3 an,4 ··· an,k
29
Similarly, we can multiply a matrix by a column vector (of course, the num-
ber of coordinates in the columns vector must be the same as the number of rows
in the matrix). This gives,
⎛ ⎞
k
Ax = ⎝ ai,j xj ⎠ .
j=1
1≤i≤n
We are now ready to multiply two matrices! This first condition is that the
number of columns in the left matrix must equal the number of rows in the right
matrix. So, let
(A B) C = A (B C) .
A special square matrix is the one with 1 in each position where i = j (the
main diagonal), and 0 in every other position. For example
⎛ ⎞
1 0 0
I := ⎝ 0 1 0 ⎠ .
0 0 1
31
This is known as the Identity matrix, for it is easy to show that, for any square
matrix,
AI = I A = A.
We now ask if, given a square matrix A, there exists a square matrix, say B,
such that A B = I. If such a matrix exists, we call it a right inverse of A. (Is it
also a left inverse?) If a matrix is both a left inverse and a right inverse, we call it
1
the inverse of A, and denote it by A−1 . Note that this does not mean , because
A
that expression has no meaning!
For example, if
1 1
A = ,
0 1
then
−1 1 −1
A = .
0 1
The condition for a square matrix to have an inverse is well known. It is
that a quantity known as the determinant of the matrix should have a non-zero
value.
6.13 Determinants
We shall start by considering a set of two equations in two unknowns:
ax + by = c, (25)
px + qy = r. (26)
You can think of this algebraically, or, if you know some Cartesian geometry,
geometrically, as two lines.
A unique solution of these equations will exist under certain conditions,
and geometrically, this is if the lines are not parallel. If the lines are parallel, then
there will either no solution (when the lines are distinct) or there will be an infinite
number of solutions (when the lines coincide). In the cases for no solution, the
slopes of the lines must be the same: that is (unless the lines are parallel to the
y–axis, that is b = q = 0)
a p
− =− or aq − bp = 0 .
b q
(Note that the second form is also true in the case when the lines are parallel to
the y–axis.)
So, if we assume that aq − bp = 0, we can find the unique solution of (25)
and (26). This is
cq − br
x = , (27)
aq − bp
ar − cp
y = . (28)
aq − bp
32
So, we see that the system of two simultaneous linear equations, (25) and
(26), has a solution if and only if
a b
p q = 0 .
ax + by + cz = p, (31)
dx + ey + f x = q, (32)
gx + hy + iz = r. (33)
⎛ ⎞
a b c
We shall define the determinant of ⎝ d e f ⎠ to be
g h i
is given by (29) and (30). These two equations can be written in matrix form as
follows:
1 q −b c x
= .
aq − bp −p a r y
a b 1 q −b
In other words, if A = , then the matrix is the
p q det(A) −p a
inverse of the matrix A, and a necessary condition for this is that the determinant
a b
|A| = is non-zero.
p q
Writing this in matrix form, we have that Ax = p has a solution
Bp = x, provided that det(A) is non-zero. Here, B is the inverse of A. Note
that det(B) = det(A).
6.16 Generalities
To bring all this together, we have defined the determinants for 2 × 2 and 3 × 3
matrices as follows: if
a1,1 a1,2
A = ,
a2,1 a2,2
then
det(A) = |A| := (a1,1 a2,2 − a1,2 a2,1 ) .
This is also written as
a a1,2
A = 1,1 ,
a2,1 a2,2
35
a1,1 a1,2 a1,3
|A| = a2,1 a2,2 a2,3
a3,1 a3,2 a3,3
a a2,3 a2,1 a2,3 a2,1 a2,2
= a1,1 2,2 − a
1,2 + a
1,3 .
a3,2 a3,3 a3,1 a3,3 a3,1 a3,2
To the element ai,j in the matrix A, the minor Ai,j is the matrix obtained
from A by deleting the row with index i and the column with index j. in other
words, delete the row and column containing ai,j . We also associate with the
position (i, j), the number δi,j := 1 if (i + j) is even, and δi,j := −1 if (i + j) is
odd. This gives an array of signs:
+ − + − + − ···
− + − + − + ···
+ − + − + − ···
− + − + − + ···
.. .. .. .. .. .. ..
. . . . . . .
a1,1 a1,2 a1,3
|A| = a2,1 a2,2 a2,3
a3,1 a3,2 a3,3
Note that taking the sum of the elements of one row multiplied by the minors
corresponding to elements of a different row (or similarly for columns) will result
in a value of 0 (why?).
In general, we get
k
|A| = δ1,j a1,j |A1,j | .
j=1
However, it is not required to expand along any particular row (or column), for
we have
k
|A| = δi,j ai,j |Ai,j | .
j=1
36
Since we know that addition of multiples of rows (or columns) to a row (or
column) does not change the value of a determinant, we note that it may be a
good strategy to do such additions first before expanding a determinant.
Finally, for 3 × 3 matrices, there is a diagonal process that parallels the 2 × 2
case. But it must be emphasised that this does not extend to higher orders.
We consider diagonals that go down and right to be positive, and those that
go up and right to be negative.
a b
|A| = = ad − bc
c d
⎡ ⎤ ⎡ ⎤
a (b) (a) b
= ⎣ ⎦ − ⎣ ⎦;
(c) d c (d)
a b c
|A| = d e f
g h i
which we write as
Ax = p .
This has a unique solution
|A1 | |A2 | |A3 |
x = , y = , z = ,
|A| |A| |A|
if and only if |A| = 0, where A1 is the matrix obtained for A by replacing the
elements in the first row by the elements of p, A2 is the matrix obtained for A
by replacing the elements in the second row by the elements of p, and A3 is the
matrix obtained for A by replacing the elements in the third row by the elements
of p.
This result (which is also true for a system of n linear equations in n vari-
ables) is called Cramer’s Rule.
Problem 58 Prove Cramer’s Rule for a system of two linear equations in two
variables.
Problem 59 Prove Cramer’s Rule for a system of three linear equations in three
variables.
ax + by + cz = 0, (35)
px + qy + rz = 0. (36)
These tell us that (x, y, z) is perpendicular to (a, b, c) and (p, q, r), so that (x, y, z)
is proportional to the cross product of (a, b, c) and (p, q, r). This is system is
satisfied by
(x, y, z) = (br − cq, cp − ar, aq − bp) .
aX + bY = −c ,
pX + qY = −r .
Aw = v ,
a b X −c
where A = ,w = and v = .
p q Y −r
38
6.19 Conics
The general equation of a conic
can be written as
⎛ ⎞⎛ ⎞
(x, y, 1) a h g x = 0,
⎝ h b f ⎠⎝ y ⎠
g f c 1
or
xT A x = 0 .
(Reminder: xT means the transpose [interchange of rows and columns] of the
[column] matrix x.)
There are two ways of remembering the positions of the coefficients of the
terms in x and y. One is that they follow alphabetical order as one travels down
the main diagonal, and returns to the starting position via the sides.
a ← h← g
↑ ↑
h b f
↑ ↑
g ← f ← c
can be solved uniquely for u and v. With these values, we obtain (38) with
A = a ,, H = h, B = b,
ab − h2 (gu + f v + c)
= ab − h2 + (hu + bv)(af − gh) + (au + hv)(bg − f h)
= abc − ch2 − af 2 − bg 2 + 2f gh ,
X = −2(gx + f y) ,
Y = ax + by ,
and that ab − h2 = 0. The next step is to rotate the coordinate axes to obtain a
form without an “xy” term. So, we let
X = x cos θ − y sin θ ,
(41)
Y = x sin θ + y cos θ .
This is equivalent to
x = X cos θ + Y sin θ ,
(42)
y = −X sin θ + Y cos θ .
The axis X = 0 in the new system corresponds to the line y = x cot θ in
the old system, and the axis Y = 0 in the new system corresponds to the line
y = −x tan θ in the old system.
We can write this in vector–matrix from as
⎛ ⎞ ⎛ ⎞
X x
(X Y 1) = (x y 1)R and ⎝ Y ⎠ = R−1 ⎝ y ⎠ ,
1 1
where ⎛ ⎞
cos θ sin θ 0
R = ⎝ − sin θ cos θ 0 ⎠.
0 0 1
Substituting (41) into ax2 + 2hxy + by 2 + c = 0 yields
The only case in which the rotation of axes is indeterminate occurs when
h = a − b = 0. Here, equation (40) has the form a(x2 + y 2 ) + c = 0, and this is
the equation of a circle when ac < 0, and has a null locus when ac > 0 or a = 0.
When a = 0 and c = 0, we have the point (0, 0).
We now consider equation (43). If A, B and C = 0 all have the same sign,
then the locus is null. If A and B have the same sign and C has the opposite sign,
then the locus is an ellipse. If A and B have opposite signs and C = 0, then the
locus is a pair of intersecting straight lines. If A and B have opposite signs and
C = 0, then the locus is a hyperbola.
Because of the invariance under translation and rotation of axes, we can describe
the locus of (37) without actually having to effect the transformations.
y 2 + 2px + q = 0 ,
⎛ ⎞
0 0 p
⎝ 0 1 0 ⎠.
p 0 q
The determinant of this matrix is −p2 . So, if p = 0, we have parallel lines and if
p = 0, we have a parabola.
In summary
42
˛ ˛
˛ a h g ˛ ˛ ˛
˛ ˛ ˛ a h ˛˛
˛ h b f ˛ ˛ Conic
˛ ˛ ˛ h b ˛
˛ g f c ˛
Problem 60 Investigate the inability of the above to distinguish the Parabola for
the parallel line pair of lines equidistant
⎛ from the
⎞ origin by considering the co-
a h g
factors of other terms in the matrix ⎝ h b f ⎠.
g f c
Horses
The proof fails because the inductive step is not valid for the step from k = 1 to
k = 2. The inductive step depends on there being at least three horses in P (k + 1).
ATOM
A Taste Of Mathematics / Aime-T-On les Mathématiques
www.cms.math.ca/Publications/books
or contact:
ou contacter :
CMS/SMC Publications
577 King Edward, Ottawa (Ontario) Canada K1N 6N5
Email: [email protected]
Courriel : [email protected]
Canadian Mathematical Society / Société mathématique du Canada
577 King Edward, POB/CP 450-A
Ottawa, Ontario, Canada K1N 6N5
www.cms.math.ca