Real Analysis
Real Analysis
P. Ouwehand
Department of Mathematics and Applied Mathematics
University of Cape Town
Note to the Student
These notes are a very rough rst draft for a short course in Real Analysis at the under-
graduate level. This is the rst time that I am teaching this particular course, and Im still
thinking hard about how to present the material; Im likely to change my mind at short
notice. At present, these notes are unnished, i.e. still being written. There is no guarantee
that you will be provided with a nished product by the end of this course, though I will
try to do so. These notes are therefore meant as a supplement to the notes you take in
class, and are not a substitute. Expect mistakes, but note that though all mistakes are my
fault, its your responsibility to nd and correct them.
How will you do that? Go to the library, which houses many books on real analysis.
Two books that you will nd particularly useful are Principles of Mathematical Analysis,
by Walter Rudin, and The Elements of Real Analysis, by Robert G. Bartle.
This course is but thirty lectures long. There is more material in the notes than can
be covered in class, and many sections can be safely ignored. (Already the preliminary
Chapter 0 is ridiculously long, and needs some serious editing.) What you need to know,
and what you can omit, will be made clear in lectures.
The content of the course remains similar to what it has been in previous years, though
the perspective has shifted slightly: More emphasis is placed on the the importance of sets
of reals, and on topological notions. Im also hoping to tackle some additional topics, such
as the RiemannStieltjes integral, if time permits.
Peter Ouwehand
June 2004
i
Contents
0 Preliminaries 1
0.1 What is Analysis? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
0.2 Basic Set Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
0.2.1 Operations on sets . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
0.2.2 Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
0.2.3 Relations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
0.2.4 Countable and Uncountable Sets . . . . . . . . . . . . . . . . . . . 25
0.3 Prelude to an Axiomatic Development of the Real Number System . . . . . 32
0.3.1 Why we need Axioms . . . . . . . . . . . . . . . . . . . . . . . . . . 32
0.3.2 A Brief Note on the Philosophy of Mathematics
. . . . . . . . . . 35
0.3.3 Logic, Formal Languages, Quantiers . . . . . . . . . . . . . . . . . 38
1 An Axiomatic Development of the Real Number System 45
1.1 Fields and Arithmetic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
1.2 Ordered Fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
1.3 The Continuum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
1.4 The Completeness Axiom . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
1.5 Construction of the Set of Reals
. . . . . . . . . . . . . . . . . . . . . . . 65
2 The Geometry and Topology of R
n
67
2.1 The Geometry of R
n
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
2.2 Some Inequalities in R
n
. . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
2.3 Sets in R
n
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
2.4 Sets in R
n
: Open and Closed Sets . . . . . . . . . . . . . . . . . . . . . . . 78
2.5 The BolzanoWeierstrass Theorem . . . . . . . . . . . . . . . . . . . . . . 85
2.6 Sets in R
n
: Compact Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
A The Place of the Reals within Mathematics 93
iii
Chapter 0
Preliminaries
0.1 What is Analysis?
Roughly speaking, analysis deals with numbers, sets of numbers, and operations on num-
bers. It is particularly concerned with what happens if certain operations are performed
an arbitrarily large number of times, perhaps innitely often.
These days we perform most calculations on a computer. Now a computer can handle
only rational numbers: Each number is stored using only a nite number of bits, 0 and 1,
and thus necessarily rational. For example,
101.11
binary
= (1)(2
2
) + (0)(2
1
) + (1)(2
0
) + (1)(2
1
) + (1)(2
2
) =
23
4
It is clear, therefore, that any number expressed in nitely many bits is equal to
integer
power of 2
,
and thus necessarily rational.
Since practically all our calculations are handled by computers, and since computers
handle only rational numbers, it would seem that the set of rational numbers is suciently
rich for all our calculations. However, we can imagine an operation being performed in-
nitely often, something that a computer cannot do. Allowing the innite to creep into
our operations results in the creation of something new, namely irrational numbers.
For example, start with 1 and perform the following operations over and over: add 1,
invert the result, and then add 1, i.e.
x
0
= 1
x
n+1
=
1
x
n
+ 1
+ 1 for n 1
Each x
n
is a rational number (i.e. a ratio of integers). If we perform this operation innitely
often, we get
2, an irrational number
1
.
1
A proof that
2 is irrational, i.e. not the ratio of two integers, will be provided shortly.
1
2 Basic Set Theory: Operations on Sets
Also consider the following pseudocode:
LET X = 1;
LET Y = 1;
FOR N = 1 TO
LET Y = Y/N;
LET X = X +Y ;
PRINT X;
Of course, the output is just
n=1
1
n!
= e Thus this algorithm starts with two rational
values for X and Y , and uses only the operations of addition and division. Both these
operations preserve rational numbers, yet the output of this algorithm is an irrational
number.
In the rst example, we took the limit of a sequence of rational numbers, and in the
second a limit of a sum of rational numbers. The concept of limit captures the notion
performing an operation innitely often. The rational numbers are not suciently rich to
handle limits, forcing us to extend the number system to also include irrational numbers.
Thus the set of real numbers is in essence obtained from the set of rational numbers by
allowing the taking limits.
The notion of limit is fundamental to analysis, and many of the results we prove in these
notes about the set of real numbers are simply not true for the set of rational numbers.
Most of the fundamental concepts of calculus involve limits.
A derivative is a limit:
df
dx
= lim
h0
f(x +h) f(x)
h
A Taylor series is a limit:
e
x
=
k=1
x
k
k!
= lim
n
n
k=1
x
k
k!
If we write p
n
(x) =
n
k=1
x
k
k!
, then each p
n
(x) is a polynomial. Thus we have here a sequence of
polynomials whose limit is not a polynomial. Again, the taking of limits has created a new kind of
object.
Similarly, every Fourier series is a limit of sums.
A denite integral is a limit: If f is continuous on the interval [a, b], then
_
b
a
f(x) dx is a limit of
lefthand sums
_
b
a
f(x) dx = lim
x0
[
ba
x
]
k=1
f(a +kx)x
Here [y] denotes the greatest integer less than or equal to y.
Continuity is dened in terms of a limit: A function f is continuous at a point x
0
if and only if
lim
h0
f(x
0
+h) = f(x
0
).
Preliminaries 3
0.2 Basic Set Theory
Because it became accepted in the 20th century that, in principle, mathematical objects
should be sets and mathematical notions should be expressible as relationships between
sets, every mathematician needs just a little set theory. The material in this section is not
dicult, and no doubt you have seen it all before. We include it merely as a reminder and
to x notation.
Intuitively, a set is just a collection of objects.
If A is a set and x is some mathematical object, we say that
x A (x is an element of A)
if x is amongst the objects collected in A, and we write
x , A
if it isnt.
The idea is that a set is characterized entirely by its elements. Thus if two sets A and B
have exactly the same elements, then we must have A = B. For example, the sets A = a
and B = a, a have the same elements, namely only a. Thus A = B. The fact that B
seems to have two copies of A is immaterial.
For the philosophically minded: This means, for example that
Evening Star = Morning Star
as both sets are equal to the planet Venus. Yet the Evening Star is seen only in the evening, whereas
the Morning Star is seen only in the morning. . .
Instead of set, we will also sometimes say class, collection or family; instead of saying
x is an element of A we will sometimes say x is a member of A or x belongs to A.
There are two ways to represent sets: (i) by listing its elements, and (ii) by some
dening property. For example, if a set A has nitely many elements a
1
, . . . , a
n
then it can
be represented by A = a
1
, a
2
, . . . , a
n
. On the other hand if A is the set of all x having a
certain property P(x), then A can be denoted by A = x : P(x).
Example 0.2.1 The set A of all integers between -1 and 3 can be represented in two ways:
(i) A = 1, 0, 1, 2, 3
(ii) A = n : n is an integer and 1 n 3
We may also write B A instead of A B; they mean the same thing (just as x y and
y x mean the same thing).
Remarks 0.2.4 Note that A = B if and only if A B and B A.
Further note that
N Z Q R C
Preliminaries 5
0.2.1 Operations on sets
There are several ways of combining sets to form new sets. In this section we dene and
give some examples of the setoperations union, intersection, dierence, complementation,
cartesian product and power set formation.
Denition 0.2.6 (Union, intersection and dierence of two sets)
Suppose that A, B are sets.
(a) The union of A and B is the set of all elements which are either in A or in B (or both).
A B = x : x A or x B
(b) The intersection of A and B is the set of all elements which belong to both A and B.
A B = x : x A and x B
(c) The set dierence of A and B is the set of all elements which belong to A, but not to
B.
A B = x : x A and x , B
Two sets A, B are said to be disjoint if they have no members in common, i.e. if
A B = . In that case, A B = A, B A = B.
Often we work within some universe, which is just the set of all objects under consid-
eration at that time. The sets that we deal with are then typically subsets of the universe.
Which set is the universe depends very much on context. If one is dealing with real numbers, the obvious
choice of universe is R, but if one is dealing with complex numbers as well, then it would be C. If one is
trying to nd the solution of an n
th
order dierential equation, then the universe will generally be the set
of all ntimes dierentiable functions.
Given a universe, we also have a unary operation on sets, called complementation.
Denition 0.2.7 Let the universe be , and let A . The complement of A is the set
of all elements in the universe which are not in A.
A
c
= x : x , A
Note that A
c
= A. Also note that A B = A B
c
.
Exercise 0.2.8 Show that A, B are disjoint if and only if A B
c
.
A set is completely determined by its elements. The order in which those elements
are arranged does not matter. For example, a, b = b, a. When we want the order to
matter, we have to deal with ordered tuples. An ordered pair is denoted by (a, b), and
should be thought of as a collection containing a and b, in that order. Thus (a, b) ,= (b, a).
Note that
(a, b) = (c, d) a = c and b = d
Generally, an ordered ntuple is denoted by (a
1
, a
2
, . . . , a
n
), and should be thought of as a
collection containing a
1
, a
2
, . . . , a
n
, in that order.
The pair (a, b) is often dened to be the set a, a, b. You can check that this denition yields the
required property that (a, b) = (c, d) i a = c and b = d.
(a, b, c) is then dened to be (a, (b, c)) (which is just the set a, a, b, b, c), etc. This is in keeping
with the notion that all mathematical objects should be sets. On rst encounter, however, you might nd
this arbitrary, clumsy, and unnecessary, and you wouldnt be far wrong: The main thing that you need to
keep in mind is that an ordered tuple is a collection in which the order matters.
Using ordered tuples, we can dene one more way of making new sets from old:
Denition 0.2.11 (Cartesian product) Suppose that A
1
, A
2
, . . . , A
n
are sets. The carte-
sian product of A
1
, . . . , A
n
is the set of all ntuples (a
1
, . . . , a
n
), with each a
k
A
k
.
A
1
A
2
A
n
= (a
1
, a
2
, . . . , a
n
) : a
k
A
k
for k = 1, 2, . . . , n
Exercises 0.2.16 (1) Draw the following sets in the xyplane (i.e. R
2
):
(i) 1, 2, 3 3, 4, 5
(ii) 1 [0, 1]
(iii) [0, 1] 1
(iv) (0, 1] [2, 3)
Preliminaries 9
(2) Describe the set [0, 1] [0, 1] [0, 1].
(3) Consider the cylinder of unit radius about the zaxis in R
3
:
c = (x, y, z) : x
2
+y
2
= 1
Represent c as a product of two sets.
Thus far, we have considered union, intersection and cartesian product as binary op-
erations, involving just two sets. Frequently, however, we may need to consider these as
innitary operations: We can, for example, take the union of innitely many sets. We
dene the union, intersection and cartesian product of a family of sets as follows:
Denition 0.2.17 (Union, intersection and product of a family of sets)
If / = A
i
: i I is a family of sets, we may dene
(a) the union
_
/ =
_
iI
A
i
= x : x A
i
for some i I
(b) the intersection
/ =
iI
A
i
= x : x A
i
for all i I
(c) the cartesian product
/ =
iI
A
i
= (a
i
)
I
: a
i
A
i
for all i I
Here (a
i
)
I
is a generalized tuple, indexed by I.
In essence, (a
i
)
I
is a function with domain I and range
iI
A
i
. We will return to this later.
We will frequently write
I
A
i
or
i
A
i
instead of
iI
A
i
. We will also write
n=1
A
n
instead
of
nN
A
n
. The same holds for
and
n=1
A
n
and
n=1
A
n
.
(2) Let B
r
= x R
3
: [x[ r. Calculate
r(0,1]
B
r
and
r(0,1]
B
r
.
n=1
mn
A
m
(b) We dene the limit inferior of the sets (A
n
) by
liminf
n
A
n
=
n=1
mn
A
m
Here is another way of making new sets from old: Given a particular set, one should
be able to collect all of its subsets together into a new set, called the power set.
Denition 0.2.22 (Power set)
If A is a set, then the power set of A is the set of all subsets of A.
T(A) = B : B A
Note that , A T(A). They are, respectively, its smallest and biggest members.
Example 0.2.23 Let A = 1, 2, 3. Then the powerset of A is the 8element set
T(A) = , 1, 2, 3, 1, 2, 1, 3, 2, 3, 1, 2, 3
Preliminaries 11
a, where C
Exercises 0.2.25 Prove the above proposition again, using the binomial theorem.
[Hint: Recall that the binomial coecient
_
n
m
_
=
n!
m!(nm)!
describes the number of ways in which m
objects can be chosen from a collection of n objects.For example , there are
_
20
11
_
ways of choosing a soccer
team from a group of twenty individuals. Also recall the binomial theorem:
(a +b)
n
=
n
m=0
_
n
m
_
a
m
b
nm
Use these facts to prove that if A is a set with n elements, then T(A) is a set with 2
n
elements.]
Preliminaries 13
For example, the points (0, 0), (1, 2), (2, 16), (3, 54) belong to the graph. Now we assert
that a function is its graph. Thus the function f(x) = 2x
3
is nothing but the set (x, y) :
y = 2x
3
R R.
Youve already met more than just a few functions in your mathematical education up to date. The
most obvious ones are functions from R
n
to R
m
, such as f(x) = x
2
, g(x, y) = sin(x
3
+ y), h(x, y, z) =
(xy, xlnz), etc. Here are a few more that you might not yet have considered as functions:
Examples 0.2.27 (a) Dene Z
f
T(Z) by: f(n) = m : m divides n. Then f is a function which
maps a number to a set. For example,
f(12) = 1, 2, 3, 4, 6, 12 = f(12)
(b) Let c
0
(R, R) = f : f is a continuous map from R to R, and let a b R. Then
_
b
a
: c
0
(R, R) R
is a function which assigns to every continuous map its denite integral.
(c) Let c
1
(R, R) be the set of all maps from R to R which have continuous rst derivatives. Then the
derivative operator is a map D : c
1
(R, R) c
0
(R, R).
(d) curl is a map from the set of vector elds on R
3
to itself. div is a map from the set of vector elds on
R
3
to the set of functions on R
3
R. grad is a map from the set of dierentiable functions R
3
R
to the set of vector elds on R
3
.
(e) An n m matrix A can be regarded as a map from A : R
m
R
n
.
(f) Addition and multiplication are functions from R
2
to R. Addition can, in fact, be described by the
1 2matrix (1 1), for (1 1)
_
a
b
_
= a +b.
(g) If is a universal set, then union and intersection can be regarded as functions from T() T() to
T(), which map the ordered pair (A, B) to A B and A B respectively.
(h) We can also regard the bigger version
of union as a map, but this time we have
: T(T())
T(). It assigns to any family of subsets of its union. (Note that a family of subsets of is just a
set of elements of T(), i.e. it is a subset of T(), and therefore an element of T(T()).) The same
goes for intersection.
For any set A, there is an important function on A called the identity function. It is
denoted by id
A
, and is dened by
id
A
: A A id
A
(a) = a
Thus id
A
= (a, a) : a A.
Examples 0.2.28 (a) The identity function on R is just the function y = x.
(b) The identity function on R
n
is the identity matrix
I
n
=
_
_
_
_
_
_
_
1 0 0 . . . 0
0 1 0 . . . 0
0 0 1 . . . 0
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
0 0 0 . . . 1
_
_
_
_
_
_
_
as
follows:
f[A
is a map from A
Remarks 0.2.31 A function f : A B is injective if no two distinct members of a map to the same
b B, i.e. if every b B has at most one preimage.
f is surjective if and only if every b in B gets mapped onto by some a A, i.e. if every b B has at least
one preimage. In that case B is the range of f, i.e. ran(f) = codom(f).
f is a bijection if and only if every b B has exactly one preimage.
It should be clear that there is a bijection from a nite set A to another set B if and only if A and B
have the same number of elements.
Note that the composition does in one step what f and g do in two:
A
f
B
g
C a
f
f(a)
g
g(f(a))
A
gf
C a
gf
g(f(a))
Also note that g f means:
Do f rst, then g
i.e. the last shall be rst.
An often used fact is that composition is an associative operation on functions, i.e.
h (g f) = (h g) f
By this equation we mean that: one side is dened if and only if the other side is dened,
and in that case they are equal.
For if A
f
B, B
g
C, and C
h
D, then h (g f) is a function from A to D which
works as follows: First do g f, then do h. But to do g f, you must rst do f, then g.
The combined result is
First do f, then g, and then h: (h (g f))(a) = h(g(f(a)))
Similarly, (h g) f is a function from A to D which works as follows: First do f, then
h g. But to do h g, you must rst do g, then h. The combined result is therefore
First do f, then g, and then h: ((h g) f)(a) = h(g(f(a)))
and thus h (g f) = (h g) f, as claimed.
Example 0.2.34 Consider the following functions (note their domains and codomains):
R
f
R
+
: x x
2
+ 1
R
+
g
R
+
: y
y
R
+
h
[1, 1] : z sin(z)
Then
R
gf
R
+
: x
x
2
+ 1
R
+1
hg
[1, 1] : y sin(
y)
and thus
R
h(gf)
[1, 1] : x sin(
x
2
+ 1)
R
(h)gf
[1, 1] : x sin(
x
2
+ 1)
16 Basic Set Theory: Functions
Preliminaries 17
Note that if f
1
exists, then
f
1
(b) = a if and only if f(a) = b
Proposition 0.2.37 A function f : A B is invertible if and only if it is a bijection.
Proof: Suppose that f is invertible, i.e. that f
1
exists. Then f
1
is a function from B
to A. We rst show that f is surjective: Let b B. Since the domain is B, f
1
(b) must
be dened, i.e. there must be some a A such that f
1
(b) = a. But then f(a) = b. Hence
every b B has a preimage.
Next we show that f is injective. For suppose that f(a
1
) = f(a
2
) = b. Then f
1
(b) = a
1
and f
1
(b) = a
2
. Since f
1
is a function, we must have a
1
= a
2
(check the denition of
function), and hence f is injective.
This proves that if f is invertible, then f is a bijection.
Now we prove the converse. If f is a bijection, then it is onto B. Hence for every b B
there is some a A such that f(a) = b. Moreover, since f is one-to-one, that a has to be
unique. So we may dene f
1
(b) to be the unique a such that f(a) = b. This makes f
1
into a welldened function f
1
: B A.
x.
(b) The function f(x) = x
2
does not have an inverse, since it is not a bijection. However,
if we restrict f to the nonnegative reals, then f[R
+
is a bijection. Its inverse is the
square root function.
(c) The function f : R (0, +) dened by f(x) = e
x
is bijective. Its inverse is the
natural logarithm ln x.
(d) The function sin x is neither injective, nor surjective; however, if we restrict sin x and
regard it as a function [
2
,
2
] [1, 1], then it is a bijection, and its inverse is
arcsin x.
(e) If A is an n nmatrix, regarded as a function on R
n
, then A has an inverse function
if and only if A has an inverse matrix. Since composition is just matrix multiplication,
the inverse function of A is just the inverse matrix A
1
.
x ,=
1
x
3
.
18 Basic Set Theory: Functions
The number x
1
=
1
x
is the inverse of x under the operation of multiplication, in that
x x
1
= 1 x
1
x = 1
noting that 1 is the identity for multiplication.
The function f
1
is the inverse of f under the operation of composition, in that
f f
1
= id f
1
f = id
noting that id is the identity for composition.
The same notation for inverse, i.e.
1
, refers to dierent operations, so theres no reason to believe that
there is any relationship between them.
We have already noted the confusion that may possibly arise by the two uses of the
symbol
1
. We have but few symbols at our disposal, and many of them must therefore
serve more than one function. Thus you must always be aware of the context in which a
particular symbol is used.
You have to do this when using ordinary language: You know in what sense the newspaper headline
School kids make great snacks at fund raiser
is meant, even though the other sense oers greater amusement value.
I say this because we are about to add to the possible confusion. With every function
f : A B (not necessarily invertible), we can associate two new functions between the
power sets of A and B
f[] : T(A) T(B) : A
b B : There is a
) = b where A
A
f
1
[] : T(B) T(A) : B
a A : f(a) B
where B
B
Preliminaries 19
Thus f[] assigns to each subset A
of A a subset f[A
] B. Similarly, f
1
[] transforms
each subset B
of B into a subset f
1
[B
] A.
We will, for the moment, use square brackets to distinguish the various functions, but will
drop this convention later. Which function is meant will be clear from context. We shall
also call f[A
along f, and f
1
[B
along f.
Note that
f[A
whereas
f
1
[B
Similarly,
b f[A
) = b
Examples 0.2.43 (a) Suppose that f : R R : x x
2
. Then
f[1, 2] = [0, 4] , f[Z] = 0, 1, 4, 9, . . . , f[4] = 16
Also
f
1
[0, 1] = [1, 1], f
1
[4] = 2, 2, , f
1
[4] =
In each case, a set is transformed into a set.
(b) Suppose that A = a
1
, a
2
, a
3
, B = b
1
, b
2
, b
3
, and that f : A B is dened by
f(a
1
) = f(a
3
) = b
1
, and f(a
2
) = b
3
. Then
f[a
1
] = f[a
3
] = f[a
1
, a
3
] = b
1
, f[a
2
] = b
3
, f[A] = b
1
, b
3
, f[] =
and
f
1
[b
3
] = a
2
, f
1
[b
2
] = f
1
[] = , f
1
(B) = f
1
[b
1
, b
3
] = A
A, B
B.
(a) Show that A
f
1
[f[A
]]
(b) Show that B
f[f
1
[B
]]
20 Basic Set Theory: Relations
(c) Show that A
= f
1
[f[A
= f[f
1
[B
, and so
b B
.
(c) If a f
1
[f[A
]. Thus there is a
).
But since f is injective, a = a
, and so a A
.]
2. Inverse images preserve the set operations: Let f : A B, and suppose that G, H are
subsets of B. Then
(a) If G H, then f
1
[G] f
1
[H];
(b) f
1
[G H] = f
1
[G] f
1
[H];
(c) f
1
[G H] = f
1
[G] f
1
[H];
(d) f
1
[GH] = f
1
[G] f
1
[H];
3. Direct images are not quite so well behaved: Let f : A B, and suppose that G, H A.
(a) Suppose that G H. Show that f[G] f[H];
(b) Show that f[G H] = f[G] f[H];
(c) Show that f[G H] f[G] f[H];
(d) Give an example to show that we may not have f[G H] = f[G] f[H];
(e) Show that f[G] f[H] f[GH] f[G];
(f) Give an example to show, in (e), that both s may fail to be =s.
We end this section with some notation: Suppose that A, B are nite sets, and that A
has n elements, and B m elements. How many functions are there from A to B?
For each a A we have m choices for the value f(a) B. Thus there are m
n
functions
from A to B. For that reason
Denition 0.2.45 Let A, B be sets. Then we dene
B
A
= set of all functions from A to B
Some authors use
A
B instead of B
A
.
A
= (a, a) : a A
Examples 0.2.49 (a) If X is the set of all people, and if P is the relation parent of, then P
1
is the
relation child of.
(b) Similarly, if P is the relation parent of, then P P is the relation grandparent of: For a(P P)c
if and only if there is b such that aPb and bPc.
(c) Moreover, S = P P
1
X
is the relation sibling of: For a(P P
1
)c if and only if there is a b
such that a P
1
b Pc, i.e. i a is the child of b and b is the parent of c, i.e. i a, c have a common
parent. Thus (a, c) S implies that a, c have a common parent. Since (a, c) S implies (a, c) ,
X
,
we see that a ,= c, and thus that a, c are brother and/or sister.
(d)
1
= , since b a if and only if a b.
(e) n divides m if and only if m is a multiple of n. Thus the multiple of relation is the inverse of the
divisor of relation.
(f) Perpendicularity between vectors is its own inverse, i.e.
1
=: x y i y x.
Preliminaries 23
Exercises 0.2.50 1. Let W be the set of all women, and let S, M be relations from W to W described
as follows: aSb i a us a sister of b; aMb i a is a mother of b. Describe
(a) M S;
(b) (M S)
1
;
(c) S
1
and M
1
;
(d) S
1
M
1
2. Suppose that R is a relation from A to B and that S is a relation form B to C. Show that
(S R)
1
= R
1
S
1
Lets take a look at equivalence relations from another angle: They are very closely related to partitions.
Denition 0.2.53 Let A be a set. A family / = A
i
: i I is called a partition of A provided that
(i) The A
i
are mutually disjoint, i.e. if i ,= j, then A
i
A
j
= for all i, j I.
(ii)
I
A
i
= A
Thus A
i
: i I is a partition of A provided that every element of A belongs to exactly one A
i
. If
A
i
: i I is a partition of A, then we can dene an equivalence relation on A by:
a b a, b belong to the same A
i
Exercise 0.2.54 Prove that is an equivalence relation.
On the other hand, if is an equivalence relation on A, then behave roughly like =. When we lump
together all elements that are the same under , we get an equivalence class.
Denition 0.2.55 Let be an equivalence relation on A. For each a A, dene the equivalence class
E(a) of a as follows:
E(a) = b A : a b
Note that E(a) = E(b) if and only if a b. If a , b, then E(a) E(b) = . Thus the sets E(a) are
either equal or disjoint. Hence the set E(a) : a A is a partition of A.
Exercise 0.2.56 Verify the above statements.
Examples 0.2.57 (a) If is the identity relation on A, then the equivalence classes are singletons:
E(a) = a.
(b) Suppose that is congruency modulo 3. Then the equivalence classes are A
1
= . . . , 6, 3, 0, 3, 6, . . . ,
A
2
= . . . , 5, 2, 1, 4, 7, . . . and A
3
= . . . , 4, 1, 2, 6, 8, . . . . The elements of A
1
leave remainder
0 when divided by 3, those of A
2
leave remainder 1, and those of A
3
leave remainder 2. Note that
A
1
, A
2
, A
3
are mutually disjoint, and that A
1
A
2
A
3
= Z.
(c) Let be the equal length relation L
2
on R
3
. The equivalence classes are spheres centred at the
origin.
Preliminaries 25
0.2.4 Countable and Uncountable Sets
In this section, we investigate the idea of the size or cardinality of a set. For nite sets,
we can determine the size of a set by counting its elements. Thus for example, the set
a, b, c has cardinality 3 (it has 3 elements). We are going to extend this idea of counting
to obtain the size to innite sets, and we will show that innity comes in many sizes.
Lets explore the idea of counting: For the moment, let n = 1, 2, . . . , n be the set
of the rst n natural numbers. To say that A = a, b, c has 3 elements is equivalent to
saying that there is a one-to-one correspondence between the sets A and 3. Indeed, this is
the heart of the idea of counting: When we count the elements of A, we are setting up a
bijection between A and 3. We go a rst, b second, c third. This is equivalent to a map
f : A
= 3 dened by f(a) = 1, f(b) = 2, f(c) = 3. Thus the idea of counting the elements
of a nite set X involves nding a bijection between X and some n. If there is a bijection
from X to n, then X has n elements.
Now for some reason, mathematicians often like to start counting at zero. In the
mathematical literature, the sets n are therefore often dened as
n = 0, 1, 2, . . . , n 1
This is the convention that we shall adopt henceforth.
It is obvious that two nite sets A and have the same size if and only if there is a
one-to-one correspondence f : A
= . We dont even have to count A and to know that
they have the same number of elements. If A = a, b, c, d and = , , , , then the
existence of the bijection f : A
= given by
f(a) = , f(b) = , f(c) = , f(d) =
is sucient to show that A and have the same number of elements. It doesnt tell us
that this number is 4.
Thus two sets have the same size if and only if there is a bijection between them; we can
bypass the idea of number. This is important, because we cannot actually count innite
sets. But we can establish bijective correspondences between innite sets. We shall adopt
this idea as our basic idea of size.
Denition 0.2.58 We dene an equivalence relation between sets as follows: If A, B
are sets, we say that A B if and only if there is a bijection from A to B. If A B,
we say that A and B have the same cardinality. We may also indicate this by saying
[A[ = [B[.
Note that having the same cardinality is an equivalence relation between sets, i.e. that
(i) [A[ = [A[ (Reexivity)
(ii) If [A[ = [B[, then [B[ = [A[ (Symmetry)
26 Basic Set Theory: Countable and Uncountable Sets
(iii) If [A[ = [B[ and [B[ = [C[, then [A[ = [C[ (Transitivity)
Exercise 0.2.59 Prove this assertion. (Note that the assertion is not obvious: When we
say that [A[ = [B[, we are not actually claiming that there are two equal numbers. What
we are saying is that there is a bijection from A to B. To prove (i), for example, you have
to nd a bijection from A to A.)
Examples 0.2.60 (a) Two nite sets have the same cardinality if and only if they have
the same number of elements.
(b) For nite sets, if A is a proper subset of B, then [A[ < [B[. This breaks down completely
for innite sets. Consider, for example, the sets N and Z. It is certainly true that
N Z. However, the map N
f
Z dened by
f(n) =
_
_
_
n
2
if n is even
n 1
2
if n is odd
is a bijection: f(1) = 0, f(2) = 1, f(3) = 1, f(4) = 2, f(5) = 2, f(6) = 3 . . . . (Note
that we are zigzagging from the positive integers to the negative integers.) Thus N
and Z have the same cardinality, even though N seems to contain fewer elements than
Z.
(c) We also have [Q[ = [N[. This can be seen as follows. Put the set of strictly positive
rational numbers Q
+
in an array
1/1 2/1 3/1 4/1 5/1 . . .
1/2 2/2 3/2 4/2 5/2 . . .
1/3 2/3 3/3 4/3 5/3 . . .
1/4 2/4 3/4 4/4 5/4 . . .
1/5 2/5 3/5 4/5 5/5 . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
We can then trace a zigzag path that moves through all the rational numbers as
follows. Start at the top line and move diagonally down to the left until you reach the
leftmost line. Repeat. We thus obtain a sequence
1
1
,
2
1
,
1
2
,
3
1
,
2
2
,
1
3
,
4
1
,
3
2
,
2
3
,
1
4
,
5
1
. . .
All of the strictly positive rational numbers occur in this sequence, and they all occur
innitely many times. For example,
1
1
,
2
2
,
3
3
. . . lie along the diagonal, and they are all
equal. To obtain a bijection from N to Q
+
, we follow the above sequence of rationals,
Preliminaries 27
but we omit any number that has already occurred to ensure that the function is
one-to-one, i.e. we prune away the repeated values. We therefore dene the function
N
f
Q
+
by
f(1) =
1
1
, f(2) =
2
1
, f(3) =
1
2
, f(4) =
3
1
, f(5) =
1
3
, f(6) =
4
1
, . . .
Note that f(5) ,=
2
2
, which is after f(4) =
3
1
in the sequence, because
2
2
=
1
1
has already
occurred as f(1). Then f is a bijection from N to Q
+
. Now even though we havent
found a formula for f, it is nevertheless a perfectly good function, and all its values
can be calculated. Can you see that f(16) =
2
5
?
In the same way, we can set up a bijection g from N to the negative rationals. Just put
g(n) = f(n). Finally, we can dene a bijection h : N Q using f, g and another
zigzag: We dene
h(1) = 0, h(2) = f(1), h(3) = g(1), h(4) = f(2),
h(5) = g(2), h(6) = f(3), h(7) = g(3), . . .
Again, we have no formula for h, but it is certainly a welldened function, and all its
values can be calculated. Check that h(23) =
1
5
.
(d) If A is any set, nite or innite, then T(A) 2
A
.(Recall that 2
A
is the set of all
functions from A to 2 = 0, 1). This can be seen as follows: If B A, dene the
indicator function (or characteristic function) I
B
: A B by
I
B
(a) =
_
1 if a B
0 else
Clearly I
B
= I
C
if and only if B = C, and so the map 1 : T(A) 2
A
dened by
1(B) = I
B
is an injection. Now suppose that 2
A
, i.e. A
0, 1. Dene a
subset B A by
a B (a) = 1
It is clear that 1(B) = I
B
= , and thus that 1 is surjective as well. This proves that
[T(A)[ = [2
A
[.
A very basic question that arises is the following: Are all innite sets countable? The
answer is No!
Example 0.2.63 We show that the unit interval I = [0, 1] is uncountable, i.e. that we
cannot nd an enumeration
I = x
n
: n N
The proof is by contradiction: Suppose that we can nd such an enumeration I = x
1
, x
2
, x
3
, x
4
, . . . ,
i.e that every real number in [0, 1] is equal to x
n
for some n. Now every number x
n
has a
decimal expansion of the form
x
n
= 0.x
n1
x
n2
x
n3
x
n4
x
n5
. . .
where x
nm
is the m
th
number in the decimal expansion of x
n
. Of course some real numbers
have two distinct decimal expansions, a terminating one and a nonterminating one. For
example, 1.0000 = 0.9999 . . . . We will choose the nonterminating decimal expansions
for our x
n
.
We now create a new real number x from the x
n
by a process called diagonalization.
We choose a
n
1, 2, . . . , 9 such that the following hold:
a
1
,= x
11
, a
2
,= x
22
, a
3
,= x
33
, . . . , a
n
,= x
nn
, . . .
To avoid a situation where we obtain a number x with a terminating decimal expansion,
we havent permitted a
n
= 0; this is just a technicality. We can now dene x: Put
x = 0.a
1
a
2
a
3
a
4
. . .
Here comes the heart of the argument: Clearly x I = [0, 1]. Now if I can be written as
a list x
1
, x
2
, x
3
, . . . , then there must be some n such that x = x
n
. But the rst decimal
place of x diers from the rst decimal place of x, since a
1
,= x
11
; hence x ,= x
1
. Similarly,
the second decimal place of x diers from the second decimal place of x
2
, since a
2
,= x
22
;
Preliminaries 29
hence x ,= x
2
. We can continue in this way to show that x ,= x
n
for any n N, i.e. x is
not on the list x
1
, x
2
, x
3
, . . . .
This proves the result! Given any list x
1
, x
2
, x
3
, . . . of real numbers in [0, 1], we now
have a technique for producing a new real number x that is not on the list. It thus follows
that no such list can contain all the real numbers in [0, 1], i.e. there is no bijection from N
to [0, 1].
Remarks 0.2.64 Cantor, who discovered the above argument for the uncountability of the reals, wrote
to a friend
I see it, but I dont believe it.
Hence there are uncountable sets. Clearly R is also uncountable, because otherwise we
could nd an enumeration r
1
, r
2
, r
3
, . . . of R. By omitting any reals which are not in
[0, 1], we could prune this into an enumeration of [0, 1].
The fact that R is uncountable causes much trouble in analysis. We shall see some
more examples of uncountable sets later on.
Denition 0.2.65 If A, B are sets, we say that the cardinality of A is less than or equal to the cardinality
of B, and write
[A[ [B[
if there is an injection from A into B. We write [A[ < [B[ if [A[ [B[, but [A[ ,= [B[, i.e. if there is an
injection from A to B, but no bijection.
The idea is that [A[ < [B[ if and only if A has fewer elements than [B[. Clearly the following holds:
Proposition 0.2.66 (a) If A B, then [A[ [B[.
(b) If [A[ [B[ and [B[ [C[, then [A[ [C[.
(c) If [A[ [B[, then [T(A)[ [T(B)[.
(d) If [A[ [B[, then [C
A
[ [C
B
[
In fact, the relation is a partial ordering between sets: Reexivity is obvious, and transitivity was
left to the exercise above. The main thing that needs to be shown is antisymmetry:
Theorem 0.2.68 (SchroderBernstein Theorem) Suppose that [A[ [B[ and that [B[ [A[. Then
[A[ = [B[.
This proposition shows that you cant make uncountable sets using nite products and countable
unions. You can, however, make uncountable sets using innite products and the powerset operation.
Proposition 0.2.70 Let A be a set. Then [A[ < [T(A)[ = 2
A
.
Proof: We know that for any set A, T(A) 2
A
, by Example 0.2.60(d). So it suces to show that
[A[ < [T(A)[. Now it is obvious that there is an injection from A into T(A): The map a a will do
the trick. Hence certainly [A[ [T(A)[. Suppose now that there is a bijection A
f
T(A), and dene
A
a
A by A
a
= f(a). Since a bijection is surjective, we must have T(A) = A
a
: a A. We shall now
show that this is impossible.
Note that A
a
A and that a A. Thus it may happen that a A
a
, or it may not. Dene B to be
the set of all a for which it does not happen, i.e. let
a B a , A
a
Preliminaries 31
The B A. Since the listing A
a
: a A is supposed to be a complete list of all the elements of T(A),
there must be some b A such that B = A
b
. However, if b B, then b , A
b
, and if b , B, then b A
b
.
Hence B cannot equal A
b
, since b belongs to one set, but not the other. The assumption that A
a
: a A
is a complete list of all the subsets of A therefore leads to a contradiction.
We omit the proof, which can be found in almost any textbook on set theory.
Exercises 0.2.72 (1) Prove that if A is uncountable and B is countable, then AB is uncountable.
(2) Prove that R [0, 1]. (Hint: Note that all non-empty nite intervals have the same cardinality as [0, 1].
First prove that all closed intervals have the same cardinality. If I is any nite interval, whether open,
closed, or halfopen, we can nd closed intervals I
1
, I
2
such that I
1
I I
2
. The SchroderBernstein
Theorem then implies that they all have the same cardinality. Now dene a map Z [0, 1)
f
R as
follows: If n Z and if x [0, 1), then dene
f(n, x) = n +x
This is clearly a bijection. Now [Z[ [[0, 1][ = [[0, 1)[, and therefore R Z [0, 1) [0, 1].)
Example 0.2.73 R 2
N
. Heres a clever way of seeing this: Every real number has a dyadic or binary
expansion, as opposed to a decimal expansion. The dyadic expansion uses only the numbers 0 and 1. For
example, if we have a dyadic number 101.011, this is
101.011
. .
dyadic
= 1 2
2
+ 0 2
1
+ 1 2
0
+ 0 2
1
+ 1 2
2
+ 1 2
3
= 5.375
. .
decimal
Hence every real number can be turned into a sequence of zeroes and ones, and vice versa. Now such a
sequence is essentially just a map from N to 2. For example, the sequence 1011001 . . . can be thought of as
the function f : N 2 which has f(1) = 1, f(2) = 0, f(3) = 1, f(4) = 1, f(5) = 0, f(6) = 0, f(7) = 1 . . . .
There are only two problems: (i) Where to put the decimal point, and (ii) Some real numbers have two
distinct dyadic expansions. For example,
1
2
= 0.1000 = 0.01111 . . . . However, if a real number has two
dyadic expansions, it is easy to see that the one must eventually end in all 0
s
, and the other must end in
all 1
s
. We call the former expansion terminating, and the latter expansion nonterminating.
We now overcome the two problems as follows: Since R [0, 1), it suces to show that 2
N
[0, 1).
Now given any x [0, 1), its nonterminating dyadic expansion x = 0.x
1
x
2
x
3
. . . will give us a sequence
x
1
, x
2
, x
3
. . . of zeroes and ones. This clearly gives us an injective map F : [0, 1) 2
N
. It is, however,
not a surjective map. But only the sequences that eventually end in all zeroes have been missed out, and
there are only countably many such. To be precise, if A = range F, then = 2
N
A is countable. Hence
[2
N
[ = [A [ = [A[ = [(0, 1][ = [R[.
32 Prelude to an Axiomatic Development of the Real Number System
Example 0.2.74 (The Cantor set) The Cantor set is a subset of [0, 1] which is constructed as follows:
Let C
0
= [0, 1]. It is a single interval of length 1. Now let C
1
be C
0
with its middle third removed, i.e.
C
1
= [0,
1
3
] [
2
3
, 1]. Thus C
1
consists of two disjoint intervals, each of length
1
3
. Now remove the middle
thirds of these two intervals to form C
2
, i.e. C
2
= [0,
1
9
] [
2
9
,
1
3
] [
2
3
,
7
9
] [
8
9
, 1]. Then C
2
is a disjoint union
of 4 intervals, each of length
1
9
. Continue in this way, removing the middle thirds of each of the intervals
comprising C
n
to form C
n+1
. It follows that C
n
consists of 2
n
intervals, each of length (
1
3
)
n
, and thus that
(C
n
) = (
2
3
)
n
. Finally, let C =
n=0
C
n
. C is the Cantor set.
How much of [0, 1] did we remove when we created C? First we removed an interval of length
1
3
, then
we removed 2 intervals, each of length
1
9
. After that, we removed 4 intervals, each of length
1
27
, etc. Thus
we have removed disjoint sets with a total length
2
0
3
1
+
2
1
3
2
+
2
2
3
3
+ =
1
3
k=0
_
2
3
_
k
which is a geometric series with sum
1
3
1
1
2
3
= 1. It seems, therefore, that we have removed the entire
length of the unit interval [0, 1]. There is no length left.
Nevertheless, C is not empty; in fact, C is uncountable. Here is one way to see this: Every real number
a [0, 1] can be written as an innite sum
i=1
a
i
3
i
, where a
i
= 0, 1 or 2. Thus the ternary expansion (as
opposed to decimal expansion) of a is 0.a
1
a
2
a
3
. . . . For example,
1
3
= 0.1000 . . . ,
5
9
=
1
3
+
2
9
= 0.1200 . . . ,
etc. A little thought will reveal that the Cantor set is formed by removing all numbers which have a 1
occurring in their ternary expansion. Thus C
1
is formed by removing all numbers which have a 1 in the
rst decimal place, C
2
is formed by removing all numbers in C
1
which have a 1 in the second decimal
place, and so on. Thus the Cantor set is just the set of all numbers a in [0, 1] which can be written as a
sum
i=1
a
i
3
i
, where a
i
= 0 or 2, but not 1. There is a bijection : 2
N
C dened as follows: If f 2
N
,
then (f) is the number with decimal expansion 0.a
1
a
2
a
3
. . . , where a
n
= 0 if f(n) = 0, and a
n
= 2 if
f(n) = 1. Hence [C[ = [2
N
[ = [R[.
2 = (q Q : q
2
2, q Q : Q
2
> 2)
(iii) John Horton Conway regards a real number as a game played by two individuals. I wont elaborate.
0.3.2 A Brief Note on the Philosophy of Mathematics
In the previous section, we got quite philosophical. Before we continue, its a good idea to
have a look at the main mathematical schools of thought. Our main concern is with the
schools of Platonism and Formalism. For completeness, I present brief and oversimplied
caricatures of these and other leading schools below. If you want an honest exposition,
youd better consult a book on the philosophy of mathematics.
36 Prelude to an Axiomatic Development of the Real Number System
Platonism: The belief that mathematical objects, though not part of the physical
universe, nevertheless have an existence which is independent of the human mind.
We speak of making mathematical discoveries, which suggests that we are somehow
able to observe mathematical objects.
Some people also speak about mathematical creations. Did da Vinci have the freedom to create a
Mona Lisa with a grimace, rather than a smile? Probably. He preferred a smile. But did Pythagoras
have the freedom to create a rightangled triangle for which his theorem that the square of the
hypotenuse is the sum of the squares on the rightangle sides fails?
Logicism: An attempt to reduce mathematics to logic. Frege and Russell are the
main protagonists. The fourvolume Principia Mathematica, by Russell and White-
head, is the most wellknown exposition of this school.
Constructivism: Constructivists require that, in order to show that a mathematical
object exists, one must explicitly show how to construct it. This leads them to reject
of the Law of the Excluded Middle, which states that for any statement , either is
true, or not- is true, i.e. theres no middle between and not-. It also leads to
the rejection of proofs by contradiction. There are many varieties of constructivism;
the most wellknown is Intuitionism, whose main proponent was Brouwer.
For example, suppose that S is a set, and that is a property. In classical logic, the following
statement is true:
Either there is a member of S that has property ,
or every member of S has property not-. ()
For example, consider the Riemann Hypothesis, which states that all the (complex) roots of the
Riemann function have a real part equal to
1
2
:
1 +
1
2
z
+
1
3
z
+ = 0 = Re(z) =
1
2
This is currently the unsolved problem in mathematics. For the classical logician, the Riemann
Hypothesis is either true or false we just dont know which. Not so for the constructivist: To
say that it is either true or false, we must either prove that it is true, or show that it is false. So
the constructivist does not accept (). For this statement to hold, we must either show how to
construct a member of S with the property , or we must show that each member of S has the
property not-.
If S is a nite set, we could, in principle, look at each of the elements of S in turn, to see if it satises
. If S is innite, however, this is generally not possible. Constructivists are happy to apply the
Law of the Excluded Middle to nite sets; its application to innite sets they regard as a colossal
mistake an unwarranted and unjustiable extrapolation of methods of reasoning designed for the
nite to the innite. Indeed, some constructivists deny the existence of innite objects altogether.
As another example, suppose that we want to prove that every real cubic polynomial p(x) =
x
3
+ax
2
+bx+c has a real root. One way to do it is to appeal to the Intermediate Value Theorem:
We see that p(x) > 0 for all suciently large positive x, and thus that p(x) lies above the Xaxis,
for all suciently large positive x. Similarly, p(x) < 0 for all suciently large negative x, so that
p(x) lies below the Xaxis, for all suciently large negative x. Hence, since p(x) is continuous,
Preliminaries 37
there must be a place where p(x) cuts the Xaxis, and that place would be a root of p(x). This
proof is nonconstructive: Weve shown that there is a root, but we havent shown how to nd it.
Formalism: This school of thought dates back to Hilbert in the late 19
th
century.
At that time, certain paradoxes in set theory shook the foundations of mathematics,
and mathematicians were suddenly confronted with the possibility that their subject
is inconsistent, i.e. selfcontradictory.
The most famous of these is Russells paradox. If we admit a naive concept of set a set is any
old collection of objects then it is possible for a set to belong to itself. For example if
A = The set of all objects that can be dened in English using fewer than twenty words
then A A, because weve just dened A using fewer than twenty words. Now consider a set of
sets R, dened as follows:
A R i A , A
Since R is a set, we may legitimately ask if it belongs to itself. By denition of R, we see that
R R i R , R
If R belongs to R, then it doesnt; and if R does not belong to R, then it does! This paradox,
usually credited to the logicist Russell in 1899??, but already noted by Zermelo in 1896?? caused
quite a lot of concern.
Hilbert, the most powerful mathematician of his era, set up a programme aimed at
proving the internal consistency of mathematics by socalled nitist means. The
formalist regards mathematics as a oneplayer game, rather like Patience (or Free-
cell). A proof of a statement , for example, is merely a sequence of statements
1
,
2
, . . . ,
n
, ending with
n
= . Each
k
must either be an axiom, or must be
obtained from previous
j
by certain permitted moves, or rules of deduction.
For example, a commonly used rule of deduction is Modus Ponens:
From and deduce
The mathematician seeking to prove the statement is like the player of Patience,
trying out permitted sequences of moves until she hits upon a sequence that works.
The idea behind the Hilbert Programme is to formalize mathematics:
Write all mathematics in a formal language;
Reduce all proofs to formal deductions;
Show that no contradictions can be derived within this formal system.
Hilbert had hoped that it would be possible to show that all of mathematics could
be thus reduced, and proved consistent. Thus commenced a massive attempt to
formalize and axiomatize all of mathematics, and the way that we now do and see
mathematics has been heavily inuenced by the Hilbert programme.
One of the rst branches of mathematics to be formalized was set theory, where the paradoxes had
been found. The ZermeloFraenkel axioms of set theory banish Russells paradox, but at a costs: It
38 Prelude to an Axiomatic Development of the Real Number System
is no longer possible for a set to belong to itself, and the intuition of a set as just any old collection
of objects had to be abandoned. It was found possible to squeeze nearly all of mathematics inside
the formal system of axiomatic set theory. Unfortunately, Hilberts student Godel proved in 1931
that the Hilbert programme was doomed to failure. In a paper entitled On formally undecidable
statements in Principia Mathematica and related systems he showed that in any formalist reduction
of mathematics there would be statements that are true, but unprovable. He also showed that no
such reduction is capable of proving its own consistency. This proved the death knell for the Hilbert
programme, though not for the Formalist school. (Godel himself was a Platonist.) With hindsight,
it is remarkable how close the Hilbert programme came to succeeding.
Platonism and Formalism disagree (quite violently) about the nature of mathematical
objects: For the Platonist, these have an existence independent of the human mind; by the
mysterious faculty of intuition we apprehend basic truths (axioms) about mathematical
objects, and then use reason to deduce ever more complex truths (theorems). For the
Formalist, there are no mathematical objects, just rules for transforming one string of
symbols into another.
Nevertheless, both schools of thought agree on what constitutes a valid mathematical
proof. The average practicing mathematician has been described as a Platonist on week-
days, and a Formalist on Sundays. That is, when he is doing mathematics he is convinced
hat he is dealing with an objective reality whose properties he is attempting to determine.
But then, when challenged to give a philosophical account of this reality, he nds it easiest
to pretend that he does not believe in it after all.
And thats what our position will be. To begin with, we will be rm Platonists: We
will believe in the objective existence of the real number system, and use our intuition to
apprehend basic truths. Recognising that our intuition is fallible, however, we wont let
it stray to far. Instead, we opt soon to formalise our intuitions into a system of axioms.
After that, intuition is only allowed to make suggestions, and only those statements that
can be seen to admit a formal proof will be admitted to the status of theoremhood.
0.3.3 Logic, Formal Languages, Quantiers
The aim of this section is to cover the bare minimum about formal theories just enough
to make our construction of the real number system intelligible.
A formal language is a collection of L whose logical symbols include
Logical Connectives
and
or
implies
if and only if
not
Preliminaries 39
It is enough to use just two connectives, e.g. and . We can then dene the remainder by
( )
( ) ( )
Just a reminder: is inclusiveor: p q is true if and only if at least one of p, q is true, possibly
both.
Quantiers
For all
There exists
We have
x x() x x()
Variables
x, y, z, x
1
, x
2
, x
3
. . .
Identity relation
A special binary relation symbol denoted =.
Logical symbols have the same meaning, regardless of context. L also has nonlogical
symbols, whose meaning depends on context:
Relation symbols
For example, if we want to talk about partial orderings, we will want a symbol ; if
we want to talk about sets, we will want symbols and .
Function symbols
For example, if we want to talk about arithmetic, we will want binary function
symbols +, . We may want unary function symbols ,
1
. If we want to talk about
sets, we will want binary function symbols , , unary function symbols
c
, T;
Constant symbols
These are specially named elements, and are often regarded as nullary function sym-
bols. For example, if we want to talk about addition, a distinguished element denoted
by 0 plays an important role. If we want to talk about sets, the set deserves its
own name.
A formal language will generally not contain all of the above nonlogical symbols, only
those needed to talk about the domain of discourse. L will also have brackets (, ), [, ], etc.
The symbols of a formal language may be strung together to form two types: terms
and formulas.
Terms are dened as follows:
40 Prelude to an Axiomatic Development of the Real Number System
(i) Every variable and every constant is a term;
(ii) If t
1
, . . . , t
n
are terms, and if F is an nary function symbol, then F(t
1
, . . . , t
n
)
is a term;
(iii) A string is a term only if it can be shown to be so by a nite number of
applications of (i) and (ii);
Formulas are dened as follows:
(i) If t
1
, . . . , t
n
are terms, and if R is an nary relation symbol, then R(t
1
, . . . , t
n
) is
a formula. (This includes the case where R is the logical binary relation symbol
=).
(ii) If , are formulas, then so are ( ), ( ), ( ), ( );
(iii) If is a formula, then so is ;
(iv) If is a formula and x is a variable, then x and x are formulas;
(v) A string is a formula only if it can be shown to be so by a nite number of
applications of (i)-(iv).
We often omit brackets when there is no danger of confusion. Moreover, we may also
abbreviate xy by x, y.
If is a formula, we write (x, y, z) to show that the variables of are (amongst)
x, y, z.
Example 0.3.1 Partial orderings
Consider the following language L: In addition to the logical symbols, L has a single binary relation
symbol . There are no function and constant symbols. Thus the only terms of L are the variables. Some
example of formulas are
x y, x(x y y z) z((z x))
The theory of partial orderings has the following axioms
(i) x(x x);
(ii) x, y(x y y x x = y);
(iii) x, y, z(x y y z x z).
This theory has many interpretations. One is the twoelement chain C
2
= 0, 1 with 0 1. This is a
linear ordering, i.e. it satises the axiom x, y(x y y x). Another example is the powerset T(A) of
a set A, where is interpreted as subset. This ordering is nonlinear if A has more than one element.
Thus dierent structures may satisfy the same axioms.
In this section, I have presented the briefest possible introduction to formal theories,
and Ive taken numerous short cuts. If you want more extensive (and more accurate)
coverage, you will have to consult a text on mathematical logic. We end this section with
some brief comments on quantiers and negation.
Consider the following formulas:
xy(y > x)
To check the truth of such a statement, it is convenient to regard it as a game between
two players, and . In this game, opens play and chooses and x. If can nd a y such
that y > x, then wins the game. If she cant, wins. The formula is true if can always
win, i.e. if has a winning strategy; else, the formula is false.
Whether or not the formula is true or false depends on where it is played. If we play it
on the natural numbers N, then has a winning strategy: If chooses x, the can choose
y = x +1. Then y > x. This works for any x that might choose. Hence has a winning
strategy: The formula is true for N.
Preliminaries 43
Suppose, however, that the game is played not in N, but on the twoelement chain
C
2
= 0, 1. Then if chooses x = 1, cannot nd a y C
2
with y > x. Hence has a
winning strategy, and the statement is false for C
2
.
Exercise 0.3.3 Give a similar analysis for the statement
yx(y > x)
Finally, a note about negating quantiers: A negation sign can creep past a quantier,
but it ips the quantier in the process:
x x() x x()
For example,
[xy(y > x)] x[y(y > x)]
xy(y ,> x)