Ch1PeanoAxioms
Ch1PeanoAxioms
1. Introduction
We begin our exploration of number systems with the most basic number
system: the natural numbers N. Informally, natural numbers are just the or-
dinary whole numbers 0, 1, 2, . . . starting with 0 and continuing indefinitely.1
For a formal description, see the axiom system presented in the next section.
Throughout your life you have acquired a substantial amount of knowl-
edge about these numbers, but do you know the reasons behind your knowl-
edge? Why is addition commutative? Why is multiplication associative?
Why does the distributive law hold? Why is it that when you count a finite
set you get the same answer regardless of the order in which you count the
elements? In this and following chapters we will systematically prove basic
facts about the natural numbers in an effort to answer these kinds of ques-
tions. Sometimes we will encounter more than one answer, each yielding
its own insights. You might see an informal explanation and then a for-
mal explanation, or perhaps you will see more than one formal explanation.
For instance, there will be a proof for the commutative law of addition in
Chapter 1 using induction, and then a more insightful proof in Chapter 3
involving the counting of finite sets.
We will use the axiomatic method where we start with a few axioms
and build up the theory of the number systems by proving that each new
result follows from earlier results. In the first few chapters of these notes
there will be a strong temptation to use unproved facts about arithmetic
and numbers that are so familiar to us that they are practically part of our
mental DNA. Resist this temptation! In the context of a formal proof, take
the attitude that such familiar facts are not certain until they are proved.
So they cannot be used in a formal proof until after they have been proved.
A similar thing can be said of definitions: pretend that your intuitive ideas
of even basic things such as + and < are inaccessible until you can have
a formal definition. In the beginning, the only terms that can be used are
Copyright c 2007–2019 by Wayne Aitken and Linda Holt. The copyright holders au-
thorize individuals to make a single paper copy of this edition for personal, noncomercial
use.
1Warning: some authors do not include 0 in the set of natural numbers. This will be
discussed in the next section.
1
2 THE PEANO AXIOMS
terms from logic and set theory, explained in Chapter 0, and the primitive
terms. The only facts that can be used are the axioms together with facts
from logic and set theory as summarized in Chapter 0, including general
facts about equality, functions, and relations.2
The system of axioms we use here is a famous system called the Dedekind-
Peano axioms (Section 2), or the Peano axioms for short. We will add
to this an axiom about iterating functions (Section 3), but in an optional
section (Section 9) to this chapter, we will see that this iteration axiom is
not necessary since it can actually be proved from Peano’s axioms. Thus
it is strictly speaking a convenient “temporary” axiom: one could replace
the iteration axiom by a theorem that says the same thing. We take it as a
temporary axiom in these notes since the proof of the iteration axiom is a
bit subtle, and is at a higher level than most of the other theorems of the
chapter. I do not want to start off the chapter by scaring away readers.
Remark 1. Although we will be strict about not using unproved assertions in
the formal development, you do not need to be so shy about using your prior
knowledge in the informal exercises. Such prior knowledge is also useful for
temporarily guiding your thinking until a firmer foundation is laid down in
the formal development.
This distinction between formal and informal is especially important in
the many exercises that will arise in these notes. The informal exercises will
be labeled as such. The rest are considered to be formal exercises.
The formal exercises may require you to fill in details of sketchy proofs
or even to write complete proofs for theorems whose proofs are not too
hard or are similar to earlier proofs. These constitute part of the official
development of the number systems, and the facts established in them can
be used in future proofs. On the other hand, the informal exercises are
designed to help familiarize you with facts or definitions, or to lead you in
interesting but tangential directions. These do not have to be solved with
a formal proof, and can appeal to prior knowledge. They are considered to
be outside the logical development of the number systems, and so cannot be
cited in a later formal proof.
For example, suppose an informal exercise asks for an example of an
associative binary operation that is not commutative. Suppose you know
about matrix multiplication from a linear algebra course. Then you can
use your knowledge of linear algebra to help solve the problem. On the
other hand, you cannot use matrix multiplication in a formal exercise since
matrices are not developed in this course.
2In these notes, we start almost at the very beginning of mathematics, but you should be
aware that there are other approaches that start with less and begin by proving theorems
about set theory first before developing the number systems. For example, set theorists
typically start with the Zermelo-Fraenkel axioms for set theory, and from there develop
set theory, the number systems, and (most of) the rest of mathematics.
THE PEANO AXIOMS 3
Remark 2. In the above discussion, the term theorem refers to any result
that has a proof. Keep in mind that other terms for theorems are commonly
used including proposition, lemma, and corollary. The term lemma is used
for a theorem that is only important as a stepping stone in proving other
theorems, and a corollary is a theorem that follows fairly easily, for example
as an interesting special case, from a previous theorem. Some authors also
make a distinction between the terms theorem and proposition, using the
label proposition for more ordinary theorems and using theorem only for
the more important theorems. These are informal guidelines: one can find
exceptions.
Remark 3. As mentioned above, in the formal development of the natural
numbers we begin by assuming that everything about the natural numbers
is as yet unknown territory. On the other hand, we do allow logic as ex-
pressed in everyday, but careful, language. This leads to a point that needs
to be clarified: even though we are developing the natural numbers from
scratch, we will allow ourselves to use a few number-related terms such as
“pair”, “unique”, “first”, “second”, and so on. We do so because we can
safely treat such basic terms as forming part of our logical vocabulary.3 We
will also use numerals for the labeling of sections, theorems, exercises, and
such. These labels have no arithmetic content, and could have just as easily
been any string of symbols. They are being used informally to help keep the
chapter organized. On the other hand, we will not take any truly mathe-
matical or arithmetic fact for granted, for example facts about addition and
multiplication. These all must be proved.
2. The axioms
Forget everything you think you know about the natural numbers, even
something as basic as 1 + 1 = 2. Pretend you don’t even know the definition
of addition. In what follows, we will recreate all this knowledge on a solid
logical foundation by proving all the elementary theorems and definining all
the basic ideas. (Of course this self-imposed forgetting should be confined
to the official formal development of the natural numbers, and the formal
proofs. Your past knowledge will come in handy for thinking up strategies
for proofs, for helping you mentally digest definitions, and for warning you
when you are about to make an error.)
At this point, the only thing that you are officially allowed to know con-
cerning the natural numbers is what is expressed in the following axioms.
They function partially as descriptions of the primitive terms, and partially
as a list of facts that we can use in later proofs. These axioms are called
3For example, the statement “the set S has at least two elements” does not really
require the number 2. It can be translated easily into basic logic as follows:
∃x ∃y (x ∈ S) ∧ (y ∈ S) ∧ (x 6= y) .
4 THE PEANO AXIOMS
the Dedekind-Peano axioms since they are based on the axioms of the Ger-
man mathematician Richard Dedekind (1831 – 1916) and the the Italian
mathematician Giuseppe Peano (1858 – 1932).4
We begin with the primitive terms described in the axioms. They are
called primitive because they do not have to be formally defined, but instead
are described in the axioms. All other terms, such as + or < must be defined.
Such definitions can build on primitive terms, notions from Chapter 0, or
any previously defined term.
Primitive Terms. The three primitive terms are N, 0, and σ.
The axioms then tell you everything you are allowed to assume about the
meaning of these terms. For example, the first axiom tells you broadly what
type of object these terms denote.
Axiom 1. (i) N is a set, (ii) 0 is an element of N, and (iii) σ is a function
σ:N→N
with domain and codomain equal to N.
We call N the “set of natural numbers”, and we call its elements “natural
numbers”. We call 0 the “zero element”, or just “zero”. We call σ the
“successor function”. If n ∈ N we call σn the “successor of n.” Informally,
the successor of n is the next number following n. This is informal since we
have not yet defined an order < on N.
Axiom 2. The image of σ : N → N does not contain 0:
¬ ∃ n ∈ N, σn = 0 .
it for n (the inductive hypothesis) and then proving it for the next number
after n (which we call σn, but it is commonly called n + 1).
The two versions of induction, however, are really the same. To see why,
we need to think about the connection between properties and sets. A basic
fact of set theory is that every property of natural numbers defines a subset
of N. Showing that a property holds for all natural numbers is the same
as showing that the corresponding subset S is all of N. To see an example,
consider the following problem from number theory (using ideas we haven’t
defined formally yet). Suppose you want to prove that every natural number
is the sum of four squares. Then instead of using the property “n is the
sum of four squares” for an inductive hypothesis, our version of induction
(Axiom 4) would use the set
S = {n ∈ N | n is the sum of four squares}.
The base step is to show 0 ∈ S. By the definition of S this actually amounts
to showing that 0 is the sum of four squares (0 = 02 + 02 + 02 + 02 ), so it
amounts to the same thing as the base case of the other form of induction.
Next you need to establish (ii) by assuming n ∈ S and showing σn ∈ S. By
definition of S this means that you assume that n is the sum of four squares
(the inductive hypothesis), and somehow try to show that the successor σn
is also the sum of fours squares (this is the hard part of the proof). Once the
base step (i) and the inductive step (ii) have been established, the induction
axiom shows that S = N. In other words, all natural numbers are the sum
of four squares. As this illustrates, using a set instead of a property is just
a very minor change of outlook, the actual work is the same.
In a later chapter we will discuss another type of induction, strong in-
duction, which is truly different from that described above. We will also
discuss versions where 0 is replaced by other “base cases”. Our first version
of induction is an axiom (Axiom 4), but the later versions will be theorems.
The later versions cannot be used until they are proved.
Remark 8. The induction axiom is more complicated than the others. There
is a cleaner way of stating this axiom using the notion of “closed” which we
now explain. If A is a subset of N then the image set σ[A] is necessarily also
a subset of N since σ is a function N → N. The subset A ⊆ N is said to be
closed under successor if σ[A] ⊆ A. In other words, σ cannot move you out
of A: for all n ∈ A, we have σn ∈ A.
Using this concept, we can express the axiom as follows:
If A ⊆ N contains 0 and is closed under successor then A = N.
Informal Exercise 2. Describe three distinct subsets of N that are closed
under σ but that are not all of N. By the above remark, none of your
examples can contain 0. This shows the importance of checking the “base
case” since all of these satisfy (ii) but not (i) of Axiom 4. Hint: since this is
informal you have available the formula σn = n + 1 even though it has not
been proved yet. Also, one of your examples can be the empty set.
THE PEANO AXIOMS 7
7A random person on the street might think of numerals and numbers as the same
thing. But numerals are symbols. If numbers are not symbols, what are they? This
comes back to the philosophical question: what are numbers really? As mentioned above
we sidestep this as follows: numbers are what the axioms postulate to exist. The axioms
do not specify what they really are, they just specify some of their properties. Numerals,
on the other hand, are names we give to the objects described by the axioms. In summary,
the axioms supply the numbers, but we supply the numerals to refer to these numbers,
and can do so any way we choose
8In definitions “if” really means “if and only if” in common mathematical writing.
8 THE PEANO AXIOMS
Now we see the first proof by induction. It is subtle in one respect. One
might want S to be the set {x ∈ N | x has a predecessor in N} for the
induction, but this definition of S does not contain 0. So Axiom 4 cannot
be used! We do not yet have a form of induction that starts at 1 (we will
establish such an induction later). So instead we just artificially put 0 in S
by using {x ∈ N | (x = 0) ∨ (x has a predecessor in N)}. We only use this
trick when we want to prove something about everything but 0.
Theorem 2. Every nonzero element of N has a predecessor in N.
Proof. Our goal is to use the induction axiom (Axiom 4). To do so we need
to define a set:
def
S = {x ∈ N | (x = 0) ∨ (x has a predecessor in N)}.
Observe (i) 0 ∈ S by definition of S.
Next we will establish that (ii) n ∈ S =⇒ σn ∈ S for all n ∈ N. So
assume n ∈ S. Since N is the codomain of σ we have σn ∈ N. Observe
that n is a predecessor of σn by Definition 3, so σn has a predecessor. Thus
σn ∈ S by definition of S.
Now that we have established (i) and (ii) above, we can use Axiom 4 to
conclude that S = N. Since S = N, every element of N is either 0 or has a
predecessor in N. So if n ∈ N and n 6= 0 we have that n has a predecessor
in N.
9We do not use ! by itself to mean “unique”. The use of the exclamation mark to mean
“unique” is only used after ∃.
THE PEANO AXIOMS 9
In one sense the predecessor function and the successor function are in-
verses since one undoes the effect of the other. However this cannot be
literally true. Since π is a function N+ → N, its inverse (if it exists) must
be a function N → N+ . To deal with this technicality we define a modified
successor function.
Definition 6 (Modified successor). We define the modified successor func-
tion σ 0 : N → N+ as follows: Given n ∈ N we define σ 0 n to be σn. Since
σn is not 0 (Axiom 2), and since σ 0 n is just σn, we know that σ 0 n is in the
set N+ . So this definition yields a function with codomain N+ as desired.
Observe that σ 0 n = σn for all n ∈ N. The only difference between the
functions is the codomain.
Exercise 5. Let a ∈ N and b ∈ N+ . Show that πb = a if and only if σ 0 a = b.
Exercise 6. Let a ∈ N. Show that π(σ 0 a) = a. Hint: let b = σ 0 a and
substitute for b in the above exercise.
Exercise 7. Let b ∈ N+ . Show that σ 0 (πb) = b.
Exercise 8. Show that π and σ 0 are inverse functions. Conclude that they
are both bijections.
Hint: recall that f : A → B and g : B → A are called inverse functions if
(i) g(f (x)) = x for all x ∈ A, and (ii) f (g(y)) = y for all y ∈ B. Recall also
that a function is a bijection if it is both injective and surjective. Finally,
recall that a function f : A → B is bijective if and only if it has an inverse
function (from B to A).
Exercise 9. We know that σ 0 is bijective. Show that σ is not a bijection.
Exercise 10. The mathematician Dedekind defined a set S to be infinite
if there is a bijection S → T where T is a proper subset of S. Explain
why N is infinite according to Dedekind’s definition.10 (We will give another
definition of infinite in Chapter 3).
Exercise 11. Show that if n ∈ N then n 6= σn. Do so by defining a certain
set S ⊆ N and using the induction axiom to show S = N.
In particular this shows that 0 6= 1, and 1 6= 2, and so on. It does not
mean 0 6= 2 though, this has to be proved separately!
3. Iteration
At this point the only operations we have are successor and predeces-
sor. But any self-respecting theory of arithmetic also needs addition and
multiplication. Our strategy for developing these operations is simple: we
define addition in terms of iterated successor, and multiplication in terms of
iterated addition. Continuing on, we will define exponentiation in terms of
iterated multiplication. These definitions all rely on the general concept of
iteration, so in order to reach our goal of basic arithmetic, we need to take
a side trip through iteration.
10A proper subset of S is a subset that is not equal to S.
10 THE PEANO AXIOMS
4. Addition
As mentioned above, we define addition in terms of iteration of successor.
Informally, you get m + n by starting with m and taking the successor n
times. This idea motivates the formal definition.
Definition 8 (Addition). Let m, n ∈ N. Let σ n : N → N be the nth
iteration of the successor map. Then
def
m + n = σ n (m).
Observe that addition defines a function N × N → N.
Remark 13. Functions S × S → S are called binary operations. Thus + is a
binary operation on N.
Remark 14. This is not the only way of viewing addition. In Chapter 3, we
will show how + can be understood in terms of counting the elements in a
disjoint union.
The following are consequences of the iteration axiom and Definition 8.
Theorem 4. For all m ∈ N
m + 0 = m.
12 THE PEANO AXIOMS
So σz ∈ Sx,y .
By the induction axiom, Sx,y = N. This is true for any x, y ∈ N. So if
x, y, z ∈ N are arbitrary, z ∈ Sx,y which implies x + (y + z) = (x + y) + z.
Remark 18. This proof by induction is valid, but, like many induction proofs,
is weak on conveying an understanding why associativity is true. In Chap-
ter 3 we give a second, more insightful proof involving the set theoretic
identity A ∪ (B ∪ C) = (A ∪ B) ∪ C.
Warning: we do not yet have the commutative law. Thus the next two
lemmas are not redundant. They do not merely repeat Theorems 4 and 6,
but assert something truly new. They are lemmas since, once the com-
mutative law is proved, they will become redundant. So they are only of
temporary use.
1 + σn = 1 + (n + 1)
= (1 + n) + 1
= σn + 1
= σ(σn).
So σn ∈ S.
We conclude that S = N.
Exercise 22. Prove Lemma 8. Complete the above sketchy proof of Lemma 9
by justifying every step by referring to earlier results, definitions, assump-
tions, or axioms, or by referring to the definition of S.
x + y = y + x.
Now assume n ∈ Sx . So
x + σn = σ(x + n)
= σ(n + x)
= 1 + (n + x)
= (1 + n) + x
= σn + x
We conclude that σn ∈ Sx .
By the induction axiom, Sx = N. This is true of any x ∈ N since x
was chosen to be any arbitrary element of N. Now let x, y ∈ N be any two
elements of N. Since Sx = N, we have y ∈ Sx . By definition of Sx we
conclude that x + y = y + x.
Remark 19. In Chapter 3 we see a more insightful proof of the commutative
law involving the set theoretic identity A ∪ B = B ∪ A.
Exercise 23. Justify every step in the above proof by referring to earlier
results, assumptions, or axioms, or by referring to the definition of Sx .
Exercise 24. Prove (x + y) + z = (x + z) + y without using induction.
5. Multiplication
As mentioned above, our strategy for defining multiplication is to use
iteration of addition. To understand, how this works, first consider the
following familiar informal definition:
m · n = |m + m + ·{z
· · + m + m} .
n times
We can interpret the phrase “n times” in terms of iteration. To see this,
notice how we can build up to this sum in n steps:
defn
m · n = αm (0).
In particular, multiplication defines a binary operation N × N → N. As is
common, we do not always need to write the dot ·, but can use juxtaposition
to indicate multiplication.
Remark 20. This is not the only way of viewing multiplication. In Chap-
ter 3, we will show how multiplication can be thought of in terms of counting
the elements in the Cartesian product of two finite sets. Another popu-
lar approach is through recursion (using the equations of Theorem 11 and
Lemma 12).
Exercise 25. Prove the following theorem, lemma, and theorem using the
iteration axiom and the definition of multiplication.
Theorem 11. For all m ∈ N
m · 0 = 0.
Lemma 12. For all m, n ∈ N
m · σn = (m · n) + m.
Theorem 13. For all m ∈ N
m · 1 = m.
Exercise 26. We already have proved that 2 + 2 = 4 and 3 + 2 = 5 (Exer-
cise 21). Use the results concerning addition to give proofs of the following:
4 + 2 = 6, 5 + 2 = 7, 6 + 2 = 8, 7 + 2 = 9, 3 + 3 = 6, 4 + 3 = 7, 5 + 3 = 8,
6 + 3 = 9, 4 + 4 = 8, 5 + 4 = 9. Use these addition facts together with The-
orems 11 and 13 and Lemma 12 to show the following: 0 · 0 = 0, 0 · 1 = 0,
0 · 2 = 0, 1 · 1 = 1, 1 · 2 = 2, 2 · 0 = 0, 2 · 1 = 2, 2 · 2 = 4, 2 · 3 = 6,
2 · 4 = 8, 3 · 2 = 6, 3 · 3 = 9.
Theorem 14 (Distributive Law: part 1). For all x, y, z ∈ N
(x + y)z = xz + yz.
Remark 21. We adopt the usual conventions for dropping parentheses. Thus,
when the parentheses and the dots are restored, the above equation is
(x + y) · z = (x · z) + (y · z).
Exercise 27. Prove the distributive law. Do so by defining, for any fixed
x, y ∈ N, a set Sx,y ⊆ N. Show that Sx,y = N by the axiom of induction. In
order to give a complete and rigorous proof, do not leave any parentheses
out in this proof.
16 THE PEANO AXIOMS
Remark 22. This induction proof is valid but, like many induction proofs,
weak on conveying an understanding why the result is true. In Chapter 3
we will see a second proof using the set theoretic identity
(A ∪ B) × C = (A × C) ∪ (B × C).
Lemma 15. If n ∈ N then 0 · n = 0.
Lemma 16. If n ∈ N then 1 · n = n.
Exercise 28. Prove the above two lemmas using the induction axiom.
Theorem 17 (Commutative Law). For all x, y ∈ N
xy = yx.
Proof. Fix x ∈ N. Let Sx = {u ∈ N | xu = ux}. We wish to show y ∈ Sx .
We do so by showing all natural numbers are in Sx (via induction).
By Theorem 11 and Lemma 15, we get 0 ∈ Sx .
Now assume n ∈ Sx . Then
x · σn = xn + x
= nx + x
= n·x+1·x
= (n + 1) · x
= σn · x.
We conclude that σn ∈ Sx .
By the induction axiom, Sx = N. Thus y ∈ Sx which implies xy = yx.
6. Exponentiation
Just as repeated addition gives multiplication, repeated multiplication
gives exponentiation. In other words, you can define exponentiation via the
iteration of a multiplication function. How we do this for exponentiation is
similar to how we developed multiplication, so the details will be left to the
reader.
Definition 10. Let m, n ∈ N. Let µm : N → N be defined by the rule
x 7→ xm. Let µnm be the nth iteration of µm . Then
def
mn = µnm (1).
Remark 24. One amusing aspect of our approach is that exponential nota-
tion is used for iteration (Section 3) before it is used in the traditional way
for exponentiation itself (here in Section 6). This is a symptom of the large
emphasis we place on functions and their iterates. Our convention is that
when an exponent is used with a function it refers to iteration, but when it
is used with a number it refers to exponentiation.
Informal Exercise 32. In contrast with the previous section, we start
with 1 instead of 0 in our iterative definition. What would happen if we
used 0 instead of 1 in Definition 10?
Remark 25. This is not the only way of viewing exponentiation. In Chap-
ter 3, we will see how it can be defined in terms of counting the number of
functions between two sets. It can also be defined using recursion.
Informal Exercise 33. Do you expect (m, n) 7→ mn to be a commutative
binary operation N × N → N? Do you expect it to be associative? If you
said ‘no’ to either question, back up your answer with a counter-example.
Theorem 20. For all m ∈ N
m0 = 1.
Lemma 21. For all m, n ∈ N
mσn = mn · m
18 THE PEANO AXIOMS
m1 = m.
(xy)n = xn y n .
Exercise 36. Prove the above using induction on n. In other words, apply
the induction axiom to a certain set Sx,y .
xm+n = xm xn .
Exercise 37. Prove the above using induction on n. In other words, apply
the induction axiom to a certain set Sx,m .
0n = 0.
Exercise 38. Prove the above without induction. Use Theorem 2 to first
show that n = σm for some m.
and so on. This function will depend, of course, on z and s, and we could
write it as ϕz,s if we want to make this dependency clear. It turns out that
the condition ϕ◦σ = s◦ϕ is what is needed to force ϕ to be such a “stepping
function” (see the corollary).
This informal discussion helps motivate the following theorem. It states
that is is possible to find such a function ϕ.
N = σ2 [N ] ∪ {(0, z)}.
but declaring the starting point to be 1, and declaring the stepping func-
tion to be µm (x) = xm. Following a path in this structure would give you
1, m, m2 , m3 , m4 , . . ..
The collection of all possible path structures forms something that math-
ematicians call a category.
Now, among all path structures, N is very special: Theorem 33 shows
it maps (uniquely) to any other path structure in a special way. More
specifically, given any other path structure A, there is a function ϕ : N → A
such that (i) 0 7→ z and (ii) ϕ ◦ σ = s ◦ ϕ. The first condition says that ϕ
sends the start to the start. The second condition matches σ with s. We
can illustrate the second condition with the following commutative diagram:
ϕ
N −−−−→ A
σ s
y y
ϕ
N −−−−→ A
What this diagram expresses is that both ways of going from the top left
set to the bottom right set gives the same image.13 This diagram expresses
the equation ϕ ◦ σ = s ◦ ϕ.
The existence of this special ϕ is called the universal property of N. In
other words, N has the universal property of being able to map uniquely
into any other path structure (in a path compatible way).
Remark 27. The universal property (Theorem 33) is important for other
reasons besides describing iteration. In fact, it makes it easy to show that
any two models of the Peano axioms are “isomorphic”. However, we will
skip this important isomorphism theorem since explaining in precisely will
lead us too far afield.