Set Theory Oxford
Set Theory Oxford
2 Set Theory
Martin Bays
HT23 Oxford
Contents
0.1 Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1 Introduction 3
1.1 Russell’s paradox . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2 Zermelo-Fraenkel Set Theory . . . . . . . . . . . . . . . . . . . . 3
1.3 Foundations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4 Consistency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.5 Why study set theory? . . . . . . . . . . . . . . . . . . . . . . . . 4
1.6 Cardinality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.7 Structure of the course . . . . . . . . . . . . . . . . . . . . . . . . 5
1
CONTENTS 2
5.5 Arithmetic on N . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
5.6 Defining Z, Q, R (not on syllabus) . . . . . . . . . . . . . . . . . 19
6 Cardinality 20
6.1 Classes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
6.2 Cardinalities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
6.3 Comparing cardinalities . . . . . . . . . . . . . . . . . . . . . . . 21
6.4 Finite sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
6.5 Countable sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
6.6 Cardinal arithmetic . . . . . . . . . . . . . . . . . . . . . . . . . . 24
10 Cardinal numbers 40
10.1 Cardinal arithmetic with Choice . . . . . . . . . . . . . . . . . . 42
10.2 Cardinal exponentation and CH (off-syllabus) . . . . . . . . . . . 43
A References 45
A.1 Exam errata . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
0.1 Acknowledgements
These notes follow the outline of Jonathan Pila’s notes for this course, which
were themselves originally based on notes of Robin Knight. Other sources in-
clude the books of Hils-Loeser, Goldrei, Jech, and Kunen, (see references below),
and lecture notes of Itay Kaplan.
1 INTRODUCTION 3
1 Introduction
What is a set? One standard informal answer might be: “A set is an unordered
collection of objects, called its elements”. We can formalise this intuitive idea
of the data given by a set as a test for equality:
Principle of Extensionality: Two sets A and B are equal if and only if they
have the same elements,
A = B ⇔ ∀x (x ∈ A ↔ x ∈ B).
But this leaves a trickier question: What sets are there? If we want to say
something holds “for all sets A”, which A must we consider? It is tempting to
give a broad definition, such as:
But take P (x) to be the property: x is a set which is not an element of itself.
Suppose R is the set whose elements are precisely the x satisfying P (x). Then
for any set x,
x∈R⇔x∈ / x.
In particular,
R∈R⇔R∈
/ R.
This contradiction shows that Unrestricted Comprehension is inconsistent.
1.3 Foundations
It has become common (though not universal) practice to consider ZFC as
forming the foundations of mathematics.
Such a position gives in particular a clear answer to the question: what is a
proof of a mathematical statement? This proceeds as follows.
Given a mathematical statement S, we first encode it into a formal (first-
order) statement σ about sets – remarkably, such encoding appears to always
be possible. We then identify the notion of a “proof of S” with the notion of
a formal proof of σ from ZFC; the latter has a clear unambiguous definition
(presented in B1.1 Logic).
1.4 Consistency
Is ZFC consistent? Could there be some paradox which it fails to avoid?
Sadly, Gödel’s 2nd Incompleteness Theorem shows that if ZFC is consistent,
then we can not prove (in the above sense, i.e. from ZFC) that it is consistent.
Mathematicians and set theorists vary in the extent to which they “believe”
that it is consistent.
What we can say for sure is that the mathematics humanity has developed
so far has not revealed any inconsistency in ZFC.
This course concentrates on (1) and (2), but also introduces the necessary
preliminaries for the Part C course Axiomatic Set Theory, which concentrates
on (3).
1.6 Cardinality
One key concern in the study of sets is the size (“cardinality”) of a set. Sets
can be finite, countable, or uncountable – but there is much more to say than
that.
We will define sets X and Y to have the same cardinality if there is a bijection
X → Y , and we will see that the ZFC axioms suffice to make this a rich and
useful concept, answering in particular questions like:
• Is there a complex number which is not the zero of any integer polynomial?
• How many lines does it take to cover the real plane?
• Can the subsets of R be exhaustively indexed by real numbers?
2 THE FIRST AXIOMS 5
∀x ∀y (x = y ↔ ∀z (z ∈ x ↔ z ∈ y)).
Proof. Existence is by the axiom Empty Set. Suppose x and y each have no
elements, i.e. ∀z z ∈
/ x and ∀z z ∈
/ y. Then x and y have the same elements, i.e.
∀z (z ∈ x ↔ z ∈ y), since both sides of “↔” are false for all z. So x = y by
Extensionality.
2.3 Pairing
ZF3 (Pairing): For any x and y (not necessarily distinct), there is a set whose
elements are precisely x and y:
∀x ∀y ∃z ∀w (w ∈ z ↔ (w = x ∨ w = y)).
(Here, ∨ means “(inclusive) or”; P ∨ Q is true iff at least one of P and Q is.)
Again, we have uniqueness by Extensionality:
Theorem 2.2. For any x and y there is a unique set whose elements are pre-
cisely x and y.
We denote this unique set by {x, y}, and by {x} in the case that y = x.
This is already enough to build up a rich collection of sets: we have the
existence of ∅, {∅}, {∅, {∅}}, {∅, {∅, {∅}}}, and so on. But we don’t yet know
that any sets with more than two elements exist!
End of lecture 1
2.4 Union
ZF4 (Union): For any set x, there is a set whose elements are precisely the
elements of the elements of x:
∀x ∃y ∀z (z ∈ y ↔ ∃w (z ∈ w ∧ w ∈ x)).
2.5 Powerset
A set x is a subset of a set y, written x ⊆ y, if every element of x is an element
of y.
x ⊆ y ⇔ ∀z (z ∈ x → z ∈ y).
ZF5 (Powerset): For any set x, there exists a set whose elements are precisely
the subsets of x:
∀x ∃y ∀z (z ∈ y ↔ z ⊆ x).
3.2 Comprehension
ZF6 (Comprehension): For any formula ϕ(x) with parameters and any set
y, there is a set z whose elements are precisely those elements x of y which
satisfy ϕ(x):
∀y ∃z ∀x (x ∈ z ↔ (x ∈ y ∧ ϕ(x))).
4 PRODUCTS AND RELATIONS 9
is an L-sentence expressing the instances of ZF6 for those formulas with param-
eters which result from substituting sets for the variables w1 , . . . , wn . So we can
express ZF6 by the axiom scheme consisting of one such sentence for each such
formula.
With this restricted version of comprehension, the argument of Russell’s
paradox does not lead to inconsistency; instead, it proves something interesting.
Theorem 3.2. There is no set of all sets: there is no set Ω such that ∀x x ∈ Ω.
End of lecture 2
Comprehension allows us to implement some more of the usual constructions
of set theory.
T
Lemma 3.3. Let a be a non-empty set. Then there is a unique set a such
that for all x, \
x∈ a ⇔ ∀y ∈ a x ∈ y.
Definition 4.1. Given sets x and y, the ordered pair with first co-ordinate
x and second co-ordinate y is the set
Proof. ⇐: Immediate.
⇒: Suppose
{{x}, {x, y}} = {{x′ }, {x′ , y ′ }}.
We can now define the Cartesian product using Powerset and Comprehen-
sion.
Proposition 4.3. Let X and Y be sets. There is a unique set X × Y , called the
Cartesian product of X and Y , with the property that the elements of X × Y
are precisely the ordered pairs ⟨x, y⟩ where x ∈ X and y ∈ Y .
has the desired property – this set exists by Comprehension, since z = ⟨x, y⟩
can be expressed in L, namely by
∀w (w ∈ z ↔ (∀u (u ∈ w ↔ u = x) ∨ ∀u (u ∈ w ↔ (u = x ∨ u = y)))).
4 PRODUCTS AND RELATIONS 11
Notation 4.4. From now on we will allow ourselves to use the ⟨x, y⟩ notation in
S formulas used in Comprehension, as well as our other defined terms {x, y},
the
x, P(x), ∅, and so on. As in the previous proof, it is always possible to
eliminate these expressions to write an equivalent formula directly in L. The
following example illustrates the general technique (and how much paper we
will save with it!):
n [ o
∀x x, y ∈ z
n [ o
⇔ ∀x ∃w (w = x, y ∧ w ∈ z)
[
⇔ ∀x ∃w (∀u (u ∈ w ↔ (u = x ∨ u = y)) ∧ w ∈ z)
⇔ ∀x ∃w (∀u (u ∈ w ↔ (u = x ∨ ∀v (v ∈ u ↔ ∃t (v ∈ t ∧ t ∈ y)))) ∧ w ∈ z)
4.2 Relations
Definition 4.5. A binary relation is a set R of ordered pairs; we then write
xRy to mean ⟨x, y⟩ ∈ R (and we use this as an abbreviation in formulas).
SS
Remark 4.6. If ⟨x, y⟩ ∈ R then x, y ∈S R, since x, y ∈ {x,
SS y} ∈ ⟨x, y⟩ ∈ R
(from which we obtain x, y ∈ {x, y} ∈ R and hence x, y ∈ R).
Definition 4.7. The domain of a binary relation R is the set
dom(R) := {x : ∃y xRy},
ran(R) := {y : ∃x xRy};
Note that with this definition, = and ∈ and ⊆ are not relations, since the
domain would be a set of all sets, though their restrictions to any given set are
relations.
End of lecture 3
4.2.1 Functions
Definition 4.8. A function is a binary relation f with the property that for
all x, there is at most one y such that ⟨x, y⟩ ∈ f .
We write f (x) = y to mean ⟨x, y⟩ ∈ f (and we use this as an abbreviation
in formulas).
The restriction of f to a set a ⊆ dom(f ) is the function
Remark 4.11. To partially explain the notation Y X : for finite sets X and Y we
have |Y X | = |Y ||X| .
Remark 4.12. The empty set is a function, called the empty function, ∅ : ∅ → ∅.
So Y ∅ = {∅} for any set Y .
Irreflexivity: ¬x < x;
Transitivity: if x < y and y < z then x < z.
It is a strict total order if also we have for all x, y ∈ X that x < y or
y < x or x = y.
Reflexivity: x ∼ x;
5 THE AXIOM OF INFINITY AND THE NATURAL NUMBERS 13
Symmetry: if x ∼ y then y ∼ x;
Transitivity: if x ∼ y and y ∼ z then x ∼ z.
X/ ∼ := {S ∈ P(X) : ∃x ∈ X S = {y ∈ X : y ∼ x}}
= {{y ∈ X : y ∼ x} : x ∈ X}.
∃x (∅ ∈ x ∧ ∀y (y ∈ x → y ∪ {y} ∈ x)).
End of lecture 4
Proof.
∀n ∈ N ∀x ∀y ((x ∈ y ∧ y ∈ n) → x ∈ n).
contradicting n ∈
/ n.
(i) n+ ̸= 0.
(ii) If n ∈ m then n+ ∈ m+ .
(iii) If n ̸= 0, then n = k + for a unique k ∈ N.
End of lecture 5
Proof. Uniqueness given existence is immediate: if n and n′ are each least, then
n ≤ n′ ≤ n, so n = n′ .
For existence, suppose X ⊆ N has no least element. We show X = ∅ by
proving ∀n ∈ N ∀m ∈ n m ∈ / X by induction. For n = 0 this is trivial, and if it
holds for n then it holds for n+ , since otherwise n would be a least element of
X.
Remark. By an easy induction, any n ∈ N is a subset of N (i.e. N is transitive)
(Exercise Sheet 2). So n = {m ∈ N : m < n}, and we can see this as justifying
our informal notation n = {0, . . . , n − 1}.
5.4 Recursion on N
Theorem 5.12 (Definition by recursion on N). Let X be a set and g : X → X
a function, and let x0 ∈ X. Then there exists a unique function f : N → X
such that
• f (0) = x0 ;
• f (n+ ) = g(f (n)) for all n ∈ N.
H := {h ∈ P(N × X) : ∃n ∈ N [h is an n-approximation]}.
S
Let f := H.
We show that f is as required.
• f : N → X: Let n ∈ N. There exists an n-approximation h, so
⟨n, h(n)⟩ ∈ f . If ⟨n, x⟩ ∈ f , then ⟨n, x⟩ ∈ h′ for some h′ ∈ H. Then
h′ is an m-approximation for some m with n ≤ m, so h′ |n+ is an
n-approximation, so h′ |n+ = h|n+ , and so x = h(n).
• f (0) = x0 : If h ∈ H is the 0-approximation, we have ⟨0, x0 ⟩ ∈ h ⊆ f .
• f (n+ ) = g(f (n)) for n ∈ N: Let h ∈ H be the n+ -approximation.
Then f (n+ ) = h(n+ ) = g(h(n)) = g(f (n)).
• f (a, 0) = g0 (a);
• f (a, n+ ) = g+ (a, f (a, n)) for all n ∈ N.
is as required.
More concisely:
5.5 Arithmetic on N
Definition 5.14. Define functions +, ·, b : N × N → N using Corollary 5.13
such that for all n, m ∈ N
• – n+0=n
– n + m+ = (n + m)+
• – n·0=0
– n · m+ = n · m + n
• – nb0 = 1
– nbm+ = (nbm) · n
We normally write nm for nbm. We use the usual operator precedence rules,
so n + m · k means n + (m · k) rather than (n + m) · k.
(i) n + 1 = n+ .
(ii) (n + m) + k = n + (m + k) (Associativity of +).
(iii) 0 + k = k
(iv) n + k + = n+ + k
(v) n + k = k + n (Commutativity of +).
(vi) n · 1 = n.
(vii) n · (m + k) = n · m + n · k (Distributivity of · over +).
(viii) (n · m) · k = n · (m · k) (Associativity of ·).
(ix) n · k = k · n (Commutativity of ·).
(x) mn+k = mn · mk .
(xi) mn·k = (mn )k .
(i) n + 1 = n + 0+ = (n + 0)+ = n+ .
(ii) • Base case: (n + m) + 0 = n + m = n + (m + 0).
• Inductive step: (n + m) + k + = ((n + m) + k)+ = (n + (m + k))+ =
n + (m + k)+ = n + (m + k + ).
5 THE AXIOM OF INFINITY AND THE NATURAL NUMBERS 19
and defining addition and multiplication accordingly. You may like to think how
we could continue in this vein to define your favourite objects of mathematics
(including those of logic!).
End of lecture 6
6 Cardinality
One key contribution of set theory is to give a rigorous mathematical develop-
ment of the notion of the “size” of an infinite object, which we call its cardinality.
We first explore what we can understand of this with the axioms we have so far.
Then we add another axiom, Replacement, which will let us reach larger cardi-
nalities. Later, we will add the Axiom of Choice, and see that this significantly
clarifies the structure of cardinalities (while still leaving some natural questions
undecided).
6.1 Classes
If ϕ(x) is a formula, it may or may not be that there is a set {x : ϕ(x)}. We
have {x : x ̸= x} = ∅, but there is no set {x : x = x}.
Nonetheless, it is convenient to reuse some of the notation and terminology
we use for sets to talk about {x : ϕ(x)}.
Definition 6.1.
Remark 6.2.
6.2 Cardinalities
Definition 6.3. Sets X and Y have the same cardinality (or are equinu-
merous), written X ∼ Y , if there exists a bijection X → Y .
ϕ(X, Y ) := ∃f [f is a bijection X → Y ].
• X ∼ X.
• If X ∼ Y then Y ∼ X.
• If X ∼ Y and Y ∼ Z then X ∼ Z.
Lemma 6.8 (Tarski’s Fixed Point Theorem). Let X be a set. Then any mono-
tone function H : P(X) → P(X) has a fixed point, where:
So H(C) = C, as required.
Theorem 6.9 (Schröder-Bernstein Theorem5 ). If |X| ≤ |Y | ≤ |X| then |X| =
|Y |.
A ⊆ B ⊆ X ⇒ f [A] ⊆ f [B]
′ ′
⇒ f [A]c ⊇ f [B]c
′ ′
⇒ g[f [A]c ] ⊇ g[f [B]c ]
′ ′
⇒ H(A) = g[f [A]c ]c ⊆ g[f [B]c ]c = H(B).
Corollary 6.10. < is a strict partial (class) order on V/ ∼, i.e. for all X, Y, Z:
Proof. Exercise (sheet 2). Hint: For (i) and (ii), prove the result by induction
when X = n ∈ N, and then compose with bijections when X is arbitrary. For
the inductive step in (ii), if f : n+ → n+ is an injection but k ∈ n+ \ ran(f ),
then f ′ := σ ◦ f : n+ → n is injective where σ = (n k) : n+ → n+ is the
transposition, hence f ′ |n : n → n is also injective, but f ′ (n) ∈ n \ ran(f ′ |n ),
contradicting the induction hypothesis.
Lemma 6.13. Let n, m ∈ N. Then n < m ⇔ |n| < |m|.
Proof. First note that if n ≤ m, then n ⊆ m by Lemma 5.10, so |n| ≤ |m| since
the inclusion n → m is injective.
Suppose n < m, so in particular n ≤ m and so |n| ≤ |m|. If |n| = |m|, then
there is a bijection f : m → n, but then f is also a function f : m → m which
is injective but not surjective, contradicting Lemma 6.12(ii). So |n| < |m|.
Conversely, if |n| < |m| then |n| ̸≥ |m| so n ̸≥ m, so n < m.
So the order on the natural numbers agrees with the order on their cardi-
nalities, which partially justifies:
Notation 6.14. If n ∈ N, we usually write the cardinality |n| as n.6 e.g. |∅| = 0,
|{3}| = 1.
• countable if |X| ≤ ℵ0 .
• countably infinite if it is countable and infinite.
• uncountable if it is not countable.
Theorem 6.17. A set X is countably infinite if and only if |X| = ℵ0 .
In other words, there is no infinite cardinality below ℵ0 .
6 For now this is an abuse of notation, since the set n is not actually equal to the proper
• |X| + |Y | := |X ∪ Y | if X ∩ Y = ∅.
• |X| · |Y | := |X × Y |.
• |X||Y | := |X Y |.
Exercise 6.21. These do define well-defined operations on cardinalities (con-
sider bijections). (To see that |X| + |Y | is always defined, note that |X| =
|{0} × X| and |Y | = |{1} × Y |, and ({0} × X) ∩ ({1} × Y ) = ∅.)
6 CARDINALITY 25
End of lecture 8
Proposition 6.22.
Proof.
D := {x ∈ X : x ∈
/ f (x)} ⊆ X.
Proof.
• ℵ0 ≤ ℵ0 · ℵ0 : n 7→ ⟨n, 0⟩ is an injection N → N × N.
(i) |Q| = ℵ0 .
(ii) |R| = 2ℵ0 .
Proof.
(ii) The map which associates a real with its Dedekind cut, x 7→ {q ∈ Q : q <
x}, is an injection R → P(Q) since Q is dense in R. So |R| ≤ |P(Q)| =
2|Q| = 2ℵ0 .
For the converse, we can use ternary expansions. Define Φ : 2N → R by
∞
X f (n)
Φ(f ) := .
n=0
3n
P∞
Then Φ is injective (using n=1 3−n = 1
2 < 1; binary expansions would
not work), so 2ℵ0 ≤ |R|.
7 REPLACEMENT AND FOUNDATION 27
7.1 Replacement
Definition 7.1. If X and Y are classes, a formula with parameters ϕ(x, y)
defines a class function F : X → Y if:
End of lecture 9
Example 7.2. P : V → V is the class function defined by
ψ(x, y) := ∀w (w ∈ y ↔ w ⊆ x).
• f (0) = x0 ;
• f (n+ ) = G(f (n)) for all n ∈ N.
Proof. Exactly as in the proof of Theorem 5.12, for each n there is a unique n-
approximation. Then F(n) := [the unique n-approximation] is a class function
F : N → V, so by Replacement,
S H := F[N] = {h : ∃n ∈ N [h is an n-approximation]}
is a set. Set f := H, and conclude exactly as in Theorem 5.12.
8 WELL-ORDERED SETS AND ORDINALS 28
ℵ0
Example 7.6. There is a cardinality greater than any of ℵ0 , 2ℵ0 , 22 , . . ., in the
following sense.
Applying recursion with x0 := N and G := P, we obtain a function f with
dom(f ) = N and f (0) = N, f (1) = P(N), f (2) = P(P(N)), . . ..
Then ran(f ) = f [N] is a set which we could write as {N, P(N), P(P(N)), . . .}.
S
Let X := f [N]. Then for any n ∈ N, we have f (n) ⊆ X and hence
|X| ≥ f (n).
One can show that ZF1-7 do not suffice to prove the existence of such a
cardinality.
7.2 Foundation
ZF9 (Foundation): Every non-empty set x has an ∈-minimal element, i.e. an
element y ∈ x such that no element of x is an element of y:
∀x (x ̸= ∅ → ∃y ∈ x y ∩ x = ∅).
Proof. (i) If x ∈ x, then {x} violates Foundation: the only element of {x} is
x, but x ∩ {x} = {x} =
̸ ∅.
One can show that if ZF1-8 are consistent, then so are ZF1-9: adding Foun-
dation can not introduce a contradiction. In particular, any set we prove to
exist using ZF1-8 (such as N and R) does not violate Foundation. So adding
Foundation is “harmless”, and substantially simplifies the set theoretic universe.
However, Foundation will not actually be used in the remainder of these notes.
ZF1-9 form the axiom system ZF. Later we will add the Axiom of Choice
(AC) to form ZFC, but we delay this until we need it.
End of lecture 10
Example 8.2.
• Z is not well-ordered by its usual order <, since Z has no least element.
Same for R.
• [0, 1] ⊆ R is not well-ordered by <, since (0, 1] lacks a least element.
• {− n1 : n ∈ N \ {0}} ∪ N ⊆ R is well-ordered by <.
Lemma 8.6. If (X, <) is a well-order and θ : (X, <) → (X, <) is an embedding,
then θ(x) ≥ x for all x ∈ X.
Proof. Suppose not. Then a := min{x ∈ X : θ(x) < x} exists. But then
θ(a) < a, so θ(θ(a)) < θ(a) since θ is an embedding, contradicting minimality
of a.
Lemma 8.7. A well-order is not isomorphic to any of its proper initial seg-
ments.
Proof. Define
Y ′ = Y , as required.
End of lecture 11
8 WELL-ORDERED SETS AND ORDINALS 31
8.2 Ordinals
Definition 8.10. A set a is transitive if every element of a is a subset of a,
i.e. x ∈ y ∈ a ⇒ x ∈ a.
Definition 8.11. An ordinal is a transitive set which is well-ordered by ∈.
That is, an ordinal is a transitive set α such that (α, <) is a well-ordered8
set, where < := {⟨β, γ⟩ ∈ α × α : β ∈ γ}.
We use < and ∈ interchangeably to denote the order on an ordinal.
We denote the class of ordinals by ON.
(i) α ∈
/ α (Irreflexivity)
(ii) α ∈ β ∈ γ ⇒ α ∈ γ (Transitivity)
(iii) α ∈ β or α = β or β ∈ α (Totality)
(iv) Any non-empty class of ordinals has an ∈-least element. (Well-foundedness)
(iv) This is immediate from Foundation, but we can also argue directly as
follows. Given a non-empty class Γ of ordinals and γ ∈ Γ, if Γ ∩ γ = ∅
then min Γ = γ, and otherwise min Γ = min(Γ ∩ γ), which exists since γ
is an ordinal.
Proof. Theorem 8.14 shows that ∈ defines a well-order on any set of ordinals.
Theorem 8.16. ON is a proper class.9
Proof. Suppose for a contradiction that |α| ≤ |X| for all α ∈ ON. Then for
every α ∈ ON, there is an injection f : α → X, and then f (x) < f (y) ⇔ x ∈ y
defines a well-order on f [X] ⊆ X which is isomorphic to α.
Considering a well-order (Y, <) as an ordered pair ⟨Y, <⟩, and let W ⊆
P(X) × P(X × X) be the set of all well-orders (Y, <) with Y ⊆ X such that
(Y, <) is isomorphic to some ordinal, and let F : W → ON be the class function
such that F((Y, <)) is the ordinal isomorphic to (Y, <), which is unique by
Lemma 8.17. Then F[W ] = ON by the previous paragraph, so ON is a set by
Replacement, contradicting Theorem 8.16.
Theorem 8.19. Every well-ordered set (X, <) is isomorphic to a unique ordinal
by a unique isomorphism.
Proof. Uniqueness of the ordinal is by Lemma 8.17, and uniqueness of the iso-
morphism is by Lemma 8.8.
For existence, by Theorem 8.18 say α ∈ ON with |α| ̸≤ |X|. Then α is not
isomorphic to an initial segment of X, so by Theorem 8.9, X is isomorphic to
an initial segment of α, which is an ordinal by Lemma 8.13.
Lemma 8.20.
End of lecture 12
F(α) = G(F|α ).
(This make sense because, by Replacement, F|α : α → F[α] is a set for any
α ∈ ON.)
• F(0) = x0 .
• F(α+ ) = S(F(α)) for all α ∈ ON.
• If η ∈ ON is a limit ordinal, then F(η) =
S S
{F(β) : β ∈ η} = F[η].
• V0 = ∅
• Vα+ = P(Vα )
• Vη = {Vβ : β ∈ η} if η is a limit ordinal.
S
This is called the von Neumann cumulative hierarchy. One proves by transfinite
induction that Vα ⊆ Vβ for α ⊆ β. The rank of a set x is then defined as the
least α ∈ ON such that x ⊆ Vα (i.e. x ∈ Vα+ ), if such exists. The axiom of
Foundation is equivalent, modulo the other axioms of ZF, to the statement that
every set is an element of some Vα , i.e. that every set has a rank.
• – α+0=α
– α + β + = (α + β)+
8 WELL-ORDERED SETS AND ORDINALS 35
S
– α+η = {α + β : β ∈ η} for η a limit ordinal.
• – α·0=0
– α · β+ = α · β + α
S
– α · η = {α · β : β ∈ η} for η a limit ordinal.
• – α0 = 1
+
– αβ = (αβ ) · α
– αη = {αβ : β ∈ η} for η a limit ordinal.
S
Example 8.28.
• 1 + ω = n∈ω 1 + n = ω ̸= ω + = ω + 1.
S
• 2ω = n∈ω 2n = ω ̸= ω · ω = ω 2 .
S
• The sum order is the linear order (A, <A ) + (B, <B ) := ((A × {0}) ∪
(B × {1}), <+ ) where for all a, a′ ∈ A and b, b′ ∈ B:
(a) (α + β, ∈) ∼
= (α, ∈) + (β, ∈).
(b) (α · β, ∈) ∼
= (α, ∈) × (β, ∈).
End of lecture 13
(α · γ + , ∈) = (α · γ + α, ∈) ∼
= (α, ∈) × (γ, ∈) + (α, ∈) ∼
= (α, ∈) × (β, ∈),
where the penultimate isomorphism uses the IH and (a), and the final iso-
morphism is by the definitions of the sum and product orders.
(β, ∈) ∼
= (α, ∈) + (β \ α, ∈) ∼
= (α + δ, ∈),
so β = α + δ.
(b) Exercise. Consider product orders, and apply Lemma 8.32 for (iv). (iii) can
also be proven by induction.
(a) AC
(b) Every set X has a choice function, a function h : P(X) \ {∅} → X such
that h(A) ∈ A for all ∅ =
̸ A ⊆ X.
End of lecture 14
Y := {{A} × A : ∅ =
̸ A ⊆ X}
is a choice function.
• If (X, <) is a partially ordered set in which every chain has an upper
bound, then (X, <) has a maximal element.
Proof. ⇒: Let (X, <) be a partial order in which every chain has an upper
bound.
By WO (and Lemma 9.4), there exists a bijection θ : α → X for some
ordinal α.
Define an increasing sequence of chains by transfinite recursion (Corol-
lary 8.24):
– C0 := ∅;
(
Cβ ∪ {θ(β)} if β ∈ α and θ(β) > x for all x ∈ Cβ
– Cβ + :=
Cβ else;
S
– Cη := β∈η Cβ if η is a limit ordinal.
Remark 9.10. Zorn’s Lemma is often applied in the following special form (which
the above proof shows is actually equivalent to our statement): if a is a set and
X ⊆ P(a) is a non-empty set of subsets of a which is closed under unions
S of non-
empty chains, i.e. if ∅ =
̸ C ⊆ X is totally ordered by inclusion then C ∈ X,
then X has a maximal element with respect to inclusion. This follows from our
statement of Zorn’s Lemma by considering the partial order (X, ⊆); indeed, the
empty chain has an upper bound since X is non-empty, and any non-empty
chain is upper-bounded by its union.
9.4 ZFC
From now on, we assume AC. We could take any of the above equivalent forms
as the axiom; we use our first formulation.
10 CARDINAL NUMBERS 40
End of lecture 15
10 Cardinal numbers
By WO, every set is equinumerous with an ordinal (this was Lemma 9.4). Using
this, we now redefine our notation |X|:
Definition 10.1. The cardinality |X| of a set X is the smallest ordinal equinu-
merous with X:
|X| := min{α ∈ ON : α ∼ X}.
(i) |X| = |Y | ⇔ X ∼ Y .
(ii) |X| ≤ |Y | if and only if and only if an injection X → Y exists.
Lemma 10.5. (i) If κ is a cardinal, then there exists a cardinal greater than
κ.
S
(ii) If K is a set of cardinals, then K is a cardinal.
• ℵ0 = ω;
• ℵα+ is the smallest cardinal greater than ℵα ;
• ℵη = β∈η ℵβ if η is a limit ordinal.
S
Theorem 10.7.
(i) κ · κ = κ
(ii) If λ is a cardinal 1 ≤ λ ≤ κ, then κ + λ = κ = κ · λ.
(iii) If λ is an infinite cardinal, then κ + λ = max(κ, λ) = κ · λ.
Proof. (i) By transfinite induction. Assume |α| · |α| = |α| for all infinite
ordinals α < κ. Then
κ≤κ+λ≤κ+κ=κ·2≤κ·κ=κ
κ≤κ·λ≤κ·κ=κ
Proof. Let h be a choice function for X. Then g(y) := h({x ∈ X : f (x) = y})
defines an injection Y → X. So |Y | ≤ |X|.
Theorem 10.11. A countable union of countable sets is countable.
S cardinal, and X is a set such that |X| ≤ κ
More generally, if κ is an infinite
and |a| ≤ κ for all a ∈ X, then | X| ≤ κ.
Definition 10.12.
αi < 2ℵ0 (i.e. 2ℵ0 does not have countable cofinality). This is all ZFC
tells us about 2ℵ0 , in the sense that for any ℵα which is not of this form,
it is consistent with ZFC that 2ℵ0 = ℵα .
End of lecture 16
Proof. (i) We apply Zorn’s Lemma. Consider the set I of linearly independent
subsets of V as a partial order, ordered by inclusion, ⊆. If C ⊆ I is a
chain, then Sits union is also linearly independent, since any finitely many
elements of C are already elements of some I ∈ C. So by Zorn’s Lemma,
there exists a maximal element B ∈ I. We conclude by showing that B is
spanning. Suppose not, say v ∈ V \ ⟨B⟩. Then one verifies directly that
B ∪ {v} is linearly independent, and v ∈ / B, contradicting maximality of
B.
A REFERENCES 45
(ii) Let B and B ′ be bases. The case where B or B ′ is finite was done in
Prelims. So suppose B and B ′ are infinite.
Let P <ω (B) := {B0 ⊆ B : |B0 | < ℵ0 } be S the set of finite subsets of B.
Then |P <ω (B)| = |B|, since P <ω (B) = n∈N B (n) where B (n) := {B0 ⊆
B : |B0 | = n}, and B n → B (n) ; (b1 , . . . , bn ) 7→ {b1 , . . . , bn } is a surjection,
so |B (n) | ≤ |B n | = |B| so |P <ω (B)| ≤ |B| by Theorem 10.11.
If B0 ∈ P <ω (B),Sthen ⟨B0 ⟩ ∩ B ′ is finite by the finite-dimensional case.
But V = ⟨B⟩ = {⟨B0 ⟩ : B0 ∈ P <ω (B)}, so B ′ = {⟨B0 ⟩ ∩ B ′ : B0 ∈
S
P <ω (B)} is a union of |P <ω (B)| = |B| finite sets, so by Theorem 10.11
again, |B ′ | ≤ |B|.
By symmetry, |B ′ | = |B|.
A References
[Copied directly from Jonathan Pila’s notes, with a few amendments]
Text to expand and supplement these notes:
[1] D. Goldrei, Classic set theory, Chapman and Hall/CRC, Boca Raton, 1998.
[4] R. Rucker, Infinity and the mind, Princeton University Press, 1995.
[5] J. Stillwell, Roads to Infinity, CRC Press, AK Peters, 2010.
[6] P. Aczel, Non-well founded sets, CSLI Lecture Notes, 14, Stanford, CA. On
web.
[7] M. Hils and F. Loeser, A First Journey through Logic, AMS Student Math-
ematical Library volume 89, 2019.
[13] P. Cohen, Set theory and the continuum hypothesis, W. A. Benjamin, 1966.
[14] P. Cohen and R. Hersch, Non-Cantorian set theory, Scientific American 217
(1967), 104–116.
[15] R. Dedekind, Essays on the theory of numbers, Dover, 1963.
[16] K. Gödel, The consistency of the axiom of choice and the generalized con-
tinuum hypothesis with the axioms of set theory, Annals of Mathematics
Studies 3, Princeton University Press, 1940.
[17] K. Gödel, What is Cantor’s continuum problem? American Mathematical
Monthly, 54 (1964), 515–525. Revised version in Benacerraf and Putnam,
Philosophy of mathematics: selected readings, Prentice-Hall, 1964.