Notes On Ultrafilters: 1 Basic Theory
Notes On Ultrafilters: 1 Basic Theory
Alex Kruckman
1 Basic theory
Let X be a set. An (ultra)filter on X is a consistent choice of which subsets of X are “large”.
Definition. A filter on X is F ⊆ P(X) such that
1. X ∈ F (the whole set is large).
Lemma (FIP). Any subset S ⊆ P(X) with the FIP has a minimal filter containing it, the
filter generated by S.
Proof. Close S downward under finite intersections, then upward under supersets.
Definition. A filter F on X is an ultrafilter if
5. For any A ⊆ X, A ∈ F or X \ A ∈ F (every set is either large or co-large).
Exercise 1. Given a filter F, show that F is an ultrafilter iff ni=0 Ai ∈ F implies that
S
Ai ∈ F for some i (a large set cannot be a finite union of small sets).
Note that principal filters are ultrafilters, but the trivial filter and the cofinite filter are
not. Are there any nonprincipal ultrafilters? The answer is yes, but not constructively! The
following lemma is a weak form of the Axiom of Choice, i.e. it is not provable in ZF.
1
Lemma (Ultrafilter lemma). Every filter is contained in an ultrafilter.
2 Generalized limits
Consider a function f : N → [0, 1], determining a sequence a0 , a1 , a2 , . . . We say that
lim an = L if for every open set U containing L, all but finitely many natural numbers
n→∞
are mapped into U by f , i.e. f −1 (U ) is in the cofinite filter on N.
Now given distinct points a and b in [0, 1], the sequence a, b, a, b, . . . does not converge. If
we take disjoint neighborhoods a ∈ U , b ∈ V , f −1 (U ) and f −1 (V ) are both infinite, coinfinite
subsets of N. An ultrafilter on N would give preference to one of these two sets and decide
whether a or b should be the limit of the sequence.
Definitions
We’ll take a more general perspective by transporting (ultra)filters from the index set to the
target space (from N to [0, 1] in the example above).
Now we can define generalized limit points of maps. With this definition, the usual notion
of convergence of a sequence is F-convergence, where F is the cofinite filter on N.
Filter convergence works best with ultrafilters on compact Hausdorff spaces, as illustrated
by the following theorem.
2
Theorem (Ultrafilter convergence theorem). Let Y be a topological space.
1. Y is compact if and only if every ultrafilter F on Y converges to at least one point.
2. Suppose for contradiction that Y is Hausdorff, but y 6= y 0 are both limit points of F.
Take disjoint open sets y ∈ U , y 0 ∈ U 0 . Now U, U 0 ∈ F, but U ∩ U 0 = ∅, contradition.
Conversely, suppose that Y is not Hausdorff. Then there are points y 6= y 0 such that
every open neighborhood of y intersects every open neighborhood of y 0 . This means
that {U | y ∈ U open} ∪ {U 0 | y 0 ∈ U 0 open} has the FIP. Let F be an ultrafilter
containing it. Then y and y 0 are both limit points of F.
3
Exercise 4. Check the details.
• βX is a compact Hausdorff space for all X. In fact, it is a Stone space: compact,
Hausdorff, and totally disconnected.
• φ is continuous.
• φ is unique.
Note that since every ultrafilter on a finite set is principal, if X is finite, βX = X with
the discrete topology.
3 Ultraproducts
Application: Prime ideals in products of rings
Exercise 5. In a finite product of rings ni=0 Ai , all prime ideals have the following form:
Q
{(a0 , . . . , ai , . . . , an ) | ai ∈ p}, where p is a prime ideal in Ai for some i. Hence a prime
ideal Q in the product `n is determined by a choice of a prime idea in one of the factors, and
Spec( ni=0 Ai ) ∼ = i=0 Spec(Ai ) (you can think of this as an isomorphism of sets, topological
spaces, or schemes).
The situation is not so simple for an infinite product of rings. For example, if {Ai | i ∈ I}
Q infinite collection of rings, the set {(ai ) | ai = 0 for all but finitely many i} is an ideal
is an
in i∈I Ai . It can be extended to a maximal ideal, which cannot be contained in any prime
ideal of the form described in the exercise.
4
This situation is analagous to one we’ve seen before: on a finite set, all ultrafilters are
principal. But on an infinite set, there are other ultrafilters, obtained by extending the
cofinite filter. One can give a complete description of the primes in an infinite product of
rings using ultrafilters, but to keep things simple, we’ll focus on products of fields here.
Q
Theorem. Let {Fi | i ∈ I} be a collection of fields. The prime ideals in the ring i∈I Fi
are in bijection with the ultrafilters on I. The ultrafilter F corresponds to the prime ideal
{(ai ) | the set of indices i such that ai = 0 is in F}. In the same way, the proper ideals in
the ring are in bijection with the filters on I.
Q
Proof. Given an element a = (ai ) ∈ i Fi , let Za = {i ∈ I | ai = 0}. Notice that Zab =
Za ∪ Zb , and Za+b ⊇ Za ∩ Zb .
Let F be a filter on I. We claim that p = {a | Za ∈ F} is a proper ideal. Indeed, 0 ∈ p
since I ∈ F, and 1 6∈ p since ∅ 6∈ F.
Suppose a, b ∈ p. Then Za , Zb ∈ F. Now Za+b ⊇ Za ∩ Zb , so Za+b ∈ F, and a + b ∈ p.
Suppose a ∈ p and c is another ring element. Then Zca = Zc ∪ Za ⊇ Za , so Zca ∈ F, and
ca ∈ p.
We have established that p is an ideal. Suppose now that F is an ultrafilter, and let a
and b be ring elements such that ab ∈ p. Now Zab = Za ∪ Zb is in F. Then Za ∈ F or
Zb ∈ F, so a ∈ p or b ∈ p, and hence p is prime.
1. I ∈ Fp , since 0 ∈ p.
2. ∅ ∈ Fp , since 1 6∈ p.
5. We have established that Fp is a filter. Suppose now that p is a prime ideal, and let
A be any subset of I. Let a and b be ring elements such that Za = A and Zb = I \ A.
Then ab = 0 ∈ p, so one of a or b is in p, and hence one of A or I \ A is in Fp , so Fp is
an ultrafilter.
5
Syntax and semantics
The ultraproduct construction is a very general method for putting together algebraic struc-
tures to obtain structures with specified properties. To speak in the correct level of generality,
we need a quick introduction to the language of universal algebra and model theory.
Definition. A language L is a set of symbols. Each symbol is specified to be a constant
symbol, a function symbol (of a specified finite arity), or a relation symbol (of a specified
finite arity). An L-structure is a set M , together with interpretations of the symbols in L:
• For each constant symbol c, an element cM ∈ M .
• Rings and fields are Lr -structures, where Lr = {+, −, ·, 0, 1}, + and · are binary
functions, − is a unary function, and 0 and 1 are constants.
There is an obvious notion of homomorphism of L-structures (a map of underlying sets
which preserves the interpretations of the symbols). There is also a natural way to define
a product of L-structures, which, as one might hope, gives the categorical product in the
category of L-structures.
But this situation is unsatisfying: the class of L-structures is too broad. At this point,
we have no way of specifying that an Lr -structure is a ring, or that ≤ is interpreted as an
order in an L≤ -structure.
In order to impose more interesting conditions on our structures, we need more expressive
syntax. We’ll use the syntax of first-order logic.
Definition. Fix a language L. A term is built up from the constant symbols of L and a
supply of variables x0 , x1 , x2 , . . . by applications of the function symbols of L.
An atomic formula is of the form t1 = t2 or R(t1 , . . . , tn ), where the ti are terms and R
is a relation symbol of arity n.
A formula is built up from atomic formulas by Boolean connectives ∧, ∨, and ¬, and
quantifiers ∀xi and ∃xi .
A sentence is a formula with no free variables. That is, every variable appearing in the
formula is quantified.
Sentences in a language L have a familiar semantics in L-structures, in which they express
properties of these structures. In particular, given a sentence φ and an L-structure M , φ is
either true or false in M . In the same way, formulas with free variables express properties
of elements. Given a formula φ(x) with free variable x and an element a ∈ M , φ(a) is either
true or false in M .
For example, the following sentence asserts that the interpretation of ≤ is a transitive
relation:
∀x1 ∀x2 ∀x3 (x1 ≤ x2 ∧ x2 ≤ x3 ) → x1 ≤ x3 .
6
Here φ → ψ is being used as a convenient abbreviation for ¬φ ∨ ψ.
The following formula in the free variable x1 expresses that x1 has a multiplicative inverse:
∃x2 x1 · x2 = 1.
Definition. If the sentence φ is true in the L-structure M , we say that M satisfies φ and
write M |= φ.
A theory is a set of sentences in a language L.
If T is a theory and the L-structure M |= φ for all φ ∈ T , then M is a model for T ,
written M |= T .
Exercise 6. Write down the field axioms as a first-order theory. Now find a theory whose
models are exactly the algebraically closed fields.
Note that the definition of formulas has an inductive structure: each formula is built
from strictly simpler formulas. This property allows us to conduct arguments by induction
on the complexity of formulas.
Since φ ∨ ψ is equivalent to ¬(¬φ ∧ ¬ψ) and ∀x φ(x) is equivalent to ¬∃x ¬φ(x), it suffices
to consider formulas built up by ¬, ∧, and ∃x.
• If f is a function symbol of arity n, f M ([(ai1 )], . . . , [(ain )]) = [(f Mi (ai1 , . . . , ain ))].
Exercise 7. Check that this is well-defined: that ∼ is an equivalence relation and that the
interpretations of f and R are independent of the choice of representative for each equivalence
class.
7
Theorem (Loś’s theorem). Let {Mi | i ∈ I} be a collection of L-structures, and let F be an
ultrafilter on I. Let φ(x) be a first-order Q
formula in the free variables
Q x, and let [(ai )] be a
tuple of elements from the ultraproduct I Mi /F |= φ. Then I Mi /F |= φ([(ai )]) if and
only if {i ∈ I | Mi |= φ(ai )} ∈ I.
Exercise 10. Let {Fi | i ∈ I} be a collection of fields. Explain the connection between
the ultraproduct of these fields with respect to the ultrafilter F and the prime ideal in the
product of these fields corresponding to F identified in the previous section.
Exercise 11. Use the ultrapower construction to exhibit a nonstandard model for Th(N),
i.e. a semiring which satisfies all the same first-order statements as N in the language of
rings, but which has infinite elements.
Exercise 12. Use the ultraproduct construction to construct a field of characteristic 0 which
has exactly one algebraic extension of each degree.
8
Application: The compactness theorem of first-order logic
The compactness theorem is a powerful tool for showing that first-order theories are sat-
isfiable, i.e. that structures with certain properties exist. There are several proofs of this
theorem, of which the ultraproduct proof given here is the slickest.
Theorem (Compactness theorem). A theory T is satisfiable if and only if every finite subset
of T is satisfiable.
Proof. If T is satisfiable, then there is a model M |= T . Now M |= ∆ for any finite ∆ ⊆ T ,
so every finite subset of T is satisfiable.
Conversely, suppose every finite subset of T is satisfiable. Then we have a collection of
structures {M∆ | ∆ ∈ I} indexed by the collection of all finite subtheories of T , with M∆ |= ∆
for all ∆ ∈ I.
The idea is to take the ultraproduct of Q the M∆ with respect to some ultrafilter on I, in
such a way that the ultraproduct M = I M∆ /F is a model for T . By Loś’s theorem, it
suffices to find an ultrafilter F such that for all φ ∈ T , {∆ | M∆ |= φ} ∈ F.
Now certainly {∆ | M∆ |= φ} ⊇ {∆ | φ ∈ ∆}, since M∆ |= ∆. So we just need to pick F
so that Aφ =T {∆ | φ ∈ ∆} ∈ F for all φ ∈ T . We can do this if {Aφ | φ ∈ T } has the FIP.
And it does: ni=0 Aφi 6= ∅, since it contains the finite theory {φ0 , . . . , φn }.
To apply the compactness theorem, just write down any theory you like. If you can show
that there are no inconsistencies arising from finite pieces of the theory, the entire theory
has a model. Write down what you want, and then get what you want!
Exercise 13. For each of the examples in the “Building funny structures” section, show
how to use the compactness theorem to prove that a structure with the desired properties
exists.
9
Question: Is every Boolean algebra isomorphic to the powerset algebra of a set X?
The answer to the question is no, but the same idea that led to the construction of the
Stone-Cech compactification can lead us to a positive result: every Boolean algebra can be
embedded as a subalgebra of a powerset algebra.
Note that if B is the powerset algebra of the set X, then βB is the Stone-Cech compact-
ification of X.
Measurable cardinals
Another way to view ultrafilters is as finitely additive 0-1-valued measures on the σ-algebra
P(X) (assigning measure 1 to “large” subsets and measure 0 to “small” subsets).
Since the usual convention in measure theory is to consider countably additive measures,
it is natural to ask whether we can find countably additive 0-1-valued measures. These would
be ultrafilters satisfying the countable intersection property (CIP), instead of the FIP.
It turns out that any ultrafilter with the CIP in fact must be an ultrafilter on a set X
of cardinality κ and satisfy the (<κ)IP (any intersection of fewer than κ many subsets is
nonempty), where κ is a measurable cardinal.
Measurable cardinals are one of the mysterious large cardinals studied in set theory. They
must necessarily be much much larger than any of the cardinalities encountered in everyday
mathematics, so large that their existence cannot be proven in ZFC. Think of the comparison
of cardinals like ℵ0 and 2ℵ0 with κ as being analagous to the comparison of cardinals like 2
and 47 with ℵ0 .
In fact, the situation is not just that we cannot prove measurable cardinals exist: we
cannot prove that the consistency of ZFC implies the consistency of ZFC + “there exists
a measurable cardinal” (so adding this axiom could make the system inconsistent), and
moreover, we can prove that such a thing cannot be proven. It’s entirely possible that ZFC
proves that there are no measurable cardinals.
But we have no such proof, and most set theorists believe that this and other “large
cardinal axioms” are not only consistent, but also philosophically motivated, and provide an
extension of the ZFC set theory which is too elegant and well-structured to be ignored. But
that’s another story...
10