Skript PAdicAnalysis Pub Screen (1)
Skript PAdicAnalysis Pub Screen (1)
Contents
1. Introduction 2
2. p-adic numbers 3
3. Newton Polygons 14
4. Multiplicative seminorms and Berkovich spaces 19
5. The Berkovich affine and projective line 24
6. Analytic spaces and functions 34
7. Berkovich spaces of curves 39
8. Integration 45
References 52
1. Introduction
We all know the fields R and C of real and complex numbers as the completion of
the field Q of rational numbers and its algebraic closure. In particular, we have
a canonical embedding Q ,→ R ⊂ C, which we can sometimes use to get number
theoretic results by applying analysis over R or C.
Now R is not the only completion of Q. Besides the usual absolute value, there are
more absolute values on Q; to be precise, up to a natural equivalence (and except
for the trivial one), there is one absolute value |·|p for each prime number p. (We
will explain what an absolute value is in due course.) Completing Q with respect
to |·|p leads to the field Qp of p-adic numbers; we can then take its algebraic
closure Q̄p and the completion Cp of that (Q̄p is, in contrast to R̄ = C, not
complete), which has similar properties as C: it is the smallest extension field
of Q that is algebraically closed and complete with respect to the p-adic absolute
value.
There is a general philosophy in Number Theory that ‘all completions are created
equal’ and should have the same rights. In many situations, one gets the best
results by considering them all together. Since Qp and Cp are complete metric
spaces (and the former is, like R or C, locally compact), we can try to do analysis
over them. Many concepts and results of ‘classical’ analysis carry over without
great problems. But we will see that there is one feature of the p-adic topology
that is a stumbling block for an easy transfer of certain parts of analysis (like
for example line integrals) to the p-adic setting: this topology is totally discon-
nected. One goal of this lecture course is to explain a way to resolve this problem,
which is to embed Cp (say) into a larger ‘analytic space’ Canp that is (in this case,
even uniquely) path-connected. This approach is due to Vladimir Berkovich; the
analytic spaces constructed in this way are also known as Berkovich Spaces. V.G. Berkovich
c MFO 2012
§ 2. p-adic numbers 3
2. p-adic numbers
In this section we will recall (or introduce) the field Qp of p-adic numbers and its
properties. We begin with a rather general definition.
(1) |x| = 0 ⇐⇒ x = 0.
(2) (Multiplicativity) |xy| = |x| · |y|.
(3) (Triangle Inequality) |x + y| ≤ |x| + |y|.
Properties (1), (2) and (3) imply that d(x, y) = |x − y| defines a metric on K.
((2) is needed for the symmetry: it implies that | − 1| = 1, so that |y − x| =
| − 1| · |x − y| = |x − y|.) The absolute value is then continuous as a real-valued
function on the metric space (K, d).
One can show that two absolute values on K are equivalent if and only if they
induce the same topology on K (Exercise).
2.2. Lemma. Let (K, |·|) be a field with a non-archimedean absolute value. Then LEMMA
R = {x ∈ K : |x| ≤ 1} is a subring of K. R is a local ring (i.e., it has exactly one valuation ring
maximal ideal) and has K as its field of fractions.
Proof. Exercise! q
2.3. Definition. A metric space (X, d) is said to be complete, if every Cauchy DEF
sequence in X converges in X: if (xn ) is a sequence in X such that complete
metric space
lim sup d(xn , xn+m ) = 0 ,
n→∞ m≥1
So the p-adic absolute value of x ∈ Q is small when (the numerator of) x is divisible
by a high power of p, and it is large when the denominator of x is divisible by a
high power of p. If x ∈ Z, then we clearly have |x|p ≤ 1: contrary to the familiar
situation with the usual absolute value, the integers form a bounded subset of Q
with respect to |·|p ! This explains the word ‘non-archimedean’ — the Archimedean
Axiom states that if x, y ∈ R>0 , then there is n ∈ Z such that nx > y; this is
equivalent to saying that Z is unbounded.
2.6. Lemma. The p-adic absolute value is a non-archimedean absolute value LEMMA
on Q. |·|p is
abs. value
Proof. We check the properties in Definition 2.1. Property (1) is clear from the
definition. Property (2) follows from vp (ab) = vp (a)+vp (b), which is a consequence
of unique factorization. Property (30 ) follows from vp (a + b) ≥ min{vp (a), vp (b)},
which is a consequence of the elementary fact that pn | a and pn | b together imply
that pn | a + b. q
2.7. Example. Let (K, |·|) be a field with absolute value. Then for every x ∈ K EXAMPLE
with |x| < 1, the series geometric
∞
X series
xn
n=0
converges to the limit 1/(1 − x): we have
N −1
X 1 −xN
xn − = = |1 − x|−1 · |x|N ,
n=0
1−x 1−x
§ 2. p-adic numbers 5
which tends to zero, since |x| < 1 (note that |1 − x| ≥ 1 − |x| > 0, so the fraction
makes sense).
For example,
1
1 + p + p2 + p 3 + p4 + . . . =
1−p
in (Q, |·|p ). ♣
2.8. Theorem. Let (K, |·|) be a field with an absolute value. Then there is a THM
field (K 0 , |·|0 ) extending K such that |·|0 restricts to |·| on K, (K 0 , |·|0 ) is complete completion
and K is dense in K 0 .
Proof. Let C(K) be the ring of Cauchy sequences over K (with term-wise addition
and multiplication). The set N (K) of null sequences (i.e., sequences converging
to zero) forms an ideal in the ring C(K). We show that this ideal is actually
maximal. It does not contain the unit (all-ones) sequence, so it is not all of C(K).
If (xn ) ∈ C(K)\N (K), then it follows from the definition of ‘Cauchy sequence’ that
there are n0 ∈ Z>0 and c > 0 such that |xn | ≥ c for all n ≥ n0 . Then the sequence
(yn ) given by yn = 0 for n < n0 and yn = 1/xn for n ≥ n0 is a Cauchy sequence
and (1) = (xn ) · (yn ) + (zn ) with (zn ) ∈ N (K), so N (K) + C(K) · (xn ) = C(K).
Since N (K) is a maximal ideal in C(K), the quotient ring K 0 := C(K)/N (K) is a
field. There is a natural inclusion K ,→ C(K) by mapping a ∈ K to the constant
sequence (a), which by composition with the canonical epimorphism C(K) → K 0
gives an embedding i : K ,→ K 0 . We define |·|0 by
|[(xn )]|0 = lim |xn |
n→∞
(where [(xn )] denotes the residue class mod N (K) of the sequence (xn ) ∈ C(K)).
The properties of absolute values and of Cauchy sequences imply that this is well-
defined (i.e., the limit exists and does not depend on the choice of the represen-
tative sequence). That |·|0 is an absolute value follows easily from the assumption
that |·| is, and it is clear that |·|0 restricts to |·| on K.
It is also easy to see that K is dense in K 0 . Let x = [(xn )] ∈ K 0 , then |i(xn ) − x|0 =
limm→∞ |xn − xn+m | tends to zero as n tends to infinity, so we can approximate x
arbitrarily closely by elements of K.
It remains to show that K 0 is complete. So let (x(ν) )ν be a Cauchy sequence in K 0
(ν)
and represent each x(ν) by a Cauchy sequence (xn )n in K. Since K is dense
(ν)
in K 0 , we can do this in such a way that |i(xn ) − x(ν) |0 ≤ 2−n for all ν and n.
(The reason for this requirement is that we need some uniformity of convergence
(n)
to make the proof work.) Let yn = xn . We claim that (yn ) is a Cauchy sequence
(in K) and that y = [(yn )] = limν→∞ x(ν) (in K 0 ). To see the first, pick ε > 0.
There is ν0 such that |x(ν) − x(ν+µ) |0 < ε for all ν ≥ ν0 and µ ≥ 0. Then
(n+m)
|yn − yn+m | = |x(n)
n − xn+m |
(n) 0 (n+m)
≤ |i(x(n)
n )−x | + |x(n) − x(n+m) |0 + |x(n+m) − i(xn+m )|0
≤ 2−n + ε + 2−n−m < 2ε
§ 2. p-adic numbers 6
when 2−n < ε/2 and n ≥ ν0 . To see the second, note that
|x(ν) − y|0 = lim |x(ν) (n)
n − xn |
n→∞
(ν) 0
≤ lim sup |i(x(ν) (ν)
− x(n) |0 + |x(n) − i(x(n) 0
n ) − x | + |x n )|
n→∞
One can show that (K 0 , |·|0 ) is determined up to unique isomorphism (of extensions
of K with absolute value), but we will not need this in the following.
Since |·|0 extends |·|, we will usually use the same notation for both. We will also
consider K as a subfield of K 0 .
2.9. Definition. Let p be a prime number. The completion Qp of Q with respect DEF
to the p-adic absolute value is called the field of p-adic numbers. Its valuation field of
ring Zp is the ring of p-adic integers. ♦ p-adic
numbers
Since |x|p takes a discrete set of values for x 6= 0, it follows that
Zp = {x ∈ Qp : |x|p ≤ 1} = {x ∈ Qp : |x|p < p}
is closed and open in Qp . This implies that for any two distinct elements x, y ∈ Qp ,
there are disjoint open neighborhoods X of x and Y of y such that Qp = X ∪ Y :
Qp is totally disconnected. To see this, let δ = |x − y|p > 0. Then the open
ball X around x of radius δ (which is x + pn+1 Zp , if δ = p−n ) is also closed,
so its complement Y = Qp \ X is open as well, and y ∈ Y . One consequence
of this is that any continuous map γ : [0, 1] → Qp is constant (otherwise let x
and y be two distinct elements in the image and let X and Y be as above; then
[0, 1] is the disjoint union of the two open non-empty subsets γ −1 (X) and γ −1 (Y ),
contradicting the fact that intervals are connected).
Next we want to show that Zp is compact. For this we need a fact about approx-
imation of p-adic numbers by rationals.
2.10. Lemma. Let x ∈ Qp \ {0} with |x|p = p−n . There is a ∈ {0, 1, 2, . . . , p − 1} LEMMA
such that |x − apn |p < |x|p . approximation
of p-adic
numbers
Proof. Replacing x by p−n x, we can assume that |x|p = 1, in particular, x ∈ Zp .
Since Q is dense in Qp , there is r/s ∈ Q such that |x − r/s|p < 1. By the
ultrametric triangle inequality, |r/s|p ≤ 1, so that we can assume that p - s. Let
a ∈ {0, 1, . . . , p − 1} be such that as ≡ r mod p. Then
|x − a|p = (x − r/s) + (r/s − a) p
≤ max{|x − r/s|p , |r/s − a|p } < 1 ,
2.11. Theorem. The ring Zp is compact with respect to the metric induced by THM
the p-adic absolute value. In particular, Qp is locally compact. Zp is compact
Proof. We show that any sequence (xn ) in Zp has a convergent subsequence. Since
Zp is complete (Zp = {x ∈ Qp : |x|p ≤ 1} is a closed subset of the complete
space Qp ), it is enough to show that there is a subsequence that is a Cauchy
sequence. We do this iteratively. Let n1 be the smallest index n such that there are
infinitely many m > n with |xn − xm |p < 1. Such an n must exist by Lemma 2.10,
which implies that there is some a such that |xn − a|p < 1 for infinitely many n.
Now let n2 be the smallest index n > n1 such that |xn − xn1 |p < 1 and such
that there are infinitely many m > n with |xn − xm | < p−1 . This again exists by
Lemma 2.10, where we restrict to the infinitely many n such that |xn − xn1 |p < 1.
We continue in this way: nk+1 is the smallest n > nk such that |xn −xnk |p < p−(k−1)
and such that there are infinitely many m > n with |xn − xm |p < p−k . Then
|xnk+1 − xnk |p < p−(k−1) for all k ≥ 1, which by the ultrametric triangle inequality
implies that (xnk )k is a Cauchy sequence:
n o
xnk+l − xnk p ≤ max xnk+m − xnk+m−1 p : 1 ≤ m ≤ l < p−(k−1) .
That Qp is locally compact follows, since any x ∈ Qp has the compact neighbor-
hood x + Zp . q
2.12. Definition. Let (K, |·|) be a complete field with a non-archimedean abso- DEF
lute value. By Lemma 2.2 the valuation ring R = {x ∈ K : |x| ≤ 1} is a local ring residue field
with unique maximal ideal M = {x ∈ K : |x| < 1}. The quotient ring k = R/M
is therefore a field, the residue field of K. The canonical map R → k is called the
reduction map and usually denoted x 7→ x̄. ♦
2.13. Lemma. Let (K, |·|) be a field with a non-archimedean absolute value and LEMMA
let x, y ∈ K with |x| > |y|. Then |x + y| = |x|. all triangles
If we have x , x , . . . , x ∈ K such that x + x + . . . + x = 0 and n ≥ 2, then are isosceles
1 2 n 1 2 n
there are at least two indices 1 ≤ j < k ≤ n such that
|xj | = |xk | = max{|x1 |, |x2 |, . . . , |xn |} .
Proof. We have |x + y| ≤ max{|x|, |y|} = |x|. Assume that |x + y| < |x|. Then
|x| = |(x + y) + (−y)| ≤ max{|x + y|, | − y|} < |x| ,
a contradiction.
For the second statement observe that if we had just one j such that |xj | is
maximal, then by the second statement we would have
X
0 = |x1 + x2 + . . . + xn | = xj + xk = |xj | ,
k6=j
2.14. Lemma. Let (K, |·|) be a field with a non-archimedean absolute value. For LEMMA
f = a0 + a1 x + a2 x2 + . . . + an xn ∈ K[x] we set extension of
abs. value
|f | := max{|a0 |, |a1 |, |a2 |, . . . , |an |} .
to K[x]
0
Then properties (1), (2) and (3 ) of Definition 2.1 are satisfied for elements
of K[x].
If K is complete with respect to |·|, then for each n ∈ Z≥0 , the space K[x]<n of
polynomials of degree < n is a complete metric space for the metric induced by |·|
on K[x].
Proof. Exercise. q
2.15. Theorem. Let (K, |·|) be a complete non-archimedean field with valuation THM
ring R and residue field k. Let F ∈ R[x] be a polynomial such that |F | = 1 Hensel’s
and suppose that we have a factorization F̄ = f1 f2 in k[x] such that f1 is monic Lemma
and f1 and f2 are coprime. (The notation F̄ means the polynomial in k[x] whose
coefficients are obtained by reduction from those of F .) Then there are unique
polynomials F1 , F2 ∈ R[x] such that F = F1 F2 , F1 is monic and F̄1 = f1 , F̄2 = f2 .
We ignore the nonlinear term H1 H2 and choose (H1 , H2 ) such that the linear terms
(0) (0)
correct the mistake, i.e., (H1 , H2 ) = Φ−1 (F − F1 F2 ). From the above we know
that |H1 |, |H2 | ≤ δ and therefore
(0) (0)
F − (F1 + H1 )(F2 + H2 ) = |H1 H2 | ≤ δ 2 .
(1) (0) (2)
Repeating this with the new approximations Fj = Fj + Hj we obtain Fj such
(1) (2) (2) (2)
that |Fj − Fj | ≤ δ 2 and |F − F1 F2 | ≤ δ 4 . Iterating this procedure, we
(m)
construct sequences (Fj )m≥0 in R[x]<nj such that
(m) (m+1) m (m) (m+1) m (m) (m) m
|F1 − F1 | ≤ δ2 , |F2 − F2 | ≤ δ2 and |F − F1 F2 | ≤ δ2
for all m. Since R[x]<nj is complete by Lemma 2.14, the sequences converge to
polynomials F1 and F2 with F = F1 F2 and F̄1 = f1 , F̄2 = f2 . This shows existence.
To show uniqueness, assume that F̃1 and F̃2 are another solution. Then
0 = F − F = F1 F2 − F̃1 F̃2 = (F1 − F̃1 )F2 + (F2 − F˜2 )F̃1 .
The map Φ as above, but using F̃1 and F2 , is still invertible (this only uses that the
reductions are f1 and f2 ), which immediately gives F1 −F̃1 = 0 and F2 −F̃2 = 0. q
2.16. Corollary. Let (K, |·|) be a field that is complete with respect to a non- COR
archimedean absolute value, let R be its valuation ring and k its residue field and Hensel’s
let f ∈ R[x] be monic. Assume that f¯ ∈ k[x] has a simple root a ∈ k. Then f has Lemma
a unique root α ∈ R such that ᾱ = a. for roots
Proof. This is the case f1 = x − a of Theorem 2.15. Note that the assumption
that a is a simple root of f¯ implies that the cofactor f2 = f¯/(x − a) is coprime
to f1 . q
2.17. Corollary. Let (K, |·|) be a field that is complete with respect to a non- COR
archimedean absolute value and let f = a0 + a1 x + . . . + an xn ∈ K[x] with an 6= 0. reducibility
Assume that there is 0 < m < n with |am | = |f | and that |a0 | < |f | or |an | < |f |. of certain
Then f is reducible. polynomials
Proof. After scaling f we can assume that |f | = 1. If |a0 | < 1, then let m be
minimal with |am | = 1. Then f¯ = xm f2 with f2 (0) 6= 0, so that f1 = xm and f2
are coprime. By Theorem 2.15 there is a factorization f = F1 F2 with deg F1 = m.
Since 0 < m < n = deg f , this shows that f is reducible.
If |an | < 1, then let m be maximal with |am | = 1. Then f¯ = f1 · ām with f1 monic
and (trivially) coprime to f2 = ām . Again by Theorem 2.15 there is a factorization
f = F1 F2 with deg F1 = m. Since again 0 < m < n = deg f , this shows that f is
reducible also in this case. q
2.18. Example. The residue field of Qp is Fp . This essentially follows from EXAMPLE
Lemma 2.10; the details are left as an exercise. ♣ Qp has Fp as
residue field
We will now look at field extensions of complete fields with absolute value. First
we introduce the norm and trace of an element in a finite field extension.
§ 2. p-adic numbers 10
2.19. Definition. Let K ⊂ L be a finite field extension and let α ∈ L. Then DEF
multiplication by α induces a K-linear map mα : L → L. We define the norm N(α) norm and
and trace Tr(α) of α to be the determinant and trace of mα , respectively. trace
If we want to make clear which fields are involved, we write NL/K (α) and TrL/K (α).
♦
2.20. Lemma. Let (K, |·|) be a field that is complete with respect to a non- LEMMA
archimedean absolute value and let R be its valuation ring. Let K ⊂ L be a finite integrality
field extension and let α ∈ L have norm N(α) ∈ R. Then α is integral over R,
i.e., α is a root of a monic polynomial with coefficients in R.
0
Proof. Let L0 = K(α) ⊂ L. Then N(α) = NL0 /K (α)[L:L ] . Let f ∈ K[x] be the
minimal polynomial of α. Its constant term a0 is ± NL0 /K (α) and so the assumption
implies |a0 | ≤ 1. If we had |f | > 1, then f would be reducible by Corollary 2.17,
a contradiction. So |f | = 1, meaning that f ∈ R[x], and α is a root of the monic
polynomial f . q
2.21. Corollary. In the situation of Lemma 2.20 we have | N(1 + α)| ≤ 1. COR
bound for
N(1 + α)
Proof. We know that α is a root of a monic polynomial f ∈ R[x]. Then 1 + α is
a root of the monic polynomial f (x − 1) ∈ R[x], and since a power of its constant
term, which is in R, is ± N(1 + α), it follows that | N(1 + α)| ≤ 1. q
2.22. Theorem. Let (K, |·|) be a complete field with a nontrivial non-archimedean THM
absolute value and let L be an algebraic extension of K. Then there is a unique extensions
absolute value |·|0 on L that extends |·|. If the extension K ⊂ L is finite, then of complete
(L, |·|0 ) is complete. fields
§ 2. p-adic numbers 11
We remark that (2.1) implies that the topology on L induced by |·|0 is the same as
the topology induced by any K-linear isomorphism L → K n , where K n has the
product topology.
Since |·|0 is uniquely determined and extends |·|, one simply writes |·| for the
absolute value on L.
We note that it can be shown that a field that is complete with respect to an
archimedean absolute value must be isomorphic to R or C (with the usual absolute
value) and that a locally compact (and then necessarily complete) non-archimedean
field of characteristic 0 must be isomorphic to a finite extension of Qp for some
prime number p. (In characteristic p, it will be isomorphic to a finite extension of
the field Fp ((t)) of formal Laurent series over Fp , which is the field of fractions of
the ring of formal power series Fp [[t]].)
2.23. Corollary. Let Q̄p be the algebraic closure of Qp . Then Q̄p has a unique COR
absolute value extending |·|p on Qp . If σ ∈ Aut(Q̄p /Qp ) is any automorphism, Q̄p has unique
then |σ(α)|p = |α|p for all α ∈ Q̄p . abs. value
Next we want to show that Q̄p is not complete (contrary to the algebraic closure C
of the completion R of Q with respect to the usual absolute value).
P∞ thatn [Qp (ζn+1 ) : Qp (ζn )] = 2 for all n ≥ 1. We now consider the series
such
n=1 ζn p . If Q̄p were complete, then series would converge (since |ζn |p = 1).
So we will show that the series does not converge in Q̄p . The proof will be by
contradiction. So we assume that the series has a limit α ∈ Q̄p . Then m :=
[Qp (α) : Qp ] is finite. Let σ ∈ Aut(Q̄p /Qp ). By Corollary 2.23, σ is continuous
with respect to the p-adic topology. This implies that
X∞
σ(α) = σ(ζn )pn .
n=2
Since [Qp (α) : Qp ] = m, there are exactly m values that σ(α) can take (namely,
the roots of the minimal P polynomial of α over Qp ). Now pick some n such that
2 > m. Note that sn = nk=1 ζk pk generates the field Qp (ζn ) (this is easily seen
n
and there are only m < 2n possibilities for the left hand side. This gives the
desired contradiction. q
element of K, we can assume that |α| ≤ 1 (recall that L has a unique absolute
value extending that of K 0 ). We must show that α ∈ K 0 . Let f ∈ K 0 [x] be the
minimal polynomial of α. Since K is dense in K 0 , there is a sequence (fn ) of monic
polynomials in K[x] such that |fn − f | < 2−n . Since K is algebraically closed,
each fn splits into linear factors. By the ultrametric triangle inequality (and using
|α| ≤ 1), we have
Y
|α − α0 | = |fn (α)| = |fn (α) − f (α)| ≤ |fn − f | < 2−n .
α0 : fn (α0 )=0
There must then be a root αn of fn such that |α − αn | < 2−n/ deg(f ) . This implies
that (αn ) converges in L to α. But K 0 is the closure of K, therefore the limit α
must already be in K 0 . q
3. Newton Polygons
Let (K, |·|) be a complete field with a nontrivial non-archimedean absolute value.
Consider a polynomial 0 6= f ∈ K[x]. Let α ∈ K̄ be a root of f . We have seen
that there is a unique extension of |·| to K̄, so it makes sense to consider |α|. We
will now describe a method that determines the absolute values of the roots of f
and how often they occur.
In this context it is advantageous to switch to a ‘logarithmic’ version of the absolute
value. We fix a positive real number c and set
v(x) = −c log |x| for x ∈ K × and v(0) = +∞ .
3.1. Definition. The map v : K → R ∪ {+∞} is the (additive) valuation asso- DEF
ciated to |·|. ♦ valuation
0 1 2 3 4 5
♣
What we did above amounts to the statement that the valuations of the roots of f
are among the slopes taken negatively of the Newton polygon. We now want to
prove a converse and give a more precise statement. We first introduce a variation
of the absolute value on the polynomial ring.
3.4. Definition. Let (K, |·|) be a field with a nontrivial non-archimedean abso- DEF
lute value and let r > 0. For f = a0 + a1 x + . . . + an xn ∈ K[x] we define |·|r on K[x]
|f |r := max{|aj |rj : 0 ≤ j ≤ n} .
When f 6= 0, then we set
`r (f ) = max{j : |aj |rj = |f |r } − min{j : |aj |rj = |f |r } ∈ Z≥0 . ♦
3.5. Lemma. Let f, g ∈ K[x] be nonzero polynomials and let r > 0. Then we LEMMA
have additivity
`r (f g) = `r (f ) + `r (g) . of `r
Proof. If f = a0 + a1 x + . . . + an xn , write
n− (f ) = min{j : |aj |rj = |f |r } and n+ (f ) = max{j : |aj |rj = |f |r }
and similarly for g and f g. Precisely as in the proof of property (2) for |·|r (i.e.,
basically Gauss’ Lemma) one sees that
n− (f g) = n− (f ) + n− (g) and n+ (f g) = n+ (f ) + n+ (g) ,
§ 3. Newton Polygons 16
3.6. Theorem. Let 0 6= f ∈ K[x] and let r > 0. Then the number of roots α ∈ K̄ THM
of f (counted with multiplicity) such that |α| = r is exactly `r (f ). roots and
Newton
In terms of the Newton polygon, this says that f has roots of valuation s if and polygon
only if its Newton polygon has a segment of slope −s, and the number of such
roots (counted with multiplicity) is exactly the length of the segment.
Proof. We can assume that K = K̄. The proof is by induction on the degree of f .
If f is constant, there is nothing to show. If f = x − α, then `r (f ) = 0 if |α| =
6 r
and `r (f ) = 1 if |α| = r.
Now assume f is not constant and let β ∈ K̄ be a root of f . Then f = (x − β)f1
for some 0 6= f1 ∈ K[x]. By the inductive hypothesis, the number of roots α of f1
such that |α| = r is `r (f1 ). If |β| = r, then the number of such roots of f is
`r (f1 ) + 1 = `r (f1 ) + `r (x − β) = `r (f ) (using Lemma 3.5). If |β| = 6 r, then the
number of such roots of f is `r (f1 ) = `r (f1 ) + `r (x − β) = `r (f ) again. q
3.7. Lemma. Let f ∈ K[x] be irreducible. Then the Newton polygon of f consists LEMMA
of a single segment. Newton
polygon of
Proof. We assume for simplicity that K has characteristic zero. Then K̄ is sepa- irreducible
rable over K and the roots of f form one orbit under the Galois group Aut(K̄/K). polynomial
Since the absolute value is invariant under the action of this group, we see that all
roots of f have the same absolute value. The claim then follows from Theorem 3.6.
n
(In characteristic p, let α be a root of f . Then for some n ≥ 0, αp is separable
n
over K and f = g(xp ) for an irreducible polynomial g ∈ K[x]. The previous
argument applies to g, but this then implies the claim also for f .) q
3.8. Lemma. Let 0 6= f ∈ K[x] with f (0) 6= 0 and let σ1 , . . . , σm be the segments LEMMA
of the Newton polygon of f . Then there is a factorization slope
factorization
f = f1 f2 · · · fm
such that the Newton polygon of fj is a single segment with the same slope and
length as σj .
We extend |·|r to an absolute value on the field K(x) of rational functions in one
variable over K in the usual way. For a given rational function f , we can then
study how |f |r varies with r.
§ 3. Newton Polygons 17
Here
ϕf (s + ε) − ϕf (s) ϕf (s) − ϕf (s − ε)
∂+ ϕf (s) = lim and ∂− ϕf (s) = lim
ε&0 ε ε&0 ε
are the right and left derivatives of the piecewise affine function ϕf .
Qn ej
Proof. We can write f = γ j=1 (x − αj ) for some γ ∈ K, αj ∈ K̄ and ej ∈ Z, so
n
X n
X
ϕf (s) = −c log |γ| − c ej log |x − αj |e−s/c = v(γ) + ej ϕx−αj (s) ,
j=1 j=1
3 0
2
2
1 4
0 1/2 1 3/2 2
-1
5
-2
§ 3. Newton Polygons 18
The green lines are the graphs of s 7→ v(aj ) + js (for j = 0, 2, 3, 4, 5); ϕf is the
minimum of these functions. The segments of the graph of ϕf correspond to the
vertices of the Newton polygon of f and the vertices of the graph of ϕf correspond
to the segments of the Newton polygon of f . ♣
3.11. Lemma. Let (K, |·|) be a complete non-archimedean field, let r > 0 and LEMMA
0 6= f ∈ K[x]. Then |·|r as
|f |r = sup{|f (α)| : α ∈ K̄, |α| ≤ r} . supremum
norm
The absolute values |·|r on the polynomial ring K[x] that we have studied in the
last section are examples of ‘multiplicative seminorms’.
4.1. Definition. Let K be a field with absolute value |·| and let A be a K-algebra DEF
(i.e., A is a ring with a ring homomorphism K → A that gives A a compatible multiplicative
structure as a K-vector space). A multiplicative seminorm on A is a map seminorm
A −→ R≥0 , a 7−→ kak Banach
algebra
such that
(1) k·k restricts to |·| on K;
(2) ka + bk ≤ kak + kbk for all a, b ∈ A;
(3) kabk = kak · kbk for all a, b ∈ A.
If in addition a = 0 is the only element with kak = 0, then k·k is a multiplicative
norm (which is the same as an absolute value on A extending |·|). In general, we
call ker k·k = {a ∈ A : kak = 0} the kernel of k·k; it is a prime ideal in A.
A K-algebra A with a fixed multiplicative norm such that A is complete with
respect to this norm is a Banach algebra over K. ♦
A seminorm on A only needs to satisfy kabk ≤ kak · kbk with equality for a ∈ K;
similarly for a norm.
If |·| is non-archimedean, then k·k also satisfies the ultrametric triangle inequality,
since then
n
n n
X n j n−j
ka + bk = k(a + b) k = ab
j=0
j
n n
X n j n−j
X
≤ kak kbk ≤ kakj kbkn−j ≤ (n + 1)(max{kak, kbk})n
j=0
j j=0
√
and n n + 1 → 1 as n → ∞.
We introduce the following notation.
4.2. Definition. Let (K, |·|) be a field with absolute value, let a ∈ K and r ≥ 0. DEF
Then D(a, r)
D(a, r) := DK (a, r) := {ξ ∈ K : |ξ − a| ≤ r}
is the closed disk of radius r around a. ♦
(3) Let (an ) be a sequence in K and (rn ) a strictly decreasing sequence in R>0
such that D(an+1 , rn+1 ) ⊂ D(an , rn ) for all n. Then
f 7−→ kf k = lim kf kan ,rn
n→∞
(1) is clear and (2) follows from the properties of |·|r . For (3) note that by
Lemma 3.11,
kf kan+1 ,rn+1 = sup{|f (α)| : α ∈ K̄, |α − an+1 | ≤ rn+1 }
≤ sup{|f (α)| : α ∈ K̄, |α − an | ≤ rn } = kf kan ,rn ,
so that the sequence (kf kan ,rn ) decreases, hence must have a limit. Properties (2)
and (3) from Definition 4.1 then follow by taking the limit in the corresponding
relations for the k·kan ,rn . ♣
In fact, this is essentially the full story, at least when K is algebraically closed and
complete.
4.4. Theorem. Let (K, |·|) be a complete and algebraically closed non-archimedean THM
field and let k·k be a multiplicative seminorm on K[x]. Then there is a decreasing classification
nested sequence of disks D(an , rn ) such that of mult.
seminorms
kf k = lim kf kan ,rn
n→∞ on K[x]
for all f ∈ K[x].
Proof. Let D = {D(a, r) : a ∈ K, r > 0, k·k ≤ k·ka,r } be the set of all closed
disks such that k·k is bounded above by the corresponding seminorm. Then for
all a ∈ K we have D(a, kx − ak) ∈ D: if f (x + a) = a0 + a1 x + . . . + an xn , then
kf k = ka0 + a1 (x − a) + . . . + an (x − a)n k
≤ max{|a0 |, |a1 |kx − ak, . . . , |an |kx − akn }
= |f (x + a)|kx−ak = kf ka,kx−ak .
In particular, D is non-empty. Conversely, we have |x − a|a,r = r, which implies
that if D(a, r) ∈ D, then r ≥ kx − ak. So
D = {D(a, r) : a ∈ K, r ≥ kx − ak} .
We claim that D does not contain two disjoint disks: assume that D1 = D(a1 , r1 )
and D2 = D(a2 , r2 ) are in D. Then we have kx − a1 k ≤ r1 and kx − a2 k ≤ r2 , so
that
|a1 − a2 | = k(x − a2 ) − (x − a1 )k ≤ max{r1 , r2 } .
If r1 ≤ r2 , then this implies that a1 ∈ D2 and then that D1 ⊂ D2 ; if r2 ≤ r1 , then
we see in the same way that D2 ⊂ D1 .
Now let ρ = inf{r > 0 : ∃a ∈ K : D(a, r) ∈ D} and choose sequences (an ) in K
and (rn ) in R>0 such that (rn ) is strictly decreasing with rn → ρ and D(an , rn ) ∈ D
for all n. Then we have D(an+1 , rn+1 ) ⊂ D(an , rn ). This implies
kf k ≤ lim kf kan ,rn .
n→∞
§ 4. Multiplicative seminorms and Berkovich spaces 21
We still have to show the reverse inequality. Since every nonzero f is a product
of a constant and polynomials of the form x − a, it suffices to prove this for the
latter. If |a − an0 | > rn0 for some n0 , then
kx − ak = k(x − an0 ) − (a − an0 )k = |a − an0 |
= lim max{|a − an |, rn } = lim kx − akan ,rn ,
n→∞ n→∞
since kx − an0 kT≤ rn0 < |a − an0 |. Otherwise, we have |a − an | ≤ rn for all n, which
says that a ∈ n D(an , rn ), so that D(an , rn ) = D(a, rn ) and we have
lim kx − akan ,rn = lim kx − aka,rn = kx − aka,ρ = ρ .
n→∞ n→∞
for all f ∈ K[x]. Since ρ ≤ r for any radiusT r of a disk in D and no two disks
in D are disjoint, it follows that D(a, ρ) = D. We can therefore distinguish the
following four types of multiplicative seminorms on K[x].
4.7. Lemma. The topological space X an as defined in Definition 4.6 is Hausdorff. LEMMA
X an is
Hausdorff
Proof. Let ξ, ξ 0 ∈ X an be distinct. We must show that there are disjoint open sets
U, U 0 ⊂ X an with ξ ∈ U and ξ 0 ∈ U 0 . Since ξ 6= ξ 0 , there must be f ∈ A such
that kf kξ 6= kf kξ0 . Assume without loss of generality that kf kξ < kf kξ0 and let
kf kξ < a < kf kξ0 . Then we can take U = Uf,−∞,a and U 0 = Uf,a,∞ . q
Note that we can alternatively define Berk A as the set of all K-algebra homomor-
phisms A → H into complete K-Banach algebras that are fields, up to an obvious
equivalence. The topology is then that of pointwise convergence. This can be
seen as similar to the definition of Spec A as the set of all K-algebra homomor-
phisms into fields, up to an obvious equivalence. Here the topology is again that of
pointwise convergence, but with the cofinal topology on the target fields (so that
basic open sets are defined by relations f 6= 0). Since a Banach algebra has more
structure than just a K-algebra, there are fewer equivalences and therefore more
points in Berk A than in Spec A. This is made precise by the following statements.
4.8. Lemma. Let K, A and X be as in Definition 4.6. Then there is a canonical LEMMA
inclusion of X(K) into X an . The inclusion is continuous when X(K) is given the X(K) ,→ X an
topology induced by the absolute value on K.
It is in fact the case that the image of X(K) in X an is dense. We will see this
later when X is the affine line.
There is also a natural map in the other direction.
4.9. Lemma. Let K, A and X be as in Definition 4.6. Then there is a canonical LEMMA
continuous map Berk A → Spec A (where Spec A has the Zariski topology). Berk A
→ Spec A
Proof. Let ξ ∈ Berk A. Then k·kξ is the pull-back of the absolute value from some
K-Banach algebra H that is a field under a homomorphism φ : A → H. Since H
is a field, the kernel of φ must be a prime ideal of A and so defines an element
of Spec A (or we just take H as a field, forgetting the absolute value).
It remains to show that the map Berk A → Spec A obtained in this way is continu-
ous. Let Uf = {ξ ∈ Spec A : f (ξ) 6= 0} be a basic open set in Spec A. Its pull-back
to Berk A is {ξ ∈ Berk A : kf kξ 6= 0} = Uf,0,∞ and therefore open, too. q
The composition of the two maps X(K) → X an → Spec A is the inclusion of X(K)
(the set of maximal ideals of A) into Spec A (the set of prime ideals of A).
The map is actually surjective: let ξ ∈ Spec A; this gives us a K-algebra homo-
morphism A → R into an integral domain finitely generated over K. One can
show that one can always define an absolute value on such an R that extends the
absolute value on K. Pulling back to A, we obtain a multiplicative seminorm; this
is a preimage of ξ.
It is also true that X an is (even path-)connected when X is connected as an
algebraic variety and that X an is locally compact. We will see this concretely for
X = A1 in the next section.
§ 5. The Berkovich affine and projective line 24
In the following, we will take a closer look at the Berkovich affine line over Cp ,
AC1,an
p
= Berk Cp [x]. According to the classification of Definition 4.5, we have four
types of points in A1,an 1
Cp . The type 1 points recover the points in A (Cp ) = Cp as in
Lemma 4.8. The type 2 and 3 points correspond to closed disks D(a, r) with r > 0
in, respectively, not in, the value group pQ of Cp . We will reserve the notation ζa,r
or ζD where D = D(a, r) for these points. We will frequently identify a with ζa,0 ,
however. Finally, the type 4 points correspond to nested sequences of disks with
empty intersection; these points are somewhat annoying, but don’t usually give
us problems. They are necessary to make the space locally compact. Two nested
sequences of disks (D(an , rn ))r and (D(a0n , rn0 ))n with empty intersection define
the same type 4 point if and only if D(an , rn ) ∩ D(a0n , rn0 ) 6= ∅ for all n.
Now fix ξ ∈ Cp and consider real numbers 0 ≤ r0 < r1 . Then there is a map
[r0 , r1 ] −→ AC1,an
p
, r 7−→ k·kξ,r = ζξ,r .
ξ ∨ η := ζξ,δ = ζη,δ .
Then
(
ζξ,r if 0 ≤ r ≤ δ,
γξ,η : [0, 2δ] −→ A1,an
Cp , r−
7 →
ζη,2δ−r if δ ≤ r ≤ 2δ
This shows that D(ξ, r) is uniquely determined by k·kξ,r and also shows that
D1 ⊂ D2 holds for two closed disks if and only if k·kD1 ≤ k·kD2 (the ‘only if’ part
follows also from the characterization of k·kξ,r as the sup norm on D(ξ, r)). This
prompts us to define
ξ ≤ ξ 0 ⇐⇒ ∀f ∈ Cp [x] : kf kξ ≤ kf kξ0
5.1. Lemma. Let ξ ∈ A1,an Cp be a type 4 point, represented by the nested sequence LEMMA
of disks D(an , rn ) with empty intersection. If ξ 0 ∈ A1,an 0 comparison
Cp with ξ ≤ ξ , then either
ξ 0 = ξ, or else ξ 0 is a point of type 2 or 3, corresponding to a disk D(a, r) such with
that D(an , rn ) ⊂ D(a, r) for all sufficiently large n. type 4 points
Proof. Let ξ 0 ≥ ξ and assume first that ξ 0 is of type 2 or 3, so ξ 0 = ζa,r for some
a ∈ Cp and r > 0. We claim that D(a, r) ∩ D(an , rn ) 6= ∅ for all n. Otherwise, we
would have empty intersection for all sufficiently large n, and then
kx − akξ0 = r < lim |a − an | = lim kx − akan ,rn = kx − akξ ,
n→∞ n→∞
0
which contradicts ξ ≤ ξ . If n is large enough so that rn ≤ r, then we must
therefore have D(an , rn ) ⊂ D(a, r).
Now assume that ξ 0 is of type 1 or 4. If ξ 0 is of type 1, then kx − ηkξ0 is zero when
η = ξ 0 , whereas kx−ηkξ ≥ r∞ > 0 for all η, so this is not possible. If ξ 0 is of type 4,
say represented by the nested sequence of disks D(a0n , rn0 ), then ξ ≤ ξ 0 ≤ ζa0n ,rn0 for
all n. By the argument above, it follows that D(an , rn ) ∩ D(a0m , rm 0
) 6= ∅ for all
m, n. But this exactly means that ξ = ξ 0 .
It remains to show that γξ,ξ0 is well-defined and continuous. For the first, note
that ζan ,ρ = ζan+1 ,ρ when ρ ≥ rn , since D(an , ρ) = D(an+1 , ρ). For the second,
consider f ∈ Cp [x]. We have to show that
[r∞ , r] 3 ρ 7−→ kf kγξ,ξ0 (ρ)
is continuous. This is clear on (r∞ , r], and it follows for the left endpoint by the
definition of k·kξ , which implies that
kf kξ = lim kf kan(ρ) ,ρ
ρ&r∞
For ξ 0 ≤ ξ, we define the path γξ,ξ0 as the reversal of the path γξ0 ,ξ .
Proof. This is clear if neither ξ nor ξ 0 are of type 4: if ξ = ζa,r and ξ 0 = ζa0 ,r0 ,
then ξ ∨ ξ 0 = ζa,ρ = ζa0 ,ρ , where ρ = max{r, r0 , |a − a0 |}. If ξ ≤ ξ 0 or ξ 0 ≤ ξ, then
the statement is also clear (with ξ ∨ ξ 0 = ξ or ξ 0 ). Otherwise, we can represent
ξ and ξ 0 by nested sequences of disks D(an , rn ) and D(a0n , rn0 ), respectively, such
that D(an , rn ) ∩ D(a0n , rn0 ) = ∅ for n large enough. For all such n, ζan ,rn ∨ ζa0n ,rn0 is
the same and therefore agrees with ξ ∨ ξ 0 .
The statement on path-connectedness is then clear. q
The equality in the definition of D(a, r) can be seen as follows. ‘⊂’ is clear. To
show ‘⊃’, assume that kx − akξ ≤ r. Then for any b ∈ Cp , we have
kx−bkξ = k(x−a)+(a−b)kξ ≤ max{kx−akξ , |a−b|} ≤ max{r, |a−b|} = kx−bka,r .
Since every f ∈ Cp [x] is a constant times a product of such terms, it follows that
ξ ≤ ζa,r .
and
{ξ ∈ A1,an −
Cp : kf kξ < a} = D(α1 , r1 ) ∪ . . . ∪ D(αn , rn )
−
(2) The open disks D(a, r)− and the complements of closed disks D(a, r) generate
the topology of A1,an
Cp (i.e., every open set is a union of finite intersections of
such sets).
(3) Two (open or closed) disks in A1,an
Cp are either disjoint or one is contained in
the other.
Proof.
(1) Exercise.
(2) We have Uf,a,b = Uf,−∞,b ∩ Uf,a,∞ . By part (1), we can write Uf,−∞,b as a union
of open disks, and we can write Uf,a,∞ , which is the complement of the set of ξ
such that kf kξ ≤ a, as the complement of a finite union of closed disks, which
is the same as a finite intersection of complements of closed disks. So each
basic open set Uf,a,b is a finite intersection of open disks and complements of
closed disks; this implies the claim.
(3) Assume that D(a, r) and D(a0 , r0 ) are not disjoint, so that there is ξ with
kx − akξ ≤ r and kx − a0 kξ ≤ r0 , w.l.o.g. such that r ≥ r0 . Then
|a − a0 | ≤ max{kx − akξ , kx − a0 kξ } ≤ r .
Now let ξ 0 ∈ D(a0 , r0 ) be arbitrary. Then
kx − akξ0 ≤ max{kx − a0 kξ0 , |a − a0 |} ≤ max{r0 , r} = r ,
so ξ 0 ∈ D(a, r). Hence D(a0 , r0 ) ⊂ D(a, r). The case when one or both disks
are open is similar. q
Proof. The first claim in (1) is clear (and holds in a similar way for open disks). For
the second claim, consider the intersection of a basic open set as above with D(a, r).
The set of type 1 points contained in this intersection is the intersection of D(a, r)
with the open disk in Cp corresponding to the open disk, minus the union of the
closed disks in Cp corresponding to the closed disks that were removed. Any such
set is non-empty (unless the whole disk D(a, r) is removed, but then the basic
open set has empty intersection with D(a, r)).
For (2), we can assume without loss of generality that a = 0. We know that
ξ ∈ D(0, r) ⇐⇒ k·kξ ≤ |·|r , so the image of D(0, r) in RCp [x] under the map
Φ : A1,an
Cp −→ R
Cp [x]
, ξ 7−→ (kf kξ )f ∈Cp [x]
Q
is the intersection of im(Φ) with the product C = f [0, |f |r ]. The definition of the
topology on A1,an
Cp is equivalent to saying that it is the subspace topology induced
by the map Φ above (with the product topology on RCp [x] ). The product C of
compact intervals is itself compact by Tychonoff’s Theorem. On the other hand,
the conditions defining a multiplicative seminorm are closed conditions, so the
image of Φ is closed. Now Φ(D(0, r)) is the intersection of a closed set and a
compact set, so it is itself compact. Since Φ is a homeomorphism onto its image,
it follows that D(0, r) is compact as well. q
in turn contains ξ. This means that kx − akξ < r and kx − aj kξ > rj . For
r0 < r sufficiently close to r and for rj0 > rj sufficiently close to rj , we still have
kx − akξ < r0 and kx − aj kξ > rj0 . Then
[
ξ ∈ V := D(a, r0 ) \ D(aj , rj0 )− ⊂ U .
j
0
S set D(a,0 r ) with a closed set) and
Now V is compact (we intersect the compact
0 −
contains the open neighborhood D(a, r ) \ j D(aj , rj ) of ξ. q
So the small metric gives the total change of diameter as we move along the path
joining ξ to ξ 0 , whereas the big metric does the same for the change in (additive)
valuation (= −c log diam). Both are indeed metrics, d on all of AC1,an
p
and ρ on the
1,an
‘hyperbolic part’ of ACp , which consists of the points of types 2, 3 and 4. The
former has the advantage that it is also defined on the points of type 1, but the
latter is in some sense more natural. It can be seen as analogous to the hyperbolic
metric on the upper half plane in C (which is given by ds2 = (dx2 + dy 2 )/y 2 when
x and y are the real and imaginary parts), which is invariant under the action
of PSL(2, R) by Möbius transformations. This analogous property will turn out
to be satisfied (with respect to PGL(2, Cp )) by the big metric.
Now we introduce the kind of tree structure that shows up here. (For more
information, consult [BR, Appendix B].)
5.9. Definition. An R-tree is a metric space (T, d) such that for any two points DEF
x, y ∈ T , there is a unique path [x, y] in T joining x to y, which is a geodesic R-tree
segment (this means that the map γ : [a, b] → T giving the path can be chosen so
that d(γ(u), γ(v)) = |u − v| for all u, v ∈ [a, b]).
A point x ∈ T is a branch point if T \ {x} has at least three connected components
(in the metric topology). x is an endpoint if T \ {x} is connected. If T \ {x} has
exactly two connected components, then x is said to be ordinary. The R-tree T
is said to be finite, if it is compact and has only finitely many branch points and
endpoints.
The strong topology on the R-tree is the metric topology on T . To define the
weak topology, we define a tangent direction at x ∈ T to be an equivalence class
of paths [x, y] with y 6= x, where two paths are equivalent when they share an
initial segment. The tangent directions at x are in one-to-one correspondence
with the connected components of T \ {x}. For a tangent direction v at x, we set
Bx,v = {y ∈ T : y 6= x, [x, y] ∈ v} (which is the connected component of T \ {x}
corresponding to v). Then the weak topology on T is the topology generated by
the sets Bx,v . ♦
The set Bx,v can be interpreted as the set of points of T that can be ‘seen’ from x
when looking into direction v. This is why the weak topology is also called the
‘observer’s topology’.
Now the following is a fact.
Proof. See [BR, Section 1.4], where the analogous statements are shown for D(0, 1).
For A1,an
Cp the argument is the same (except that there is no natural way to pick a
root). q
Let ξ ∈ A1,an 0
Cp be any point. If ξ is another point, then there are three possibilities:
ξ > ξ 0 , ξ < ξ 0 , or neither. In the last two cases, the path from ξ to ξ 0 starts ‘going
up’ to ξ ∨ ξ 0 (= ξ 0 in the second case). So the points ξ 0 6= ξ such that ξ 6> ξ 0
constitute the set Bξ,up . If ξ is of type 1 or 4, then there are no ξ 0 with ξ > ξ 0 , so
‘up’ is the only direction.
Now consider ξ = ζa,r of type 3. If ξ 0 < ξ, then there is some ξ 0 ≤ ζa0 ,r0 < ξ, and
we have D(a0 , r0 ) ⊂ D(a, r), so |a0 − a|p ≤ r. Since r ∈ / |C×
p |p , we can choose r
0
such that |a0 − a|p < r0 < r, but then ζa0 ,r0 = ζa,r0 , so the path from ξ to ξ 0 shares
an initial segment with [ζa,r , ζa,0 ]. This shows that other than ‘up’, there is exactly
one tangent direction ‘down’.
Finally, when ξ = ζa,r is of type 2, then by a similar argument, for any ξ 0 < ξ
there is a0 ∈ D(a, r) such that ξ 0 ∈ D(a0 , r)− . If two points ξ 0 , ξ 00 are in the same
open disk, then the paths from ξ to these two points share an initial segment;
otherwise ξ 0 ∨ ξ 00 = ξ and they do not. So the directions other than ‘up’ at ξ
are in one-to-one correspondence with open disks of radius r contained in D(a, r).
To show that the set of such disks corresponds to F̄p , we assume that a = 0 and
r = 1 (we can shift and scale to reduce to this case). Two disks D(a0 , 1)− and
D(a00 , 1)− contained in D(0, 1) are equal if and only if |a0 − a00 |p < 1, which means
that a0 , a00 ∈ R (the valuation ring of Cp ) have the same image in the residue class
field F̄p . So we see that the ‘downward’ directions at a type 2 point correspond to
the elements of F̄p ; together with the ‘up’ direction, which we can let correspond
to ∞, we get P1 (F̄p ). (For ζ0,1 the correspondence is canonical. For other points
it is so only up to an automorphism of P1F̄p , since there is a choice involved in the
shifting and scaling.)
If ξ = ζa,r is a point of type 2 or 3, then the basic open set Bξ,v of the weak topology
is the complement of D(a, r) when v = ‘up’ and is D(a0 , r)− with a0 as above when
v is a ‘downward’ direction. For ξ of type 1 or 4, Bξ,v is just AC1,an p
\ {ξ}. This
shows that the weak topology of the R-tree agrees with the Berkovich topology.
Here is a rough sketch of D(0, 1):
1E. Hrushovski, F. Loeser and B. Poonen, Berkovich spaces embed in Euclidean spaces,
L’Enseignement Math. 60 (2014), no. 3–4, 273–292.
§ 5. The Berkovich affine and projective line 30
ζ0,1
ζ0,r
ζ0,|p|1/2
ζ0,|p|2
ξ
ζ1,0 ζ1+p,0 ζ−1,0 ζ0,0 ζp2,0 ζ−p,0 ζp,0
Points of type 1, 2, 3 and 4 are green, red, purple and blue, respectively.
Let T be an R-tree and let S ⊂ T be nonempty and finite. Then one can consider
the convex hull Γ of S; this is the union of the paths joining the points in S.
Then Γ is a finite R-tree and there is a natural deformation retraction τΓ : T → Γ
that sends any point x ∈ T to the point of Γ that is hit first by the unique path
from x to any fixed point in Γ. It can be shown that the weak topology on T is
the weakest topology that makes all the maps τΓ continuous (where we take, say,
the metric topology on Γ). (This is an exercise.)
not contain the irrelevant ideal. Applying this to the polynomial ring Cp [x, y]
with the standard grading, this results in P1,an
Cp . We will discuss this in more
detail below. An advantage of this approach is that it makes it easy to see
that a morphism P1Cp → P1Cp induces a continuous map PC1,anp
→ P1,an
Cp .
Recall
Lthat a graded ring is a ring R coming with a direct sum decomposition
R = n≥0 Rn as an additive group such that for all f ∈ Rm and g ∈ Rn we have
f g ∈ Rm+n . The elements of Rn are said to be homogeneous of degree n. An
ideal I of R is said to be homogeneous if it is generated by homogenous elements.
If K is a field and the decomposition is a direct sum of K-vector spaces, then R is
a graded K-algebra. In algebraic geometry, Proj R is the L set of all homogeneous
prime ideals of R that do not contain the irrelevant ideal n≥1 Rn , with the Zariski
topology. For example, PnK = Proj K[X0 , X1 , . . . , Xn ] with the usual grading of
the polynomial ring. The ‘Berkovich Proj’ construction does something similar
for Berkovich spaces.
special fiber (i.e., the base change to F̄p ) is a configuration of several P1 ’s arranged
in tree form. (Possibly we return to that later.)
If we modify the definition of the diameter by defining it on PC1,an
p
as
(
diam(ξ) if ξ ∈ D(0, 1)
diam0 (ξ) =
diam(φ(ξ)) if ξ ∈ PC1,an
p
\ D(0, 1),
where φ is the involution induced by z 7→ 1/z, and set ξ ∨0 η to be the point
where [ξ, ζ0,1 ] and [η, ζ0,1 ] first meet, then we can define a small metric d0 on PC1,an
p
in the same way as before. Then P1,an Cp can be identified with the R-tree given by
the small metric. However, the small metric is not invariant under automorphisms
in general (the point ζ0,1 plays a special role in its definition; any automorphism
that moves ζ0,1 will change the metric).
To conclude this section, we give another interpretation of the seminorm associated
to a point in P1,anCp that (contrary to the interpretation as the supremum norm on
the associated disk when applied to polynomials) also works for rational functions.
Consider first a point ξ = ζa,r of type 2. If f is a nonzero polynomial with roots
α1 , . . . , αm in D(a, r), then for all α in the closed set D(a, r) \ m −
S
j=1 D(α j , r) we
have |f (α)|p = kf kξ . To see this, we can assume that a = 0 and f is monic. Write
f = (x − α1 ) · · · (x − αm )(x − αm+1 ) · · · (x − αn ) ,
where αm+1 , . . . , αn are the roots of f outside of D(0, r). Then
Yn Yn
kf kξ = kx − αj kξ = |α − αj |p = |f (α)|p ,
j=1 j=1
since for j ≤ m, we have kx−αj kξ = r = |α−αj |p (here we use that α ∈/ D(αj , r)− ),
and for j > m, we have kx − αj kξ = |αj |p = |α − αj |p . So |f |p is constant and
equal to kf kξ on D(a, r) outside a finite union of open disks contained in D(a, r).
We can apply this separately to the numerator and the denominator of a rational
function f = g/h ∈ Cp (x), which gives the same statement for such an f . So for
a type 2 point ξ = ζa,r , the induced seminorm k·kξ on Cp (x) is the absolute value
taken by any given function f ∈ Cp (x) on D(a, r) outside finitely many smaller
open disks (the set of disks to be removed depends on f ); this is also the same as
§ 5. The Berkovich affine and projective line 33
5.12. Lemma. Let ξ ∈ P1,an Cp and denote the corresponding multiplicative semi- LEMMA
norm on Cp (x) as usual by k·kξ . Then for every f ∈ Cp (x) (that does not have a characteri-
pole at ξ when ξ is of type 1) we have that kf kξ is the unique ν ∈ R≥0 such that for zation
every ε > 0 there is a closed neighborhood U of ξ in P1,an
Cp such that |f (α)|p −ν ≤ ε
of k·kξ
1
for all α ∈ P (Cp )∩U . The set U can be taken to be a closed Berkovich disk minus
a finite union of open Berkovich disks.
ϕ is doing on P1,an
Cp .
§ 6. Analytic spaces and functions 34
In this section, we follow roughly part of Brian Conrad’s text [AWS, Chapter 1].
Recall that a K-Banach algebra (for a complete field K with absolute value) is
a K-algebra A with a (submultiplicative) norm |·|A that restricts to the absolute
value on K and such that A is complete with respect to this norm. So we have
|a|A 6= 0 for a 6= 0, |a + b|A ≤ |a|A + |b|A and |ab|A ≤ |a|A · |b|A .
If K is non-archimedean, then |·|A also satisfies the ultrametric triangle inequality.
6.1. Definition. Let A be a K-Banach algebra; we write the norm on A as |·|A . DEF
A multiplicative seminorm k·k on A is said to be bounded, if kf k ≤ |f |A for all bounded
f ∈ A. ♦ multiplicative
seminorm
We note that it is sufficient to require kf k ≤ C|f |A for some constant C > 0, since
this implies
kf k = (kf n k)1/n ≤ (C|f n |A )1/n ≤ C 1/n |f |A ;
we obtain kf k ≤ |f |A by letting n tend to infinity. So the notion of ‘bounded
multiplicative seminorm’ does not change when we replace the norm on A by
an equivalent one (i.e., one that is bounded below and above by some positive
multiple of |·|A ).
6.2. Definition. Let A be a K-Banach algebra. The Berkovich space Berk A DEF
associated to A is the set of all bounded multiplicative seminorms on A, with the Berkovich
weakest topology that makes the maps k·k 7→ kf k continuous for all f ∈ A. ♦ space
associated
In a similar way as we did earlier, one shows the following. to A
6.3. Theorem. Let A be a K-Banach algebra. Then Berk A is a compact Haus- THM
dorff space, which is empty if and only if A = {0}. Berk A is
compact and
We note that one can define Berk A for any (commutative) ring A that is complete Hausdorff
with respect to some absolute value (a Banach ring); it does not have to be an
algebra over some complete field. One could take A = Z with the usual absolute
value, for example (completeness follows trivially, since every Cauchy sequence
must be eventually constant), or any ring with the trivial absolute value. It is an
instructive exercise to work out the structure of Berk Z!
From now on, K will be, as usual, a complete and algebraically closed non-
archimedean field, for example, K = Cp . There is an important class of K-Banach
algebras.
Here we use multi-index notation: for i = (i1 , . . . , in ) we set xi = xi11 · · · xinn and
ri = r1i1 · · · rnin ; also |i| = i1 + . . . + in . Then
X
ai xi := max ri |ai |
i
i
§ 6. Analytic spaces and functions 35
Note that by scaling the variables suitably, any Tate algebra in n variables is
isomorphic to the standard Tate algebra
Khx1 , . . . , xn i := Kh1−1 x1 , . . . , 1−1 xn i .
This is not true for generalized Tate algebras.
6.5. Example. We have Berk Cp hxi ∼ = D(0, 1). The map is induced by the EXAMPLE
inclusion Cp [x] ,→ Cp hxi; the image consists of all seminorms on Cp [x] that are D(0, 1)
bounded by the restriction of the norm on the Tate algebra to the polynomial
ring, which is exactly |·|1 = k·k0,1 . By definition, this set is D(0, 1). It also follows
easily from the definitions that the map induces a homeomorphism.
More generally, the same argument shows that Berk Cp hr−1 xi ∼ = D(0, r), for every
r > 0. We see that there is some kind of correspondence between type 2 points and
Tate algebras, and type 3 points and generalized Tate algebras. Quite explicitly, we
obtain Cp hr−1 xi as the completion of Cp [x] with respect to the norm corresponding
to ζ0,r . ♣
6.6. Lemma. The Tate algebras over K are noetherian (every ideal is finitely LEMMA
generated) unique factorization domains. They are also Jacobson rings (every Properties
prime ideal is the intersection of the maximal ideals it is contained in). Every of Tate
ideal is closed. algebras
The norm is intrinsically defined as |f | = maxφ |φ(f )|, where φ runs over all
continuous K-algebra homomorphisms to K.
The Tate algebra T = Khr1−1 x1 , . . . , rn−1 xn i has the following universal property.
Given any K-Banach algebra A and elements a1 , . . . , an ∈ A with |aj |A ≤ rj , there
is a unique continuous K-algebra homomorphism φ : T → A with φ(xj ) = aj .
6.7. Lemma. |·|A/I is a norm on A/I turning A/I into a Banach algebra such LEMMA
that the canonical homomorphism A → A/I is continuous. induced
norm
Proof. We have to show a number of things.
(1) |a + I|A/I = 0 only for a ∈ I.
If |a + I|A/I = 0, then there is a sequence (bn ) in I such that |a + bn |A → 0,
hence −bn → a. Since I is closed, this implies a ∈ I.
(2) |(a + a0 ) + I|A/I ≤ max{|a + I|A/I , |a0 + I|A/I }.
This follows easily from the definition.
§ 6. Analytic spaces and functions 36
Simple examples of affinoid domains are the closed Berkovich disks D(0, r), where
A is itself a (generalized) Tate algebra.
6.9. Example. Let 0 < r ≤ s. Then we can consider the algebra EXAMPLE
annuli
−1 −1 Khs−1 X, rY i
A = Khs X, rX i :=
hXY − 1i
with its induced norm. Using the relation XY = 1, its elements can be written
uniquely as the image of a series of the form
∞
X ∞
X
a−m Y m + a0 + an X n
m=1 n=1
the conditions on the coefficients then are equivalent to saying that this series
converges whenever we plug in some element α such that r ≤ |α| ≤ s. For K = Cp ,
we see that via the map Cp [x] → A sending x to X, Berk A gets identified with
A(r, s) := D(0, s) \ D(0, r)− , which is a closed Berkovich annulus. ♣
There are more properties similar to those of algebras of finite type in algebraic
geometry. For example, if A is K-affinoid and A0 is a K-algebra that is a finitely
generated A-module, then A0 is also K-affinoid. Any K-affinoid algebra is a finitely
generated module over some Tate algebra (this corresponds to the Noether nor-
malization theorem).
One can then construct more general Berkovich K-analytic spaces by gluing affi-
noid domains. There are some technical issues here, because the gluing is along
compact ‘sub-affinoid domains’ and not along open sets as one does in algebraic
or differential geometry.
For example, for fixed s > 0 one can glue the closed disks of radius r < s via
the natural inclusions D(0, r) ⊂ D(0, r0 ) whenever r < r0 < s to obtain the
open disk D(0, s)− . In a similar way, one gets open annuli A(r, s)− (for r < s).
Considering the affine line as ‘the open disk of radius ∞’, we obtain AC1,an
p
. We get
1,an
the projective line PCp by gluing two closed disks D(0, 1) along the annulus A(1, 1)
as discussed in the last section.
We also have the following (see the exercises). As usual, R denotes the valuation
ring of K and k the residue field.
For example, taking A = Cp hxi, we find à = F̄p [x], and the reduction map sends
ζ0,1 to the generic point of A1F̄p = Spec à and the points in the open Berkovich
disk D(α, 1)− (for α ∈ D(0, 1)) are mapped to the closed point ᾱ ∈ A1 (F̄p ). In
particular, we see that the preimage of a closed point is an open Berkovich disk,
and the preimage of the open set A1F̄p \ S, where S is a finite set of closed points,
is the closed set D(0, 1) \ s∈S D(αs , 1)− , where αs ∈ D(0, 1) has image s.
S
6.12. Example. Let 0 < r < s be in |K × | and consider as before the algebra EXAMPLE
A = Khs−1 X, rX −1 i. The induced norm on A is given by annuli
X
an X n = max {rn |an | : n ≤ 0} ∪ {sn |an | : n ≥ 0}
A
n∈Z
§ 6. Analytic spaces and functions 38
The anti-continuity of the reduction map ρA implies that the gluing together of
strict affinoids along closed sub-affinoids induces a corresponding gluing of the
Spec Ã’s along open sets. For example, gluing two copies of D(0, 1) along A(1, 1)
to obtain P1,an 1 1
Cp corresponds to gluing to copies of AF̄p along A \{0} (which is Spec Ã
for the annulus A(1, 1)), resulting in P1F̄p in the usual way. The reduction maps
also glue, which in the example gives the map identifying the tangent directions
at ζ0,1 ∈ P1,an 1 1
Cp with P (F̄p ) and sending ζ0,1 to the generic point of PF̄p .
A given space can usually be obtained in many different ways by gluing. This can
result in different ‘special fibers’ (the glued spectra of the algebras Ã) and therefore
different reduction maps. For example, we can obtain PC1,an p
also by gluing D(0, 1)
(via the inversion map) and D(0, 1/p) to A(1/p, 1) along the sub-annuli A(1, 1)
and A(1/p, 1/p). The special fiber of this object consists of two copies of P1F̄p
meeting transversally in one point. It is obtained from the special fiber of the
annulus — two A1 ’s meeting in one point — by gluing an A1 to each of the lines
(along the complement of the intersection point). In a similar way, we can glue
together also more general affinoids (the most general of which is a closed disk
minus a finite union of open ones) to obtain a ‘model’ of P1,an Cp . The skeleton
associated to such a model is the convex hull of the points mapping to generic
points of the special fiber; in the preceding example, this would be the interval
[ζ0,1/p , ζ0,1 ]. In general, it will be a finite sub-tree of P1,an
Cp with endpoints of type 2.
There is a converse to this, valid for general smooth projective irreducible curves C
over Cp : There is a bijective correspondence between finite subgraphs of Berk C
with vertices of type 2 and semistable models of C over the valuation ring R of Cp
(up to isomorphism), such that the vertices correspond to the generic points of
the special fiber of the semistable model under the reduction map.2
In this section we will take a closer look at Berkovich spaces of (smooth projective
irreducible) curves over Cp . We begin with some lemmas. Let Tr = Cp hr−1 xi be
the univariate (generalized) Tate algebra with parameter r > 0.
7.2. Lemma. Let F (x, y) ∈ Tr [y] be monic of degree n such that F (0, y) ∈ Cp [y] LEMMA
has no multiple roots. Then there is 0 < r0 ≤ r and there are h1 , . . . , hn ∈ Tr0 such étale
that F (x, y) = (y − h1 (x)) · · · (y − hn (x)). covering
of disk
Proof. Let η1 , . . . , ηn ∈ Cp be the n distinct roots of F (0, y). By assumption,
βj := F 0 (0, ηj ) 6= 0 (where F 0 denotes the derivative of F as a polynomial in y).
Let F̃j (x, y) := −βj−1 F (x, y + ηj ) Note that F̃j has no constant term and that
the coefficient of y is −1. So, writing F̃j (x, y) = f0 (x) + f1 (x)y + . . . and setting
γ = max{|f0 |r , |1 + f1 |r , |f2 |r , . . .}, we have for every 0 < r0 ≤ r that
|f0 |r0 ≤ γr0 /r , |1 + f1 (x)|r0 ≤ γr0 /r and |fm |r0 ≤ γ for m ≥ 2.
Now pick λ ∈ C×
p with |λ|p ≤ γ
−1
and set
F̃˜j (x, y) = λ−1 F̃j (x, λy) = λ−1 f0 (x) + f1 (x)y + λf2 (x)y 2 + . . . ;
we have |λ−1 f0 |r0 ≤ |λ|−1 0 0
p γr /r, |1+f1 |r0 ≤ γr /r and |λ
m−1
fm |r0 ≤ |λ|m−1
p γ ≤ γ 2−m
for m ≥ 2. Fix 0 < δ < 1 such that δ ≤ γ −1 . With r0 = r min{1, |λ|p γ −1 δ, γ −1 δ}
the assumptions of Lemma 7.1 are then satisfied for F̃˜ , so (translating back to
j
the original coordinates) there is hj ∈ Tr0 with hj (0) = ηj and F (x, hj (x)) = 0.
We can take r0 so that it works for all j; we then obtain n distinct roots of F
in Tr0 . q
§ 7. Berkovich spaces of curves 40
I think that one can take any r0 ≤ r here such that F (ξ, y) has no multiple roots
for all ξ ∈ Cp with |ξ|p ≤ r0 , at least when p > n, but so far I have found no proof
for this more precise statement. We will see below that it is true when n = 2 and
p > 2.
7.3. Example. The example F (x, y) = y 2 − y + x over C2 shows that the EXAMPLE
condition p > n is necessary in general. The two roots of F are necessity
√ ∞ of ‘p > n’
1 − 1 − 4x 2 3 4 5
X 1 2n − 2
n
h1 = = x + x + 2x + 5x + 14x + . . . = x
2 n=1
n n−1
n
and h2 = 1 − h1 . The coefficients are integers, but the coefficient of x2 is odd
for all n, so the coefficients of the series are bounded but do not tend to zero
2-adically. This means that h1 , h2 ∈ Tr for all r < 1, but not for r = 1. On
the other hand, the discriminant of F is 1 − 4x, which does not vanish even for
|x|2 < 4. This seems to be some issue related to ‘wild ramification’. ♣
7.4. Theorem. Let C be a smooth projective irreducible curve over Cp and let THM
φ : C → P1 be a morphism of degree n. Then the induced map φ∗ : C an → P1,an coverings
has fibers of size at most n. of P1
Proof. Let ζ ∈ P1,an be some point; we can assume without loss of generality that
ζ 6= ∞. The statement is clear when ζ is a type 1 point, so we consider ζ not
of type 1; in particular, the seminorm on Cp [x] associated to ζ is a norm. The
point ζ also corresponds to a homomorphism Cp [x] → H into a complete field H
with absolute value such that the absolute value pulls back to the seminorm given
by ζ and the image of Cp [x] in H generates a dense subfield. Let K be the
function field of C; we have an inclusion Cp (x) ⊂ K, which is a field extension
of degree n. Then H ⊗Cp (x) K is an algebra over H of degree n, which splits as
a direct product of finite field extensions of H (whose degrees sum to n). There
is a unique extension of the absolute value of H to each of these fields, which by
pulling back to K (and then restricting to the affine coordinate ring A of φ−1 (A1 ))
give rise to the various multiplicative (semi)norms on A extending k·kζ . So the
fiber φ−1
∗ (ζ) has as many points as there are fields in the splitting of H ⊗Cp (x) K;
this number is at most n. q
This basically shows that we can glue together C an from n copies of P1,an . If the
statement above on the choice of r0 is true and p > n, then it follows that φ∗ can
be branched (i.e., have fibers of size < n) only above points in the convex hull of
the type 1 points corresponding to branch points of φ. This might be true even
when this statement is wrong; in any case it would mean that C an deformation
retracts to a finite graph that is glued from n copies of a tree (= the convex hull
of the branch points).
We now consider hyperelliptic curves in more detail. We assume that p > 2. First
we need two elementary lemmas.
7.5. Lemma. Let f (x) ∈ Tr with |f |r < 1. Then there is h(x) ∈ Tr with |h|r < 1 LEMMA
such that 1 + f (x) = (1 + h(x))2 . square
roots
More generally, the same result applies to any complete ring R in place of Tr as
long as |2|R = 1 and 2 ∈ R× .
§ 7. Berkovich spaces of curves 41
7.6. Lemma. Let H be the completion of the field of fractions of Tr . Then x is LEMMA
not a square in H. x not a
square
Proof. First consider the case that r ∈ |C× p |. Then we can assume r = 1, since
Tr is isomorphic to T1 and the isomorphism only scales x. Assume that x = z 2
is a square in H. Since the field of fractions of T1 is dense in H, there must be
a, b ∈ T1 such that |z − a/b| is small, which implies that |a(x)2 − xb(x)2 | |b(x)|2 .
We can scale a and b so that |b| = 1; then also |a| ≤ 1. Looking at the reductions,
we must have that a(x)2 − xb(x)2 reduces to zero in F̄p [x], but this is impossible,
since the first term has even degree and the second term is nonzero and has odd
degree.
To deal with arbitrary r, we can enlarge Cp to obtain a field K (that is again
complete and algebraically closed) such that r ∈ |K × |. Then the argument above
shows that x is not even a square in a larger field. q
The preceding lemma can be generalized: if f ∈ T1◦ has reduction in F̄p [x] that is
not a square, then f is not a square in H. Conversely (assuming p is odd as we
do here), one can show using Lemma 7.5 that when the reduction of a polynomial
f ∈ Cp [x] ∩ T1◦ is a nonzero square, then f is a square in H. See the exercises.
A hyperelliptic curve C of genus g over Cp can be glued together from two affine
pieces, the plane curves
y 2 = f2g+2 x2g+2 + f2g+1 x2g+1 + . . . + f1 x + f0
and
Y 2 = f0 X 2g+2 + f1 X 2g+1 + . . . + f2g+1 X + f2g+2
with thePidentifications xX = 1 and Y = X g+1 y; we assume that the polynomial
f (x) = j fj xj has no multiple roots and degree at least 2g + 1. If we assume
that f2g+2 6= 0, then C → P1x is unbranched above infinity, so we can restrict our
attention to the first affine patch. We denote the double cover C → P1 given by
the x-coordinate by π, and we denote by Θ the set of branch points in P1 (Cp ) (i.e.,
the roots of f ), which we also consider as type 1 points of P1,an Cp .
7.7. Lemma. Let ζ = ζ0,r ∈ A1,an be of type 2 or 3. We have a natural partition LEMMA
of Θ into finitely many nonempty subsets Θv (indexed by the tangent direction v branching
in which the points in Θv are visible). Then π∗−1 (ζ0,r ) consists of two points if all of π
subsets Θv have even cardinality, and of one point otherwise.
/ |C×
Proof. If r ∈ p | (then ζ is of type 3), then there are just two tangent directions,
which we denote 0 and ∞. Otherwise we select one point a ∈ P1 (Cp ) representing
each occurring tangent direction different from ‘up’ and write Θa for the corre-
sponding subset (and Θ∞ for the subset of roots ξ such that |ξ|p > r). Then we
can write
Y x Y #Θa
Y ξ − a
f (x) = c 1− · (x − a) 1−
ξ∈Θ
ξ a ξ∈Θ
x−a
∞ a
§ 7. Berkovich spaces of curves 42
with some c ∈ C× p . By Lemma 7.5 (note that |x/ξ|r < 1 when |ξ|p > r) each factor
1 − x/ξ in the first product is a square in Tr . Now let H be the completion of the
field of fractions of Tr . Then for ξ ∈ Θa , we have |ξ − a|p < r and |x − a|r = r,
so (ξ − a)/(x − a) ∈ H has absolute value < 1, so 1 − (ξ − a)/(x − a) is also a
square in H using Lemma 7.5 again. Since Cp is Qalgebraically closed, c is of course
also a square. So f (x) is a square in H times a (x − a)#Θa . By Lemma 7.6 and
the remark following it, it follows that the latter and hence f (x) is a square in H
if and only if all exponents #Θa are even (if ζ is of type 3, then we only need
that x is not a square; if ζ is of type 2, we can assume that r = 1). On the other
hand, the degree 2 algebra over H induced by π is H[y]/hy 2 − f (x)i. This is a
field if and only if f (x) is not a square in H; otherwise it splits as a product of
two copies of H (by the Chinese Remainder Theorem). Since the number of fields
in the product decomposition of this algebra is the number of points in the fiber
of π∗ above ζ, this completes the proof. q
7.8. Corollary. Let p > 2 and let C : y 2 = f (x) be a hyperelliptic curve COR
over Cp with hyperelliptic double cover π : C → P1 . Then the induced double hyperelliptic
cover π∗ : C an → P1,an is branched exactly above points ζ such that there is at least Berkovich
one tangent direction at ζ that points to an odd number of roots of f . In particular, curve
branching only occurs along the convex hull T of the branch points of π (i.e., the
roots of f ) considered as points of type 1, and C an can be obtained by gluing two
copies of P1,an along parts of T .
Proof. Let ζ ∈ P1,an . If ζ ∈ / T , then there is some Berkovich disk D(a, r) con-
taining ζ such that D(a, r) does not contain branch points of π. The proof of
Lemma 7.7 in the case that Θ = Θ∞ (or also Lemma 7.5 directly) then shows that
f (x + a) is a square in Cp hr−1 xi, which implies that π∗ is unramified above D(a, r)
and in particular above ζ. So branching can only occur along T ; in particular, no
branching occurs at points of type 4 (which is consistent with the statement, since
there is only one tangent directions in which all 2g + 2 branch points can be seen).
For points ζa,r ∈ T not of type 1, the statement on branching is Lemma 7.7 (after
shifting x by a). For the points ζ of type 1, the statement is clear — note that
when ζ is a branch point of π, then one can see the other 2g + 1 branch points in
the unique tangent direction; otherwise one sees all 2g + 2 branch points in this
direction. q
7.9. Examples. The simplest case is when g = 0 (strictly speaking, a double EXAMPLES
cover of P1 is a hyperelliptic curve only when g ≥ 2, but we can consider them double
also for g = 0 or g = 1). Up to a change of coordinates, we can take f (x) = x; covers of P1,an
then the branch locus in P1 (Cp ) consists of the two points 0 and ∞. Their convex
hull T is simply the arc connecting the two (it consists of all points ζ0,r for r ≥ 0
together with ∞). So as a tree, T has two leaves connected by an odd edge. By
the remarks above, C an is simply connected, which is no surprise, since C ∼ = P1 .
1 1,an
In the sketches below, P (Cp ) is symbolized by the circle and P by the disk it
encloses; odd vertices and edges of T are red and even ones (which do not exist
here) are green.
The next case is when g = 1; it is a bit more interesting. We have four branch
points; by a change of coordinates, we can move three of them to 0, 1, ∞. Then
f (x) = x(x − 1)(x − λ) for some λ ∈ Cp \ {0, 1}. There are four automorphisms
of P1 that fix a set of four points, but there are 24 ways of selecting the three
points that are mapped to 0, 1, ∞, so each curve C arises from (in general) six
different choices of λ. If λ is one of them, then the others are
1 1 1 λ
1 − λ, , 1− , and .
λ λ 1−λ λ−1
In particular, we can assume that |λ|p ≤ 1. There are now two cases.
If |λ|p = |1 − λ|p = 1, then the four branch points lie in four different tangent
directions at ζ0,1 , since the reduction λ̄ ∈ P1 (F̄p ) is distinct from (the reductions of)
0, 1, ∞. So the tree T has one inner vertex of degree 4 and four edges connecting
it to the four leaves. In particular, all vertices and edges are odd, and C an is
simply connected. In this case C is an elliptic curve with good reduction.
In the second case, we can assume (perhaps after replacing λ by 1 − λ) that
|λ|p < 1. Then at ζ0,1 , we see one branch point each when looking to ∞ or 1,
but we see two (namely 0 and λ) when looking to 0. We get a similar partition
at ζ0,|λ|p , which is now {0}, {λ}, {1, ∞}. So T has the two (odd) inner vertices ζ0,1
and ζ0,|λ|p , each of degree 3, there is one (even) edge connecting them, and each of
the two vertices has two more (odd) edges connecting them to two of the branch
points (1 and ∞, resp., 0 and λ). Gluing two copies of T along the odd edges and
vertices then gives a graph that looks like T with the central edge doubled. This
double edge forms a circle to which everything else can be retracted. Its length in
the big metric is twice the length of the edge, so it is 2vp (λ), which is the same as
§ 7. Berkovich spaces of curves 44
−vp (j(C)), where j(C) is the j-invariant of the elliptic curve C, given by
(λ2 − λ + 1)3
j(C) = 28 .
λ2 (1 − λ)2
In any case, C an is not simply connected; its fundamental group is that of the circle,
so ∼= Z. Taking the universal covering space ‘unwraps’ the circle and replaces
it by a line; the universal covering C̃ an of C an is Gan m = P
1,an
\ {0, ∞}. (Gm
denotes the multiplicative group; its affine coordinate ring is Cp [x, x−1 ] and its
Cp -points are Gm (Cp ) = C× p .) Tate has shown (using different language) that the
covering map Gan m → C an
is actually a group homomorphism and that the group of
deck transformations is generated by ‘multiplication by q’, where q ∈ C× p satisfies
|q|p < 1. Pulling back the coordinate functions to Gm gives an explicit analytic
uniformization C(Cp ) ∼ = C× p /q
Z
that is similar in spirit to the uniformization
over the complex numbers: C(C) ∼ = C/(Z + Zτ ) ∼ = C× /q Z , where the second
2πiz 2πiτ
isomorphism is induced by z 7→ e and q = e , with τ ∈ C in the upper
half-plane (so |q| < 1).
∞ ∞
1 1
ζ 0,1 ζ 0,1
ζ 0,|λ|
λ
0 λ 0
8. Integration
By ‘reasonable’, we mean that this integral has the properties we expect from such
a notion, for example:
Z Z Z
(1) Linearity in the 1-form. (α1 ω1 + α2 ω2 ) = α1 ω1 + α2 ω2
γ γ γ
for α1 , α2 ∈ Cp and ω1 , ω2 closed 1-forms on X.
Z Z Z
(2) Additivity on paths. ω= ω+ ω
γ1 +γ2 γ1 γ2
when γ1 (1) = γ2 (0) and γ1 + γ2 is the path obtained by traversing first γ1 and
then γ2 .
Z
(3) Fundamental theorem of calculus. df = f (γ(1)) − f (γ(0))
γ
when f is a function on X.
Z Z
∗
(4) Change of variables. φω= ω
γ φ◦γ
when φ : Y → X is a morphism, γ is a path in Y and ω is a closed 1-form
on X.
Z Z
(5) Homotopy invariance. ω= ω
γ1 γ2
when γ1 and γ2 are homotopic with fixed endpoints. (This includes invariance
under orientation-preserving re-parameterization of the path.)
We note that Property (3) uniquely determines the integral when X = D(0, 1)−
is an open
P disk,n since then any 1-form on X has the form ω = f (t) dt where
f (t) = ∞ a
n=0 n t converges on D(0, 1) −
, and
∞
X an−1 n
f (t) dt = d t ,
n=1
n
where the series on the right converges, too (since for 0 ≤ r < 1 we have that
rn /|n|p ≤ nrn → 0). So every 1-form is exact and Property (3) applies. The same
is true for open Berkovich ‘poly-disks’ (which are the analytic spaces associated
to a product of open disks in Cp ), when we deal with higher-dimensional spaces.
As a more interesting example, let us consider the logarithm, which should be
defined on X = Gan
m = P
1,an
\ {0, ∞} by the integral
Zx
dt
log x = .
t
1
§ 8. Integration 46
We can do the same within any open disk not containing zero; using Property (2)
this gives us
for a ∈ C×
log a(1 + z) = log a + log(1 + z) p , |z|p < 1.
The question then is, how to fix the values of log a when |a − 1|p ≥ 1?
If we use another important property of the logarithm, namely its functional equa-
tion
log(xy) = log x + log y ,
then this determines log on the subgroup R× = {a ∈ C× ×
p : |a|p = 1}: If a ∈ R ,
×
then there is a root of unity ζ ∈ R such that |a−ζ|p < 1. The functional equation
forces us to set log ζ = 0, so
log a = log ζ + log(ζ −1 a) = log(1 + z)
with |z|p = |ζ −1 a − 1|p < 1. Note that the functional equation follows from the
properties an integral should have:
Zxy Zx Zxy Zx Zy
dt (2) dt dt (4) dt dt
log(xy) = = + = + = log x + log y ,
t t t t t
1 1 x 1 1
where we use that dt/t is invariant under the scaling map t 7→ xt.
This does not tell us, however, how to define log p, say. In fact, it turns out that
we can give log p an arbitrary value λ, but then log is indeed uniquely determined
as
log(upv ) = log u + vλ for |u|p = 1, v ∈ Q.
We can take λ = 0, or (at the other extreme) treat λ as an indeterminate and
consider our integrals to have values in Cp [λ]. We write logλ for the ‘branch of the
logarithm’ with logλ p = λ.
As it turns out, fixing λ already fixes the integral. The following was proved by
Berkovich [Ber, Thm. 9.1.1].
8.1. Theorem. If we fix λ, then there is a unique integral satisfying properties THM
(1) to (5) and Existence
and
Zp uniqueness
dt
(6) = λ. of integral
t
1
(In fact, Berkovich does more: he defines sheaves of functions on X that allow for
iterated integration.)
§ 8. Integration 47
8.2. Example. Consider the open Berkovich annulus A = A(r, R)− . In a similar EXAMPLE
way as for an open disk, the analytic functions on A can be written as Laurent Integral
series on an
∞
X annulus
−3 −2 −1 2
f (t) = . . . + a−3 t + a−2 t + a−1 t + a0 + a1 t + a2 t + . . . = an tn
n=−∞
that converge for all τ ∈ Cp such that r < |τ |p < R. Then f (t) dt = dg(t)+a−1 dt/t
where X an−1
g(t) = tn
n6=0
n
is again an analytic function on A. Recall that A is simply connected, so integrals
depend only on the endpoints. If τ1 , τ2 ∈ Cp satisfy r < |τj |p < R, then we obtain
Zτ2 Zτ2 Zτ2 Zτ2
dt (1) dt
f (t) dt = dg(t) + a−1 = dg(t) + a−1
t t
τ1 τ1 τ1 τ1
(3,4,6) τ2
= g(τ2 ) − g(τ1 ) + a−1 logλ .
τ1
Here we use the change of variables formula for the map A → Gan m that multiplies
−1
by τ1 and (6). So if we want the integral to be well-defined (i.e., independent of
the choice of λ), we need a−1 = 0, so that f (t) dt = dg(t) is exact. ♣
8.3. Example. Consider an elliptic curve E over Cp (say, p > 2, but this is EXAMPLE
not really necessary) such that vp (j(E)) < 0. Then (as Tate has shown) there is Integral
q ∈ C× ×
p such that |q|p < 1 and E(Cp ) ' Cp /q ; the latter extends to a covering
Z on Tate
map π : Gan an
m → Gm /q ' E
Z an
that exhibits Ganm as the universal covering of E
an ell. curve
an 1
(recall that Gm is simply connected). Let 0 6= ω ∈ Ω (E) be a regular (and
therefore invariant) differential on E and let γ : [0, 1] → E an be the closed path
∗
that is the image under π of the (unique) path from 1 to q in Gan m . Then π ω is
an invariant differential on Gm , so π ∗ ω = α dt/t for some α ∈ C×p . Then
Z Zq Zq
(4) dt (1) dt (6)
ω= α =α = α logλ q .
t t
γ 1 1
λ
There is a unique λ ∈ Cp such that log q = 0; if we choose this value, then
integrals over ω on E will not depend on the path. But this will not work for
two Tate curves simultaneously when their q parameters are not multiplicatively
dependent. So we cannot avoid the path-dependence of the integral. In particular,
there will not be a function f (in the sense of Berkovich’s integration theory) on
all of E an such that df = ω. This is somewhat analogous with the situation over C,
where there is no holomorphic function f on C× such that df = dz/z. ♣
There is a different kind of integral that one can define on elliptic curves, and more
generally, abelian varieties over Cp . We will do this here for regular (= invariant)
1-forms, but the theory can be extended to arbitrary algebraic 1-forms on analytic
spaces associated to algebraic varieties, see [Col].
This is similar to the standard logarithm, which is related to the group structure
on the multiplicative group Gm . Let ω be an invariant 1-form on an abelian
variety A over Cp (for example, an elliptic curve). There is an open (and closed)
neighborhood U of the origin 0 ∈ A(Cp ) that is a subgroup and analytically
isomorphic to an open polydisk D. Every closed 1-form is exact on an open
§ 8. Integration 48
Rx
polydisk, so we obtain a function logω : U → Cp that satisfies logω x = 0 ω
for x ∈ U . By the same argument as before, logω is a homomorphism on U .
Contrary to Gm , A is projective, which implies that A(Cp )/U is a torsion group
(every element has finite order), so in this case there is a unique extension of logω
to all of A(Cp ) as a group homomorphism logω : A(Cp ) → Cp : if x ∈ A(Cp ) is
arbitrary, then there is some n ∈ Z≥1 such that nx ∈ U ; we must then have
logω x = (logω (nx))/n (and this does not depend on the choice of n with nx ∈ U ).
We define, for ω as above and x, y ∈ A(Cp ),
Zy
Ab
ω = logω y − logω x .
x
Then this integral satisfies Properties (1), (2) and (4) for abelian varieties and in-
variant differentials, where in (4) we only consider morphisms as varieties (which
are morphisms as abelian varieties composed with a translation). This integral
depends only on the endpoints and does not involve a choice of path, so Prop-
erty (5) is trivially satisfied. We can extend it to (the Berkovich spaces associated
to) arbitrary smooth projective varieties and closed regular 1-forms as follows. Let
V be such a variety and let A be its Albanese variety; write [y − x] for the image
of (y, x) under the Albanese map V × V → A. Fixing x0 ∈ V , set α : V → A,
x 7→ [x − x0 ]. Then if ω is a closed regular 1-form on V , there is an invariant
1-form ωA on A such that α∗ ωA = ω (this does not depend on the choice of x0 ); in
fact, α∗ gives a canonical isomorphism of the space of invariant 1-forms on A with
the space of closed regular 1-forms on V . We then define the ‘abelian integral’ as
Zy [y−x]
Z
Ab Ab
ω= ωA = logωA [y − x] .
x 0
It satisfies (1), (2), and (4) for morphisms of varieties. If one extends it to arbitrary
algebraic closed 1-forms as in [Col], then it also satisfies (3) (for a suitable class of
functions) and (6). Also this more general integral depends only on the endpoints,
not on a path between them. This makes it clear that it differs from Berkovich’s
integral, which in general does depend on the chosen path.
(This does not contradict the uniqueness statement in Theorem 8.1, because
the abelian integral is restricted to algebraic objects and morphisms, whereas
Berkovich’s theorem talks about integration for analytic 1-forms, spaces and maps.)
The abelian integral has some useful applications, which derive from the following
easy result. The rank of an abelian group G is defined as rk G = dimQ G ⊗Z Q, or DEF
equivalently, the maximal number of elements of G that are linearly independent rank
over Z (rk G := ∞ if this number is unbounded).
8.4. Theorem. Let A be an abelian variety over Cp and let Γ ⊂ A(Cp ) be a THM
subgroup of rank rk Γ = r < g = dim A. Then there are at least g − r linearly 1-forms
independent (over Cp ) invariant 1-forms ω such that logω γ = 0 for all γ ∈ Γ. killing Γ
Proof. Write Ω1 (A) for the Cp -vector space of invariant 1-forms on A. This space
has dimension g = dim A. The map
Ω1 (A) × A(Cp ) −→ Cp , (ω, x) 7−→ logω x
§ 8. Integration 49
8.5. Corollary. Let C be a curve of genus g > 0 defined over a number field K, COR
with Jacobian variety J. Let P0 ∈ C(K); denote by i the associated embedding killing
C → J sending P0 to the origin. Let Kv be the completion of K at some p- rational
adic place v. If rk J(K) = r < g, then there are at least g − r > 0 Kv -linearly points on
independent 1-forms ω on C over Kv such that a curve
ZP
Ab
ω=0 for all P ∈ C(K).
P0
Proof. We apply Theorem 8.4 and its proof to Γ = J(K) ⊂ J(Kv ) ⊂ J(Cv ). If we
restrict to 1-forms defined over Kv , the proof goes through as before, working with
vector spaces over Kv instead of over Cp . This gives us ω1 , . . . , ωg−r ∈ Ω1 (J/Kv )
such that logωj x = 0 for all j and all x ∈ J(K). Pulling back 1-forms induces an
isomorphism between Ω1 (J/Kv ) and Ω1 (C/Kv ), and we have for any j and any
P ∈ C(K)
ZP i(P )
Z
Ab Ab
i∗ ωj = ωj = logωj i(P ) = 0 ,
P0 0
We would like to obtain a uniform bound for the number of points in C(K). The
following result allows us to reduce to disks and annuli.
8.6. Lemma. Let C and Kv be as above. Then there is a finite collection of LEMMA
analytic maps φm : D(0, 1)− → C an and ψn : A(ρn , 1)− → C an with ρn ∈ |Kv |× partition
(where C an is defined with respect to Kv ⊂ Cp ) defined over Kv such that of C(/Kv )
[ [
φm D(0, 1)− ∩ Kv ∪ ψn A(ρn , 1)− ∩ Kv ,
C(Kv ) =
m n
and the number of the φm and the ψn each are bounded in terms of g and Kv only.
Proof. See [Sto, Prop. 5.3]. The numer of annuli is ≤ 3g − 3, and the number of
disks is in O(qg), where q is the size of the residue field of Kv . q
§ 8. Integration 50
− −
Write Dm = φm (D(0, 1) ∩ Kv ) and An = ψn (A(ρn , 1) ∩ Kv ). It then suffices to
bound # C(K) ∩ Dm and C(K) ∩ An for each m and n. Let V ⊂ Ω1 (C/Kv )
be the space of regular 1-forms that kill C(K) as in Corollary 8.5. Then
ZP
Ab
n o
C(K) ∩ Dm ⊂ P ∈ Dm : ω=0
P0
for every ω ∈ V , and similarly for An . For a disk Dm we have for τ ∈ Kv with
|τ |p < 1
φZ
m (τ ) φZ
m (0) φZ
m (τ ) φZ
m (0) Zτ
Ab Ab Ab Ab
ω= ω+ ω= ω+ φ∗m ω =: h(τ )
P0 P0 φm (0) P0 0
(on a disk there is no difference between the two integrals; this comes from the
definition of logω on U by formal integration of power series on the polydisk D).
Let π be a uniformizer of Kv (i.e., |π|p < 1 and generates the value group of Kv ).
Then the number of zeros of h in Kv ∩ D(0, 1)− ⊂ D(0, |π|p ) is bounded in terms
of the number of zeros of φ∗m ω in D(0, 1)− and |π|p (see Exercises). The number of
zeros of φ∗m ω in D(0, 1)− is the same as the number of zeros of ω in φm (D(0, 1)− ),
which is at most the total number of zeros of ω, which in turn is 2g − 2. This gives
us a bound for #(C(K) ∩ Dm ) that only depends on g and Kv (via |π|p ). Since
the total number of disks Dm is bounded in terms of g and Kv , we get a bound
on the total number of K-points of C contained in some Dm that only depends
on g and Kv .
For annuli, the situation is more complicated. There is the following theorem that
compares the two integrals on an annulus.
8.7. Theorem. Fix a Kv -defined analytic map ψ : A(ρ, 1)− → J an . There is a THM
linear form α : Ω1 (J/Kv ) → Kv such that for any x, y ∈ A(ρ, 1)− we have comparison
of integrals
Zy ψ(y)
Ab
Z on annuli
∗
ψ ω− ω = α(ω) vp (y) − vp (x)
x ψ(x)
8.8. Theorem. Fix d ≥ 1 and g ≥ 3. There is some B = B(d, g) depending only THM
on d and g such that for any number field K with [K : Q] ≤ d and any curve C of uniform
genus g over K, with Jacobian J such that rk J(K) ≤ g −3, we have #C(K) ≤ B. bound on
rational
Proof. We can assume that C(K) 6= ∅; let P0 ∈ C(K) and use it as the base-point points
for an embedding i : C → J as in Corollary 8.5. Fix some prime p; there are then
only finitely many possibilities (up to isomorphism) for the completion Kv at a
p-adic place v. Let B(d, g) be the maximum of the bounds for C(K) obtained by
the reasoning above for g and each Kv . The claim follows. q
For K = Q, one can write down explicit bounds. For hyperelliptic curves (which
allow for quite explicit computations) a possible bound is
(
1 if r = 0 (and g ≥ 3),
#C(Q) ≤ 33(g − 1) +
8rg − 1 if 1 ≤ r ≤ g − 3,
where r = rk J(Q), see [Sto, Thm. 9.1] (this works with Q3 ). In the general case,
one obtains a bound of the shape O(g 2 ), see [KRZB, Thm. 5.1]. For arbitrary d,
one still has a bound of the shape Od (g 2 ), but the implied constant depends
exponentially on d.
Literatur 52
References
[BR] Matthew Baker, Robert Rumely: Potential theory and dynamics on the Berkovich
projective line. Mathematical Surveys and Monographs 159. American Mathematical
Society, Providence, RI, 2010.
[AWS] Matthew Baker, Brian Conrad, Samit Dasgupta, Kiran S. Kedlaya, Jeremy
Teitelbaum: p-adic geometry. Lectures from the 10th Arizona Winter School held at
the University of Arizona, Tucson, AZ, March 10–14, 2007. Edited by David Savitt
and Dinesh S. Thakur. University Lecture Series, 45. American Mathematical Society,
Providence, RI, 2008. xii+203 pages.
[Ber] Vladimir G. Berkovich: Integration of one-forms on p-adic analytic spaces. Annals
of Mathematics Studies 162. Princeton University Press, Princeton, NJ, 2007. vi+156
pages.
[Col] Pierre Colmez: Intégration sur les variétés p-adiques. Astérisque 248, Société
Mathématique de France, Paris, 1998. viii+155 pages.
[KRZB] Eric Katz, Joseph Rabinoff and David Zureick-Brown: Uniform bounds for
the number of rational points on curves of small Mordell-Weil rank, Preprint (2015),
arXiv:1504.00694v2 [math.NT].
[Sto] Michael Stoll: Uniform bounds for the number of rational points on hyperelliptic
curves of small Mordell-Weil rank, Preprint (2015), arXiv:1307.1773v6 [math.NT].