(Algorithms and Computation in Mathematics 26) Oleg Karpenkov (Auth.) - Geometry of Continued Fractions-Springer-Verlag Berlin Heidelberg (2013)
(Algorithms and Computation in Mathematics 26) Oleg Karpenkov (Auth.) - Geometry of Continued Fractions-Springer-Verlag Berlin Heidelberg (2013)
in Mathematics • Volume 26
Editors
Arjeh M. Cohen Henri Cohen
David Eisenbud Michael F. Singer
Bernd Sturmfels
Geometry
of Continued
Fractions
Oleg Karpenkov
Dept. of Mathematical Sciences
University of Liverpool
Liverpool, UK
v
vi Preface
First, I would like to thank Vladimir Arnold, who introduced the subject of con-
tinued fractions to me and who provided me with all necessary remarks and dis-
cussions for many years. Second, I am grateful to many people who helped me with
remarks and corrections related to particular subjects discussed in this book. Among
them are F. Aicardi, T. Garrity, V. Goryunov, I. Pak, E.I. Pavlovskaya, C.M. Series,
M. Skopenkov, A.B. Sossinski, A.V. Ustinov, A.M. Vershik, and J. Wallner. Espe-
cially I would like to express my gratitude to Thomas Garrity and David Kramer
for exhaustively reading through the manuscript and giving some suggestions to
improve the book. Finally, I am grateful to my wife, Tanya, who encouraged and
inspired me during the years of working on this book.
The major part of this book was written at the Technische Universität Graz. The
work was completed at the University of Liverpool. I am grateful to the Technische
Universität Graz for hospitality and excellent working conditions. Work on this book
was supported by the Austrian Science Fund (FWF), grant M 1273-N18.
vii
Contents
ix
x Contents
In the first part of this book we study geometry of continued fractions in the plane.
As usually happens for low dimensions, this case is the richest, and many in-
teresting results from different areas of mathematics show up here. While trying
to prove analogous multidimensional statements, several research groups have in-
vented completely different multidimensional analogues of continued fractions (see,
e.g., Chap. 23). This is due to the fact that many relations and regularities that are
similar in the planar case become completely different in higher dimensions. So we
start with the classical planar case.
The first two chapters are preliminary. Nevertheless, we strongly recommend that
the reader look through them to get the necessary notation. We start in Chap. 1 with
a brief introduction of the notions and definitions of the classical theory of continued
fractions. Further, in Chap. 2, we shed light on the geometry of the planar integer
lattice. We present this geometry from the classical point of view, where one has a
set of objects and a group of congruences acting on the set of objects. This approach
leads to natural definitions of various integer invariants, such as integer lengths and
integer areas.
In Chap. 3 we present the main geometric construction of continued fractions.
It is based on the notion of sails, which are attributes of integer angles in integer
geometry.
In the next three chapters we come to the problem of the description of integer
angles in terms of integer parameters of their sails. We start in Chap. 4 with a con-
struction of a complete invariant of integer angles (it is written in terms of integer
characteristics of the corresponding sails). Further, in Chap. 5, we use these invari-
ants to define trigonometric functions for integer angles in integer geometry. These
functions have many nice properties similar to Euclidean trigonometric functions,
while the others are totally different. For instance, there is an integer analogue of
the formula α + β + γ = π for the angles in a triangle in the Euclidean plane, which
we prove in Chap. 6.
In Chap. 7 we introduce a notion of geometric continued fractions related to
arrangements of pairs of lines passing through the origin in the plane. One can con-
sider these arrangements to be invariant lines of certain real two-dimensional matri-
2
In this chapter we bring together some basic definitions and facts on continued frac-
tions. After a small introduction we prove a convergence theorem for infinite regular
continued fractions. Further, we prove existence and uniqueness of continued frac-
tions for a given number (odd and even continued fractions in the rational case).
Finally, we discuss approximation properties of continued fractions. For more de-
tails on the classical theory of continued fractions we refer the reader to [105].
1
α = a0 + (1.1)
1
a1 +
1
a2 +
.. 1
.+
an
The expression in the right-hand side of the equality is called a continued fraction
expansion of a given number α (or just a continued fraction of α), and denoted by
[a0 ; a1 : · · · : an ]. The numbers a0 , . . . , an are the elements of this continued fraction.
Definition 1.2 A continued fraction is odd (even) if it has an odd (even) number of
elements.
lim [a0 ; a1 : · · · : ak ].
k→∞
We denote it by [a0 ; a1 : · · · ].
Definition 1.4 The number [a0 ; a1 : · · · : ak−1 ] is called the k-convergent (or just
convergent) to the finite or infinite continued fraction [a0 ; a1 : · · · ].
Definition 1.5 When the first element a0 of a continued fraction is an integer and all
other elements are positive integers, we say that this continued fraction is regular.
Let us show how to construct a regular continued fraction for a rational number α.
1
rk−1 = rk−1 + .
1/(rk−1 − rk−1 )
α = [a0 ; a1 : · · · : an−1 : an ],
where an = rn .
1.1 Continued Fractions 5
Remark 1.6 Let ri = pi /qi with positive relatively prime integers pi , qi for i ≥ 1.
Then for every k ≥ 1 we have
pk+1 1 1 qk
= rk+1 = = = .
qk+1 rk − rk pk /qk − pk /qk pk − qk pk /qk
Since rk > 1, its denominator is less than its numerator (i.e., qk+1 < pk+1 = qk ).
Hence the sequence of denominators qk decreases with the growth of k. Therefore,
the algorithm stops in a finite number of steps.
9 1 1
=1+ =1+ .
7 (7/2) 3 + 1/2
The term continued fraction was used for the first time by J. Wallis in 1695, but the
story of continued fractions starts much earlier with the Euclidean algorithm, named
after the Greek mathematician Euclid, who described it in his Elements (Books
VII and X). Actually the algorithm was known before Euclid; it was mentioned
in the Topics of Aristotle. For further historical details we refer to the book [57] by
D.H. Fowler. The goal of the algorithm is to find the greatest common divisor of a
pair of integers. Let us briefly describe the algorithm.
Euclidean Algorithm
Input data. Consider two positive integers p and q.
Goal of the algorithm. To find the greatest common divisor of p and q (usually it
is denoted by gcd(p, q)). We do this in several iterative steps.
Step 1. Set q0 = q. Find integers a0 and q1 with q > r1 ≥ 0 such that
p = a0 q + q1 .
Inductive Step k. Suppose that we have completed k − 1 steps and get the integers
ak−2 and qk−1 . Find integers ak−1 and qk where qk−1 > qk ≥ 0 such that
Remark 1.8 Since the sequence (qk ) is a decreasing sequence of positive integers,
the algorithm always stops.
Remark 1.9 The integers ak−1 and rk at each step are uniquely defined. The Eu-
clidean algorithm actually generates the elements of a continued fraction. We always
have
p
= [a0 ; a1 : · · · : an ].
q
In general one can consider continued fractions with arbitrary real elements. In
Chap. 6 and in Chap. 13 we use continued fractions with arbitrary integer elements
to describe integer triangles and later to generalize to the case of toric varieties. In
Chap. 11 we present some of the geometric ideas behind continued fractions with
arbitrary integer and real coefficients. In the rest of the book we shall deal with
regular continued fractions.
In the case of continued fractions with arbitrary elements, we could encounter
infinity while calculating the value of a continued fraction. For instance, this is the
case for the continued fraction [1; 1 : 0]. To avoid the annoying consideration of
different cases of infinities, we propose to add an element ∞ to the field of real
numbers R and define the following operations:
1 1
∞ + a = a + ∞ = ∞, = ∞, = 0.
0 ∞
Denote the resulting set by R. Notice that we do not add two elements: +∞ and
−∞, but only one: ∞. In some sense we are considering the projectivization of the
real line.
In this notation, for the continued fraction [1 : 1; 0] we have
1 1 1
[1; 1 : 0] = 1 + =1+ =1+ = 1 + 0 = 1.
1 + 1/0 1+∞ ∞
Let us say a few words about the numerators and denominators of rational num-
bers and the elements of their continued fractions. It turns out that the numerator
and the denominator can be expressed as polynomial functions of the elements of
the continued fraction. Namely, there exists a unique pair of polynomials Pk and Qk
in the variables x0 , . . . , xk with nonnegative integer coefficients such that
Pk (x0 , . . . , xk )
= [x0 ; x1 : · · · : xk ], and Pk (0, . . . , 0) + Qk (0, . . . , 0) = 1.
Qk (x0 , . . . , xk )
The first condition defines the polynomials only up to a multiplicative constant, so
the second condition is a necessary normalization condition.
Example 1.10 Let us calculate the above polynomials for the continued fractions
[x0 ], [x0 ; x1 ], [x0 ; x1 : x2 ], . . . . The polynomials are as follows:
P0 (x0 ) = x0 , Q0 (x0 ) = 1;
P1 (x0 , x1 ) = x0 x1 + 1, Q1 (x0 , x1 ) = x1 ;
P2 (x0 , x1 , x2 ) = x0 x1 x2 + x2 + x0 , Q2 (x0 , x1 , x2 ) = x1 x2 + 1;
...
Now the numerator and the denominator are described via polynomials P and Q
as follows. Consider a finite (or infinite) continued fraction [a0 ; a1 : · · · : an ] with
n ≥ k (or [a0 ; a1 : · · · ] respectively). We set
In some sense the numbers pk and qk may be considered partial numerators and
partial denominators of rational numbers. Namely (by Proposition 1.13 below)
these numbers are the numerators and denominators of k-convergents. If k = n,
we get the numerator and the denominator of [a0 ; a1 : · · · : an ].
Let us prove several interesting facts and relations on the numbers pk and qk (and
hence that they are true numerators and denominators for k = n). In Proposition 1.12
below we prove that the integers pk and qk are relatively prime for every k. Then in
Proposition 1.13 we prove an important recurrence relation on both numerators and
denominators, which we will use in several further chapters. After that we deduce
one more useful relation (see Proposition 1.15) and prove Theorem 1.16 on the
convergence of infinite continued fractions.
For the next several propositions we use some additional notation:
Proof Since
p̂k
= [a1 ; a2 : · · · : ak ]
q̂k
we get
pk 1 a0 p̂k + q̂k
= a0 + = .
qk p̂k /q̂k p̂k
Therefore,
pk = λ(a0 p̂k + q̂k ),
qk = λp̂k .
Now we rewrite the second condition for polynomials Pk and Qk :
1 = Pk (0, . . . , 0) + Qk (0, . . . , 0)
= λ a0 Pk−1 (0, . . . , 0) + Qk−1 (0, . . . , 0) + λPk−1 (0, . . . , 0).
It is clear that the last expression represents a positive integer that is divisible by λ.
Hence 1 is divisible by λ, and therefore λ = 1.
and therefore
p2 a 2 + a 0 a 1 a 2 + a 0 a 2 p 1 + p 0
= = .
q2 a1 a2 + 1 a2 q1 + q0
1.2 Convergence of Infinite Regular Continued Fractions 9
pk 1 1
= a0 + = a0 +
qk p̂k /q̂k (ak p̂k−1 + p̂k−2 )/(ak q̂k−1 + q̂k−2 )
a0 (ak p̂k−1 + p̂k−2 ) + ak q̂k−1 + q̂k−2
=
ak p̂k−1 + p̂k−2
ak (a0 p̂k−1 + q̂k−1 ) + (a0 p̂k−2 + q̂k−2 ) ak pk−1 + pk−2
= = .
ak p̂k−1 + p̂k−2 ak qk−1 + qk−2
(The last equality holds by Lemma 1.11.) Therefore, the relations of the system
hold.
From Proposition 1.13 it follows that the partial numerators and denominators are
bounded by the Fibonacci numbers Fi (defined inductively as follows: F1 = F2 = 1,
and Fn = Fn−1 + Fn−2 ).
Corollary 1.14 For regular continued fractions the following estimates hold:
Suppose that the statement holds for k − 2 and k − 1, we prove it for k. Notice
that the qk are all positive and the pk are either all negative or all nonnegative. We
apply Proposition 1.13:
and
qk = ak qk−1 + qk−2 ≥ 1 · Fk + Fk−1 = Fk+1 .
This concludes the proof.
In the next proposition we present another useful expression for partial numera-
tors and denominators.
pk−1 pk (−1)k
− = .
qk−1 qk qk−1 qk
10 1 Classical Notions and Definitions
p0 q1 − p1 q0 = a1 a0 − (a1 a0 + 1) = −1.
Suppose the statement holds for k − 1. Let us prove it for k. By Proposition 1.13
we get
pk−1 qk − pk qk−1 = pk−1 (ak qk−1 + qk−2 ) − (ak pk−1 + pk−2 )qk−1
= −(pk−2 qk−1 − pk−1 qk−2 ) = (−1)k .
Notice that the theorem does not always hold for continued fractions with arbi-
trary real elements.
converges (we leave this statement as an exercise for the reader), the sequence ( pqkk )
is a Cauchy sequence. Therefore, ( pqkk ) converges.
continued fractions,
Theorem 1.17
(i) For every rational number there exist a unique odd and a unique even regular
continued fraction.
(ii) For every irrational number α there exists a unique infinite regular continued
fraction whose k-convergents converge to α.
For instance, 9
7 = [1; 3 : 2] = [1; 3 : 1 : 1] and π = [3; 7 : 15 : 1 : 292 : 1 : 1 : 1 :
2 : · · · ].
Proof of Theorem 1.17 Existence. In Sect. 1.1 we have shown how to construct a
regular continued fraction [a0 ; a1 : · · · : an ] for a rational number α. Notice that if α
is rational but not an integer, then an > 1, and therefore,
One of these continued fractions is odd and the other is even. For an integer α we
always get α = [α] = [α − 1; 1].
In the case of an irrational number α, the algorithm never terminates, generating
the regular continued fraction α = [a0 ; a1 : a2 : · · · ] and the sequence of remainders
rk > 1 such that
α = [a0 ; a1 : · · · : ak−1 : rk ].
Let us show that α = α . From Proposition 1.13 for [a0 ; a1 : · · · : ak−1 : rk ] and
[a0 ; a1 : · · · : ak−1 : · · · ] we have
pn−1 rn + pn−2 pn pn−1 an + pn−2
α= and = .
qn−1 rn + qn−2 qn qn−1 an + qn−2
Proposition 1.18
(i) The sequence of even convergents (p2k /q2k ) is increasing, and the sequence of
odd convergents (p2k+1 /q2k+1 ) is decreasing.
(ii) For every real α and a nonnegative integer k we have
p2k p2k+1
≤α and ≥ α.
q2k q2k+1
Equality holds only for the last convergent in case of rational α.
Proof
(i) By Proposition 1.13 we have
(−1) 1 1
= − .
qm−1 qm−2 qm
Since the sequence of the denominators (qk ) is increasing (by Proposition 1.15),
we have that pm−2 /qm−2 is greater than pm /qm for all even m, and it is less for
all odd m. This concludes the proof of (i).
1.4 Monotone Behavior of Convergents 13
(ii) The sequence of even (odd) convergents is increasing (decreasing) and tends to
α in the irrational case, and we end up with some pn /qn = α in the rational
case. This implies the second statement of the proposition.
In the second statement we show that the larger k is, the better a k-convergent
approximates α.
is strongly decreasing, except for the case of α = [a0 , 1, 1], where this sequence
consists of the following three equivalent elements: (1/2, 1/2, 0).
which is equivalent to
1 1/r2
> ,
a1 + 1/r2 a1 (a1 + 1/r2 )
and further to
a1 r2 > 1.
The only case in which a1 r2 ≤ 1 is a1 = r2 = 1. This is exactly the exceptional case
mentioned in the formulation of the theorem.
Now we proceed with the general case of k ≥ 2. From Proposition 1.13 we have
α − pk−1 = |pk−1 qk−2 − qk−1 pk−2 |
qk−1 qk−1 (qk−1 rk + qk−2 )
1
= . (1.2)
qk−1 (qk−1 rk + qk−2 )
Similarly, we have
1
α − pk = . (1.3)
qk qk (qk rk+1 + qk−1 )
14 1 Classical Notions and Definitions
Let us estimate the expression on the right side of the last equality:
1 1 1
≤ =
qk (qk rk+1 + qk−1 ) qk (qk + qk−1 ) qk (qk−1 ak + qk−2 + qk−1 )
1 1
= ≤
qk (qk−1 (ak + 1) + qk−2 ) qk (qk−1 rk + qk−2 )
1
< . (1.4)
qk−1 (qk−1 rk + qk−2 )
In the last inequality we used the fact that qk > qk−1 , which follows directly from
Proposition 1.13 for k ≥ 2:
Let us substitute the first and the last expressions of inequality (1.4) by equalities
(1.2) and (1.3). We get
α − pk > α − pk−1 .
qk qk−1
Therefore, the sequence is strictly decreasing.
Let c ≥ √1 . Then for every α, the inequality has an infinite number of integer solu-
5
tions (p, q).
Lemma 1.21 Theorem 1.20 is true if for every irrational α, there are infinitely many
integer solutions (p, q) of
α − p ≤ √ 1 .
q 5q 2
Proof It is clear that for rational α = p/q the integer pairs (np, nq) are solutions.
In the irrational case it is enough to prove the theorem for c = √1 . Let us prove that
5
the equality can occur at most twice. Suppose we have
p 1 p̂ 1
α− = ±√ and α − = ±√
q 5q 2 q̂ 5q̂ 2
for some choice of signs. Then
p p̂ 1 1 1
− = √ ± 2 −± 2 .
q q̂ 5 q̂ q
p p̂
= and |q| = |q̂|.
q q̂
Hence equality holds at most for two pairs of integers (p, q) and (−p, −q).
Our next step in the proof of Theorem 1.20 is to give an estimate on the growth
of denominators.
Lemma 1.22 Consider a real number α with regular continued fraction [a0 ; a1 :
· · · ]. Let pk /qk be its k-convergents. Then for an infinite sequence of integers we
have
√
qk 1+ 5
≥ .
qk−1 2
Proof of Theorem 1.20 From Lemma 1.21 it is enough to show that for every irra-
tional α there exist infinitely many pairs of rational numbers satisfying
α − p ≤ √ 1 .
q 5q 2
Consider two consecutive convergents. From Proposition 1.15 we have
pk−1 pk 1
− = .
q q q
k−1 k k−1 qk
In addition, from Proposition 1.18 it follows that α is contained in the segment with
endpoints pk−1 /qk−1 and pk /qk . Hence if the distance between endpoints is small
enough, namely
1 1 1
≤ √ 2 + √ 2,
qk−1 qk 5qk−1 5qk
then either (pk−1 , qk−1 ) or (pk , qk ) is a solution of the above equation. The last
inequality is equivalent to the following:
qk−1 2 √ qk−1
− 5 + 1 > 0.
qk qk
√
±1+ 5
The quadratic polynomial on the left side has two roots: 2 . Hence the inequal-
ity holds when
√
qk−1 1 + 5
≥ .
qk 2
From Lemma 1.22 we have infinitely many k satisfying the last inequality. There-
fore, the number of solutions of the above inequality is also infinite. This concludes
the proof of Theorem 1.20.
It turns out that for certain numbers the estimate of the approximation rate cannot
be essentially improved.
1.5 Approximation Rates of Regular Continued Fractions 17
√
Denote the golden ratio by θ and its conjugate (1 − 5)/2 by θ .
Proof of Proposition 1.23 First of all, let us show that it is enough to check only all
the convergents pk /qk . From Theorem 1.16 it follows that best approximations are
convergents to a number. Let p/q be a rational number such that qk ≤ q < qk+1 .
Therefore,
2
p
2 pk
2 pk
q α − ≥ q α − ≥ qk α − .
q qk qk
Hence if p/q is a solution of inequality (1.5), then pk /qk is a solution of inequality
(1.5) as well. Therefore, if there are infinitely many solutions of inequality (1.5)
then there are infinitely many convergents to the golden ratio among them.
Second, we prove the statement for the convergents. From Proposition 1.13 it
follows that the k-convergent to the golden ratio equals Fk+1 /Fk . Recall Binet’s
formula for Fibonacci numbers via the golden ratio and its conjugate:
k
θk − θ
Fk = √
5
k+1 − θ k+1
k
θ − pk−1 = θ − Fk+1 = θ − θ = θ
qk−1 Fk k k
θk − θ θk − θ
1 − θ 2k
= = √ 1 1 − θ 2k .
k 5Fk2
(θ k − θ )2
2k
Since |θ | < 1, we have |1 − θ | = 1 + o(1) and hence
θ − pk−1 = √ 1 +o 2
1
.
qk−1 2
5qk−1 qk−1
This implies the statement for convergents and concludes the proof of Proposi-
tion 1.23.
18 1 Classical Notions and Definitions
1.6 Exercises
pk−2 pk (−1)k−1 ak
− = .
qk−2 qk qk qk−2
Exercise 1.3 Consider the convergents of a regular continued fraction. Prove that
for every k ≥ 0 we get
1 pk 1
≥ α− > .
qk qk+1 qk qk (qk+1 + qk )
Exercise 1.6 Construct an infinite continued fraction with real coefficients that has
exactly two limit points: 1 and −1.
Chapter 2
On Integer Geometry
In many questions, the geometric approach gives an intuitive visualization that leads
to a better understanding of a problem and sometimes even to its solution. In the next
chapters we give an interpretation of the elements of continued fractions in terms
of integer geometry, with the continued fractions being associated to certain invari-
ants of integer angles. The geometric viewpoint on continued fractions also gives
key ideas for generalizing Gauss–Kuzmin statistics to studying multidimensional
Gauss’s reduction theory, leading to several results in toric geometry.
The notion of geometry in general can be interpreted in many different ways. In
this book we think of a geometry as of a set of objects and a congruence relation,
which is normally defined by some group of transformations. For instance, in Eu-
clidean geometry in the plane, we study points, lines, segments, polygons, circles,
etc., with the congruence relation being defined by the group of all length-preserving
transformations O(2, R) (the orthogonal group). The aim of this chapter is to intro-
duce basic ideas of integer geometry. In order to have a general impression we start
with the simplest, two-dimensional, case. We will need multidimensional integer
geometry only in the second part of the book, so we describe it later, in Chaps. 14
and 15.
We start in Sect. 2.1 with general definitions of integer geometry, and in par-
ticular, define integer lengths, distances, areas of triangles, and indexes of angles.
Further, in Sects. 2.2 and 2.3 we extend the notion of integer area to the case of
arbitrary polygons whose vertices have integer coordinates. In Sect. 2.4 we formu-
late and prove the famous Pick’s formula that shows how to find areas of polytopes
simply by counting points with integer coordinates contained in them. Finally, in
Sect. 2.5 we formulate one theorem in the spirit of Pick’s theorem: it is the so-called
twelve-point theorem.
We consider the integer lattice Z2 in R2 . The objects of integer geometry are integer
points with integer coordinates, integer segments and polygons having all vertices
in the integer lattice, integer lines passing through pairs of integer points, integer
angles with vertices at integer points. The congruence relation is defined by the
group of affine transformations preserving the integer lattice, i.e. Aff(2, Z). This
group is a semidirect product of GL(2, Z) and the group of translations on integer
vectors. We use ∼ = to indicate that two objects are integer congruent.
The congruence relation is slightly different from that in the Euclidean case. We
illustrate this with integer congruent triangles in Fig. 2.1.
Remark 2.1 To avoid confusion we add prefixes for triangles and angles, writing
ABC and ∠ABC respectively.
As long as we have objects and a group of transformations that acts on the objects,
we get invariants—quantities that are preserved by transformations of the objects.
Usually the study of these invariants is the main subject of the corresponding ge-
ometry. For instance, in Euclidean geometry, the invariants are lengths of segments,
areas of polyhedra, measures of angles, etc.
So what are the invariants in integer geometry? There are two different types of
invariants in integer geometry: affine invariants and lattice invariants. Affine invari-
ants are intrinsic to affine geometries in general. Affine transformations preserve
– the set of lines;
– the property that a point on a line is between two other points;
– the property that two points are in one half-plane with respect to a line.
Lattice invariants are induced by the group structure of the vectors in the lattice. The
majority of lattice invariants are indices of certain subgroups (i.e., sublattices).
2.1 Basic Notions and Definitions 21
Recall several notions from group theory. Let H be a subgroup of a group G. Con-
sider an element g ∈ G. The sets
First, we show that for every integer vector g there exists an integer point P ∈
Par(v, w) such that AP ∈ gH . The point P is constructed as follows. Let
g = λ1 v + λ2 w.
22 2 On Integer Geometry
Consider
P = λ1 − λ1 v + λ2 − λ2 w + A.
Since
0 ≤ λ1 − λ1 , λ2 − λ2 < 1,
the point P is inside the parallelogram and AP ∈ gH .
Second, we prove the uniqueness of P . Suppose that for two integer points
P1 , P2 ∈ Par(v, w) we have AP1 ∈ gH and AP2 ∈ gH . Hence the vector P1 P2 is
an element in H . The only element of H of type
αv + βw with 0 ≤ α, β < 1
Then we have six points satisfying the above condition. Therefore, the index of the
sublattice generated by AB and AC is 6.
Now we are ready to define several integer invariants. We start with integer length.
Definition 2.4 The integer length of an integer segment AB is the number of integer
points in the interior of AB plus one. We denote it by l(AB).
For example, the integer lengths of the segments AB and CD are both equal to 2.
Proposition 2.5 Two integer segments are congruent if and only if they have the
same integer length.
bx − ay = 1.
x y
a b
is invertible, and therefore, it is in SL(2, Z). This matrix takes the segment OK
to OB , and thus OK ∼= OB ∼ = OB. Hence all the segments of integer length k are
congruent to the segment OK. Therefore, they are all integer congruent.
Definition 2.6 Consider integer points A, B, and C that do not lie in one line. An
integer distance from the point A to the integer segment (line) BC is the index of
the sublattice generated by all integer vectors AV, where V is an integer point of the
line BC in the integer lattice. We denote it by ld(A, BC).
The points A, B, and C are collinear, we agree to say that the integer distance
from A to BC is zero.
Fig. 2.2 A triangle ABC and the sublattices for calculation of ld(A, BC), lS(ABC), and
lα(∠BAC)
There are two integer lines parallel to AB and lying in the region with boundary
lines AB and AA . Hence, ld(A, BC) = 2 + 1 = 3.
Let us start with the notion of integer area for integer triangles. We will define the
integer areas for polygons later, in the next subsection.
For instance, the triangle in Fig. 2.2 has integer area equal to 6.
As in Euclidean geometry, integer area does not uniquely determine the congru-
ence class of triangles. Nevertheless, all integer triangles of unit integer area are
congruent, since the vectors of the edges of such triangles generate the integer lat-
tice.
An angle is called rational if its vertex is an integer point and both its edges contain
integer points other than the vertex.
2.2 Empty Triangles: Their Integer and Euclidean Areas 25
Definition 2.8 The index of a rational angle ∠BAC, denoted by lα(∠AOB), is the
index of the sublattice generated by all integer vectors of the lines AB and AC in the
integer lattice.
In addition, if the points A, O, and B are collinear, we say that lα(∠AOB) = 0.
Geometrically, the index of an angle ∠BAC is the integer area of the smallest tri-
angle AB C whose edges AB and AC generate the sublattices of lines containing
the corresponding edges. For instance, the angle ∠BAC in Fig. 2.2 has index equal
to 2:
Remark 2.9 We write l, ld, lS, and lα (starting with the letter “l”) to indicate that
all these notions are well defined for any lattice, and not just for the integer lattice.
Definition 2.10 An integer triangle is called empty if it does not contain integer
points other than its vertices.
We show that empty triangles are exactly the triangles of integer area 1 and Eu-
clidean area 1/2. In particular, this means that all empty triangles are integer con-
gruent (as we show later, this is not true in the multidimensional case for empty
tetrahedra).
Denote by S(ABC) the Euclidean area of the triangle ABC. Recall that
1
S(ABC) = det(AB, AC).
2
(Here by det(v, w) we denote the determinant of the (2 × 2) matrix whose first and
second columns contain the coordinates of the vectors v and w respectively.)
Proposition 2.11 Consider an integer triangle ABC. Then the following state-
ments are equivalent:
(a) ABC is empty;
(b) lS(ABC) = 1;
(c) S(ABC) = 1/2.
26 2 On Integer Geometry
Proof (a) ⇒ (b). Let an integer triangle ABC be empty. Then by symmetry, a
parallelogram with edges AB and AC does not contain integer points except for
its vertices as well. Therefore, by Proposition 2.2 there is only one coset for the
subgroup generated by AB and AC. Hence, AB and AC generate the integer lattice,
and lS(ABC) = 1.
(b) ⇒ (c). Let lS(ABC) = 1. Hence the vectors AB and AC generate the integer
lattice. Thus any integer point is an integer combination of them, so
b1 c1 λ1 μ 1 1 0
= .
b2 c2 λ2 μ2 0 1
Since these matrices are integer, both their determinants equal either 1 or −1. Hence
the Euclidean area of ABC coincides with the Euclidean area of the triangle with
vertices (0, 0), (1, 0), and (0, 1) and equals 1/2.
(c) ⇒ (a). Consider an integer triangle of Euclidean area 1/2. Suppose that it has
an integer point in the interior or on its sides. Then there exists an integer triangle
with Euclidean area smaller than 1/2, which is impossible (since the determinant of
two integer vectors is an integer).
Remark 2.12 Proposition 2.11 implies that all empty integer triangles are congru-
ent.
Proposition 2.13
(i) Every integer polygon admits a decomposition into empty triangles.
(ii) Two decompositions of the same polygon into empty triangles have the same
number of empty triangles.
2.3 Integer Area of Polygons 27
Proposition 2.13 introduces a natural extension of the integer area to the case of
integer polygons.
Definition 2.14 The integer area of an integer polygon is the number of empty
triangles in its decomposition into empty triangles.
Since P is not on AB, we have #(APB) < k and #(CPD) < k. Therefore,
by the inductive hypothesis, we get
Proof The statement holds for empty triangles by Proposition 2.11. Now the corol-
lary follows directly from Proposition 2.13 and the definition of integer area.
Remark 2.16 Corollary 2.15 implies that the index of an integer sublattice generated
by vectors AB and AC in the integer lattice equals
det(AB, AC).
We conclude this section with a nice formula that describes a relation between inte-
ger points in a polygon and its Euclidean area.
Theorem 2.17 (Pick’s formula) The Euclidean area S of an integer polygon satis-
fies the following relation:
S = I + E/2 − 1,
where I is the number of integer points in the interior of the polygon and E is the
number of integer points in the edges.
For the integer area one should multiply the right part of the formula by two.
the number of inner integer points equals 4, the number of integer points on the
edges equals 6, and the area equals 6. So,
6 = 4 + 6/2 − 1.
Proof of Theorem 2.17 We prove Pick’s formula by the induction on the area.
Base of induction. If the Euclidean area of an integer polygon is 1/2, then by
Proposition 2.11 the polygon is an empty triangle. For the empty triangle we have
S = 0 + 3/2 − 1 = 1/2.
Induction step. Fix k. Suppose that the statement holds for every integer polygon
of area k /2, where k < k. Let us prove the statement for polygons of area k/2. Let
P be an integer polygon of area k/2. Then it can be decomposed into two integer
polygons P1 and P2 (for instance as in the proof of Proposition 2.13(i)) intersecting
30 2 On Integer Geometry
E + E ∗ = 12.
2.6 Exercises
Exercise 2.1 For any triangle ABC, show that
Exercise 2.2 Let 1 , 2 , and 3 be three parallel integer lines and let Ai ∈ li be
integer points. Suppose also that the lines 1 and 3 are in different half-planes with
respect to the line 2 . Prove that
Exercise 2.4 Find an example of two integer noncongruent triangles that have the
same integer area.
Exercise 2.7 Prove that the index of a subgroup generated by v and w in the integer
lattice is the absolute value of det(v, w).
Continued fractions play an important role in the geometry of numbers. In this chap-
ter we describe a classical geometric interpretation of regular continued fractions in
terms of integer lengths of edges and indices of angles for the boundaries of convex
hulls of all integer points inside certain angles. In the next chapter we will extend
this construction to construct a complete invariant of integer angles. For the geom-
etry of continued fractions with arbitrary elements see Chap. 11.
It is interesting to note that in the case of α = 7/5, shown in Fig. 3.1, the ratios
of the coordinates of extremal points of the convex hulls
are convergents to the number 7/5. This does not happen by chance for 7/5; such
a situation is typical. In next theorem we show a general relation between Ai and
Bi and the convergents for an arbitrary real number α ≥ 1 (we discuss the case of
0 ≤ α < 1 later, in Theorem 3.4).
where pk /qk are convergents. The only exception is for the rational case for the
following reason: the last vertices of the principal parts for both sails coincide with
(pn , qn ), where pn /qn is the last convergent, i.e., α = pn /qn .
Lemma 3.2
(a) The segment with endpoints (p2k−2 , q2k−2 ) and (p2k , q2k ) is in the sail
A0 A1 A2 . . . .
(b) The segment with endpoints (p2k−1 , q2k−1 ) and (p2k+1 , q2k+1 ) is in the sail
B0 B1 B2 . . . .
3.1 Classical Construction 35
Proof We start with item (a). First, notice that both points (p2k−2 , q2k−2 ) and
(p2k , q2k ) are in the convex hull of the sail including A0 A1 A2 . . . , since
p2k−2 p2k
α> and α > .
q2k−2 q2k
Consider the line l passing through the points (p2k−2 , q2k−2 ) and (p2k , q2k ).
From Proposition 1.13 we know that all integer points on l are as follows:
Let us prove that the line l is at unit integer distance to the origin. The inte-
ger distance is equivalent to the index of the sublattice generated by the vectors
(p2k−2 , q2k−2 ) and (p2k−1 , q2k−1 ). From Proposition 1.15 it follows that
i.e., the Euclidean area of the corresponding triangle is 1/2. Hence by Proposi-
tion 2.11 these vectors generate the lattice, and ld((0, 0), l) = 1.
Therefore, there is no integer point in the interior of the band between l and the
line parallel to l and passing through the origin.
It is clear that the point (p2k−2 , q2k−2 ) − (p2k−1 , q2k−1 ) has nonpositive coordi-
nates, and therefore it is not in the cone.
Let us show that the point
is not in the cone. Notice that the point (p2k , q2k ) + (p2k−1 , q2k−1 ) belongs to the
segment with endpoints
(p2k−1 , q2k−1 ) and (p2k+1 , q2k+1 ) = (p2k−1 , q2k−1 ) + a2k+1 (p2k , q2k )
Proof of Theorem 3.1 Actually, Lemma 3.2 almost proves the theorem. We have to
check only the endpoints of broken lines.
First, note that A1 = (α, 1), i.e., A1 = (p0 , q0 ).
Second, (if α ∈/ Z), we have
B1 = (0, 1) + a2 α, 1 = (p1 , q1 ).
Finally, we have to check in the case of rational α whether the segment with
endpoints (pn−1 , qn−1 ) and (pn , qn ) for the last two convergents is in one of the
two sails. The point (pn−1 , qn−1 ) is in one of the two sails by Lemma 3.2. The
36 3 Geometry of Regular Continued Fractions
point (pn , qn ) is in the intersection of the sails. The last thing we have to show
is that the triangle with vertices (0, 0), (pn−1 , qn−1 ), and (pn , qn ) is empty. From
Proposition 1.15 it follows that the Euclidean area of the triangle is 1/2; hence
by Proposition 2.11 the triangle is empty. Therefore, the segment with endpoints
(pn−1 , qn−1 ) and (pn , qn ) is in the sail.
We have found all the segments of the principal parts of both sails, concluding
the proof.
Remark 3.3 In the case of rational α with the principle parts of the corresponding
angles A1 . . . An and B1 . . . Bm , we have the following. If the last element of the odd
continued fraction is 1, then n = m + 1, otherwise n = m.
where pk /qk are convergents. The only exception is when α is rational, in which
case the last vertex of the principal parts for both sails coincide with (pn , qn ), where
pn /qn is the last convergent, i.e. α = pn /qn .
Proof The proof repeats the proof of Theorem 3.1 except for the following differ-
ence. Since a0 = 0, the point A0 should coincide with A1 . This is the explanation
for the shift in indices.
3.2 Geometric Interpretation of the Elements of Continued Fractions 37
In the case of 0 < α < 1, the corollary also holds, the only difference being that
l(Ai Ai+1 ) = a2i+2 instead a2i .
Proof of Corollary 3.5 The corollary follows directly from the explicit formulas for
Ak and Bk of Theorem 3.1, after applying Proposition 1.13.
Theorem 3.1 and Corollary 3.5 lead to an interesting algorithm for constructing
sails for regular continued fractions.
Output. If α is rational, then the algorithm constructs both sails in finite time. If α
is irrational, then the algorithm also calculates both sails but in infinitely many
steps.
We leave the case 0 < α < 1 as an easy exercise for the reader.
In Fig. 3.3 we show the steps of the algorithm for the particular case α = [1; 2 : 2].
38 3 Geometry of Regular Continued Fractions
Proof Let us calculate the index of an integer angle at some vertex of the principal
part of one of the two sails. By Theorem 3.1 it is equivalent to calculate the index of
the angle between a pair of vectors (pi−1 , qi−1 ) and (pi+1 , qi+1 ):
lα ∠(pi−1 , qi−1 )(0, 0)(pi+1 , qi+1 )
= |pi−1 qi+1 − pi+1 qi−1 | = pi−1 (ai qi + qi−1 ) − (ai pi + pi−1 )qi−1
= ai |pi−1 qi − pi qi−1 | = ai .
The second equality follows from Proposition 1.13, while the last follows from
Proposition 1.15.
Edge–Angle Duality From Corollary 3.5 and Theorem 3.6, we get that
lα(∠Ai Ai+1 Ai+2 ) = l(Bi Bi+1 ) and lα(∠Bi Bi+1 Bi+2 ) = l(Ai+1 Ai+2 ).
For the example of α = 7/5 (see Figs. 3.1 and 3.4), we get
lα(∠A0 A1 A2 ) = l(B0 B1 ) = a1 = 2;
lα(∠B0 B1 B2 ) = l(A1 A2 ) = a2 = 2.
3.4 Exercises 39
3.4 Exercises
Exercise 3.2 Describe the edge–angle duality for the case 0 < α < 1.
Exercise 3.3 Construct the sails of continued fractions for rational numbers 3/2,
10/7, 18/13, and 13/18.
Exercise 3.4 Find geometrically all the convergents to the rational numbers 8/5,
9/4, and 17/8.
Exercise 3.5 Is it sufficient to know all indices for the angles in the principal parts
of the sail defined by α > 1 and the indices of angles of its dual sail to reconstruct
α in a unique way? The question is interesting for both of the following cases: if α
is rational and if α is irrational.
Chapter 4
Complete Invariant of Integer Angles
Definition 4.1 Let ∠ABC be a rational angle, where A and C are lattice points
distinct from B. The integer sine of the angle is the following number:
lS(∠ABC)
,
l(AB) l(BC)
Remark 4.2 Notice that the integer sine is well defined, i.e., it does not depend on
the choice of points B and C. We leave the reader to check this as an exercise.
Let us briefly compare sin and lsin. One difference is in the multiplicative con-
stant 1/2. This is due to the fact that an empty triangle has integer area 1 but Eu-
clidean area 1/2, as shown in Proposition 2.11. Yet the difference between sin and
lsin is much stronger, as illustrated in the following proposition.
Proposition 4.3 The integer sine of a rational angle coincides with the index of the
angle.
In particular, this implies that the integer sine takes all possible nonnegative in-
teger values.
Proof of Proposition 4.3 Consider a rational angle ∠ABC with A, B, C not con-
tained on one line. Let A and C be the nearest integer points to B in the open rays
BA and BC, respectively. Then from the definition of integer length, we get
l BA = l BC = 1.
Hence,
lsin ∠AB C = lS ∠AB C .
The integer area of AB C is the index of the sublattice generated by BA and BC
in Z2 . Since the vectors BA and BC generate all integer points of the lines AB and
AC, the integer area of AB C is equivalent to the index of the angle. Therefore,
we get
lsin(∠ABC) = lα(∠ABC).
lsin(∠ABC) = lα(∠ABC) = 0.
In Chap. 3 we studied sails for a certain subset of integer angles. Let us extend the
notion of sails to the case of arbitrary integer angles.
Definition 4.4 Let α be an arbitrary integer angle. Consider the convex hull of the
set of all integer points in α except the origin. The boundary of this convex hull is
called the sail of this angle.
Remark 4.5 Notice that the sail of a lattice angle is a broken line homeomorphic
to R. A sail is either a two-sided infinite broken line or a one-sided infinite broken
line including also a ray at one side or a finite broken line that includes two rays
at both sides. These rays appear when edges of angles contain integer points other
than the vertex of the corresponding angle.
Let us now define a very important characteristic of the sail that we will use in
the future.
Definition 4.6 Consider an arbitrary angle with integer vertex. Let the sail for this
angle be a broken line with sequence of vertices (Ai ). Define:
a2k = l Ak Ak+1 ,
a2k−1 = lsin(∠Ak−1 Ak Ak+1 )
for all admissible indices. The lattice length sine sequence (LLS sequence for short)
for the sail is the sequence (an ).
The LLS sequence can be either finite or infinite on one or both sides.
Remark 4.7 Notice that LLS sequences contain only positive integer elements.
Consider a finite to the left LLS sequence. Regular continued fractions define a
one-to-one correspondence between such LLS sequences and real numbers not
smaller than 1; namely, the sequence (a0 , a1 , . . .) corresponds to the real number
[a0 ; a1 : · · · ]. This continued fraction is an important invariant of integer angles (we
will call it the integer tangent), and we will study this invariant in the next chapter.
Proposition 4.8 Integer length, integer area, index and integer sine of a rational
angle are invariant under the action of the group of integer affine transformations
Aff(2, Z).
44 4 Complete Invariant of Integer Angles
Proof Every integer linear transformation sends subgroups to subgroups and the
corresponding cosets to the corresponding cosets; hence all integer indices are in-
variant under integer linear transformations. Thus, every integer affine transforma-
tion preserves indexes of the corresponding integer lattice subgroups in Z2 as well.
Therefore, the integer area and the index (and hence the integer sine) of a rational
angle are also preserved. The integer length is preserved, since all the inner integer
points map to inner integer points.
Corollary 4.9 The LLS sequence is an invariant of lattice angles with respect to
Aff(2, Z).
Proof First, note that convex hulls are preserved by the elements of Aff(2, Z) (we
leave this as an easy exercise for the reader). Then the statement follows directly
from the fact that the integer length and the index are invariants of Aff(2, Z).
It is interesting to note that the angles ∠ABC and ∠CBA are not necessarily inte-
ger congruent (i.e., there is no integer affine transformation taking one to another).
For example, if we consider
then ∠ABC and ∠CBA are not integer congruent. We will say more about such pairs
of angles in the next section.
Theorem 4.10 Two angles with vertices at integer points are lattice congruent if
and only if they have the same LLS sequences.
Proof From Corollary 4.9 we know that the LLS sequence is an integer invariant of
angles. It remains to prove that if two LLS sequences coincide, then the correspond-
ing angles are integer congruent.
Consider two angles α and β with sails (Ai ) and (Bi ). Let the corresponding
LLS sequences coincide and be equivalent to (ai ). Let the vertices of the angles α
and β be Oα and Oβ respectively. Consider an affine transformation taking Oβ to
Oα , B0 to A0 , and B1 to A1 . This affine transformation is integer, since l(A0 A1 ) =
l(B0 B1 ) = a0 and ld(Oα , A0 A1 ) = ld(Oβ , B0 B1 ). Suppose the angle β is taken to
some angle γ with sail (Ci ) (we already know that the vertex of γ is Oα , C0 = A0 ,
and C1 = A1 ).
Let us prove that the broken lines A0 A1 A2 . . . and C0 C1 C2 . . . coincide by in-
duction. Suppose A0 A1 . . . Ak−1 coincides with C0 C1 . . . Ck−1 . Let us prove that
Ak = Ck .
First, we know that
and
l(Ak−1 Ak ) = l(Ck−1 Ck ) = a2k−2 .
4.3 On Complete Invariants of Angles with Integer Vertex 45
Hence we have
This follows from the simple fact (see Exercise 4.6) that
lS(∠ABC)
ld(A, BC) = .
l(BC)
Notice that ∠Ak−2 Ak−1 Ak and ∠Ck−2 Ck−1 Ck are two parts of convex sails, and
hence the points Ak and Ck are on the other side of the point Oα with respect to
the line Ak−2 Ak−1 = Ck−2 Ck−1 . Hence Ak and Ck are both on a line l1 parallel to
the line Ak−2 Ak−1 containing points at integer distance a2k−2 a2k−3 from the line
Ak−2 Ak−1 .
Second, we have
where Oα Ak−1 = Oα Ck−1 . Again, by convexity we know that the points Ak and
Ck are in a different half space from the point Ak−2 = Ck−2 with respect to the line
Oα Ak−1 . Therefore, Ck and Ak are on a line l2 parallel to Oα Ak−1 .
Since Oα Ak−1 and Ak−2 Ak−1 are not parallel, the intersection of lines l1 and l2
is a point coinciding with both Ak and Ck .
Therefore, the broken lines A0 A1 A2 . . . and C0 C1 C2 . . . coincide.
The fact that . . . A−1 A0 A1 and . . . C−1 C0 C1 coincide follows from the consid-
ered case after performing a GL(2, Z) transformation taking A0 to A1 and A1 to
A0 , namely the transformation
1 0
.
a0 −1
We apply this transformation to both sails (Ai ) and (Ci ) and get that the images of
Ck and Ak coincide for every negative k. Therefore, the whole sails (Ai ) and (Bi )
coincide.
Theorem 4.11 For every sequence of positive integers (odd finite or infinite on one
or both sides) there exists an angle with vertex at the origin whose LLS sequence is
this sequence.
The remaining case is the case of both-side infinite sequences. Here we construct
the angle for the part with nonnegative indices. Then apply to this angle the trans-
formation we used in the proof of Theorem 4.10 and construct the angle for the
remaining part of the sail and get the angle with the prescribed LLS sequence.
4.4 Exercises
Exercise 4.1 Show that the integer sine of a rational angle ∠ABC does not depend
on the choice of integer points B and C on the edges.
Exercise 4.2 Prove that convex sets are taken to convex sets and their boundaries
to boundaries under affine transformations.
Exercise 4.3 Let A = (3, −2), B = (0, 0), and C = (2, 1). Prove that ∠ABC and
∠CBA are not integer congruent.
Exercise 4.4 Prove that if the principal parts of the sails of two angles are integer
congruent, then the angles themselves are integer congruent.
Exercise 4.5 Let d be a positive integer and l an integer line. Prove that all lattice
points lying at integer distance are contained in two lines parallel to l at the same
Euclidean distance from l.
ltan(∠ABC) = [a0 ; a1 : a2 : · · · ].
When the points A, B, and C are collinear (but the points A and C are distinct from
B) we say that ltan(∠ABC) = 0.
lS(ABC)
lsin(∠ABC) = .
l(AB) l(BC)
lsin α
lcos α = .
ltan α
It is clear that the integer sine, integer tangent, and therefore integer cosine are
invariants of Aff(2, Z).
Now we give the inverse function to the integer tangent.
{y = 0 | x ≥ 0} and {y = αx | x ≥ 0}.
We define the zero angle as larctan 0. The angle π is the angle ABC, where A =
(1, 0), B = (0, 0), and C = (−1, 0).
First we show that the integer tangent and arctangent are in fact inverse to each
other.
Proposition 5.4
(i) For every real s ≥ 1, we have ltan(larctan s) = s.
(ii) For every rational or R-irrational angle α, the following holds:
larctan(ltan α) ∼
= α.
Proof
(i) From Corollary 3.5 and Theorem 3.6 we have that the elements of the LLS se-
quence for larctan s coincide with the elements of the regular continued fraction
for s. Hence the statement holds by definition of the integer tangent.
(ii) Both angles larctan(ltan α) and α have the same LLS sequences. Therefore,
they are congruent by Theorem 4.10.
Proposition 5.5
(i) The values of integer trigonometric functions for integer congruent angles co-
incide.
(ii) The lattice sine and cosine of any rational angle are relatively prime positive
integers.
(iii) For every angle α the following inequalities hold:
Equality holds if and only if the lattice vectors of the angle rays generate the
whole lattice.
(iv) (Description of lattice angles.) Two integer angles α and β are congruent if
and only if ltan α = ltan β.
Proof
(i) This is a direct corollary of the fact that integer sine and tangent are defined
only by values of certain indices.
(ii) By Proposition 5.4 it is enough to prove the assertion for angles larctan α for
rational α ≥ 1.
Consider m n ≥ 1, where m and n are relatively prime integers. Then the
sail of the angle will contain the point (n, m). It is also clear that the lat-
tice distance between the point (n, m) and the line y = 0 is m, and hence
n ) = m.
lsin(larctan m
Since ltan(larctan m n) = n n ) = m, we have
m
and lsin(larctan m
lcos(larctan n ) = n.
m
(iii) This is true for all integer arctangent angles. Therefore, it is true for all angles.
(iv) The LLS sequence is uniquely defined by the integer tangent. Therefore, the
statement follows from Theorem 4.10.
Definition 5.6 The integer angle ∠BOA is said to be transpose to the integer angle
∠AOB. We denote it by (∠AOB)t .
It immediately follows from the definition that for any integer angle α, we have
t t
α ∼ = α.
Further, we will use the following notion. Suppose that some arbitrary integers
a, b, and c, where c ≥ 1, satisfy ab ≡ 1(mod c). Then we write
−1
a ≡ b(mod c) .
50 5 Integer Trigonometry for Integer Angles
For the trigonometric functions of transpose integer angles the following rela-
tions hold.
Theorem 5.7 (Trigonometric relations for transpose angles) Let an integer angle α
be not contained in a line. Then
(1) If α ∼
= larctan(1), then α t ∼
= larctan(1).
(2) If α larctan(1), then
−1
lsin α t = lsin α, lcos α t ≡ lcos α(mod lsin α) .
p p
det = −1.
q q
p − p q − q
ξ= .
p −q
Since det(ξ ) = −1, the transformation ξ is integer-linear and changes the ori-
entation. Direct calculations show that the transformation ξ takes the angle
larctant (p/q) to the angle larctan(p/(p − p )). (See the example in Fig. 5.1.)
Since gcd(p, p ) = 1 and p > p − p , the following holds:
lsin(α t ) = p,
lcos(α t ) = p − p .
For the trigonometric functions of adjacent integer angles the following relations
hold.
Theorem 5.9 (Trigonometric relations for adjacent angles) Let α be some integer
angle. Then one of the following holds:
(1) If α is the zero angle, then π − α = ∼ π.
(2) If α is the straight angle, then π − α ∼ = 0.
(3) If α ∼
= larctan(1), then π − α ∼ = larctan(1).
(4) If α is neither zero nor straight nor integer-congruent to larctan(1), then
ltan α
π −α∼ = larctant
,
ltan(α) − 1
−1
lsin(π − α) = lsin α, lcos(π − α) ≡ − lcos α(mod lsin α) .
52 5 Integer Trigonometry for Integer Angles
Remark 5.10 Suppose that an integer angle α is neither zero nor straight. Then the
conditions
lcos(π − α) ≡ (− lcos α(mod lsin α))−1 ,
0 < lcos(π − α) ≤ lsin α,
uniquely determine the value lcos(π − α).
Proof of Theorem 5.9 Consider an integer angle α. Directly from the definitions it
follows that if α ∼
= 0, then π − α ∼
= π , and, if α ∼
= π , then π − α ∼
= 0.
Suppose that ltan α = p/q > 0, where gcd(p, q) = 1. Then by Proposition 5.4(ii)
we have α ∼= larctan(p/q). Therefore,
π −α∼
= π − larctan(p/q).
−1 1
ξ1 = .
0 1
Since det(ξ1 ) = −1, the transformation ξ1 is integer-linear and changes the ori-
entation. Direct calculations show that the transformation ξ1 takes the cone cor-
responding to the angle π − larctan(p/q) to the cone corresponding to the an-
gle larctant (p/(p − q)). Since ξ1 changes the orientation, we have to transpose
larctant (p/(p − q)). (See the example in Fig. 5.2.) Therefore,
ltan α
π −α∼ = larctant
.
ltan(α) − 1
Let A = (1, 0), B = (q, p), and O = (0, 0). Consider the sail of the integer angle
π − larctan(p/q), which is integer-congruent to π − α. Suppose that an integer
point C = (q , p ) of the sail for π − larctan(p/q) is the closest integer point to the
endpoint of the sail B (or equivalently, the segment BC is in the sail and has the
unit integer length). The coordinate p is positive, since p/q > 1. Since the triangle
BOC is empty and the vectors OB and OC define the same orientation as OA
and OB,
p p
det = 1.
q q
Consider a linear transformation ξ2 of the two-dimensional plane,
5.4 Adjacent Integer Angles 53
p − p q − q
ξ2 = .
−p q
p
lcos(π − α) = lcos larctan = p − p .
p − p
Remark 5.11 The statements of Theorem 5.7 and Theorem 5.9 were first intro-
duced in terms of regular continued fractions and so-called zig-zags by P. Popescu-
Pampu [169].
π − (π − α) ∼
= α.
We define right integer angles by analogy with Euclidean angles, using their sym-
metric properties.
It turns out that in integer geometry there exist exactly two integer inequivalent
right integer angles.
Hence,
−1 −1
lcos α(mod lsin α) ≡ − lcos α(mod lsin α) .
First we give the definition of parallel lines. Two integer lines are said to be parallel
if there exists an integer shift of the plane by an integer vector taking the first line to
the second.
Definition 5.15 Consider two distinct integer parallel lines AB and CD, where A,
B, C, and D are integer points. Let the points A and D be in different open half-
planes with respect to the line BC. Then the integer angle ∠ABC is called opposite
interior to the integer angle ∠DCB.
Proposition 5.16 Two integer angles opposite interior to each other are integer-
congruent.
Proof Consider two distinct integer parallel lines AB and CD. Let the points A and
D be in distinct open half-planes with respect to the line BC. Let us prove that
∠ABC ∼ = ∠DCB.
Consider the central symmetry S of the two-dimensional plane at the midpoint
of the segment BC. Let P be an arbitrary integer point. Note that S(P ) = C + PB.
Since the point C and the vector PB are both integer, S(P ) is also integer. Since the
central symmetry is self-inverse, the inverse map S −1 also takes the integer lattice
to itself.
Therefore, the central symmetry S is an integer-affine transformation. Since S
takes the angle ∠ABC to the angle ∠DCB, we obtain ∠ABC ∼ = ∠DCB.
5.7 Exercises
Exercise 5.1 Find both algebraically and geometrically the integer trigonometric
functions for the adjacent and transpose angles to the angles 73 , larctan 94 , larctan 13
5 .
Exercise 5.2 Let α be an arbitrary integer angle. Find expressions in the trigono-
metric functions for the angles π − α t and (π − α)t . Are these two angles integer
congruent?
Exercise 5.3 Can an integer triangle have two integer right angles? What about
three integer right angles?
Chapter 6
Integer Angles of Integer Triangles
In this chapter we study integer triangles, based on the results from previous chap-
ters. We start with the sine formula for integer triangles. Then we introduce integer
analogues of classical Euclidean criteria for congruence for triangles and present
several examples. Further, we verify which triples of angles can be taken as angles
of an integer triangle; this generalizes the Euclidean condition α + β + γ = π for
the angles of a triangle (this formula will be used in Chap. 13 to study toric sin-
gularities). Then we exhibit trigonometric relations for angles of integer triangles.
Finally, we give examples of integer triangles with small area.
Proposition 6.1 (The sine formula for integer triangles) For any integer triangle
ABC the following holds:
l(AB) l(BC) l(CA) l(AB) l(BC) l(CA)
= = = .
lsin(∠BCA) lsin(∠CAB) lsin(∠ABC) lS(ABC)
Proof We have
lS(ABC) = l(AB) l(AC) lsin(∠CAB) = l(BA) l(BC) lsin(∠BCA)
= l(CB) l(CA) lsin(∠ABC).
After inverting the expressions and multiplying by all three integer lengths, we get
the statement of the proposition.
Fig. 6.1 A counterexample to the second criterion of integer congruence for triangles
Recall that the cosine formula for Euclidean triangle ABC with a = |BC|, b =
|AC|, c = |AB|, and α = ∠BAC is
a 2 = b2 + c2 − 2bc cos α.
It seems that to write an integer cosine formula is a hard task. Currently there are no
theorems related to addition in integer trigonometry.
Proposition 6.2 (The first criterion of integer triangle integer congruence) Con-
sider integer triangles ABC and A B C . Suppose that
AB ∼= A B , AC ∼= A C , and ∠CAB ∼ = ∠C A B .
Then A B C ∼
= ABC.
It turns out that the literal generalizations of the second and third criteria from
Euclidean geometry to integer geometry do not hold. The following two examples
illustrate these phenomena.
Example 6.3 (The second criterion of triangle integer congruence does not hold in
integer geometry) In Fig. 6.1 we show two integer triangles ABC and A B C .
We have
AB ∼
= A B , ∠ABC ∼ = ∠A B C ∼
= larctan(1), and
= ∠C A B ∼
∠CAB ∼ = larctan(1).
The triangle A B C is not integer congruent to the triangle ABC, since
lS(ABC) = 4 and lS A B C = 8.
6.3 On Sums of Angles in Triangles 59
Fig. 6.2 A counterexample to the third criterion of integer congruence for triangles
Example 6.4 (The third criterion of triangle integer congruence does not hold in
integer geometry) In Fig. 6.2 we show two integer triangles ABC and A B C .
All edges of both triangles are integer congruent (of length one), but the triangles
are not integer congruent, since lS(ABC) = 1 and lS(A B C ) = 3.
Instead of the second and the third criteria, we have the following additional
criterion.
Proof All the integer lengths of the corresponding edges are the same by the sine
formula. Hence the triangles are integer congruent by the first criterion.
In the following example we show that the additional criterion of integer triangle
integer congruence is not improvable.
Example 6.6 In Fig. 6.3 we give an example of two integer inequivalent triangles
ABC and A B C of the same integer area 4 and congruent angles ∠ABC, ∠CAB,
and ∠A B C , ∠C A B all integer-equivalent to the angle larctan(1), but ABC
A B C .
First, we reformulate the Euclidean criterion in the form of tan functions. A tri-
angle with angles α, β, and γ exists if and only if
tan(α + β + γ ) = 0,
tan(α + β) ∈/ [0; tan α]
(without loss of generality, we suppose that α is acute). The next theorem is a trans-
lation of this condition into the integer case.
Second we give several preliminary definitions.
Let n be an arbitrary positive integer, and let A = (x, y) be an arbitrary integer
point. Denote by nA the point (nx, ny).
We are not yet ready to prove the first statement of this theorem; we shall do
it later, in Chap. 12. We prove the second statement of the theorem after a small
remark.
6.4 Angles and Segments of Integer Triangles 61
Remark 6.9 The statement of Theorem 6.8(i) does not necessarily hold for even
continued fractions of the tangents. For instance, consider an integer triangle with
integer area equaling 7 and all angles integer congruent to larctan 7/3. For the odd
continued fractions 7/3 = [2; 2 : 1] of all angles we have
[2; 2 : 1 : −1 : 2 : 2 : 1 : −1 : 2 : 2 : 1] = 0.
If instead we take the even continued fractions 7/3 = [2; 3], then we have
35
[2; 3 : −1 : 2 : 3 : −1 : 2 : 3] =
= 0.
13
Remark 6.10 In Chap. 13 we discuss the relation on integer angles of integer poly-
gons similar to the case of triangles.
Proof of the second statement of Theorem 6.8 Consider a triangle ABC with ra-
tional angles α, β, and γ (at vertices at A, B, and C respectively). Suppose that
for every k > 1 and every integer triangle KLM, the triangle ABC is not integer
congruent to the k-multiple of KLM. In other words, we have
gcd l(AB), l(BC), l(CA) = 1.
Suppose that S is the integer area of ABC. Then by the definition of integer
sine, the following holds:
⎧
⎨ l(AB) l(AC) = S/ lsin α,
l(BC) l(BA) = S/ lsin β,
⎩
l(CA) l(CB) = S/ lsin γ .
Since gcd(l(AB), l(BC), l(CA)) = 1, we have l(AB) = λ1 and l(AC) = λ2 .
Therefore, the triangle ABC is integer congruent to the triangle A0 B0 C0 of
the theorem.
⎧
⎨ π − larctan( c lcos α−b ) if c lcos α > b,
c lsin α
∠BCA ∼
= larctan(1) if c lcos α = b,
⎩ t c lsin α
larctan ( b−c lcos α ) if c lcos α < b,
⎧ t)
⎪ b lsin(α
⎨ π − larctan ( b lcos(α t )−c ) if b lcos(α ) > c,
t t
∠ABC ∼
= larctan(1) if b lcos(α t ) = c,
⎪
⎩ larctan( b lsin(α t ) )
c−b lcos(α t ) if b lcos(α t ) < c.
Proof We start with proving the formula for the angle ∠BCA. Let α ∼ = larctan(p/
q), where gcd(p, q) = 1. Then CAB ∼ = DOE, where D = (b, 0), O = (0, 0), and
E = (qc, pc). Let us express the angle ∠DEO. Denote by Q the point (qc, 0). If
qc − b = 0, then ∠BCA = ∠DEO ∼ = larctan 1. If qc − b
= 0, then we have
∼ cp ∼ c lsin α
∠QDE = larctan = larctan .
cq − b c lcos α − b
The expression for the angle ∠BCA follows directly from the above expression for
∠QDE, since ∠BCA ∼ = ∠QDE. (See Fig. 6.4; here l(OD) = b, l(OQ) = c lcos α,
and therefore l(DQ) = |c lcos α − b|.)
To obtain an expression for the angle ∠ABC, we consider the triangle BAC.
Calculate the angle ∠CBA and then transpose all angles in the expression. Finally,
the integer length of CB is defined from the integer sine formula.
Let us define certain types of triangles occurring in integer geometry. Since dual
angles are not necessarily congruent, we have more different types than in the Eu-
clidean case.
Definition 6.12 An integer triangle ACB is said to be dual to the triangle ABC.
6.5 Examples of Integer Triangles 63
By the first criterion of integer congruence for integer triangles, the number of
integer congruence classes for integer triangles with bounded integer area is always
finite. In Fig. 6.5 we show the complete list of 33 triangles representing all integer
congruence classes of integer triangles with integer areas not greater than 10. We
enumerate the vertices of the triangle clockwise. Near each vertex of a triangle we
write the tangent of the corresponding rational angle. Inside any triangle we write its
area. We draw dual triangles on the same light gray region (if they are not self-dual).
Integer regular triangles are colored in dark gray, integer isosceles but not integer
regular triangles are white, and the others are light gray.
Integer Triangles of Small Area The above criteria allow us to enumerate all
integer triangles of small integer area up to integer equivalence. In the following
table we write down the numbers N (d) of noncongruent integer triangles of in-
teger area d for d ≤ 20 (here the dual noncongruent triangles are counted as two).
d 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
N(d) 1 1 2 3 2 4 4 5 5 6 4 10 6 8 8 11 6 13 8 14
d
C0 = ,0 .
λ2 q
64 6 Integer Angles of Integer Triangles
Fig. 6.5 List of integer triangles of integer area less than or equal to 10
6.6 Exercises 65
It appears that the growth rate is linear. For instance, for a prime number d, we
always have N(d) ≤ d. Still, in some exceptional cases, when d has many divisors,
it can happen that N (d) > d. For instance, N (240) = 248.
6.6 Exercises
Exercise 6.2 Let ltan(ABC) = 7/5; l(AB) = 3; l(BC) = 5. Find the remaining
angles and the remaining edge.
Exercise 6.3 Prove that in Euclidean geometry there exists a triangle with pre-
scribed angles α, β, and γ if and only if
tan(α + β + γ ) = 0,
tan(α + β) ∈
/ [0; tan α]
(here without loss of generality we suppose that α is acute).
Exercise 6.5 Suppose q has n divisors. Find an upper estimate on N (q) linear in d
and n.
Chapter 7
Continued Fractions and SL(2, Z) Conjugacy
Classes. Elements of Gauss’s Reduction Theory.
Markov Spectrum
In this chapter we study the structure of the conjugacy classes of SL(2, Z). Recall
that SL(2, Z) is the group of all invertible matrices with integer coefficients and unit
determinant. We say that the matrices A and B from SL(2, Z) are integer conjugate
if there exists an SL(2, Z) matrix C such that B = CAC −1 .
Notice that in the case of algebraically closed fields (say in C), every matrix
is conjugate to its Jordan normal form. The situation with SL(n, Z) is much more
complicated, since Z is not a field. A description of integer conjugacy classes in the
two-dimensional case is the subject of Gauss’s reduction theory, where conjugacy
classes are classified by periods of certain periodic continued fractions (for addi-
tional information we refer to [102, 132], and [134]). We present first steps in the
study of multidimensional Gauss reduction theory in Chap. 21, in the second part of
this book.
In this chapter we discuss the main elements of classical Gauss reduction theory
based on the theory of geometric continued fractions studied in previous chapters
(see also [97]). In particular, we formulate several open problems and show relations
to the Markov spectrum.
7 18
Fig. 7.1 Geometric continued fraction of the matrix 5 13
multidimensional case in the second part of the book. Now we give a definition from
the one-dimensional Klein geometric continued fraction. It is slightly different from
the definition of regular continued fractions.
Definition 7.1 Consider a pair of distinct lines l1 and l2 passing through an integer
point O. The lines l1 and l2 divide the plane into four angles. A geometric continued
fraction is the union of the sails of these angles.
Notice that the LLS sequences of the angles between the lines are lattice invari-
ants of a geometric continued fraction. Since the LLS sequence of an angle is its
integer complete invariant (see Theorem 4.10), we have the following corollary.
Corollary 7.2 The LLS sequence is a complete invariant of lattice angles under the
group of lattice affine transformations Aff(2, Z).
Definition 7.3 Consider a real spectrum matrix A. The matrix A has exactly two
distinct eigenlines. The geometric continued fraction defined by these two straight
lines is said to be associated to A.
In Fig. 7.1 we show the geometric continued fraction for the matrix
7 18
.
5 13
Integer lengths of edges are denoted by black digits, and integer sines, by white. The
LLS sequences of all four sails are equivalent to or inverse to
(. . . , 2, 1, 1, 3, 2, 1, 1, . . .).
7.1 Geometric Continued Fractions 69
As we have already seen in Fig. 7.1, the continued fractions of adjacent angles are
equivalent up to a reversal of order. This happens not by chance, such situation is
general. Before showing this, we discuss the notion of sail duality.
Definition 7.4 Two sails are dual with respect to each other if the LLS sequence
of one sail coincides (up to an index shift if the LLS sequences are infinite on both
sides) with the inverted LLS sequence of the second sail.
Proposition 7.5 Let A be a real spectrum matrix with no integer eigenvectors. Then
the sails of the opposite octants are congruent. The sails of the adjacent octants are
dual.
Proof The sails of opposite octants are congruent, since they are taken one to an-
other by the symmetry about the origin.
Let us prove the duality. Let one of the sails for the geometric continued fraction
be a broken line (Ai ). Without loss of generality we may fix coordinates such that
A0 = (1, 0) and A1 = (1, a0 ). Set B0 = (0, 1). Notice that B0 is on the dual sail
(since the sail is defined by the lines y = αx and y = βx, where a0 < α < a0 + 1
and −1 < β < 0 in the chosen coordinates). Denote the remaining points of the dual
sail by Bi , starting from B0 .
Let (ai ) and (bi ) be the LLS sequences for the sails (Ai ) and (Bi ). In Chap. 3
we have already shown the edge–angle duality in the orthant of points with positive
coordinates. In other words, we know that ai = b−i−1 for i ≥ 1.
Let Bi be the point symmetric to Bi with respect to the origin (i ∈ Z). Recall that
a0 a1 + 1 −a1
.
a0 a1 a2 + a0 + a2 −a1 a2 − 1
This transformation takes the point B−1 to the point (0, 1). Now we have edge–angle
duality of A2 A1 A0 . . . and B−1 B0 . . . . Therefore, ai = b−i−1 for i ≤ 1. Hence the
LLS sequences are inverse to each other.
Definition 7.6 The LLS sequence of a matrix A in GL(2, R) is the LLS sequence
for any of its sails up to the choice of a direction of a sequence and zero element.
70 7 Continued Fractions and SL(2, Z) Conjugacy Classes
In the next proposition we list some basic properties of LLS sequences for real
spectrum matrices.
Proposition 7.7
(i) For any sequence of integers infinite in two sides there exists a real spectrum
matrix whose LLS sequence coincides with the given sequence.
(ii) Let real spectrum matrices A and B have the same LLS sequence. Then there
exists a matrix C commuting with A and integer conjugate to B.
Consider now the case of real spectrum matrices in SL(2, Z). It turns out that all
such matrices have characteristic polynomials irreducible over the rational numbers.
The sails of such matrices are called algebraic.
Remark 7.9 Later, in Corollary 7.20, we show that a sail with periodic LLS se-
quence is algebraic.
Proof of Proposition 7.8 Let A be a real spectrum matrix in SL(2, Z). The ma-
trix A preserves its invariant lines and the lattice Z2 . Hence it acts on each of the
sails by shifting the vertices of the corresponding broken line along the broken line.
Therefore, the LLS sequences of algebraic sails are periodic.
In Fig. 7.1, we show the sails of a real spectrum algebraic matrix with the period
of the LLS sequence equal to (2, 1, 1, 3).
Definition 7.10 Let M be a real spectrum SL(2, Z) matrix. Then M acts on the sail
for M as a shift. Hence M also defines a shift of the LLS sequence. The factor of
the LLS sequence with respect to this shift is called the LLS period of M.
Proposition 7.11 Two real spectrum SL(2, Z) matrices M1 and M2 with positive
eigenvalues are integer conjugate if and only if their LLS periods coincide.
7.2 Geometry of Gauss’s Reduction Theory 71
Proof Let M1 and M2 be integer conjugate. Then they have integer congruent ge-
ometric continued fractions, and they define the same shift of these continued frac-
tions. Therefore the LLS periods of M1 and M2 coincide.
Suppose now that the LLS periods of the matrices M1 and M2 coincide. This
means that the corresponding continued fractions have the same LLS sequences,
and therefore they are integer congruent. Therefore, there exists a matrix M2 integer
congruent to M2 whose geometric continued fraction coincides with the geometric
continued fraction of M1 . Suppose v is a vertex of the geometric continued fraction
of the operator M1 . Then
M1 (v) = M2 (v) and M1 M1 (v) = M2 M1 (v) ,
since the matrices M1 and M2 define the same shift of the geometric continued
fraction. Since the vectors v and M1 (v) form a basis of R2 , we have
M1 = M2 .
1 1 0 1 0 1
, , and .
−1 0 −1 0 −1 −1
1 n
for n ≥ 0.
0 1
72 7 Continued Fractions and SL(2, Z) Conjugacy Classes
Case of Real Spectra The case of matrices with real distinct eigenvalues is the
most complicated (we call such a matrix a real spectrum matrix). We use geometric
continued fractions to construct a complete invariant of conjugacy classes of such
matrices. Let us collect some general definitions and theorems related to this case.
There are two similar subfamilies of real spectrum SL(2, Z) matrices: those with
positive eigenvalues and those with negative eigenvalues. The one-to-one corre-
spondence between them is given by associating to a matrix M the matrix −M.
For simplicity, we consider only matrices with positive eigenvalues.
a c
M=
b d
in SL(2, Z) is reduced if d > b ≥ a ≥ 0.
Remark The definition of a reduced matrix is slightly different from the classical
one: a matrix in SL(2, Z) is reduced if and only if it has nonnegative entries that are
nondecreasing downward and to the right. In the framework of geometric continued
fractions it is more convenient to use our definition, which we further generalize to
the multidimensional case in Chap. 21.
We show how to construct a reduced matrix from a given one in Sect. 7.3.2.
It is interesting to note that the LLS period of a reduced matrix (and in particular
its LLS sequence) can be written directly from the coefficients of the matrix.
a c
Theorem 7.14 Consider a reduced matrix M = bd . Suppose b
a = [a1 ; a2 : · · · :
a2n−1 ] and λ = d−1
b . Then its LLS period is
(1, λ).
Corollary 7.16 (On enumeration of reduced matrices) The set of real spectrum re-
duced matrices is in one-to-one correspondence (defined in Remark 7.15) with the
set of finite sequences consisting of an even number of positive integer elements.
Proof It is clear that two distinct reduced matrices have distinct sequences as de-
fined in Remark 7.15. Let us show that the sequence
(a1 , a2 , . . . , a2n−1 , λ)
Remark One can ask, what happens if we consider a matrix constructed from a
sequence with an odd number of elements? It turns out that in this case, we have a
real spectrum GL(2, Z) matrix and negative determinant. The LLS sequence of the
associated geometric continued fraction has this sequence as one of the periods.
So the reduced matrices are “almost” normal forms, since each matrix could have
more than one normal form.
Corollary 7.17 (On almost normal forms) The number of reduced matrices in an
integer conjugacy class coincides with the number of elements in the minimal period
of the corresponding LLS sequence.
Proof In Proposition 7.11 we showed that two matrices are integer conjugate if their
LLS periods are conjugate. From Corollary 7.16 it follows that reduced matrices are
in one-to-one correspondence with finite sequences of even lengths. The number
of sequences that are cyclic permutations of each other exactly coincides with the
number of elements in the minimal period of the sequences, which is also a minimal
period of the corresponding LLS sequence.
Proof Proposition 7.11 is a reformulation of the first part of the theorem. Corol-
lary 7.16 implies that all sequences are realized.
1519 1164
M= .
−1964 −1505
The LLS period of M is (1, 2, 1, 2). Hence there are exactly two reduced matrices
represented by the sequences (1, 2, 1, 2) and (2, 1, 2, 1). The coefficients a and b
for the corresponding reduced matrices are
b 4 b 8
= [1; 2 : 1] = and = [2; 1 : 2] = .
a 3 a 3
We find the elements c and d of the reduced matrices from conditions λ = d−1b
and ad − bc = 1. Finally, we get both reduced matrices integer conjugate to M:
3 8 3 4
and .
4 11 8 11
Corollary 7.20 A sail with a periodic LLS sequence is algebraic (i.e., a sail of some
algebraic real spectrum matrix).
Proof In Theorem 7.14 we constructed the algebraic matrices for all finite se-
quences as periods. Then in Proposition 7.7 we showed that sails with equivalent
LLS sequences are either integer congruent or dual. Therefore, every sail with a
periodic LLS sequence is algebraic.
c = r + λa and d = s + λb.
a r + λa
M= .
b s + λb
They are both positive, and hence the matrix M takes each sail to itself.
Let us consider the sail S whose convex hull contains the point P = (1, 0). De-
note by α the closed integer angle with vertex at the origin O whose edges are
integer rays OP and OM(P ). Consider the sail of α. Denote the broken line of this
sail by Sα .
Now we show that the sail of α is completely contained in the sail of the matrix
M containing P . Denote by Sα∞ the following infinite broken line:
+∞
i
M (Sα ) .
i=−∞
By construction, the convex hull of Sα∞ coincides with the convex hull of the sail S
of the matrix M. Let us prove that Sα∞ coincides with S, i.e., with the boundary of
its convex hull. The broken line Sα∞ is the boundary of the convex hull if the convex
angles generated by every pair of its adjacent edges do not contain the origin. Due
to linear periodicity it is sufficient to check all the angles of one of the periods of
the broken line Sα∞ . For instance, we will examine all the angles whose vertices are
in Sα except for the angle with vertex at M(P ) (the situation is clear for the angles
at vertices M(P ) and P ). All convex angles generated by pairs of adjacent edges
at inner vertices of the sail Sα do not contain the origin by definition. It remains to
check the angle with vertex at P = (1, 0).
Since b/a > 1, the direction of the first edge is (0, 1). Let us choose an integer
point Q on the second edge of this angle such that l(PQ) = 1. By construction, the
triangle OPQ is empty, and hence lS(OPQ) = 1. Thus ld(Q, OP) = 1. Therefore,
Q = (x, −1) for some integer x. Since the set M −1 (Sα ) is convex, the value of x is
determined by the eigendirection
−a + λb + s + (a + λb + s)2 − 4
, −1
2b
as follows:
−a + λb + s + (a + λb + s)2 − 4
x= + 1.
2b
This implies that x is contained in the open interval (λ + 1 + (s − 1)/b, λ + 1 + s/b).
From b ≥ s > 0 we get
x = λ + 1.
Hence for λ ≥ 1, the convex angle at vertex P does not contain the origin. Therefore,
the broken line Sα∞ coincides with the sail.
Let b/a = [a0 ; · · · : a2n−1 ]. In Chap. 3 it was shown that the sail Sα consists of
exactly n+1 integer segments; the integer lengths of the consecutive segments equal
a0 , a2 , . . . , a2n ; the integer sines of the corresponding angles equal a1 , a3 , . . . , a2n−1
(see Corollary 3.5 and Theorem 3.6). The integer sine of the angle at point P equals
7.3 Some Technical Details and Open Questions 77
p r
.
q s
Remark 7.21 During the algorithm we only use conjugations with GL(2, Z) ma-
trices and multiply by −Id. So in the output one should expect a reduced integer
matrix that is conjugate either to M or to −M. Nevertheless, since M, as well as
any reduced matrix, has all positive eigenvalues, the resulting matrix is conjugate
exactly to M.
Remark 7.22 The algorithm works in a finite number of steps, since after each iter-
ation the integer absolute value of q is decreases.
Remark 7.23 To find the LLS period of a matrix M, one should construct the re-
duced matrix M conjugate to M using the above algorithm and then apply Theo-
rem 7.14 to M .
As we have already seen, in the majority of conjugacy classes there is more than one
reduced matrix. Then it is natural to introduce some additional notion of complexity
to reduce the number of the reduced matrices. Let us introduce one important no-
tion of complexity, whose extended definition we will use later in Chap. 21 in the
multidimensional case.
Problem 2 Find a reduced matrix with minimal ς -complexity from a given conju-
gacy class.
Remark The minimal ς -complexity coincides with the minimal value of lsin POQ,
where P is an arbitrary integer point distinct from O, and Q = A(P ).
If the LLS period of M is (a1 , . . . , a2n ), then the above problem is equivalent to
finding the minimal numerator among the numerators of the rational numbers:
[a1 ; · · · : a2n−1 ], [a2 ; · · · : a2n ], [a3 ; · · · : a2n : a1 ], ...,
[a2n ; a1 : · · · : a2n−2 ].
Example 7.25 For instance, if the LLS period of M is (a, b), where a < b, then the
minimum for the numerators is a.
Example 7.26 Let the minimal period of the LLS sequence consists of three ele-
ments: (a, b, c, a, b, c). The numerator of the fraction [a; b : c : a : b] is minimal if
and only if c ≥ a, b.
7.3 Some Technical Details and Open Questions 79
The following example shows that it is not correct to skip the maximal entry of
the sequence.
Example 7.27 Consider the LLS period (1, 4, 5, 4, 1, 4). The minimum of the nu-
merators is attained at the fraction [1; 4 : 5 : 4 : 1], and not at the fraction [4; 1 : 4 :
1 : 4].
Problem 3
(i) Which reduced matrix with a given trace (or with a given LLS sequence) is the
most frequent as N tends to infinity?
(ii) What is the relative probabilities of two sequences representing the same LLS
period as N tends to infinity?
(iii) Is it true that the reduced matrix with minimal ς -complexity is the most fre-
quent among the reduced matrices of a given LLS period?
In Table 7.1 we show some values of #25000 (P ) for matrices with absolute value
of the trace less than 11.
Observe that SL(2, Z) matrices corresponding to the sequence (1, 2) are more
frequent than those corresponding to (1, 1). This is due to the fact that matrices
with LLS periods (1, 1) have integer congruent dual sails. One may enumerate the
matrices with multiplicities equivalent to the number of integer congruent sails and
obtain
4#25000 (1, 1) > 2#25000 (1, 2) + 2#25000 (2, 1).
We conclude with the following question. Denote by GK(P ) the frequency of a
sequence P = (a1 , a2 , . . . , a2n−1 ) in the sense of Gauss–Kuzmin:
80 7 Continued Fractions and SL(2, Z) Conjugacy Classes
Table 7.1 Values of #25000 (P ) for matrices with small absolute value of the trace
Absolute value Notation for classes Period Matrix Value of
of the trace of equivalent matrices P [[p, r][q, s]] #25000 (P )
1 (α1 + 1)α2
GK(P ) = ln ,
ln(2) α1 (α2 + 1)
where
Problem 4 Let
We conclude this chapter with a small observation of a classical subject that in some
sense generalizes the question of complexity of minimal periods.
f (x, y) = ax 2 + bxy + cy 2
√
The set of all possible values of Δ(f )/m(f ) is called the Markov spectrum. For
a general reference on the subject we recommend the book of T.W. Cusick [41].
Proposition 7.28
(i) Let f be a quadratic form with positive discriminant. Consider the geometric
continued fraction defined by two lines, all of whose points satisfy the equation
f (x, y) = 0.
Let S be the set of all vertices in all four sails of this continued fraction. Then
we have
m(f ) = inf f (x, y).
(x,y)∈S
(ii) Let the LLS sequence of the continued fraction be [. . . , a−1 , a0 , a1 , . . .]. Sup-
pose that vi = (xi , yi ) is a vertex of the sail corresponding to the element ai of
the sail (i.e., the integer sine at vertex vi in the sail equals ai ). Then
Δ(f )/m(f ) = ai + [0; ai+1 : ai+2 : · · · ] + [0; ai−1 : ai−2 : · · · ].
82 7 Continued Fractions and SL(2, Z) Conjugacy Classes
Remark 7.29 If the LLS sequence is periodic, the expression of Proposition 7.28(ii)
is
2
1 1 4
Δ(f )/m(f ) = a0 + + − ,
ltan α ltan α t lsin2 α
where the integer angle α is integer congruent to larctan([a1 ; · · · : a2n ]), and where
(a0 , a1 , . . . , a2n ) is some minimal even period of the LLS sequence.
Remark 7.30 Instead of a quadratic form in two variables one can take a form of
degree n in n variables corresponding to the product of n linear real forms. Any
such form defines a point in the multidimensional Markov spectrum. Namely, for
a form f one takes the infimum of the set of absolute values of f for all nonzero
integer points divided by the n-th root of the discriminant of f . There is not much
known about the multidimensional Markov spectrum. We refer the interested reader
to [49–51], and [194] (see also in [72]).
The Markov spectrum is √a closed set that has√a complicated structure. It does not
contain points less than 5. On the segment [ 5, 3] the spectrum has a unique limit
point, which is 3. It was described by A. Markov (see in [135] and [136]). The first
elements in the spectrum in the increasing order are as follows:
√
√ √ 221
5 = (1) , 8 = (2) , = (2, 2, 1, 1) ,
5
√ √
1517 7565
= (2, 2, 1, 1, 1, 1) , = (2, 2, 2, 2, 1, 1) ,
13 29
√
2600
= (2, 2, 1, 1, 1, 1, 1, 1) .
17
All these points are enumerated in Theorem 7.31. All real numbers greater than the
so-called Freiman’s constant
√
221564096 + 283748 462
F= = 4.5278295661 . . .
491993569
are in the Markov spectrum. The segment [3, F ] has not been completely studied.
It has many gaps, i.e., open segments that belong to the complement to the Markov
spectrum.
√ √ The largest gap in the Markov spectrum below 3 is the open segment
( 12, 13). The first gaps, including the maximal one, were calculated by O. Per-
7.4 Minima of Quadratic Forms and the Markov Spectrum 83
ron in [166] and [167]. Nowadays many other gaps are known, but a complete de-
scription of the gaps is unknown. For further information we refer the interested
reader to [38] and [41] (see also [84]).
It is also known that in some neighborhood of 3 the Markov spectrum has
Lebesgue measure zero (for instance, in the paper [31] by R.T. Bumby, it is shown
that the spectrum is of measure zero for the ray x < 3.33440).
m2 v ≡ ±m1 mod m
Remark 7.32 It is interesting to observe that the theory of the Markov spectrum is
based on the study of continued fractions with elements 1, 2, and 3. This is due to
the fact that LLS sequences containing greater elements correspond to the quadratic
forms that contribute to the Markov spectrum above Freiman’s constant.
Finally, let us say a few words about the integer solutions of the equation mentioned
in Theorem 7.31. The equation
x 2 + y 2 + z2 = 3xyz
1, 2, 5, 13, 29, 34, 89, 169, 194, 233, 433, 610, 985, 1325.
84 7 Continued Fractions and SL(2, Z) Conjugacy Classes
Proposition 7.33 Every Markov triple is obtained from (1, 1, 1) by applying a se-
quence operations of the following two types:
– permute the numbers x, y, and z in the triple (x, y, z);
– if (x, y, z) is a Markov triple, then (x, y, 3xy − z) is also a Markov triple.
7.5 Exercises 85
7.5 Exercises
Exercise 7.2 Is it true that two SL(2, R) matrices are integer conjugate if and only
if their geometric continued fractions are integer congruent?
Exercise 7.3 Consider a real spectrum SL(2, Z) matrix. Let its eigen lines be y =
α1,2 x. Then α1 and α2 are conjugate quadratic irrationalities.
Exercise 7.4 Let A be a real spectrum (2 × 2) matrix. Find all the matrices com-
muting with A.
Exercise 7.5 Is the statement of Proposition 7.5 true for rational angles (R-angles,
L-angles)? Find a correct analogue in these cases.
Exercise 7.7 Classify the integer conjugacy classes in the complex spectra case.
103 69
.
100 67
Chapter 8
Lagrange’s Theorem
The aim of this chapter is to study questions related to the periodicity of geomet-
ric and regular continued fractions. The main object here is to prove Lagrange’s
theorem stating that every quadratic irrationality has a periodic continued fraction,
conversely that every periodic continued fraction is a quadratic irrationality. One of
the ingredients to the proof of Lagrange theorem is the classical theorem on integer
solutions of Pell’s equation
m2 − dn2 = 1.
So, there is a strong relation between periodic fractions and quadratic irrationalities.
It is natural to ask what happens in cases of cubic, quartic, etc., irrationalities? How
can one generalize Lagrange’s theorem to the multidimensional case? We give a
partial answer to such questions in Chap. 18.
We start in Sect. 8.1 with the study of so-called Dirichlet groups, which are the
subgroups of GL(2, Z) preserving certain pairs of lines. These groups are closely
related to the periodicity of sails. The structure of a Dirichlet group is induced by
the structure of the group of units in orders (we will discuss this later in more detail
for the multidimensional case in Chap. 17); here we restrict ourselves to the simplest
two-dimensional case. In Sect. 8.2 we take a break and show how to take nth roots
of matrices using Gauss’s reduction theory. In Sect. 8.3 we study the solutions of
Pell’s equation. And finally, in Sect. 8.4 we prove Lagrange’s theorem.
The set Ξ (A) is a group under the operation of matrix multiplication. We call it
the Dirichlet group in dimension 2. We study the multidimensional case later, in
Chap. 17.
In case of A with only real eigenvalues, we are interested in the subgroup of the
Dirichlet group Ξ (A) containing all matrices with positive eigenvalues. This group
is called the positive Dirichlet group and denoted by Ξ+ (A).
Definition 8.1 We say that two Dirichlet groups (or positive Dirichlet groups) Ξ1
and Ξ2 are integer congruent if there exists B ∈ GL(2, Z) such that
A → B −1 AB
is an isomorphism of Ξ1 and Ξ2 .
Proposition 8.2 Two Dirichlet groups (or positive Dirichlet groups) Ξ1 and Ξ2 are
integer congruent if there exist A ∈ Ξ1 with distinct eigenvalues and B ∈ GL(2, Z)
such that B −1 AB ∈ Ξ2 .
Let us study the structure of Dirichlet groups for the matrices of GL(2, Z). There
are three essentially different cases here. The first case is for complex spectrum
matrices. In the second and the third cases we have real spectrum matrices, i.e.,
matrices all of whose eigenvalues are real. In the second case we consider matrices
with rational eigenvalues: the eigenvalues of such GL(2, Z) matrices are ±1. In the
third case we study matrices with irrational eigenvalues:
√
the eigenvalues of such
a±b c
matrices are quadratic irrationalities (i.e., d , where a, b, d are integers and
c > 1 is a square-free integer).
Recall also that the Dirichlet groups are defined only for matrices with distinct
eigenvalues, so we will consider only matrices with distinct eigenvalues.
Complex Spectra Case There are exactly two Dirichlet groups of such type up
to conjugation. The representative matrices of these groups can be chosen from the
group SL(2, Z). We have already seen them in Chap. 7:
0 1 1 1
; .
−1 0 −1 0
The first matrix is the matrix of rotation by the angle π/2. Only its powers commute
with it, and hence the Dirichlet group is isomorphic to Z/4Z. The second matrix
represents the 6-symmetry of a lattice. The related Dirichlet group is isomorphic to
Z/6Z.
8.1 The Dirichlet Group 89
Real Spectra Case I: Rational Eigenvalues Here we have two conjugate classes
of matrices represented by
1 0 1 −1
A1 = ; A2 = .
0 −1 0 −1
The Dirichlet group for Ξ (Ai ) is isomorphic to Z/2Z ⊕ Z/2Z, and it is generated
by ±Ai (for i = 1, 2 respectively).
Real Spectra Case II: Irrational Eigenvalues This is the largest set of irre-
ducible matrices whose conjugacy classes are described via Gauss’s reduction the-
ory. We start with a GL(2, Z) matrix A with distinct irrational eigenvalues (as we
have already mentioned, the eigenvalues are conjugate quadratic irrationalities).
First, note that A2 is in SL(2, Z) and has distinct eigenvalues. Hence without loss
of generality we restrict ourselves to SL(2, Z) matrices. Such matrices have peri-
odic geometric continued fractions with periodic LLS sequences. Due to Gauss’s
reduction theory and Proposition 7.7 there exists a one-to-one correspondence be-
tween integer conjugacy classes of Dirichlet groups and minimal periods of LLS
sequences.
Theorem 8.4 Consider an irreducible real spectrum matrix A ∈ SL(2, Z). Then its
Dirichlet group Ξ (A) is homeomorphic to Z ⊕ Z/2Z.
Proof The proof is straightforward. Any matrix commuting with A has the same
eigenlines, so it either acts as a shift on the sail, or exchanges the sails. The group
of shifts is homeomorphic to Z. Suppose it is generated by Â. If the period is even,
then the dual sails are not congruent, and the group is Z ⊕ Z/2Z generated by Â
and −E. If the period of the LLS sequence of the sail is odd, then one can extract
a square root of  in SL(2, Z). This square root will exchange dual sails, which are
congruent in this case. The Dirichlet group is again Z ⊕ Z/2Z, and it is generated
by Â1/2 and −E.
Corollary 8.6 Consider an irreducible real spectrum matrix A ∈ SL(2, Z). Then its
positive Dirichlet group Ξ+ (A) is homeomorphic to Z.
11 30 11 8 + 2·11
A= = .
15 41 15 11 + 2·15
90 8 Lagrange’s Theorem
The period of the LLS sequence related to this continued fraction is (1, 2, 1, 2, 1, 2).
So there exists a cube root of A in SL(2, Z). This cube root is also reduced and has
period of LLS sequence (1, 2). Hence by Theorem 7.14 it is
1 0 + 2·1 1 2
= .
1 1 + 2·1 1 3
Denote this matrix by Â. The group Ξ (A) is generated by  and −E.
In this section we study how to check whether a certain matrix has roots that are
also integer matrices. We start with SL(2, Z) matrices and further say a few words
about GL(2, Z) case.
q3 r2 q2 p3 + (s2 − p2 )q3
r3 = , s3 = .
q2 q2
Proposition 8.8 The matrix A3 constructed in Step 3 is an nth root of the matrix A2 .
Proof There exists a matrix M ∈ Ξ (A2 ) mapping (1, 0) to (a3 , b3 ). This matrix
corresponds to the sail integer self-congruence related to the shift for the period
(a1 , . . . , ak ) if k is even. If k is odd, we have an integer congruence of dual sails. In
both cases the matrix M is the nth root of A2 contained in Ξ (A2 ).
Since M(1, 0) = (a3 , b3 ), the first row of M coincides with the first row of A3 .
The coefficients in the second rows of A3 and M are uniquely defined by the fact
that A2 M = MA2 and A3 M = MA3 . The formulas are obtained from these matrix
equations (which are four equations in the elements). Hence A3 = M ∈ Ξ (A2 ) ⊂
GL(2, Z), and it is constructed by the formulas in the algorithm.
Remark 8.9 Let now A ∈ GL(2, Z) and det(A) = −1. It is clear that A does not
have integer even roots. Suppose n is odd. Then to find the nth root one should
check the (2n)th root for the matrix A2 in SL(2, Z). The matrix (A2 )1/2n (if it is in
GL(2, Z)) is one of the nth roots of A.
m2 − dn2 = 1,
where n and m are integer variables and d is not the square of an integer. Pell’s
equation has trivial solutions (±1, 0). The problem of finding nontrivial solutions
of Pell’s equation was posed by P. Fermat in 1657. The solution of Pell’s equation
was mistakenly attributed to J. Pell. In fact, W. Brouncker found a method to solve
this equation. The first to publish a strict proof was J.-L. Lagrange, in 1768 (see
[129]).
m2 − dn2 = 1
has positive solutions. In addition, there exists a positive solution (m1 , n1 ) such that
the set of all positive solutions coincides with the set of pairs (mk , nk ) with positive
integer parameter k defined as follows:
k
mk m1 dn1 1
= .
nk n1 m1 0
92 8 Lagrange’s Theorem
q 2 d − p2 = c
has infinitely many integer solutions. Choose among these solutions (m1 , n1 ) and
(m2 , n2 ) such that
m 1 m dn
= Am,n , where Am,n = .
n 0 n m
Notice that Am,n is an integer matrix with characteristic polynomial t 2 − 2mt + 1 in
variable t. Hence, Am,n is in SL(n, Z), and all its eigenvalues are distinct positive
real numbers. √
Consider an angle α√d with vertex at the origin and edges y = ±x/ d for x > 0.
Let us show that Am,n preserves α√d .
8.4 Periodic Continued Fractions and Quadratic Irrationalities 93
x 2 − dy 2 = 0.
Let us find the image of this union with respect to the action of Am,n . The equation
for the image is
(mx + dny)2 − d(nx + my)2 = 0,
which is equivalent (after simplification) to the initial equation
x 2 − dy 2 = 0.
Hence Am,n preserves the union of two lines. There are two possible situations here:
either these lines are invariant lines of Am,n , or Am,n interchanges the lines. Since
points with positive coordinates are taken to points with positive coordinates, the
lines are eigenlines. Finally, since the eigenvalues are positive real numbers, the
rays defining the angle α√d are preserved.
So the matrix Am,n is in the positive Dirichlet group Ξ+ preserving the angle
α√d . It is easy to see that the point (1, 0) is on the sail of α√d (it is the only point
with x ≤ 1 in the angle).
We have shown that every positive solution of Pell’s equation is defined by a shift
of a sail by a matrix of the positive Dirichlet group Ξ+ . From the other point of view,
every matrix of the Dirichlet group preserves the value x 2 − dy 2 (which is 1 in our
case). Recall that Ξ+ is isomorphic to Z. Letting A generate this group (we choose
A such that it takes (1, 0) to the half-plane y > 0), then all the solutions with positive
x-coordinate are of the form Ak (1, 0). We choose only a positive parameter k, in
order to have positive y-coordinates of the solutions. Finally, denoting by (m1 , n1 )
the point A(1, 0), then the matrix
m1 dn1
n1 m1
is in Ξ+ and defines the same shift of the sail (as A), hence equaling A. This con-
cludes the proof of the second statement of the theorem.
ak+h = ak .
Lemma 8.12 For every quadratic irrationality ξ there exists an SL(2, Z) matrix
such that one of its eigenvectors is (1, ξ ).
a11 a12
A= .
a21 a22
for some n
= 0.
Let us find a matrix of SL(2, Z) satisfying this system for some integer n. Since
n is an integer, the coefficients a12 and a21 are integers as well. Since det A = 1, we
have
det A = a11 a22 − a12 a21 = a11 (na11 − c1 ) + n2 c0 c2 = 1.
Therefore,
nc1 ± n2 (c12 − 4c0 c2 ) − 4
a11 = .
2
The coefficient a11 is an integer if and only if there exists an integer m satisfying
m2 = n2 c12 − 4c0 c2 − 4.
Set D = c12 − 4c0 c2 , m = m/2, and n = n/2 and rewrite the equation
m2 − Dn2 = 1.
8.4 Periodic Continued Fractions and Quadratic Irrationalities 95
Proposition 8.13 Let α1 > α2 > α3 be distinct numbers and let α1 be irrational.
Consider two angles α and β defined by pairs of rays in the lines (y = α1 x, y =
α2 x) and (y = α1 x, y = α3 x), where x > 0. The LLS sequences of these two angles
coincide from some element (up to a sequence shift).
Proof of Lagrange’s theorem First, let us show that every periodic continued frac-
tion represents a quadratic irrationality. Since the continued fraction is infinite, the
corresponding number is irrational.
Suppose that the periodic continued fraction for ξ does not have a preperiod, i.e.,
ξ = (a0 ; a1 : · · · : an ) .
Then
pn ξ + pn−1
ξ = [a0 ; a1 : · · · : an : ξ ] = .
qn ξ + qn−1
96 8 Lagrange’s Theorem
(The last equality holds by Theorem 1.13.) Notice that the denominator qn−1 ξ +
qn−2 is nonzero, since ξ is irrational. Therefore ξ satisfies
Set
ξ̂ = (an+1 ; an+2 : · · · : an+m ) .
Then by Theorem 1.13 we have
pn ξ̂ + pn−1
ξ = [a0 ; a1 : · · · : an : ξ̂ ] = .
qn ξ̂ + qn−1
8.5 Exercises
Exercise 8.2 Prove that the two Dirichlet groups of real spectrum matrices with
rational eigenvalues
1 0 1 −1
;
0 −1 0 −1
are not integer congruent.
8.5 Exercises 97
Exercise 8.3 Let A be a (2 × 2) matrix with distinct eigenvalues. Then every matrix
commuting with A has the same eigenlines.
Exercise 8.4 Do the following matrices have a cube root in GL(2, Z):
25 32 89 144
; ?
32 41 144 233
m2 − dn2 = 1
for d = 2, 3, 5.
√
Exercise 8.6 Find an SL(2, Z) matrix with eigenvector (1, 11).
Chapter 9
Gauss–Kuzmin Statistics
It turns out that the frequency of a positive integer k in a continued fraction for
almost all real numbers is equal to
1 1
ln 1 + ,
ln 2 k(k + 2)
1 k
n−1
lim f T x .
n→∞ n
k=0
Consider the measure space of the segment I = {x | 0 ≤ x < 1} with the Borel σ -
algebra Σ and the measure μ̂ defined on a measurable set S as
1 dx
μ̂(S) = .
ln 2 S 1 + x
The coefficient 1/ ln 2 is taken such that the measure of the segment I equals 1.
Here “almost every point” means “except for a subset of zero measure”.
The measure μ̂ is equivalent to the one-dimensional Lebesgue measure λ on the
segment [0, 1] (for more information on measure theory, see [137]). Hence we have
a similar statement in the case of the measure space (X, Σ, μ̂).
102 9 Gauss–Kuzmin Statistics
Let us introduce a transformation whose ergodic properties will form the basis for
the proof of the Gauss–Kuzmin theorem.
We consider the measure space (X, Σ, μ̂) defined in the previous section. Define
the Gauss map T of a segment [0, 1] to itself as follows:
T (x) = {1/x},
where {r} denotes the fractional part r − r (see Fig. 9.1).
Proposition 9.7 The Gauss map T is measure-preserving for the measure space
(X, Σ, μ̂).
Proof The first equality follows directly from the fact that
T [0; b1 : b2 : · · · ] = [0; b2 : b3 · · · ]
and the fact that every real number has a unique regular continued fraction expansion
with the last element not equal to 1.
The second equality is straightforward.
Proof of Proposition 9.7 Consider a measurable set S. From Lemma 9.8 it follows
that
1 dx
μ̂ T −1 (S) =
ln 2 T −1 (S) 1 + x
∞
1 dx
= .
ln 2 T −1 (S)∩[1/(k+1),1/k] 1 + x
k=1
Notice that on each open (i.e., without the boundary points) segment ]1/k,
1/(k + 1)[ the operator T is in one-to-one correspondence with the open segment
]0, 1[. Let us denote the inverse function to T on the segment ]1/k, 1/(k + 1)[ by
−1
T(k) . Therefore,
−1 1 1 −1
T T (S) ∩ , = T T(k) (S) = S,
k+1 k
−1 1
T(k) (x) = .
x +k
Then we have
−1
dT(k) (x)
dx dx
= = −1
T −1 (S)∩[1/(k+1),1/k] 1+x −1
T(k) (S) 1+x −1
T (T(k) (S)) 1 + T(k) (x)
−d(1/(x + k)) dx
= =
S 1 + 1/(x + k) S (x + k)(x + k + 1)
−1
(the negative sign is taken, since the map T(k) : x → x+k
1
changes the orientation).
So we have
∞
−1 1 dx
μ̂ T (S) = .
ln 2 S (x + k)(x + k + 1)
k=1
104 9 Gauss–Kuzmin Statistics
Since the integrated functions are nonnegative, we can change the order of the sum-
mation and the integration operations. We get
∞
−1 1 1
μ̂ T (S) = dx
ln 2 S (x + k)(x + k + 1)
k=1
∞
1 1 1 1 dx
= − dx = = μ̂(S).
ln 2 S x +k x +k+1 ln 2 S x + 1
k=1
Let us consider one example of a measurable set that is invariant under the Gauss
map.
Denote by Ψ the set of all irrational numbers in the segment [0, 1] whose con-
tinued fractions contain only finitely many 1’s. It is clear that
T −1 (Ψ ) = Ψ,
since the operation T −1 shifts elements of continued fractions by one and inserts
the first element.
Proof Denote by Υn the set of all irrational numbers that contain the element 1
exactly at place n. Notice that
Υ1 = [1/2, 1],
9.3 On the Gauss Map 105
Υn+1 = T n (Υ1 )
is measurable.
Denote by Ψ0 the set of all irrational numbers that do not contain an element ‘1’.
Since
∞
Ψ0 = X \ Υn ,
n=1
the set Ψ0 is also measurable. Then
T −n (Ψ0 )
is measurable.
We will prove later that the Gauss map is ergodic, and therefore, Ψ is either of
zero measure or full measure in X.
In terms of k-convergents pk /qk = [0; a1 : · · · : ak ], the expression for T(a−11 ,...,an ) (x)
is as follows (see Proposition 1.13):
106 9 Gauss–Kuzmin Statistics
pn /x + pn−1 pn + pn−1 x
T(a−11 ,...,an ) (x) = = .
qn /x + qn−1 qn + qn−1 x
Lemma 9.12 The measure of a segment I(a1 ,...,an ) satisfies the following inequality:
1
μ̂(I(a1 ,...,an ) ) < .
ln 2(qn + qn−1 )(pn + qn )
Proof We have
1 dx 1 [0;a1 :···:an :1] dx
μ̂(I(a1 ,...,an ) ) = =
ln 2 I(a1 ,...,an ) 1 + x ln 2 [0;a1 :···:an ] 1 + x
1 pn + pn−1 pn
= ln 1 + 1+
ln 2 qn + qn−1 qn
1 1
= ln 1 +
ln 2 (qn + qn−1 )(pn + qn )
1
< .
ln 2(qn + qn−1 )(pn + qn )
The last inequality follows from the concavity of the natural logarithm function.
Lemma 9.13 For any invariant set S of positive measure and any interval I(a1 ,...,an ) ,
ln 2
μ̂(S ∩ I(a1 ,...,an ) ) ≥ μ̂(S)μ̂(I(a1 ,...,an ) ).
2
Proof Since the map T is surjective, we also have
T (S) = S.
1 dx 1 pn + pn−1 x pn + pn−1 x
= d 1+
ln 2 S∩I(a ,...,an ) 1 + x ln 2 S qn + qn−1 x qn + qn−1 x
1
1 dx
=
ln 2 S (qn + qn−1 x)(qn + qn−1 x + pn + pn−1 x)
1 dx
≥
ln 2 · qn (qn + qn−1 + pn + pn−1 ) S 1 + x
1
= μ̂(S)
qn (qn + qn−1 + pn + pn−1 )
1 ln 2
≥ μ̂(S) ≥ μ̂(S)μ̂(I(a1 ,...,an ) ).
(qn + qn−1 )(2pn + 2qn ) 2
The last inequality follows from Lemma 9.12.
9.4 Pointwise Gauss–Kuzmin Theorem 107
Proof of Proposition 9.11 Let S be a measurable subset of the unit interval such that
T −1 (S) = S. Suppose also μ̂(S) > 0.
For any irrational number y = [0; a1 : a2 : · · · ], from Lemma 9.13 we have
Consider x in the segment [0, 1]. Let the regular continued fraction for x be [0; a1 :
· · · : an ] (odd or infinite). For a positive integer k, set
#(k, n)
P̂n,k (x) = ,
n
where #(k, n) is the number of integer elements ai equal to k for i = 1, . . . , n. Define
Theorem 9.14 For every positive integer k and almost every x (i.e., in the comple-
ment of a set of zero measure) the following holds:
1 1
P̂k (x) = ln 1 + .
ln 2 k(k + 2)
Hence, by Birkhoff’s pointwise ergodic theorem, the limit P̂k (x) exists almost ev-
erywhere. Since the transformation T is ergodic, we apply the Birkhoff–Khinchin
ergodic theorem and get
1 1/k
1 dx 1 1
P̂k (x) = χ]1/(k+1),1/k] d μ̂ = = ln 1 + .
0 ln 2 1/(k+1) 1 + x ln 2 k(k + 2)
Let α be some irrational number between zero and one, and let [0; a1 : a2 : a3 : · · · ]
be its regular continued fraction.
Let mn (x) denote the measure of the set of real numbers α contained in the
segment [0, 1] such that T n (α) < x (here T is the Gauss map). In his letters to
P.S. Laplace C.F. Gauss formulated without proofs the following theorem.
ln(1 + x)
lim mn (x) = .
n→∞ ln 2
This theorem is technically complicated. For the proof we refer to the original
manuscripts of R.O. Kuzmin [121] and [122] (see also A.Ya. Khinchin [105]).
Denote by Pn (k), for an arbitrary integer k > 0, the measure of the set of all real
numbers α of the segment [0, 1] such that each of them has the number k at the nth
position. The limit limn→∞ Pn (k) is called the frequency of k for regular continued
fractions and is denoted by P (k).
1 1
P (k) = ln 1 + .
ln 2 k(k + 2)
In this section we switch to the multidimensional case for a while in order to give
some definitions that are similar to the one-dimensional case (we will use these later,
in Chap. 19).
9.6 Cross-Ratio in Projective Geometry 109
The projective linear group (or the group of projective transformations) is the quo-
tient group
PGL(R, n) = GL(R, n)/Z(R, n),
where Z(R, n) is the one-dimensional subgroup of all nonzero scalar transforma-
tions of Rn . The group PGL(R, n) acts on the equivalence classes of vectors in Rn
with respect to Z(R, n). We have
Rn /Z(R, n) = RP n−1 .
Consider the affine part Rn−1 ⊂ RP n−1 . The stabilizer for the affine part is exactly
the group Aff(R, n − 1).
It is clear that the cross-ratio does not depend on the choice of the Euclidean coor-
dinate on the line, and therefore, it a function on the space of ordered 4-tuples of
distinct points in a line.
As we have already noted above, the space Rn−1 can be considered an affine
chart Rn−1 ⊂ RP n−1 . Hence the action of PGL(R, n) is well defined on the closure
of Rn−1 (which is actually RP n−1 ). The projective transformations take planes to
planes and, in particular, lines to lines. So it is natural to ask what happens to the
cross-ratios of four points on a line.
dxdy
.
(x − y)2
110 9 Gauss–Kuzmin Statistics
Proof The density of the infinitesimal cross-ratio coincides with the asymptotic co-
efficient at ε 2 of the cross-ratios of 4-tuples of points:
x, y, x + εdx, y + εdy,
where Z/2Z is the group transposing the lines in geometric continued fractions.
Note that FCF 1 is a 2-fold covering of CF 1 . We call the map of “forgetting” the
order in the ordered collections the natural projection of the manifold FCF 1 to the
manifold CF 1 and denote it by p (i.e., p : FCF 1 → CF 1 ).
Notice that FCF 1 is homeomorphic to the annulus and CF 1 is homeomorphic to
the Möbius band.
The group PGL(2, R) of transformations of RP 1 takes the set of all straight lines
passing through the origin in the plane into itself. Hence, PGL(2, R) naturally acts
on CF 1 and FCF 1 . It is clear that the action of PGL(2, R) is transitive, i.e., it takes
any (framed) continued fraction to any other. Notice that a stabilizer of any geomet-
ric continued fraction is one-dimensional.
Remark 9.21 The name for the invariant forms comes from theory of energies of
knots and graphs in low dimensional topology, where these forms are used as den-
sities for Möbius energies that are invariant under the group of Möbius transforma-
tions in R3 (we refer the interested reader to the book by J. O’Hara [150]).
Proposition 9.22 All Möbius forms of the manifolds CF 1 and FCF 1 are propor-
tional.
Proof Transitivity of the action of PGL(2, R) implies that all Möbius forms of the
manifolds CF 1 and FCF 1 are proportional.
Remark 9.25 The projection p takes the Möbius measures of the manifold FCF 1
to the Möbius measures of the manifold CF 1 . This establishes an isomorphism
between the spaces of Möbius measures for CF 1 and FCF 1 . Since the manifold
of framed continued fractions possesses simpler chart systems, all formulas of the
work are given for the case of the framed continued fraction manifold. To calculate
a measure of some set F of the unframed continued fraction manifold, one should:
take p −1 (F ); calculate the Möbius measure of the obtained set of the manifold of
framed continued fractions, and divide the result by 2.
the second straight lines of the collection respectively. Denote by |v|l the Euclidean
length of a vector v in the coordinates Ol Xl Yl of the chart FCF 1,l . Note that the
chart FCF 1,l is the space R × R minus its diagonal.
Consider the following form in the chart FCF 1,l :
dxl ∧ dyl
ωl (xl , yl ) = .
|xl − yl |2l
Proposition 9.26 The measure μωl coincides with the restriction of some Möbius
measures to FCF 1,l .
Proof Notice that the form ωl (xl , yl ) coincides with the infinitesimal cross-ratio on
the line l. Hence it is invariant under projective transformations of l (on an every-
where dense subset) in the chart FCF 1,l . Therefore, the measure μωl coincides with
the restriction of some Möbius measures to FCF 1,l .
Corollary 9.27 A restriction of an arbitrary Möbius measure to the chart FCF 1,l is
proportional to μωl .
Proof The statement follows from the proportionality of any two Möbius mea-
sures.
Consider now the manifold FCF 1 as a set of ordered pairs of distinct points on
a circle R/πZ (this circle is a one-dimensional projective space obtained from the
unit circle by identifying antipodal points). The doubled angular coordinate ϕ of the
circle R/πZ induced by the coordinate x of the straight line R naturally defines the
coordinates (ϕ1 , ϕ2 ) of the manifold FCF 1 .
1 ϕ1 − ϕ2
ω1 = cot2 dϕ1 ∧ dϕ2 .
4 2
Definition 9.29 The quantity μ1 (CF 1 (F )) is called the relative frequency of the
edge F .
Note that the relative frequencies of edges of the same integer-linear type are
equivalent. Every edge of a one-dimensional continued fraction is at unit integer
distance from the origin. Thus, the integer-linear type of a segment is defined by its
integer length (the number of inner integer points plus unity). Denote the relative
frequency of the edge of integer length k by μ1 ( k ).
1
μ1 k = ln 1 + .
k(k + 2)
(k + 1)(k + 1) 1
= ln = ln 1 + .
k(k + 2) k(k + 2)
This proves the proposition.
(k+1)(k+1)
Remark 9.31 Note that the argument of the logarithm k(k+2) is the cross-ratio
of points (−1, 1), (0, 1), (k, 1), and (k + 1, 1).
9.11 Exercises
Exercise 9.1
(a) Prove that the measure
1 dx
μ(S) =
ln 2 S 1+x
is a probability measure on the segment [0, 1], i.e., μ([0, 1]) = 1.
(b) Find μ([a, b]) for 0 ≤ a < b ≤ 1, where μ is as above.
Exercise 9.2 Ergodicity of the doubling map. Consider the space (S 1 , Σ, λ), where
X is the unit circle, Σ is the Borel σ -algebra, and λ is the Lebesgue measure. Con-
sider the doubling map T : S 1 → S 1 such that
T (ϕ) = 2ϕ.
Exercise 9.4 Prove the μ̂-density theorem from the Lebesgue density theorem.
Exercise 9.5 Recall that Ψ0 is the subset irrational numbers in [0, 1] whose contin-
ued fractions do not contain 1 as an element. Prove by elementary means (without
using ergodic theorems) that
μ̂(Ψ0 ) = 0.
Exercise 9.7 Prove that any two triples of points on a line are projectively equiva-
lent. Find a criterion for two 4-tuples of points on a line to be projectively equivalent.
We say that a rational number p/q (where q > 0) is a best approximation of a real
number α if for any other fraction p /q with 0 < q < q we get
α − p > α − p .
q q
Remark 10.1 By the definition of best approximations, every integer is a best ap-
proximation of every real number α. To be more consistent we should say that the
best approximations with integer denominator are α and α + 1.
Remark 10.2 From the definition of best approximations it is not immediate that
there are no two best approximations with the same denominator (greater than 1).
This is a direct corollary of Theorem 10.3 below.
Theorem 10.3 Let α be a positive real number with regular continued fraction
[a0 ; a1 : · · · ]. Denote by pk /qk its k-convergent and by rk the remainder part
[ak ; ak+1 : · · · ]. A rational number p/q (with q > 1) is a best approximation of
α if and only if there exist k ≥ 0 and an integer x such that
Proposition 10.5 Consider an arbitrary nonzero real number α. Then all its con-
vergents are best approximations for α.
Proof Let pk /qk be the k-convergent to α (we assume that the integers pk and qk
are relatively prime). We prove the proposition by induction on k.
Base of induction. If the denominator qk equals 1, then we have either α for
k = 0 or sometimes α + 1 for k = 1 when a1 = 1. As we have agreed in Re-
mark 10.1, these two numbers are best approximations.
Step of induction. Suppose that the (k − 1)-convergent pk−1 /qk−1 is a best ap-
proximation. Let us prove that the k-convergent pk /qk is a best approximation (we
10.1 Two Types of Best Approximations of Rational Numbers 117
From Theorem 3.1 it follows that A and B are vertices of two different sails for two
angles defined by the ray y = αx together with two positive coordinate rays.
Denote by l1 the line passing through the points O and A, and by l2 the line
passing through the points B and parallel to l1 . Draw the line through B parallel
to the line x = 0 and denote the intersection of this line with the ray OA by C (see
Fig. 10.1).
From Proposition 1.15 it follows that the triangle AOB is empty. Hence, there
are no integer points in the strip between the lines l1 and l2 (which is gray in
Fig. 10.1). Thus every integer point of the triangle BOC is either in the line OC or
coincides with B. Therefore, every integer point (p, q) with qk−1 ≤ q < qk satisfies
p
− α > min pk−1 − α , pk − α = pk − α
q q q q
k−1 k k
(the last equality follows from Proposition 1.19). By induction, pk−1 /qk−1 is a best
approximation. Hence if q < qk−1 , then
p
− α > pk−1 − α > pk − α
q q q
k−1 k
(the second inequality follows from Proposition 1.19). Therefore, the k-convergent
pk /qk is a best approximation. The induction step is complete.
Lemma 10.6 Consider a nonzero real number α > 0 with regular continued frac-
tion [a0 ; a1 : · · · ]. Let pi /qi be its i-convergent for i = 0, 1, 2, . . . . Then for every
k ≥ 1 every best approximation p/q of α satisfying
qk−1 < q ≤ qk
118 10 Geometric Aspects of Approximation
lies on the line passing through the point (pk , qk ) and parallel to the vector
(pk−1 , qk−1 ) (it is the line l2 in Fig. 10.1).
Proof Let A = (pk−1 , qk−1 ) and B = (pk , qk ). Suppose that the lines l1 and l2 are
as above (see Fig. 10.1).
Consider E(p, q). If ld(E, OA) > 1, then the triangle EOA contains at least one
integer point E (p , q ) incident with EOA distinct from E and not contained in
AO. First, it is clear that q < q, since E has the greatest second coordinate among
all the points of the triangle OAE. Secondly, E and E are in the same half-plane
with respect to the line y = αx. Hence,
p
− α > p − α .
q q
qk < q ≤ qk+1
are of the form [a0 ; · · · : ak : m], where m is a positive integer not greater than ak .
Proof From Lemma 10.6 we know that all best approximations satisfying the con-
dition of the corollary are on a line l2 . All integer points of this line are of the form
B + mOA for an integer m, or in other words,
For m ≤ 0 the point B + mOA is not in the first octant. For m > ak the second coor-
dinate of the point B + mOA is greater than qk . Therefore, we have 0 < m ≤ ak .
Proof of Theorem 10.3 By Corollary 10.7 we know that all best approximations are
of the form
[a0 ; · · · : ak : x],
10.1 Two Types of Best Approximations of Rational Numbers 119
where 0 < x ≤ ak+1 . By Proposition 10.5 we get that all the convergents (i.e., the
case x = ak+1 ) are best approximations. It is easy to see that
Recall that
pk rk+1 + pk−1
α= ;
qk rk+1 + qk−1
pk
[a0 ; · · · : ak ] = ;
qk
pk x + pk−1
[a0 ; · · · : ak : x] = .
qk x + qk−1
Therefore,
[a0 ; · · · : ak−1 : ak ] − α = |pk qk−1 − qk pk−1 | = 1
qk (qk rk+1 + qk−1 ) qk (qk rk+1 + qk−1 )
and
[a0 ; · · · : ak : x] − α = |pk qk−1 − qk pk−1 |(rk+1 − x)
(qk x + qk−1 )(qk rk+1 + qk−1 )
rk+1 − x
= .
(qk x + qk−1 )(qk rk+1 + qk−1 )
Here |pk qk−1 − qk pk−1 | = 1 holds by Proposition 1.15. Notice that we do not con-
sider the case x ≥ rk+1 , since then the denominator of the corresponding fraction is
greater than qk+1 . Hence inequality (10.1) is equivalent to the inequality
rk+1 − x 1
< ,
(qk x + qk−1 ) qk
We say that a rational number p/q (where q > 0) is a strong best approximation of
a real number α if for every fraction p /q with 0 < q < q we get
q α − p > |qα − p|.
10.1 Two Types of Best Approximations of Rational Numbers 121
In this subsection we show that all strong best approximations are convergents of
the corresponding number. In particular, every strong best approximation is a best
approximation.
Proof Consider a strong best approximation p/q of α. Then for every rational num-
ber p /q with 0 < q < q we get
q α − p > |qα − p|.
Theorem 10.9 The set of all strong best approximations of a real number α coin-
cides with the set of all convergents of α.
but on the other hand, hqk +qk−1 > qk . Hence for 0 < h < ak+1 , the rational number
of expression (10.2) is not a strong best approximation. Therefore, every strong best
approximation of α is a convergent of α.
For the case of convergents we have
|qk+1 α − pk+1 | = (qk+1 + qk )α − (pk+1 + pk ) − (qk α − pk )
< |qk α − pk | − (qk+1 + qk )α − (pk+1 + pk ) < |qk α − pk |.
The first inequality holds since the integer points (pk+1 , qk+1 ) and (pk+1 +
pk , qk+1 + qk ) are in different half-planes with respect to the line y = αx. Therefore,
all the convergents are strong best approximations.
Example 10.11 Consider the example of α = 47 21 = [2; 4 : 5]. All best approxima-
tions and strong best approximations are shown in the following table.
Best + + + + + +
Strong best + − + − − +
In Sect. 10.2.4 we prove a general estimate for the discrepancy of best approxi-
mations for the arrangements defined by pairs of continued fractions with bounded
elements: it turns out that in this case, the discrepancy is proportional to N −2 , where
N is the size of the corresponding best approximation. Further, we give an example
of an arrangement one of whose continued fractions has a rapid growing sequence
of elements. In this case the discrepancy is asymptotically not better than N −1−ε .
We study best approximations of arrangements whose lines are eigenlines of cer-
tain matrices in SL(2, Z) in Sect. 10.2.5. Finally, in Sect. 10.2.6 we show how to
simplify the calculation of best approximations in practice.
In this section we study arrangements of two distinct lines passing through the ori-
gin.
We start with the notion of Markov–Davenport forms.
f (x, y) = ax 2 + bxy + cy 2
with real coefficients and positive discriminant Δ(f ) = b2 − 4ac. Then the form
f (x, y)
√
Δ(f )
is called the Markov–Davenport form.
Example 10.13 Let us write the Markov–Davenport form for the arrangement of
eigenlines of the Fibonacci operator
1 1
.
1 0
y = −θ x and y = θ −1 x,
124 10 Geometric Aspects of Approximation
√
where θ is the golden ratio, i.e., θ = 1+ 5
2 . The Markov–Davenport form of the
Fibonacci operator is
(y + θ x)(y − θ −1 x) 1
= √ x 2 − xy − y 2 .
−θ − θ −1 5
To construct an approximation theory for arrangements one should choose the set
of all approximations and define the quality of each of the approximations of this
subset. We consider the set of all integer arrangements as the set of approximations:
an arrangement of two lines passing through the origin is called integer if both its
lines contain integer points distinct from the origin. Let us define the quality of the
approximation (which we call the size).
For a vector v = (a, b) denote by |v| the norm max(|a|, |b|).
Remark 10.15 For a generic irrational arrangement the best approximation of any
size greater than 1 is unique. Nevertheless, for particular cases there could be more
than one best approximation of the same size.
Let us now define a natural distance between two arrangements in the plane passing
through the origin.
Take the maximal absolute values of the coefficients of these forms. Suppose that
they equal c1 and c2 . The discrepancy between R1 and R2 is min(c1 , c2 ). We denote
it by ρ(R1 , R2 ).
10.2 Rational Approximations of Arrangements of Two Lines 125
Remark 10.17 In case of arrangements R1 and R2 of ordered lines one should con-
sider the maximal absolute values of the coefficients only of one form
Remark 10.18 The space of all arrangements of two lines passing through the ori-
gin is two-dimensional. Nevertheless, the complexity of approximation by integer
arrangements is more complicated in comparison with simultaneous approximation
of a line in R3 (whose configuration space is also two-dimensional). This happens
because for the approximation of the arrangement of two lines one needs two pairs
of relatively prime integers, while for simultaneous approximation of a vector in R3 ,
a triple of relatively prime integers suffices. Nevertheless, geometric properties of
continued fractions essentially simplify the approximations of arrangements. This is
relevant especially for so-called algebraic arrangements, whose lines are eigenlines
of some SL(2, Z) matrix. In this case all the approximations are constructed from
some periodic sets that are easy to construct.
C1 C2
< ρ(R, RN ) < 2 .
N2 N
126 10 Geometric Aspects of Approximation
This theorem is a reformulation of one of the results of [101]. We begin the proof
with the following two technical lemmas.
Let Rδ1 ,δ2 denote the arrangement of lines y = (αi + δi )x for i = 1, 2.
α12 δ2 − α22 δ1 + α1 δ1 δ2 − α2 δ1 δ2 2
x
(α1 − α2 )(α1 − α2 + δ1 − δ2 )
α2 δ1 − α1 δ2
+ xy
(α1 − α2 )(α1 − α2 + δ1 − δ2 )
δ2 − δ1
+ y2. (10.3)
(α1 − α2 )(α1 − α2 + δ1 − δ2 )
Consider the absolute values of the coefficients at y 2 and at xy for the form
ΦR − ΦRδ1 ,δ2 . By the conditions of the lemma these coefficients are less then ε. We
have
δ2 − δ1
(10.4)
(α − α )(α − α + δ − δ ) < ε;
1 2 1 2 1 2
α2 δ1 − α1 δ2
(10.5)
(α − α )(α − α + δ − δ ) < ε.
1 2 1 2 1 2
(α1 − α2 )2
|δ1 − δ2 | < ε.
1 − ε|α1 − α2 |
Lemma 10.21 Let ε be a positive real number. Suppose that real numbers δ1 and
δ2 satisfy |δ1 | < ε and |δ2 | < ε. Then the following estimate holds:
Proof The statement of the lemma follows directly from the estimate for R − Rδ1 ,δ2 ,
shown in (10.3). For the coefficient at x 2 we have
2
α1 δ2 − α22 δ1 + α1 δ1 δ2 − α2 δ1 δ2 α12 + α22 + |α1 − α2 |ε
(α − α )(α − α + δ − δ ) < (|α − α |)(|α − α | + 2ε) ε.
1 2 1 2 1 2 1 2 1 2
α2 δ1 − α1 δ2 2(|α1 | + |α2 |)
xy < ε.
(α1 − α2 )(α1 − α2 + δ1 − δ2 ) (|α1 − α2 |)(|α1 − α2 | + 2ε)
δ2 − δ1 2
y2 < ε.
(α1 − α2 )(α1 − α2 + δ1 − δ2 ) (|α1 − α2 |)(|α1 − α2 | + 2ε)
Therefore, the statement of the lemma holds by the definition of discrepancy.
Proof of Theorem 10.19 Let us start with the proof of the inequality
C1
< ρ(R, RN ).
N2
Let α1 = [a0 ; a1 : · · · ] and pi /qi = [a0 ; a1 : · · · : ai ] for i = 0, 1, . . . . Without loss
of generality we assume that N > a0 . Assume that k is the greatest integer for which
pk ≤ N and qk ≤ N . Then we have the following estimates:
p
min α1 − |p|≤N, |q|≤N
q
pk+1 1
≥ α1 − ≥
qk+1 qk+1 (qk+1 + qk+2 )
128 10 Geometric Aspects of Approximation
1
≥
(ak+1 + 1)qk ((ak+1 + 1)qk + (ak+1 + 1)(ak+2 + 1)qk )
1 1
≥ · .
(ak+1 + 1)2 (ak+2 + 2) N 2
The second inequality follows from Exercise 1.3 on page 18 (we leave the proof for
the reader). For the third inequality we use the following inequality twice:
C2
ρ(R, RN ) > .
N2
Again we assume that k is the greatest integer for which |pk | ≤ N and qk ≤ N .
For α1 we have the following:
α1 − pk < pk+1 − pk = 1 < ak+1 + 1 < (ak+1 + 1)(|α1 | + 1) .
2
qk qk+1 qk qk qk+1 2
qk+1 N2
The first inequality follows from the fact that if the k-convergent is greater than α,
then the (k + 1)-convergent is less then α, and conversely. Proposition 1.15 implies
the second equality. The third inequality follows from inequality (10.6). In the last
inequality we use the assumption that either qk+1 > N or |pk+1 | > N , in which case
|pk+1 | N
qk+1 > >
|α1 | + 1 |α1 | + 1
(here the first inequality holds since |α − pk+1 /qk+1 | < 1). From conditions of the
theorem the set of all elements ai of the continued fraction for α is bounded. There-
such that for every N there exists an approximation
fore, there exists a constant C2,1
of α1 of magnitude smaller than C2,1 /N 2 .
such that for every N
A similar estimate holds for α2 . There exists a constant C2,2
there exists an approximation of α1 of magnitude smaller than C2,2 /N 2 . Therefore,
Remark 10.22 Now we say a few words about the case of unbounded elements of
continued fractions. Let ε be a small positive real number. If the elements of a con-
tinued fraction (say for α1 ) grow fast enough, then there exists a sequence Ni for
which the approximations RNi are of magnitude (N C)1+ε . Let us illustrate this by the
i
following example.
10.2 Rational Approximations of Arrangements of Two Lines 129
– a0 = 1;
pk
– let a0 , . . . , ak be defined and [a0 ; · · · : ak ] = qk ; then ak+1 = (qk )M−1 .
Let Nk = qk+1
2 + 1 for k = 1, 2, . . . . Then there exists a positive constant C such
that for every integer i we have
C
ρ(R, RNi ) ≥ 1+1/M
.
Ni
From Theorem 10.3 it follows that there is no best approximation whose denomina-
tors q satisfies
qk < q ≤ N k .
C
ρ(R, RNk ) = ρ(R, Rpk /qk ) ≥ 1+1/M
.
Nk
The last inequality follows from Lemma 10.20. This completes the proof.
C
ρ(R, RN ) ≤ ,
N
Fig. 10.2 The arrangement of two lines and its geometric continued 1-, 2-, and 3-fractions
Let us prove a relation between classical geometry of numbers and best simultane-
ous approximations.
Definition 10.24 Define inductively the w-sails for an arbitrary angle C with vertex
at the origin.
– let the 1-sail be the sail of C.
– suppose that all w-sails for w < w0 are defined. Then let the w0 -sail be
0 −1
w
∂ conv C ∩ Z \{O} \
2
w-sail ,
w=1
In Fig. 10.2 we show the geometric continued 1-, 2-, and 3-fractions for a partic-
ular arrangement of two lines. It is interesting to notice that the geometric continued
3-fraction in the figure is homothetic to the geometric continued 2-fraction and to
the geometric continued fraction (which is a 1-fraction). It turns out that it is a gen-
eral fact that the w-continued fraction of any general arrangement R (i.e., whose
lines do not contain integer points except the origin) is homothetic to the geometric
continued fraction of R; the coefficient of homothety is w. We leave this statement
as an exercise for the reader.
10.2 Rational Approximations of Arrangements of Two Lines 131
Definition 10.25 The minimal absolute value of the form ΦR on the integer lattice
minus the origin is called the Markov minimum of ΦR . We denote it by αR .
Recall that Markov minima are studied in the Markov spectrum theory discussed
in Sect. 7.4.
Proof The statement follows directly from the fact that the w-continued fraction of
R is homothetic to the continued fraction of R.
Theorem 10.27 Let R be an algebraic arrangement. Then there exists C > 0 such
that for every N ∈ Z+ the following holds: each of the two lines of the best approx-
imation arrangement RN contains an integer point of some w-sail for w < C.
Notice that
ΦR (x1,N , y1,N ) = (x1,N − α1 y1,N )(x1,N − α2 y1,N )
α1 − α2
x1,N x1,N − α2 y1,N
=
− α1 · y1,N .
y1,N α1 − α2
The first term above is bounded by C̃/N 2 for some constant C̃ that does not depend
on N . Hence,
132 10 Geometric Aspects of Approximation
2
y1,N (x1,N /y1,N ) − α2 (x1,N /y1,N ) − α2
|ΦR (x1,N , y1,N )| ≤ C̃ 2 · .
N α1 − α2 ≤ C̃ α1 − α2
Finally, the last expression is uniformly bounded in N . The same property holds for
the values |ΦR (v2,N )|.
Therefore, by Lemma 10.26 the vectors of v1,N and v2,N for all N are in the
union of a finite number of w-continued fractions for R.
Conjecture 5 We conjecture that for all algebraic arrangements for almost all N ,
the vectors v1,N and v2,N defining RN are in a 1-geometric continued fraction.
Proposition 10.28 Let m and n be two integers. Supposing that |α1 − pq | < δ (or
|α2 − pq | < δ, respectively), then the following holds:
α1 − p > |α1 − α2 | |ΦR (p, q)|
q |α1 − α2 | + δ q2
α2 − p > |α1 − α2 | |ΦR (p, q)| .
q |α1 − α2 | + δ q2
Proof We have
α1 − p = 1 |p − α1 q| = 1 |p − α1 q|(p − α2 q)
q q q p − α2 q
|ΦR (p, q)| |α1 − α2 | |α1 − α2 | |ΦR (p, q)|
= > .
q2 |α1 − α2 + (p/q − α1 )| |α1 − α2 | + δ q2
1 ≤ pi ≤ N and 1 ≤ qi ≤ N.
Also find pj /qj , the convergent satisfying the same property for α2 .
Step 2. Consider the arrangement R of lines
pi pj
y= x and y = x.
qi qj
We know that
pj
α − pi ≤ 1 1
and α − ≤ .
qi qi qi+1 qj qj qj +1
1 1
ρ(R, R) < C max , ,
qi qi+1 qj qj +1
where
max(2, 2(|α1 | + |α2 |), α12 + α22 + |α1 − α2 |)
C= .
(|α1 − α2 |)(|α1 − α2 |)
Step 3. From the definition of best approximations and the result of Step 1 we have
1 1
ρ(R, RN ) ≤ ρ(R, R) < C max , .
qi qi+1 qj qj +1
where C is as in Step 2.
Step 4. Suppose now that the lines of the best approximation RN contains integer
vectors (n1 , m1 ) and (n2 , m2 ) of unit integer length. By Proposition 10.28 we
write the following estimates on the values of the Markov–Davenport form ΦR
at these vectors:
134 10 Geometric Aspects of Approximation
ΦR (m1 , n1 ) |α1 − α2 | + δ1 m1 |α1 − α2 | + δ1
< α1 − < δ1 ;
n21 |α1 − α2 | n1 |α1 − α2 |
ΦR (m2 , n2 ) |α1 − α2 | + δ2 m2 |α1 − α2 | + δ2
< α1 − < δ2 .
n22 |α1 − α2 | n2 |α1 − α2 |
Step 5. This is the last step. Here we should consider all integer arrangements that
contains integer points (m1 , n1 ) and (m2 , n2 ) satisfying the inequalities obtained
in Step 4 for ΦR (mi , ni ) for i = 1, 2. Then one should choose the arrangement
with the smallest discrepancy, which is RN .
Output. In the output we have the best approximation RN .
Remark 10.29 The above procedure of best approximation calculation seriously de-
creases the number of integer points in the square N × N that should be tested to
find RN . This is of a great importance especially for the case of algebraic arrange-
ments. The procedure gives an explicit estimate on the number of w-sails whose
points should be checked to construct best approximations. Since all w-sails are pe-
riodic, all its points are contained in the finite number of orbits with respect to the
corresponding Dirichlet group action.
Example 10.30
Input data. Consider an arrangement of eigenlines of the Fibonacci operator:
0 1
.
1 1
Step 3. Now, following Lemma 10.20 we write the estimates on δ1 and δ2 of the
best approximation RN = Rδ1 ,δ2 :
80.35 18.97
|δ1 | < and |δ2 | < .
N2 N2
10.3 Exercises 135
Step 4. Suppose that the best approximation RN is defined by the two integer vec-
tors (m1 , n1 ) and (m2 , n2 ) of unit integer length. Then we have the following
two estimates:
|ΦR (m1 , n1 )| 80.65 |ΦR (m2 , n2 )| 18.99
< , < .
n21 N2 n22 N2
Step 5 and Output. We have completed the computations for N = 106 , and the an-
swer in this case is the matrix with eigenspaces F29 y − F30 x = 0 and F30 y +
F29 x = 0. We conjecture that for the Fibonacci matrix all the best approxima-
tions are the arrangements of lines Fk−1 y − Fk x = 0 and Fk y + Fk−1 x = 0 for
k = 0, 1, . . . .
10.3 Exercises
Exercise 10.1 Find all best approximations and all strong best approximations for
the numbers 433/186 and 575/247.
Exercise 10.2 Let ABC be an integer empty triangle. Consider a strip bounded by
two lines parallel to AB and passing through the points A and C respectively. Prove
that all integer points on this strip are at its boundary lines.
Exercise 10.3 Consider a positive real number α with a regular continued fraction
[a0 ; a1 : · · · ]. Let x be an integer satisfying 1 ≤ x ≤ ak+1 − 1. Prove that
[a0 ; a1 : · · · : ak : x] − α > [a0 ; a1 : · · · : ak : x + 1] − α .
Exercise 10.4 Let α = [a0 ; a1 : · · · ] be a positive real number. Denote by pk /qk its
k-convergent and by rk the remainder part [ak ; ak+1 : · · · ]. Then the condition
qk+1
rk+1 <
qk
3 5 7 −2
and
2 3 −3 1
Exercise 10.6 Consider an arbitrary angle C with vertex at the origin whose edges
do not contain other integer points. Prove that the w-sail of C is homothetic to the 1-
sail of C and find the coefficient of homothety. What happens for the angles having
integer points on their edges?
Exercise 10.7 Let R be an algebraic arrangement of two lines. Then the following
statements hold.
(a) The Markov–Davenport form ΦR attains only a finite number of values at the
integer points of the corresponding continued fraction. Denote this set by S.
(b) The form ΦR attains the value x at some vertex of the w-sail if and only if
x/w ∈ S.
Exercise 10.8 Find all best approximations of sizes 1, 2, 3, 4, 5 for the arrangement
of the eigenlines of the Fibonacci operator.
Chapter 11
Geometry of Continued Fractions with Real
Elements and Kepler’s Second Law
Remark Without loss of generality, we mostly use only O-broken lines, where O is
the origin.
sgn(ABC)
⎧
⎨ 1, if the pair of vectors (BA, BC) defines the positive orientation,
= 0, if the points A, B, and C are in a straight line,
⎩
−1, if BA, BC defines the negative orientation.
Let us generalize the notion of LLS sequences for sails to an arbitrary O-broken
line.
The sequence (a0 , . . . , a2n−2 ) is called a lattice signed length-sine sequence for the
broken line A0 A1 . . . An , or LSLS sequence, for short.
The continued fraction for the O-broken line A0 A1 . . . An is the element [a0 ; a1 :
· · · : a2n−2 ] of R.
Notice that the LSLS sequence for the sail of some angle coincides with the LLS
sequence of this sail. So we consider LSLS sequences as a natural combinatoric–
geometric generalization of LLS sequences. Note also that an O-broken line is
uniquely defined by its LSLS sequence, the point A0 , and the direction of the vector
A0 A1 .
In Fig. 11.1 we show how to get signs of elements of the LSLS sequence from
the local geometry of a broken line. As an example we consider the O-broken line
with four vertices in Fig. 11.2.
In the next theorem we state that the LSLS sequence is the complete invariant
for the action of the group of proper integer congruences (i.e., integer congruences
preserving the orientation of the plane).
Theorem 11.3
(i) Two O-broken lines are proper integer congruent if and only if their LSLS se-
quences coincide.
11.2 Continued Fractions with Real Coefficients 139
(ii) Every finite sequence of nonzero integers is realizable as an LSLS sequence for
some O-broken line.
We skip the proof of this theorem, since later, in the next section, we introduce a
more general construction that generalize LSLS sequences for O-broken lines. For
further information on O-broken lines and their LSLS sequences we refer to [94]
and [96].
Further in this chapter we work in the plane with a marked point O (say the
origin). For two vectors v and w in the plane we denote the oriented volume of the
parallelogram spanned by the vectors v and w by |v × w|.
In Chap. 3 (see page 37) we gave a geometric algorithm to construct sails for real
numbers given the regular continued fractions of the corresponding numbers. The
algorithm has a sequence of positive integers (a0 , a1 , . . .) as input and its output
is the sails for the number [a0 , a1 , . . .]. In this subsection we slightly modify this
algorithm such that it will construct a broken line starting from an arbitrary sequence
of nonzero real numbers. This algorithm will give us a base to define continued
fractions related to broken lines (we do this later, in the next subsection).
Ak+1 = Ak + a2k Ak Q.
Remark 11.4 The constructed broken line in some sense generalizes geometric con-
struction from the case of sails related to regular continued fractions (see Theo-
rem 3.1 above) to the case of broken lines related to arbitrary continued fractions
(see Theorem 11.10 and Corollary 11.11 below).
11.2 Continued Fractions with Real Coefficients 141
|OA0 × OA1 | = a0 .
Inductive Step. Supposing that the statement holds for k − 1, let us prove it for k.
We start with a2k :
Example 11.6 Let us consider the first interesting case of broken lines with three
vertices. Without loss of generality we fix A0 = (1, 0) and the direction v = (0, 1).
Letting the sequence be (a0 , a1 , a2 ), we have
A1 = (1, a0 ).
142 11 Geometry of Continued Fractions and Kepler’s Second Law
P = (1, 1 + a0 ), Q = (1 + a1 , 1 + a0 + a0 a1 ).
Finally, we have
A2 = (1 + a1 a2 , a0 + a2 + a0 a1 a2 ).
Notice that the coordinates of the point A2 coincide with the denominator and the
numerator of the continued fraction [a0 ; a1 : a2 ].
Let us extend the definition of LLS sequences for sails to the case of broken lines,
basing the definition on the expressions of Proposition 11.5. Consider an arbitrary
broken line A0 . . . An satisfying the following condition: any of its edge is not con-
tained in a line passing through the origin O.
Proposition 11.8 Let a broken line be an O-broken line. Then its LLS sequence
coincides with its LSLS sequence.
Proposition 11.9 Consider two broken lines A0 . . . An and B0 . . . Bn whose LLS se-
quences are (a0 , . . . , a2n ) and (b0 , . . . , b2n ) respectively. Suppose that there exists a
GL(2, R)-operator taking the first broken line to the second one. Let the determinant
of this operator equal λ. Then we have
a2k = λb2k , k = 0, . . . , n,
a2k−1 = λ1 b2k−1 , k = 1, . . . , n.
Proof The statement follows directly from formulas of Proposition 11.5, since the
area of any parallelogram in the definition is multiplied by λ.
11.2 Continued Fractions with Real Coefficients 143
Let us recall the notion of polynomials Pk and Qk from Chap. 1. The polynomials
Pk and Qk are uniquely defined by the following three conditions: first, they are
relatively prime; secondly,
Pk (x0 , . . . , xk )
= [x0 ; x1 : · · · : xk ];
Qk (x0 , . . . , xk )
thirdly,
Pk (0, . . . , 0) + Qk (0, . . . , 0) = 1.
Proof We prove this theorem by induction on the number of vertices in the broken
line, i.e., in n.
Base of induction. If the broken line has two vertices A0 , A1 and its LLS se-
quence is (a0 ), then A1 = (1, a0 ).
Step of induction. Assume that the statement holds for all broken lines with k
vertices. Let us give a proof for an arbitrary broken line with k + 1 vertices. Consider
a broken line A0 . . . Ak . Let its LLS sequence be (a0 , . . . , a2k ).
Let
a a + 1 −a1
T= 0 1 .
−a0 1
The transformation T takes A1 to (1, 0) and the line A2 A1 to the line x = 1. Under
the transformation T the broken line A0 . . . Ak is taken to another broken line, which
we denote by B0 B1 . . . Bk . Since the determinant of T equals 1, then by Proposi-
tion 11.9, the LLS sequence for B0 B1 . . . Bk coincides with (a0 , . . . , a2k ). By the
induction assumption, the statement holds for the broken line B1 . . . Bk . Thus we
have
Bk = Q2k−1 (a2 , . . . , a2k ), P2k−1 (a2 , . . . , a2k ) .
Letting Bk = (q, p), then
Ak = T −1 (Bk ) = p + a1 q, a0 p + (a0 a1 + 1)q .
Therefore, we have
Now we substitute p = P2k−1 (a2 , . . . , a2k ) and q = Q2k−1 (a2 , . . . , a2k ). Let us cal-
culate the value of the sum of the numerator and denominator in the leftmost fraction
at the point O = (0, . . . , 0):
a0 P2k−1 + (a0 a1 + 1)Q2k−1 + (P2k−1 + a1 Q2k−1 ) (O)
= (P2k−1 + Q2k−1 )(O) = 1.
The first equality holds since we take a0 = a1 = 0. The second equation holds by
definition of P2k−1 and Q2k−1 . Therefore,
P2k+1 (a0 , a1 , . . . , a2k ) = a0 P2k−1 (a2 , . . . , a2k ) + (a0 a1 + 1)Q2k−1 (a2 , . . . , a2k ),
Q2k+1 (a0 , a1 , . . . , a2k ) = P2k−1 (a2 , . . . , a2k ) + a1 Q2k−1 (a2 , . . . , a2k ).
This concludes the proof for the broken line A0 . . . Ak . The induction step is com-
plete.
Remark 11.12 Supposing that the sequence of numbers (a0 , a1 , . . . , a2n ) in Corol-
lary 11.11 contains only positive integers, the broken line A0 . . . An coincides with
the sail of the angle defined by rays y = 0 and y = αx in the first quadrant of the
plane. In particular, A0 . . . An is in the boundary of a convex set and its LLS se-
quence coincides with the LLS sequence of the sail for the angle ∠A0 OAn .
Corollary 11.13 Consider two broken lines A0 . . . An and B0 . . . Bm with LLS se-
quences (a0 , a1 , . . . , a2n ) and (b0 , b1 , . . . , b2m ) respectively. Let B0 = A0 and the
vector A0 A1 coincide with the vector ab00 B0 B1 . Suppose that
Then the points An , Bm , and the origin O are contained in one line.
11.2 Continued Fractions with Real Coefficients 145
Proof Consider a transformation in the group SL(2, R) taking the point A0 to (1, 0)
and A1 to some point on the line x = 1. It takes both broken lines to some broken
lines with B0 = A0 = (1, 0) and the points A1 and B1 to some points on the line
x = 1. By Proposition 11.9 this operator does not change the continued fraction.
Now we are in a position to apply Corollary 11.10: the images of the points An , Bm ,
and the origin are on a line. Therefore, the points An , Bm , and O are on a line.
Knowing the continued fraction for a broken line, it is possible to say whether
the corresponding broken line is closed.
Corollary 11.14 (On necessary and sufficient conditions for broken lines to be
closed) Consider a broken line A0 A1 . . . An with the LLS sequence (a0 , a1 ,
. . . , a2n ). Let A0 = (1, 0) and A0 A1 be on the line x = 1. Then the broken line
is closed if and only if
[a0 ; a1 : · · · : a2n ] = 0.
Notice also that for an arbitrary broken line the conditions are almost the same, as
seen from Proposition 11.9.
Example 11.15 Let us write the conditions for a broken line consisting of three
edges to form a triangle. Suppose that the LLS sequence of this broken line is
(a0 , a1 , a2 , a3 , a4 ). Then the conditions are introduced by the following system:
a0 a1 a2 a3 a4 + a0 a1 a2 + a0 a1 a4 + a0 a3 a4 + a2 a3 a4 + a0 + a2 + a4 = 0,
a1 a2 a3 a4 + a1 a2 + a1 a4 + a3 a4 + 1 = 1.
In Fig. 11.4 we show two examples of broken lines with three edges.
Due to Proposition 11.9 it is sufficient to solve this problem only for translations
by a vector.
Let us go even further and consider a differentiable curve instead of a broken line.
In this situation it helps to think of curves as broken lines with infinitesimally small
segments. In some sense the LLS sequence “splits” into a pair of functions related to
the odd and even elements of the LLS sequence. Further, we call these two functions
areal and angular densities. In this section we discuss some basic properties of areal
and angular densities, in particular we shell show that the areal density is the inverse
to the velocity of a body that moves according to the second Kepler law.
Throughout this section we suppose that all curves are of class C 2 and have the
arc-length parameterization.
Proposition 11.17 (Areal density and the Kepler’s second law) Suppose that a body
moves along the curve γ with velocity 1/A. Then the sector area velocity of the body
is constant and equals 1.
Proposition 11.18 Let the vectors Oγ (t) and γ̇ (t) be noncollinear at some mo-
ment t. Then the following holds:
A2 (t)B(t) = κ(t).
A2 (t)B(t)
ε 2
γ (t + ε) = γ (t) + εγ (t) + γ (t) + terms of higher order,
2
ε 2
γ (t − ε) = γ (t) − εγ (t) + γ (t) + terms of higher order.
2
Let us substitute
ε 2
ε 2
γ (t)γ (t + ε) = εx + x , εy + y + terms of higher order,
2 2
2
ε ε 2
γ (t)γ (t − ε) = −εx + x , −εy + y + terms of higher order.
2 2
Hence we have
|γ (t)γ (t − ε) × γ (t)γ (t + ε)| ε 3 (x y − y x ) + 4th order terms
lim 3
= lim
ε→0 ε ε→0 ε3
ε 3 κ(t) + 4th order terms
= lim
ε→0 ε3
= κ(t).
Let us prove the theorem on existence of a curve with a given angular density.
In some sense this theorem is a smooth analogue of the algorithm to construct a
broken line by the elements of the corresponding continued fraction of Sect. 11.2.1.
It is interesting to observe that to reconstruct a curve it is necessary to know the areal
density (which corresponds only to odd elements in the LLS sequences for broken
lines).
Theorem 11.19 Suppose that we know the areal density A(t) smoothly depending
on a parameter t in some neighborhood of t0 , the starting position γ (t0 ), and the
origin O.
– If |A(t0 )| > |Oγ (t0 )|, then there is no finite curve satisfying the above conditions
– If |Oγ (t0 )| > |A(t0 )| > 0, then the curve γ is uniquely defined in some neighbor-
hood of the point γ (t0 ).
Proof In polar coordinates (r, ϕ) with center at O, the curve γ is defined by the
following system of differential equations:
2
r ϕ̇ = A,
ṙ 2 + r 2 ϕ̇ 2 = 1.
This system is equivalent to the union of the following two systems:
ϕ̇ = rA2 , ϕ̇ = rA2 ,
and
2 2
ṙ = 1 − Ar 2 , ṙ = − 1 − Ar 2 .
By the main theorem of ordinary differential equations (e.g., see [8]), each of these
two systems has a finite solution if and only if |r| > |A| > 0.
There are many interesting questions related to the convergence of broken lines to
smooth curves in the context of areal and angular densities. Let us first show one
particular result in this direction.
Let γ (t) be a curve with arc-length parameter t ∈ [0, T ] and densities A(t) and
B(t). In addition we suppose that the vectors Oγ (t) and γ̇ (t) are linearly indepen-
dent for all admissible values of t. For a positive integer n we denote by γn the
broken line V0,n . . . Vn,n with vertices
i
Vi,n = γ T .
n
Suppose that the LLS sequence of γn is (a0,n , . . . , a2n,n ). Consider the following
two functions An and Bn :
Proof This follows directly from the definition of density functions and the proper-
ties of LLS sequences shown in Proposition 11.5.
We continue with two open problems. The first one is in some sense an inverse
problem to the statement of Theorem 1.16.
Notice that it is not clear what kind of limits for the elements of LLS sequence
one should use here.
Finally, it is interesting to have some criterion for the curve to be closed, as was
done for broken lines in Corollary 11.14.
Problem 8 What are the conditions on the functions A(t) and B(t) for the resulting
curve γ to be closed?
Let us write down the areal and angular densities for straight lines, ellipses, and
logarithmic spirals.
2 2
Ellipses and Their Foci Consider again the ellipse xa 2 + yb2 = 1 with a ≥ b > 0.
√
Let now O be the point (− a 2 − b2 , 0), which is one of the foci of the ellipse. Then
the areal and the angular densities are as follows:
√
ab + b a 2 − b2 cos t
A(t) =
a 2 sin2 t + b2 cos2 t
and
a
B(t) = √ .
b a 2 sin2 t + b2 cos2 t(a + cos t a 2 − b2 )2
Remark on Keplerian planetary motion Consider the Sun and some planet that
moves around the Sun. Then the planet moves according to Kepler’s laws:
I. The orbit of the planet is an ellipse with the Sun at one of the two foci of the
ellipse.
II. The motion has constant sector area velocity.
III. The square of the orbital period of the planet is proportional to the cube of the
semimajor axis of its orbit.
2 2
The trajectory of the planet is an ellipse defined by the equation xa 2 + yb2 = 1
with a ≥ b > 0 in some Euclidean coordinates (according to Kepler’s first law), in
addition, the Sun is one of the foci. Then according to Kepler’s second law, the
planet moves with velocity λ/A(t) for all t. The constant λ is defined from Kepler’s
third law:
T 2 Te2
= 3,
a3 ae
where by T we denote the orbital period of our planet, and by Te and ae , respec-
tively, the orbital period and the semimajor axis for Earth. Denote by L the length
of the ellipse for the planet, i.e.,
π/2
b2
L = 4a 1 − 1 − 2 cos2 tdt.
0 a
L
Since T = |λ| 0 |1/A(t)|dt, we have
3/2
Te a
λ = ± L .
0 |1/A(t)|dt ae
Observe that
1
A3 (t)B(t) = ,
b2 + 1
meaning that for logarithmic spirals the function A3 B is constant.
Remark 11.21 When we consider lines, ellipses, and spirals we have certain rela-
tions between densities A and B: the functions A, A/B, and A3 B respectively are
constant. Notice that if A2 B is a constant function, then the curvature is constant,
and hence we get circles. It is natural to study what kind of curves are defined by
certain relations between the densities. For instance, the following question remains
open: what curves do we get if AB (or simply B) is constant?
11.4 Exercises
Exercise 11.2 Draw a broken line on five vertices whose LLS sequence satisfies
(a) all elements are positive;
(b) all elements are negative;
(c) even elements are positive and odd elements are negative;
(d) odd elements are positive and even elements are negative.
Let us start with the following question. Suppose that we have arbitrary numbers a,
b, and c satisfying
a + b = c.
How do we calculate the continued fraction for c knowing the continued fractions
for a and b? It turns out that this question is not a natural question within the theory
of continued fractions. The simplest algorithm works as follows:
– first calculate the rational number representations for a and b;
– add them according to basic school arithmetic;
– write the continued fraction for the sum.
One can hardly imagine any law to write the continued fraction for the sum directly.
The main obstacle here is that the summation of rational numbers does not have
a geometric explanation in terms of the integer lattice. In this chapter we propose
to consider a “geometric summation” of continued fractions, which we consider a
summation of integer angles.
In Sect. 12.1 we start with the notion of extended integer angles. These angles
are the integer analogues of Euclidean angles of the type kπ + ϕ for arbitrary in-
tegers k. We classify extended angles by writing normal forms representing all of
them. Finally, we define the M-sums of extended angles and integer angles. Further,
in Sect. 12.2, we show how the continued fractions of extended angles are expressed
in terms of the corresponding normal forms. Finally, in Sect. 12.3 we give a proof
of Theorem 6.8(i) on the sum of integer angles in integer triangles, which is based
on the techniques introduced in this chapter.
sets S1 and S2 are proper integer congruent if there exists a proper integer affine
transformation of R2 taking the set S1 to the set S2 , which we write as S1 ∼
ˆ S2 .
=
We say that an integer oriented broken line An An−1 An−2 . . . A0 is inverse to the
integer oriented broken line A0 A1 A2 . . . An .
Definition 12.1 Consider an integer point V . Two integer oriented V -broken lines
l1 and l2 (see Definition 11.1) are said to be equivalent if their first and their last
vertices coincide respectively and the closed broken line generated by l1 and the
inverse of l2 is contractible in R2 \ {V }.
Definition 12.2 The equivalence class of integer oriented V -broken lines contain-
ing a broken line A0 A1 . . . An is said to be the extended integer angle for the bro-
ken line A0 A1 . . . An with vertex V (or for short, extended angle). We denote it by
∠(V , A0 A1 . . . An ).
The set of extended angles is invariant with respect to the action of (proper)
integer affine transformations. Therefore, (proper) integer congruence for extended
angles is well defined.
r = {V + λv | λ ≥ 0}.
and call it the intersection number of the ray r and the segment AB.
12.1 Extension of Integer Angles. Notion of Sums of Integer Angles 155
the intersection number of the ray r the V -broken line A0 A1 . . . An and denote it by
#(r, A0 A1 . . . An ).
The number
1
#(r+ , A0 A1 . . . An ) + #(r− , A0 A1 . . . An )
2
is called the revolution number for the extended angle ∠(V , A0 A1 . . . An ).
We denote it by #(∠(V , A0 A1 . . . An )). Additionally, we put by definition
#(∠(V , A0 )) = 0.
Example 12.5 Let O = (0, 0), A = (1, 0), B = (0, 1), and C = (−1, −1). Then
1
# ∠(O, A) = 0, # ∠(O, AB) = ,
4
3
# ∠(O, ABCA) = 1, # ∠(O, ACB) = − .
4
Let us show that the definition of the revolution number is well defined.
Proposition 12.6 The revolution number of any extended angle is well defined.
Proof Consider an arbitrary V -broken line and the corresponding extended angle
∠(V , A0 A1 . . . An ). Let
From the definition we have V = V , A0 = A0 , An = Am , and the broken line
A0 A1 . . . An Am−1 . . . A1 A0 is contractible in R2 \ {V }. This implies that
# ∠(V , A0 A1 . . . An ) − # ∠ V , A0 A1 . . . Am
1
= # r+ , A0 A1 . . . An Am−1 . . . A1 A0 + # r− , A0 A1 . . . An Am−1 . . . A1 A0
2
=0+0=0
(we leave to the reader the proof of the second equality; see Exercise 12.2). Hence,
# ∠(V , A0 A1 . . . An ) = # ∠ V , A0 A1 . . . Am .
Remark Notice that the revolution number of a closed V -broken line coincides with
the degree of a point V with respect to the V -broken line.
Finally, we formulate a nice expression for the rotation number of a closed in-
teger broken line with integer vertices not passing through the origin introduced in
a recent preprint [79] by A. Higashitani and M. Masuda. Recall that the rotation
number about the integer point V of a closed broken line L not passing through
V is the degree of the projection of L to the unit circle along the rays with vertex
at V . We denote the rotation number by RotV (L). Notice that a rotation number for
V -broken lines coincides with the revolution number of the corresponding extended
angles.
Proposition 12.8 Consider a closed integer V -broken line L and enumerate all
of its integer points A1 , . . . , Ad (not only vertices). In addition, set A0 = Ad and
Ad+1 = A1 . Then we have
1 1
d d
RotV (L) = εi + ai ,
4 12
i=1 i=1
In the next theorem we use zero integer angles and their trigonometric functions.
Let A, B, and C be three integer points on a straight line. Suppose that B is distinct
from A and C and that it is not between A and C. We say that the integer angle
∠ABC with vertex at B is zero. Further, we put by definition
Denote by larctan(0) the angle ∠AOA, where A = (1, 0) and O is the origin.
(a0 , . . . , an , a0 , . . . , an , . . . , a0 , . . . , an , b0 , . . . , bm ).
!" #
k times
by
(a0 , . . . , an )k , b0 , . . . , bm .
Theorem 12.10 For every extended angle Φ there exists a unique normal angle
of Definition 12.9 that is proper integer congruent to Φ. (This angle is called the
normal form of Φ.)
(ii) Suppose that the LSLS sequences for the extended angles Φ1 and Φ2 are re-
spectively
(−1, 2, −1, 2)k−1 , −1, 2, −1, m, a0 , . . . , a2n and
(−1, 2, −1, 2)k−1 , −1, 2, −1, 2, a0 , . . . , a2n .
Proof We start the proof with the first statement of the lemma. Without loss of
generality we assume that the vertices of the extended angles Φ1 and Φ2 are at the
origin, say
Φ1 = ∠(O, A0 . . . A2k+n+1 ),
Φ2 = ∠(O, B0 . . . B2k+n+1 ).
Additionally we assume that A0 = B0 = (1, 0) and the points A1 and B2 are on the
lines x = 1 and (m + 1)y = x respectively. These conditions together with the LSLS
sequences uniquely identify the vertices
⎧
⎪
⎪ A2l = ((−1)l , 0), for l < k − 1,
⎪
⎨ A2l+1 = ((−1) , (−1) ),
l l for l < k − 1,
⎪
⎪ A = ((−1) k , 0),
⎪
⎩
2k
A2k+1 = ((−1)k , (−1)k a0 ),
and
⎧
⎪
⎪B2l = ((−1)l , 0), for l < k − 1,
⎪
⎨B
2l+1 = ((−1) (−m − 1), (−1) ), for l < k − 1,
l l
⎪
⎪B2k = ((−1)k , 0),
⎪
⎩
B2k+1 = ((−1)k , (−1)k a0 ).
Hence A2k = B2k and A2k+1 = B2k+1 . Notice that the remaining parts of both LSLS
sequences (i.e., (a0 , . . . , a2n )) coincide. Therefore, the point Al coincides with the
point Bl for l > 2k.
On the one hand, it is clear that the integer oriented broken line
is contractible. On the other hand, Al = Bl for l > 2k. Therefore, the angle Φ1 is
proper integer congruent to Φ2 . This concludes the proof of Lemma 12.11(i).
The proof of Lemma 12.11(ii) is similar, and we leave it as an exercise for the
reader.
Proof of Theorem 12.10 Uniqueness of normal forms. Let us first show that the
extended angles listed in Definition 12.9 are pairwise proper integer noncongruent.
160 12 Extended Integer Angles and Their Summation
Consider the revolution numbers of the extended angles listed in Definition 12.9:
Types
I IIk (k ≥ 0) IIIk (k ≥ 0) IVk (k ≥ 0) Vk (k > 0)
Therefore, the revolution numbers of extended angles distinguish the types of the
angles.
For Types I, IIk , and IIIk the proof of uniqueness is complete, since any such
type contains exactly one extended angle.
Now we prove that any two normal forms of Type IVk are not proper integer
congruent for any integer k ≥ 0. Consider the extended angle
Φ = kπ + larctan [a0 ; a1 : · · · : a2n ]
of Definition 12.9. Assume that the O-broken line for the angle Φ is A0 A1 . . . Am ,
where m = 2|k| + n + 1. Notice that the LSLS sequence for this broken line is
characteristic for Φ. We consider the two cases of odd and even k separately.
If k is even, then the integer angle ∠A0 OAm is proper integer congruent to the
integer angle larctan([a0 ; a1 : · · · : a2n ]). The integer arctangent is a proper inte-
ger affine invariant for Φ. This invariant distinguishes all the extended angles of
Type IVk with even k.
Suppose now that k is odd. Set B = O + A0 O. The integer angle ∠BVAm is
proper integer congruent to larctan([a0 ; a1 : · · · : a2n ]). The integer arctangent is a
proper integer affine invariant for the extended angle Φ, distinguishing extended
angles of Type IVk with odd k.
The proof of uniqueness of normal forms of Type Vk repeats the proof for normal
forms of Type IVk .
Therefore, the extended angles listed in Definition 12.9 are not proper integer
congruent.
Existence of normal forms. Now we prove that every extended angle is proper
integer congruent to one of the normal forms.
Consider an arbitrary extended angle Φ = ∠(V , A0 A1 . . . An ). If #(Φ) = k/2
for some integer k, then Φ is proper integer congruent to an angle of one of the
Types I–III. If #(Φ) = 1/4, then the extended angle Φ is proper integer congruent
to the extended angle defined by the sail of the integer angle ∠A0 VAn of Type IV0 .
Suppose now that #(Φ) = 1/4 + k/2 for some positive integer k. Choose the
broken line defining Φ whose LSLS sequence is of the following form:
(1, −2, 1, −2)k−1 , 1, −2, 1, m, a0 , . . . , a2n , (12.1)
where ai > 0, for i = 0, . . . , 2n. Notice that it is always possible to construct such a
broken line: first, construct the broken line for
(1, −2, 1, −2)k−1 , 1, −2, 1 .
12.1 Extension of Integer Angles. Notion of Sums of Integer Angles 161
Secondly, take the sail of the corresponding angle. The union of the broken line
and the sail is a broken line whose LSLS sequence is exactly as in (12.1). By
Lemma 12.11 the extended angle Φ defined by such an LSLS sequence is proper
integer congruent to the extended angle of Type IVk defined by the sequence
(1, −2, 1, −2)k−1 , 1, −2, 1, −2, a0 , . . . , a2n ,
where ai > 0, for i = 0, . . . , 2n. By Lemma 12.11 the extended angle defined by
this sequence is proper integer congruent to the extended angle of Type Vk defined
by the sequence
(−1, 2, −1, 2)k−1 , −1, 2, −1, 2, a0 , . . . , a2n ,
Corollary 12.12 Two extended angles are proper integer congruent if and only if
they have the same normal form.
Having the list of normal forms, we can extend the trigonometric functions to the
case of extended integer angles.
Definition 12.13 Consider an extended angle Φ with the normal form kπ + ϕ for
some integer (possible zero) angle ϕ and for an integer k.
(i) The integer angle ϕ is said to be associated with the extended angle Φ.
(ii) The numbers ltan(ϕ), lsin(ϕ), and lcos(ϕ) are called the integer tangent, the
integer sine, and the integer cosine of the extended angle Φ.
There exists a canonical embedding of the set of all integer angles into the set of
all extended angles, since every sail is an integer broken line at distance one from
the vertex of the angle.
162 12 Extended Integer Angles and Their Summation
0π + larctan(ltan ϕ)
From Theorem 12.10 it follows that for every integer angle ϕ there exists a unique
extended angle ϕ corresponding to ϕ. Therefore, two integer angles ϕ1 and ϕ2 are
proper integer congruent if and only if the corresponding extended angles ϕ 1 and
ϕ 2 are proper integer congruent.
Φ = ∠(V , A0 A1 . . . An ).
The angle
∠(V , An An−1 . . . A0 )
is said to be opposite to Φ, which we denote by −Φ.
Let us write the normal form for the opposite extended angle.
−Φ ∼
ˆ (−k − 1)π + (π − ϕ).
=
In the next definition we introduce sums of integer and extended angles. A sum of
two angles is not uniquely defined, in contrast to Euclidean geometry.
l
Φi , or equivalently by Φ1 +m1 Φ2 +m2 · · · +ml−1 Φl .
M,i=1
It is clear that M-sums of extended angles are defined in a unique way up to the
set choice of the set M.
Φ +−3 Φ = π + larctan 1,
Φ +−2 Φ = π + larctan 0,
Φ +−1 Φ = 0π + larctan 1,
Φ +0 Φ = 0π + larctan 2,
3
Φ +1 Φ = 0π + larctan .
2
Φ1 = 0π + larctan 2,
3
Φ2 = 0π + larctan ,
2
Φ3 = 0π + larctan 5.
Then
5
Φ1 = 0π + larctan 1 and Φ2 = 0π + larctan .
2
Then
12 13
Φ1 +1 Φ2 = 0π + larctan
= 0π + larctan = Φ2 + 1 Φ1 .
7 5
Remark 12.21 The definition of M-sums of extended angles is canonically lifted to
the case of classes of proper integer congruences of extended angles.
l
αi , or equivalently by α1 +m1 α2 +m2 · · · +ml−1 αl .
M,i=1
Remark 12.23 The definition of M-sums of integer angles is also canonically lifted
to the case of classes of proper integer congruences of integer angles.
Recall that r denotes the maximal integer not greater than r.
Proof Without loss of generality we assume that V is the origin O, A0 = (1, 0), and
1
A0 + sgn A0 O A1 A0 A1 = (1, 1).
a0
(One can get this after a certain integer affine transformation of the plane.)
By Theorem 3.1 the coordinates of the point An are (q, p). This directly implies
the statement of the theorem for the cases p > q > 0, p/q = 0, and p/q = ∞.
Suppose now that q > p > 0. Consider the integer angle ϕ = ∠A0 PAn . Let
B0 . . . Bm be the sail for ϕ. Direct calculation shows that the point
B0 B1
D = B0 +
l(B0 B1 )
coincides with the point (1 + (q − 1)/p, 1). Consider the proper integer linear
transformation T satisfying the following two conditions: first, T takes the point
A0 = B0 to itself, and second, T takes the point D to (1, 1) . These two conditions
uniquely identify T as
1 −(q − 1)/p
T= .
0 1
The transformation T takes the point An = Bm , with coordinates (q, p), to the point
with coordinates (q − (p − 1)/pp, p) . Therefore,
p
ϕ = larctan .
q − (q − 1)/pp
The proof for the case p > 0 and q < 0 repeats the proof described above after
switching to the corresponding adjacent angles.
Finally, the case with p < 0 is centrally symmetric with respect to all the cases
considered above. This concludes the proof of Theorem 12.24.
Theorem 6.8(i) (On sums of integer tangents of angles in integer triangles) Let
(α1 , α2 , α3 ) be an ordered triple of angles. There exists a triangle with consecutive
angles integer congruent to α1 , α2 , and α3 if and only if there exists i ∈ {1, 2, 3}
such that the angles α = αi , β = αi+1(mod 3) , γ = αi+2(mod 3) satisfy the following
conditions:
(a) for ξ = ] ltan α, −1, ltan β[ the following holds ξ < 0, or ξ > ltan α, or ξ = ∞;
(b) ] ltan α, −1, ltan β, −1, ltan γ [ = 0.
Definition 12.25 Let ABC be an integer triangle. Consider all integer points in
the interior of ABC and on its sides lying at unit integer distance from the side AB.
We denote their number by l1 (AB; C) (see Fig. 12.2).
Recall that all integer points at unit integer distance from AB described in the
definition are contained in one straight line parallel to AB. Besides that,
Lemma 12.26 For every integer triangle ABC the following holds:
Proof Consider an arbitrary integer triangle ABC. Suppose that the pair of vectors
(BA, BC) defines the positive orientation of the plane (otherwise, we consider the
triangle ACB). Set
D = A + CB and E = A + CA
BAD ∼
ˆ ABC,
=
and hence
∠BAD ∼
ˆ ∠ABC
= and l1 (BA; D) = l1 (AB; C).
Since EABD is a parallelogram, we get
AED ∼
ˆ BAD ∼
= ˆ ABC.
=
Therefore,
∠DAE ∼
ˆ ∠BCA
= and l1 (DA; E) = l1 (BC; A).
Let A0 . . . An be the sail of the angle ∠CAB with the corresponding LLS se-
quence
(a0 , . . . , a2n−2 ).
Further, let B0 B1 . . . Bm be the sail of ∠BAD (where B0 = An ) with the correspond-
ing LLS sequence
(b0 , . . . , b2m−2 ).
Finally, let C0 C1 . . . Cl be the sail of ∠DAE (where C0 = Bm ) with the correspond-
ing LLS sequence
(c0 , . . . , c2l−2 ).
Consider now the A-broken line
A0 . . . An B1 B2 . . . Bm C1 C2 . . . Cl .
where t and u are certain integers. By the definition of the M-sum of extended
angles this sequence defines the extended angle
Denote by π+ the open half-plane bounded by the straight line AC and containing
the point B. Let also π− denote the complement of π+ in the plane. Since
An,−2 = A + An A,
the integer points An,k for k less than or equal to −2 are in the closed half-plane π− .
Since
An,−1 = A + An An ,
the integer points An,k for k greater than or equal to −1 are in the open half-
plane π+ .
The intersection of the parallelogram AEDB and the open half-plane π+ contains
exactly l(AB) points of the described set: these points are the points An,k with
−1 ≤ k ≤ l(AB) − 2.
Since
BAD ∼
ˆ ABC,
=
the number of points An,k in the closed triangle BAD is l1 (AB; C): these points
are exactly the points An,k with k satisfying
l(AB) − l1 (AB; C) − 1.
The point An,k0 is contained in the segment B0 B1 of the sail of the integer angle
∠BAD (see Fig. 12.4). Since
∠BAD ∼
ˆ ∠ABC,
=
12.3 Proof of Theorem 6.8(i) 169
we have
Therefore,
α +u β +v γ = π.
Then there exists a triangle with three consecutive integer angles proper integer
congruent to α, β, and γ respectively.
Proof Let
O = (0, 0), A = (1, 0), and D = (−1, 0).
Choose the integer points
From the construction, the pair of vectors (OB, OC) defines the positive orientation,
and ∠BOC ∼ ˆ β. Since
=
α +u β +v γ = π and α +u β ∼
ˆ ∠AOC,
=
we have ∠COD ∼
ˆ γ.
=
170 12 Extended Integer Angles and Their Summation
Define
B = (q1 p2 , p1 p2 ) and C = (q2 p1 , p1 p2 )
and consider the triangle B OC . The coincidence of angles
implies
∠B OC ∼
ˆ β.
=
Since β
= 0, the points B and C are distinct and the straight line B C does not
coincide with the straight line OA. Since the second coordinates of both points B
and C equal q1 q2 , the straight line B C is parallel to the straight line OA. Therefore,
by Proposition 5.16 we have
∠C B O ∼
= ∠AOB = ∠AOB ∼
=α = ∠C OD = ∠COD ∼
and ∠OC B ∼ = γ.
So, the consecutive integer angles of the triangle B OC are integer congruent
to α, β, and γ .
Let α, β, and γ be nonzero integer angles satisfying the conditions (a) and (b) of
Theorem 6.8(i). From the second condition
ltan(α), −1, ltan(β), −1, ltan(γ ) = 0,
it follows that
α +−1 β +−1 γ = kπ.
Since all three tangents are positive, we have either k = 1 or k = 2.
By the first condition,
Hence
α +−1 β = 0π + ϕ,
for some integer angle ϕ, and hence k ≤ 1. Together with the above, this implies
that k = 1. Therefore, by Lemma 12.27 there exists a triangle with three consecutive
integer angles integer congruent to α, β, and γ , respectively.
Let us prove the converse statement. First we prove condition (b) of Theo-
rem 6.8(i). We do it by reductio ad absurdum. Suppose that there exists a triangle
ABC with consecutive integer angles
such that
⎧
⎪
⎨] ltan α, −1, ltan β, −1, ltan γ [
= 0,
] ltan β, −1, ltan γ , −1, ltan α[
= 0,
⎪
⎩
] ltan γ , −1, ltan α, −1, ltan β[
= 0.
This system of inequalities and Lemma 12.26 imply that at least two of the integers
are nonnegative.
Without loss of generality we suppose that
l(AB) − l1 (AB; C) − 1 ≥ 0,
l(BC) − l1 (BC; A) − 1 ≥ 0.
are nonnegative and the last one is positive, either r > 0 or r = ∞. On the other
hand, by Lemma 12.26 and by Theorem 12.24, we have that r = 0/−1 = 0. We
arrive at a contradiction. Hence the second condition always holds.
Now we prove that condition (a) holds. Consider a triangle ABC with consec-
utive integer angles
We have proven the second condition, so without loss of generality we assume that
ltan(α), −1, ltan(β), −1, ltan(γ ) = 0.
Since
α +−1 β +−1 γ = π,
we have
α +−1 β = 0π + ϕ
for some integer angle ϕ. The last expression for ϕ implies the first condition of the
theorem.
This concludes the proof of Theorem 6.8(i).
172 12 Extended Integer Angles and Their Summation
12.4 Exercises
Exercise 12.1 Find the revolution number of the O-broken line with vertices
#(r, A0 A1 . . . An ) = 0.
Exercise 12.4
(a) Prove that every extended angle possesses a unique opposite extended angle.
(b) Prove that the M-sum is uniquely defined by the ordered sequence of extended
angles and the set of integers M.
Exercise 12.5 Calculate the normal forms for the following angles
5
(2π + larctan 1) +−1 3π + larctan ;
2
4 7
4π + larctan +0 3π + larctan ;
3 3
5
(−5π + larctan 119) +2 3π + larctan .
3
Exercise 12.6 Find an example of a triple of integer angles that are not the angles
of an integer triangle.
Exercise 12.7 Is it true that for an arbitrary pair of integer angles (α, β) there exists
an integer angle γ such that these three angles are angles of some integer triangle?
Chapter 13
Integer Angles of Polygons and Global Relations
for Toric Singularities
In Chap. 6 we proved a necessary and sufficient criterion for a triple of integer an-
gles to be the angles of some integer triangle. In this chapter we prove the analogous
statement for the integer angles of convex polygons. Further, we discuss an applica-
tion of these two statements to the theory of complex projective toric surfaces. We
refer the reader to the general theory of toric surfaces in the works of V.I. Danilov
[43], G. Ewald [55], W. Fulton [59], T. Oda [149], and A. Trevisan [199]. In this
book we do not consider the multidimensional case (we refer the interested reader
to the paper of H. Tsuchihashi [200]).
In Sect. 13.1 we formulate and prove a theorem on integer angles of convex
polygons. After a brief introduction of the main notions and definitions of complex
projective toric surfaces (Sect. 13.2) we discuss two problems related to singular
points of toric varieties using integer geometry techniques.
We start with a theorem on necessary and sufficient conditions for integer angles to
be the angles of some convex integer polygon.
Proof Condition (i) ⇒ Condition (ii). Suppose that there exists an integer convex
polygon A1 A2 . . . An with prescribed consecutive integer angles α1 , . . . , αn . We as-
sume that the pair of vectors (A2 A3 , A2 A1 ) defines the positive orientation of the
Define
∠Bi OBi+1 , if i = 1, . . . , n − 1,
βi =
∠Bn OB1 , if i = n.
Consider the broken line that is the union of all the consecutive sails for angles
β1 , . . . , βn . This broken line defines an extended angle proper integer congruent
to 2π + 0. The LSLS sequence for this broken line contains exactly n − 1 new
elements (together with all the elements of the LLS sequences for the sails of the
angles β1 , . . . , βn ). Denoting these numbers by m1 , . . . , mn−1 , we get
∠Bi OBi−1 ∼
ˆ π − αi−1 ,
= for i = 2, . . . , n, and ∠B1 OBn ∼
ˆ π − αn .
=
Denote by P the integer point
n
O+ OBi .
i=1
Since αi
= 0, the angle π − αi is not contained in a line (i = 1, . . . , n). Hence,
the origin O is an interior point of the convex hull of the points Bi for i = 1, . . . , k.
Therefore, there exists an integer s such that the integer triangle Bs PBs+1 contains
the origin O and in addition, O is not in the edge Bs Bs+1 (here Bn+1 = B1 and
B0 = Bn ). This implies that
O = λ1 OP + λ2 OBi + λ3 OBi+1 ,
Fig. 13.1 Two integer noncongruent polygons with proper integer congruent integer angles
Remark 13.2 On the one hand, Theorem 13.1 is an extension of Theorem 6.8(i).
On the other hand, the direct generalization of Theorem 6.8(ii) is false: the integer
angles do not uniquely determine the proper integer affine homothety types of integer
convex polygons. See an example in Fig. 13.1.
We start with the definition of complex projective toric surfaces associated to integer
convex polygons.
The closure of the set Ω in the natural topology of CP m is called the complex
projective toric surface associated with the polygon P and denoted by XP .
P= and Q =
176 13 Integer Angles of Polygons and Global Relations for Toric Singularities
x1 x3 = x0 x4 .
Notice that the integer area coincides with the degree of the resulting surface. This
is a general fact of toric geometry.
Remark 13.5 One can generalize the described construction to the case of polyhedra
of arbitrary dimensions. Notice that in the multidimensional case this construction
may have nonisolated singular points. In this situation there is a standard procedure
to resolve them (via certain ς -processes). The resulting variety is the toric variety
corresponding to the polyhedron.
Remark 13.6 One of the main properties of toric surfaces (and toric varieties in
general) is as follows. Every toric surface admits a huge group of isomorphisms.
While here we discuss only toric surfaces, similar statements hold for toric varieties
of arbitrary dimension. Consider a toric surface XP , parameterized as XP (t0 , t1 , t2 ).
Consider the multiplicative group of complex numbers C∗ = C \ {0}. Every toric
surface XP admits a natural action of the group C∗ × C∗ . An arbitrary element
(a, b) of the group C∗ × C∗ acts on the toric surface XP as follows:
f(a,b) : XP → XP , f(a,b) XP (t0 , t1 , t2 ) = XP (at0 , bt1 , t2 ).
Ãi = (0 : · · · : 0 : 1 : 0 : · · · : 0),
Theorem 13.8
(i) The set XP is a complex projective complex-two-dimensional surface with iso-
lated algebraic singularities;
(ii) the complex projective toric surface contains the points Ãi for i = 0, . . . , n
(where n + 1 is the number of vertices of the convex polygon);
13.2 Toric Surfaces and Their Singularities 177
P=
In the appropriate affine charts these three singularities are defined by the equation
xy = z3 .
For the proof of Theorem 13.8 we refer to classical textbooks on toric geome-
try ([43, 55, 59, 149], etc.). Nevertheless, we would like to outline some ideas and
remarks related to the proof. The first item is a classical statement of algebraic ge-
ometry.
Outline of the proof of item (ii) Consider an arbitrary vertex Ai of the polygon P .
Let li be some strictly supporting line for the polygon at Ai (i.e., li ∩ P = {Ai }).
Consider a vector v orthogonal to li and directed away from the polygon. Assume
that v = (λ1 , λ2 ). Consider a curve in RP m ⊂ CP m with real parameter t:
λ x +λ y λ x +λ y
t 1 0 2 0 : t 1 1 2 1 : · · · : t λ1 xm +λ2 ym |t ∈ R .
Outline of the proof of item (iii) To detect all singular points of XP we use the action
of the group C∗ × C∗ introduced in Remark 13.6. Notice that the group takes sin-
gular points to singular points and nonsingular points to nonsingular points. There
are two-dimensional, one-dimensional, and zero-dimensional orbits of the action of
178 13 Integer Angles of Polygons and Global Relations for Toric Singularities
C∗ × C∗ on XP . Since all the singular points of XP are isolated, they are all con-
tained in the zero-dimensional orbits, i.e., at points where C∗ × C∗ acts trivially. It
can be only the points with a single nonzero coordinate
(0 : · · · : 0 : 1 : 0 : · · · : 0).
The toric surface XP contains only singular points with ones at places 0, . . . , n. That
is exactly the points Ã0 , . . . , Ãn .
Remarks on items (iv) and (v) As stated in Theorem 13.8(v), singularities of toric
surfaces are in one-to-one correspondence with integer affine types of nonoriented
sails of integer angles. Therefore, toric singularities are classified by the LLS se-
quences of corresponding integer angles up to switching the order of the sequence.
We call the corresponding LLS sequence the continued fractions of the toric singu-
larity. Therefore, the pair of rational numbers
ltan α, ltan α t
We begin this subsection with triangles. The corresponding toric surfaces have at
most three toric singularities. The relations on these singularities are described by a
reformulation of Theorem 6.8(i).
13.3 Relations on Toric Singularities of Surfaces 179
Remark Let us say one more time that one should use only odd continued fractions
for c1 , c2 , and c3 in the statement of the above proposition.
When n > 3 we have the following weaker statement. It follows directly from
Theorem 13.1.
Notice that all integers of M are greater than −2. In addition, at least one of the
elements of M is equal to −1.
Finally, we discuss the realizability of toric surfaces with a prescribed set of toric
singularities.
Φ = α1 +1 α2 +1 · · · +1 αn .
It is clear that one of the LSLS sequences for Φ is obtained by adding the LLS
sequences of the angles αi and the number 1 taken n − 1 times. All the elements
of this LSLS sequence are positive integers, and hence Φ is of the form ϕ + 0π for
some integer angle ϕ.
If ϕ ∼
= larctan 1, then we have
Consider the sail for the angle π −ϕ. Suppose that the sequence of all its consecutive
integer points (not only vertices) is (B0 , . . . , Bs ), where the order coincides with the
order of the sail. Then we have
∠Bi OBi+1 ∼
ˆ larctan 1
= for every i = 1, . . . , s.
Therefore, k = n + s + 3, and
Example 13.14 Let us construct a projective toric surface having a unique toric sin-
gularity with the sail pair (7/5, 7/3). Consider α ∼
=ˆ larctan(7/5). First of all, we
draw the angle π − α and its adjacent angle π − (π − α) (which is α). Further,
we subdivide the half-plane in the complement to the union of α and π − α in a
standard way into two angles β and γ (such that the resulting LSLS sequence ends
in (−2, 1, −2, 1)).
Finally, we construct the hexagon P all of whose angles are adjacent to the angles
of the obtained decomposition.
The toric surface XP has a unique singular point; its sail pair is (7/5, 7/3).
182 13 Integer Angles of Polygons and Global Relations for Toric Singularities
13.4 Exercises
Exercise 13.2 Write the toric surfaces for the following polygons
P= , Q= , and R = .
Enumerate all their singularities and find the sail pairs of the corresponding singu-
larities.
Exercise 13.3 Find a toric surface with a unique toric singularity whose sail pair is
(5/3, 5/2).
Exercise 13.4 Prove that the sequence M in Corollary 13.11 has at least one nega-
tive integer. Show that all negative numbers are equal to −1.
We say that a vector (point) is integer if all its coordinates are integers. A plane in
Rn is called integer if its integer vectors form a sublattice whose rank equals the
dimension of the plane. We say that a polyhedron is integer if all its vertices are
integer. A cone in Rn is said to be integer if its vertex is integer and each of its
edges contains integer points distinct from the vertex.
The integer congruence relation is defined as in the two-dimensional case by the
group of affine transformations preserving the set of integer points, i.e., Aff(n, Z).
This group is a semidirect product of GL(n, Z) and the group of translations on
integer vectors. We use “∼=” to indicate that two objects are integer congruent.
As in the planar case described in Chap. 2, the integer invariants in the multidimen-
sional case are easily defined in terms of indices of certain sublattices. We first give
a geometric description of the index of sublattices in terms of the number of integer
points in certain parallelepipeds.
Proof The proof of this statement is similar to the proof of the planar one. Let H be
a subgroup of Z2 generated by v1 , . . . , vk . Define
n $
Par(v1 , . . . , vk ) = λi vi 0 ≤ λi < 1, i = 1, . . . , n .
i=1
First, we prove that for every integer vector g there exists an integer point P
lying in the set Par(v1 , . . . , vk ) such that AP ∈ gH . Let
n
g= μi v i .
i=1
Then consider
n
P =A+ μi − μi vi .
i=1
αv + βw with 0 ≤ α, β < 1
Let us start with the notion of integer volume for integer simplices. We will define
the integer volumes for polyhedra in the next subsection.
We postpone the calculation of the volume of simplices for a while (see Exam-
ple 14.32 below).
First, we give the definition of an integer angle for two linear subspaces.
Definition 14.4 Consider two integer linear spaces L1 and L2 that are not con-
tained one in another. The integer angle between L1 and L2 is the index of the
sublattice generated by all integer vectors of L1 ∪ L2 in the integer lattice of the
space Span(L1 , L2 ). Denote it by lα(L1 , L2 ).
In case one space is a subspace of the other we agree to say that the integer angle
between them is zero.
In order to calculate integer angles via integer volumes one can use the following
proposition.
Proposition 14.5 Consider two integer linear spaces L1 and L2 that are not con-
tained one in another. Let the sets of independent integer vectors
lV(u1 , . . . , uk , v1 , . . . , vl , w1 , . . . , wm ) lV(w1 , . . . , wm )
lα(L1 , L2 ) = .
lV(u1 , . . . , uk , w1 , . . . , wm ) lV(v1 , . . . , vl , w1 , . . . , wm )
188 14 Basic Notions and Definitions of Multidimensional Integer Geometry
Proof First, let us change the basis (wi ) of L1 ∩ L2 to the basis (w i ) that generates
the integer sublattice in L1 ∩ L2 . The value of the formula stays unchanged, since
the numerator and the denominator are both divided by lV2 (w1 , . . . , wm ).
Second, we extend the basis (wi ) with integer vectors ui to the basis of (ui , wi )
that generates the integer lattice of L1 . The value of the formula stays unchanged,
since the numerator and the denominator are both divided by lV(u1 , . . . , uk , w 1 ,
. . . , w m ).
Third, we extend the basis (w i ) with integer vectors v i to the basis of (v i , wi )
that generates the integer lattice of L2 . For similar reasons the value of the formula
stays unchanged.
From the definition we have
⎧
⎪
⎪lV(u1 , . . . , uk , v 1 , . . . , v k , w1 , . . . , wm ) = lα(L1 , L2 ),
⎪
⎨lV(u , . . . , u , w , . . . , w ) = 1,
1 k 1 m
⎪
⎪lV(v , . . . , v , w , . . . , w ) = 1,
⎪
⎩
1 k 1 m
lV(w 1 , . . . , wm ) = 1.
For these vectors the statement of the proposition is clearly holds. This concludes
the proof.
Finally, we give a definition of the integer angle between arbitrary (affine) planes.
Definition 14.6 Consider two integer planes π1 and π2 . Let L1 and L2 be the spaces
of vectors corresponding to π1 and π2 . The integer angle between π1 and π2 is the
index lα(L1 , L2 ). We denote it by lα(π1 , π2 ).
Definition 14.7 Consider two disjoint integer planes π1 and π2 . Let L1 and L2 be
the spaces of vectors corresponding to π1 and π2 , and let a = A1 A2 be a vector such
that A1 ∈ π1 and A2 ∈ π2 . The integer distance between π1 and π2 is the index of
the sublattice generated by all integer vectors of L1 ∪ L2 ∪ Span(a) in the integer
lattice of the space Span(L1 , L2 , a). Denote it by ld(π1 , π2 ).
In the case of an intersecting subspace of another we agree to say that the integer
distance between them is zero.
Remark 14.8 One of the simplest useful invariants in three-dimensional integer ge-
ometry is the distance between an integer point p and an integer plane π , i.e., the
index of the lattice generated by the integer vectors joining p and all integer points
of π in the whole integer lattice of Z3 . Integer distance has a nice geometric de-
scription. Draw all integer planes parallel to π . One of them contains the point p
(let us call it πp ). The integer distance ld(p, π) is the number of integer planes in
14.2 Integer and Euclidean Volumes of Basis Simplices 189
the region bounded by the lines πp and π plus one. Let us illustrate by the following
example:
There are three integer planes parallel to π and lying between πp and π . Hence,
ld(p, π) = 3 + 1 = 4.
Proposition 14.9 Consider two disjoint integer planes π1 and π2 . Let L1 and L2
be the spaces of vectors corresponding to π1 and π2 , and let a be a vector with
one integer endpoint in π1 and one integer endpoint in π2 . Suppose that the sets of
independent integer vectors
Definition 14.10 An integer polyhedron is called empty if it does not contain any
integer points other than its vertices.
190 14 Basic Notions and Definitions of Multidimensional Integer Geometry
Remark 14.12 In Chap. 2 we showed that every empty triangle has integer area 1
(and Euclidean area 1/2). This is no longer true in the multidimensional case; for
instance, see Example 14.33. We describe the situation in the multidimensional case
in the next chapter.
n
ej = λij Ai A0 ,
j =1
with integers λij for 1 ≤ i, j ≤ n. Denote by M1 the matrix where at the j th place
of the ith column is λij . Let also M2 be the matrix whose ith column is filled with
the coordinates of the vector A0 Ai for i = 1, . . . , n. Then
M1 · M2 = Id,
where Id is the identity matrix. Since these matrices are integer, both their determi-
nants equal either 1 or −1. Hence the Euclidean volume of the simplex A0 A1 . . . An
1
coincides with the Euclidean volume of the coordinate simplex and equals n! .
(c) ⇒ (a). Consider an integer simplex A0 A1 . . . An of Euclidean volume n! . Sup-
1
Proposition 14.13 For an arbitrary simplex S of full dimension we have the follow-
ing formula:
lV(S) = V P (S) .
Proof Let us prove this statement by induction on the integer volume of simplices.
Base of induction. The statement for simplices of integer volume 1 follows di-
rectly from Proposition 14.11.
Step of induction. Let the statement hold for all simplices of integer volume less
then N (N > 1). Let us prove it for an arbitrary simplex S = v0 v1 . . . vn of lattice
volume N .
Consider a polyhedron P (S). By Proposition 14.11, since lV(S) > 1, there is
an integer point in P (S) distinct from the vertices. Without loss of generality we
suppose that this point is not on any facet parallel to v1 , . . . , vn . Hence
d(v0 , v1 . . . vn ) > 1.
Let us say a few words about the determinant being well defined. Recall that L
can be mapped by a linear (not necessarily integer) mapping to Zk ⊂ Rk . In Propo-
sition 14.11 we showed that all simplices whose edges generate Zk have the same
Euclidean volume. Hence all simplices whose edges generate L also have the same
Euclidean volume.
In the following theorem we establish a relation between the integer volume of a
simplex S and the Euclidean volume of the corresponding parallelepiped P (S).
Theorem 14.15 Let S be a k-dimensional integer simplex in Rn and let L(S) denote
the integer lattice of the k-dimensional integer plane containing S. Then we have
V (P (S))
lV(S) = .
det(L(S))
Using the result of Theorem 14.15 we extend the definition of integer area to general
polyhedra.
As in the two-dimensional case, we have the additivity property for lattice area.
k
lV(P ) = lV(Pk ).
i=1
Theorem 14.18 For every convex integer polyhedron P (not contained in a hyper-
plane) there exists a decomposition into integer empty simplices.
Proof Let us give a sketch of the proof. The proof is based on induction on the
number of integer points in P ∈ Rn .
Base of induction. If P has only n + 1 integer points inside, then it is an empty
simplex.
Step of induction. Suppose now that the statement holds for every k < m. Let us
prove it for m. Consider an integer polyhedron P . Suppose that it contain points
p1 , . . . , pm . Let f1 , . . . , fs be all the (n − 2)-dimensional faces of P . We fix one
integer point of P , say some pi , and consider all pyramids with vertex at this point
and bases in faces f1 , . . . , fs . These pyramids subdivide P into integer polyhedra.
If P contains at least n + 2 points, then it is possible to choose pi such that this
subdivision contains at least two smaller polyhedra (each of them contains a smaller
number of integer points). By induction all the smaller polyhedra are decomposable;
hence P is decomposable itself.
Theorem 14.19 For every convex integer polyhedron P in R3 there exists a decom-
position of 4P into integer tetrahedra congruent to the basis tetrahedron.
194 14 Basic Notions and Definitions of Multidimensional Integer Geometry
(Recall that nP is the polyhedron whose vertices have all the coordinates n-times
greater than the coordinates of P .)
Idea of the proof By Theorem 14.18 it is enough to find a proof for the list of empty
tetrahedra, which is known due to White’s theorem (see Corollary 15.3 below). It
turns out that for every empty tetrahedron T ∈ R3 , the tetrahedron 4T admits a
decomposition into integer tetrahedra congruent to the basis tetrahedron. We skip
here the technical details related to explicit constructions of these decompositions.
The Grassmann algebra (or exterior algebra) of the vector space Rn is the alge-
bra over R with multiplication (usually denoted by ∧), generated by elements 1,
e1 , . . . , en , where e1 , . . . , en is the basis of Rn . The relations defining the algebra
are
ei ∧ ej = −ej ∧ ei ;
e i ∧ 1 = 1 ∧ ei = e i ;
1 ∧ 1 = 1.
This algebra is denoted by Λ(Rn ), with the operation ∧ called the wedge product.
Let r be a positive integer. Denote by Λk (Rn ) the vector space generated by the
elements ei1 ∧ · · · ∧ eik , where i1 , . . . , ik ∈ {1, . . . , n}. In addition we put Λ0 Rn = R.
The space Λk (Rn ) is called the kth exterior power of R and its elements are called
k-forms on R.
Proposition 14.20
(i) The dimension of the kth exterior power of Rn is as follows:
n n
, if k ≤ n,
dim Λ R = k
k
0, if k > n.
(ii) Any element of the exterior algebra can be written as a sum of k-forms, so we
have
Λ Rn = Λ 0 R n ⊕ Λ 1 R n ⊕ · · · ⊕ Λ n R n .
14.4 Lattice Plücker Coordinates and Calculation of Integer Volumes 195
Let us define a natural basis of the space Λ(Rn ) for an arbitrary k ≤ n. Let I and
J be two strictly increasing sequences of k elements of the set {1, . . . , n}. We say
that I > J if there exists s ≤ k such that it = jt for t < s and is > js . We call this
ordering lexicographic.
Example 14.22 The dimension of the space Λ2 (R4 ) is 6. The lexicographic basis
of Λ2 (R4 ) is as follows:
(e1 ∧ e2 , e1 ∧ e3 , e1 ∧ e4 , e2 ∧ e3 , e2 ∧ e4 , e3 ∧ e4 ).
The set of all k-dimensional subspaces of Rn is called the linear Grassmannian, and
denoted by Gr(k, Rn ). Denote by P (Λk (Rn )) the projectivization of the rth exte-
rior power Λk (Rn ), i.e., we consider all forms up to a multiplication by a nonzero
constant.
1 1 1 1 1 1
= det e ∧ e2 + det e ∧ e3 + det e ∧ e3 .
1 2 1 1 5 1 2 5 2
1 1 1 1 1 1
det : det : det = (1 : 4 : 3).
1 2 1 5 2 5
Remark 14.27 It is clear that the lattice Plücker embedding is well defined (since
different generators of the lattice define the same k-form up to sign, which is also
preserved for generators of positive orientation).
N
v1 ∧ · · · ∧ vk = λi ω i .
i=1
Remark Notice that the lattice Plücker coordinates of lattices are affine and the
arbitrary Plücker coordinates of subspaces are projective.
If L is a sublattice of the integer lattice Zn , then all its lattice Plücker coordinates
are integers. Let us now show how to write the integer volume of integer simplices
in terms of lattice Plücker coordinates.
Theorem 14.30 Let s1 s2 . . . sk+1 be a simplex in Rn and let (p1 , . . . , pN ) be the lat-
tice Plücker coordinates for the lattice generated by vectors si sk+1 for i = 1, . . . , k.
Then
lV(s1 s2 . . . sk+1 ) = gcd(p1 , . . . , pN ).
Proof Let
N
v1 ∧ · · · ∧ vk = pi ω i
i=1
(as usual, (ω1 , . . . , ωN ) is the lexicographic basis associated to a chosen basis
(e1 , . . . , en )). Consider an arbitrary A ∈ GL(n, Z). We have
N
N
A(v1 ∧ · · · ∧ vk ) = A pi ω i = pi A(ωi ).
i=1 i=1
Since all k-forms A(ωi ) have integer coefficients and all the numbers pi are divis-
ible by gcd(p1 , . . . , pN ), the greatest common divisor for the lattice Plücker coor-
198 14 Basic Notions and Definitions of Multidimensional Integer Geometry
gcd(q1 , . . . , qN ) = gcd(p1 , . . . , pN ).
Proof of Theorem 14.30 The statement holds for all lattices in the plane spanned by
e1 , . . . , ek . In this case the first lattice Plücker coordinate is the Euclidean volume
of the parallelogram P (s1 , s2 , . . . , sk+1 ), which coincides with the integer volume
of the simplex s1 s2 . . . sk+1 (see Corollary 14.15). All the other lattice Plücker coor-
dinates are zero.
An arbitrary k-dimensional plane can be taken to the coordinate k-plane by some
GL(n, Z) transformation. By Lemma 14.31, the gcd(p1 , . . . , pN ) is a GL(n, Z) in-
variant. So in this case it is also equal to the integer volume of the simplex.
The lattice Plücker coordinates are (0, 12, 6, 18), and their greater common divisor
is 6. Hence the integer volume of S equals 6.
Example 14.33 Consider the following two tetrahedra (see in Fig. 14.1):
T1 = conv (0, 0, 0), (1, 0, 0), (0, 1, 0), (0, 0, 1) ;
T2 = conv (0, 0, 0), (1, 0, 0), (0, 1, 0), (1, 1, 2) .
14.5 Ehrhart Polynomials as Generalized Pick’s Formula 199
These tetrahedra are both empty, but they have distinct areas:
The emptiness of the tetrahedra means that they have only four integer points as ver-
tices, and hence they have the same number of integer points in all the corresponding
faces.
The following estimate relates the number of integer vertices in the polyhedron
and its Euclidean volume.
Theorem 14.34 (S.V. Konyagin, K.A. Sevastyanov [115]) The number of ver-
tices of an n-dimensional convex integer polyhedron P is bounded from above by
c(n)V (P )(n−1)/(n+1) , where c(n) is a constant not depending on P .
So the generalization should have a different nature. Let us say a few words on
Ehrhart polynomials, which to some extent generalize Pick’s formula.
So for the integer variable t there exist rational numbers a0 , . . . , ad such that
a0 = 1;
1
ad−1 = lV(F );
2 · (d − 1)!
F
lV(P )
ad = .
d!
The sum in the formula for ad−1 is taken over all faces of P of codimension 1.
The other coefficients of the Ehrhart polynomial do not have a simple combina-
torial interpretation now. We refer the interested reader to a recent book by M. Beck
and S. Robins [18].
Remark 14.38 Let us study the Ehrhart polynomials for integer polygons in the
plane. Consider a polygon P , let I be the number of integer points in the interior of
P , and let E be the number of integer points in the union of edges of P . Then we
have
L(P , t) = A(P ) · t 2 + E/2 · t + 1.
By Definition 14.35 we have
L(P , 1) = # P ∩ Z2 = I + E.
A(P ) = I + E/2 − 1.
Example 14.39 Let us write the Ehrhart polynomials for the tetrahedra T1 and T2 in
Example 14.33:
1 11
L(T1 , t) = x 3 + x 2 + x + 1;
6 6
1 3 5
L(T2 , t) = x + x 2 + x + 1.
3 3
14.6 Exercises
Exercise 14.1 Calculate the integer distance and the integer angle between the fol-
lowing two planes in R4 :
x = 0, x + y = 0,
and
y = 0, z + t = 4.
14.6 Exercises 201
Exercise 14.4 Prove that any two integer k-dimensional planes in Rn are integer
congruent.
Exercise 14.5 Find the Plücker coordinates of the sublattice generated by the fol-
lowing vectors:
Exercise 14.6 Find and prove the expression for the leading coefficient of the
Ehrhart polynomial.
Recall that an integer polyhedron is called empty if it does not contain integer points
other than its vertices. In this chapter we give the classification of empty tetrahedra
and the classification of pyramids whose integer points are contained in the base
of pyramids in R3 . Later in the book we essentially use the classification of the
mentioned pyramids for studying faces of multidimensional continued fractions.
In particular, the describing of such pyramids simplifies the deductive algorithm
of Chap. 20 in the three-dimensional case. We continue with two open problems
related to empty objects in lattices. The first one is a problem of description of empty
simplices in dimensions greater than 3. The second is the lonely runner conjecture.
We conclude this chapter with a proof of a theorem on the classification of empty
tetrahedra.
Theorem 15.1 (White’s theorem [207]) Let ADBA be an empty tetrahedron. Then
all integer points in P (ADBA ) except the vertices are contained in one of the planes
passing through centrally symmetric edges distinct from the edges of the tetrahe-
dron.
Remark 15.2 For every parallelepiped there exist exactly three planes as in the the-
orem. See an example with four inner integer points in Fig. 15.1.
Fig. 15.1 If ADBA is empty, then all integer points are in one of the dark gray three planes. We
give an example with four points inside
Corollary 15.3
(i) Classification of marked empty tetrahedra. An integer empty tetrahedron with
a marked vertex is integer congruent to exactly one of the following marked
tetrahedra:
– a tetrahedron with vertices (0, 0, 0), (1, 0, 0), (1, 0, 1), and (1, 1, 0);
ξ
– T1,r of the list “T-W,” where r ≥ 2, 0 < ξ ≤ r/2, and gcd(ξ, r) = 1 (see
Fig. 15.2).
The point (0, 0, 0) is a marked vertex for all the tetrahedra in the list.
(ii) Classification of empty tetrahedra. Any empty tetrahedron is integer congruent
ξ ν (without marked
to an empty marked tetrahedron. The tetrahedra T1,r1 and T1,r 2
vertex) are integer congruent if and only if r1 = r2 and for r1 > 1, at least one
of the following four equations holds:
Remark 15.4 Classifications of empty tetrahedra and empty marked tetrahedra co-
incide only for r = 1, 2, 3, 4, 5, 6, 8, 10, 12, 24. (This is the case when the square
of every element of the multiplicative group (Z/mZ)∗ equals ±1. Hence all the
summands in the multiplicative group are either Z/4Z or Z/2Z. There are 15 such
groups. Only nine of them satisfy the condition. In addition we have the case r = 1.)
We give all the proofs for the statements of this section in Sect. 15.4.
We say that a pyramid is marked if the vertex of the pyramid is specified. (So every
tetrahedron corresponds to four marked pyramids, while all the other pyramids are
in one-to-one correspondence with marked pyramids.) A pyramid in R3 is called
15.2 Classification of Completely Empty Lattice Pyramids 205
Fig. 15.2 The T-W list of empty pyramids. The origin (0, 0, 0) is the marked vertex for all the
pyramids
integer if its vertex is an integer point and its base is an integer polygon. An integer
pyramid is called completely empty if every integer point inside the pyramid is either
contained in the base or coincides with the vertex of the pyramid.
An integer pyramid is called one-story (multistory) if the integer distance from
the vertex to the base is 1 (≥ 1). One-story integer empty pyramids can have an arbi-
trary polygon as a base, since there is no geometric obstacle to constructing arbitrary
pyramids. The multistory case is more interesting, as described in the following the-
orem.
We skip the complete technical proof Theorem 15.5 (for a complete proof we
refer to [91]).
206 15 On Empty Simplices, Pyramids, Parallelepipeds
From Proposition 2.11 it follows that all empty triangles are integer congru-
ent to the coordinate triangle (n = 2). In Corollary 15.3 all distinct empty three-
dimensional tetrahedra are listed (n = 3). For the rest of the cases (n ≥ 4), the prob-
lem is open. In the case n = 4, we would like to mention one interesting recent
result. To formulate it we introduce a notion of the width of integer polyhedra.
are respectively 1, 2, and 3. These triangles have minimal possible integer area
among the triangles of the given width: the areas are 1, 3, and 7 respectively.
The last example means that every empty tetrahedron is contained between two
neighboring parallel integer planes (see Fig. 15.3).
Almost all four-dimensional empty integer simplices have the following property.
15.3 Two Open Problems Related to the Notion of Emptiness 207
Theorem 15.9 (M. Barile et al. [16]) Up to possibly a finite number of exceptions,
every empty simplex in dimension 4 has width 1 or 2.
To conclude this subsection we observe once more that the complete classifi-
cation of empty simplices in R4 is still not known, while the situation in higher
dimensions is entirely open.
In this subsection we discuss the lonely runner conjecture, which is also known as
view-obstruction problem for n-dimensional cubes. We start with a general formu-
lation of the conjecture.
This conjecture was introduced by J.M. Wills [208] in 1967. Later it was consid-
ered by T.W. Cusick (in [37, 39], and [40]) as the geometric view-obstacle problem.
The cases k = 2, 3 are easy exercises. The conjecture holds for k = 4 (U. Betke and
J.M. Wills [19], T.W. Cusick [39]), k = 5 (T.W. Cusick and C. Pomerance [42], see
also in [20]), k = 6 (T. Bohman et al. [21]; see also in [20]), and k = 7 (J. Barajas
and O. Serra [13] and [14]). For k ≥ 8 it is not known whether this conjecture is
true.
Let us describe the question in geometric terms. Without loss of generality we
study the conjecture only for the first runner. We say that the starting point is the
zero point of a circle. It is usual to put the speed of the first runner equal to zero.
We associate each runner that travels with nonzero speed with a point on a circle.
The configuration space of all (k − 1)-tuples on the circle is the (k − 1)-dimensional
torus T k−1 (which is the direct product of k − 1 copies of circles S 1 = R/Z). We
mark the point (0, . . . , 0) on this torus as the first runner (who is actually not running
but standing). While the time goes from zero to infinity, the configuration of (k − 1)-
runners travels linearly in the torus T k−1 . The question is whether this trajectory is
always far from the middle point of the torus (1/2, 1/2, . . . , 1/2).
To simplify the picture it is good to consider the universal covering space of
T k−1 , which is Rk−1 . Here the standing runner is associated with an integer lattice
Zk−1 . Now we reformulate the lonely runner conjecture as a view-obstacle problem.
coordinate axes. Suppose the hunter is not willing to shoot in the directions having
at least one zero coordinate. Then he will shoot into some tree. It is supposed that
the bullet has zero size.
Finally, we give a discrete lattice version of the lonely runner conjecture.
Remark 15.10 Let us say a few words about the inverse question of meeting of all
the runners. Suppose k runners having distinct constant speeds start at a common
point and run laps on a circular track. Then for every ε > 0 and T > 0, there is a
time t > T at which all the runners are at arc distance at most ε away from each
other. We leave this as an exercise to the reader.
15.4.1 IDC-System
Definition 15.11 Let A, B, C, and D be integer points that span the entire space
R3 (the collection of points is ordered). Let us construct a system of coordinates
related to these points. Let b, c, and d be the integer distances from the points B, C,
and D to the planes of the faces ACD, ABD, and ACD, respectively. Consider the
point A as the origin. Set the coordinates of B, C, and D equal to (b, 0, 0), (0, c, 0),
and (0, 0, d) respectively. The coordinates of the other points in R3 are defined by
linearity. We say that this system of coordinates is an integer-distance coordinate
system with respect to ABCD or (IDC-system for short).
integer point in the old system is an IDC-node. The converse is true if and only if
the vectors AB, AC, and AD generate the integer lattice. The coordinates of integer
IDC-nodes coincide with the integer distances to the coordinate planes in the IDC-
system.
If a and b are two integer vectors, denote by La,b the integer sublattice generated by
a and b. Consider the set of all cosets in Z3 /La,b . Every left coset of this quotient
group is contained in a plane parallel to the plane spanned by the vectors a and b. In
addition, every such plane contains only finitely many left cosets, their number being
equal to the index of the sublattice La,b in the integer sublattice in the subspace
Span(a, b).
We leave the proof of this lemma as a simple exercise for the reader.
Corollary 15.13 All integer distances from the vertices of an integer (three-
dimensional) tetrahedron with empty faces to the opposite faces are equal.
Proof Consider an integer tetrahedron OABC with empty faces. Suppose that
ld(A, OBC) = n.
of the parallelepiped are empty. Consider all the integer planes parallel to OBC
that divide the parallelepiped into two nonempty parts. Since ld(A, OBC) = n, the
number of these planes equals n − 1.
By Lemma 15.12 every such integer plane contains exactly one left coset of the
lattice Z3 /LOB,OC . Since the faces of the parallelepiped are empty, the intersec-
tion of this class with a parallelepiped is one point in the interior. Hence the paral-
lelepiped P (OABC) contains exactly n − 1 integer points.
Suppose that ld(B, OAC) = m, and ld(B, OAC) = k. Then for similar reasons
the parallelepiped contains m − 1 and k − 1 interior integer points, respectively.
Therefore, we have k = m = n.
For the proof of Theorem 15.1 we need the following lemma. Let ld(B, ACD) = r
for some r > 1. Consider the IDC-system with respect to ADBA (with integer-
distance coordinates (x, y, z)). Denote by B , C, C , and D the vertices with coor-
dinates (0, r, r), (r, r, 0), (r, r, r), and (r, 0, r), respectively.
Lemma 15.14 There is a unique integer node in the interior of the intersection of
the plane x + y + z = r + 1 and the parallelepiped (here we restrict ourselves to the
case r > 1).
0 ≤ x ≤ r, 0 ≤ y ≤ r, and x + y + z = r + 1.
Since the point (0, 0, r + 1) is not integer (r + 1 is not divisible by r), we have
z0 ≤ r.
Suppose that z0 ≤ 0. In this case the vector KC is integer; hence the point
K = A + KC is an integer node (see Fig. 15.4). Notice that the point K is in the
tetrahedron ADBA and it is distinct from the vertices of the tetrahedron. So ADBA is
not empty, contradicting the conditions of the lemma. Therefore, we obtain z0 > 0.
Hence we get 0 ≤ x0 ≤ r, 0 ≤ y0 ≤ r, and 0 ≤ z0 ≤ r. Since the edges of the
shifted parallelogram do not pass through the vertices of the parallelepiped, the
point K is in the interior of the parallelepiped.
15.4 White’s Theorem and the Empty Tetrahedra Classification Theorems 211
We prove the statement of the theorem by induction on the number of points inside
the parallelepiped (or equivalently on the value of ld(A , ABD)).
Base of induction. Suppose that ld(A , ABD) = 2. Then the unique integer node
inside the parallelepiped is the center of symmetry of the parallelepiped. It is in the
intersection of all planes passing through opposite edges.
Step of induction. Suppose that the statement of the theorem holds for all empty
tetrahedra with integer distance from the vertices to the opposite edges less than r.
Let us prove the theorem for an arbitrary empty tetrahedron A ABD with
ld A , ABD = r.
Case 1. The Integer Nodes Are in the Plane A CD The plane A CD is defined
by the equation
r − x0
x+ z = r.
z0
212 15 On Empty Simplices, Pyramids, Parallelepipeds
The intersection of the plane A CD with the plane z = 1 is the line (x1 , t, 1), where t
is a linear parameter of the line and x1 is a constant equal to r − r−x0
z0 . This plane con-
tains integer nodes only if x1 is integer. Let us estimate it (assuming that y0 ≤ z0 ).
On the one hand,
r − x0 r − (r + 1 − z0 − y0 ) 1 y0
x1 = r − =r − =r + −1−
z0 z0 z0 z0
1
≥r −2+ > r − 2.
z0
On the other hand, x1 < r. Since x1 is an integer and r − 2 < x1 < r, we have
x1 = r − 1. Therefore, there exist integer nodes with coordinates (r − 1, t, 1), and
hence by Lemma 15.12 one of them (with coordinates (r − 1, t0 , 1), for some integer
t0 ) is contained in the parallelepiped P (ADBA ). So the rest of the integer nodes of
the parallelepiped P (ADBA ) have the coordinates (r − k, (k · t0 mod r), k), and
hence they are all in the parallelogram A B CD.
Case 2. The Integer Nodes Are in the Plane A BC The plane A BC is defined
by the equation
r − y0
y+ z = r.
z0
The intersection of the plane A BC with the plane z = 1 is the line (t, y1 , 1), where t
is a linear parameter of the line and y1 is a constant equal to r − r−y0
z0 . This plane con-
tains integer nodes only if x1 is integer. Let us estimate it (assuming that x0 ≤ z0 ).
On the one hand,
r − y0 r − (r + 1 − z0 − x0 ) 1 x0
y1 = r − =r − =r + −1−
z0 z0 z0 z0
1
≥r −2+ > r − 2.
z0
The Integer Nodes Are not in the Plane B BD Let us show that the case of the
plane B BD is an empty case. This plane is defined by the equation
r − y0 − x0
x +y + z = r.
z0
15.4 White’s Theorem and the Empty Tetrahedra Classification Theorems 213
The intersection of the plane B BD with the plane z = 1 is the line ( a+t a−t
2 , 2 , 1),
where t is a linear parameter for the line and a is a constant equivalent to r −
r−x0 −y0
z0 . Such a line contains integer nodes only if a is integer. On the one hand,
r − y0 − x0 z0 − 1
a=r − =r − > r − 1.
z0 z0
On the other hand, we have a < r. Therefore, there are no integer nodes in the
intersection of the parallelepiped P (ADBA ) (and of the parallelepiped P (ADBA )
as well) with the plane z = 1, which is impossible.
Therefore, the first two cases enumerate all realizable positions of integer nodes.
In both cases the integer nodes are in one of the planes mentioned in the formulation
of the theorem. Hence, the statement holds for an arbitrary pyramid ABCDA with
ld(A , ABD) = r. The proof of theorem is completed by induction.
Remark 15.15 Notice that the third case in the proof is empty, since we assume that
the third coordinate of A is not less than the first and the second coordinates.
The parameter ξ is relatively prime to r, since there would otherwise exist integer
ξ r−ξ
points on the edges distinct from the vertices. The tetrahedra T1,r and T1,r are
integer congruent. Therefore, we restrict the consideration of the tetrahedra to the
case 0 < ξ ≤ r/2.
Emptiness of the tetrahedra. All the tetrahedra are empty, since all integer points
in P (ABDA ) are contained in one of the diagonal planes that does not intersect the
tetrahedron in the interior of the parallelepiped.
214 15 On Empty Simplices, Pyramids, Parallelepipeds
Any two marked tetrahedra in the list are not integer congruent. Let us introduce
the following integer invariant to distinguish the tetrahedra. Consider an arbitrary
marked tetrahedron ABDA with vertex A and the related three-sided angle with
vertex at A and a section ABD. By Lemma 15.14 this plane contains a unique
integer point (denoted by K) on the plane parallel to ABD at integer distance r + 1
from A . Integer distances to the faces of the three-sided angle are 1, ξ , and r − ξ for
some ξ . The unordered collection [1, ξ, r − ξ ] is an integer invariant of the marked
tetrahedron A ABD. This invariant (together with ld(A , ABD) = r) distinguishes all
distinct tetrahedra in the list of the corollary.
Proof of Corollary 15.3(ii) We prove the same three statements as in the proof of
Corollary 15.3(i).
Completeness of the list. Emptiness of the tetrahedra. By Corollary 15.3(i) every
empty tetrahedron is integer congruent to at least one of the marked tetrahedra in
the list of Corollary 15.3(i). We have already shown that all the tetrahedra in this list
are empty.
Integer congruence of tetrahedra in the list. The affine group of symmetries of
tetrahedra S4 includes the affine group of symmetries of marked tetrahedra S3 . For
each of the four angles of a tetrahedron we construct the integer point uniquely
defined by the conditions of Lemma 15.14. Direct calculation shows that the inte-
ger 4-tuples of distances from these points to four planes of the faces of an empty
tetrahedron are as follows
where r is the volume of the tetrahedron and the pair (ξ, ν) satisfies ξ · ν ≡
(1 mod r). So this unordered 4-tuple of unordered 4-tuples is an invariant of empty
ξ ν , T r−ξ , and T r−ν in the list of Corollary 15.3(i)
tetrahedra. The tetrahedra T1,r , T1,r 1,r r
are integer congruent; all the other pairs of empty tetrahedra in the list have distinct
4-tuples of unordered 4-tuples of distances, and hence they are not congruent.
15.5 Exercises
Exercise 15.1 Find triangles of width 4 with the smallest possible integer area.
Exercise 15.2 Prove that the width of all empty 3-dimensional tetrahedra is 1.
Exercise 15.3 Find proofs for the lonely runner conjecture for 2 and 3 runners.
Exercise 15.5 Write the Ehrhart polynomial for all empty tetrahedra.
Exercise 15.6 Prove the equivalence of the lonely runner conjectures in the classi-
cal, view-obstacle, and discrete forms.
Chapter 16
Multidimensional Continued Fractions
in the Sense of Klein
In the first part of this book we studied continued fractions from the geometric point
of view of sails and their integer invariants. In this chapter we present a natural
generalization of sails to the multidimensional case by F. Klein.
16.1 Background
Now we formulate the classical concept of the sail, which is the cornerstone of
the theory of multidimensional continued fractions in the sense of Klein.
Definition 16.2 The boundary of the set A-hull(C) is called the sail of this cone.
16.2 Some Notation and Definitions 217
Definition 16.3 The set of all sails for all the cones C1 , . . . , CN is called the n-
dimensional continued fraction associated to the given n + 1 hyperplanes in Rn+1
passing through O in general position.
Definition 16.5 Assume that π is a supporting hyperplane for a convex set S and
that S is not a subset of π . Then we say that the set Fπ = π ∩ S is a face of S.
The dimension of Fπ is the dimension of a face. If Fπ is a point or a segment
(including rays), we say that the face is either a vertex or an edge.
Remark 16.6 Notice that it is not necessary that the faces of A-hulls coincide with
the faces of their closures. One of the illustrations of this is the cone of Exam-
ple 16.20 below, shown in Fig. 16.4. The shaded area is not a face of A-hull(C), but
it is a face of its closure. The face of the A-hull(C) is the dashed ray.
In the most interesting cases the set A-hull(C) is closed (see Theorem 16.24). So
the following rather tautological proposition is of interest in studying the faces of
sails.
Consider a rational simplicial cone C in Rn+1 with vertex at the origin. The sail of
C has finitely many compact faces of all dimensions. In fact, all compact faces are
contained in the pyramid OA1 A2 . . . An+1 , where Ai is the first integer point on the
ith edge distinct from O. We call sails of rational cones finite. If a multidimensional
continued fraction has a finite sail, then all the other sails of this continued fraction
are also finite; we call this continued fraction finite.
Example 16.9 In Fig. 16.1 we give an example of a sail for a simplicial cone defined
by the three vectors (3, 2, 3), (1, −4, 3), and (−2, 1, 3).
We recall that the classification of two-dimensional cones (i.e., planar angles) and
therefore the corresponding geometric one-dimensional sails is provided by LLS
sequences for both the finite and infinite cases in Chap. 4. The problem of describing
multidimensional (in particular two-dimensional) finite sails is relatively new and
currently has no consistent answer. Let us study the two-dimensional sails that have
a unique compact face. It is clear that this face is triangular, since the cone has three
edges.
Theorem 16.10 Let the sail for a simplicial cone C in R3 have a unique compact
face. Then this face falls into one of the following two cases.
16.4 On a Generalized Kronecker’s Approximation Theorem 219
(i) If the integer distance from the origin to the triangular compact face is one,
then every integer triangle is realizable as this face. Two sails whose unique
compact faces are at integer distance one are integer congruent if and only if
their triangular faces are integer congruent.
(ii) Suppose that the face is at an integer distance to the origin greater than 1. Then
the integer congruence classes of corresponding sails are in one-to-one corre-
spondence with the list “T-W” of completely empty pyramids shown in Fig. 15.2
(here each pyramid is associated to a simplicial cone centered at a marked ver-
tex of the pyramid and having edges containing edges of the pyramid). The
compact faces of the sails are exactly the bases of such pyramids.
Proof If the integer distance from the face to the vertex is one, then the statement
is straightforward. If the distance is greater than one, then the statement follows
directly from Theorem 15.5 on the classification of integer multistory completely
empty marked pyramids.
Problem 12 Describe all finite two-dimensional sails (and the corresponding con-
tinued fractions).
The first tasks here are to investigate sails having two, three, four, etc., compact
faces.
Proof The proof is straightforward. Consider two convex sets S and T . Let u, v ∈
S ⊕ T . Then u = s1 + t1 and v = s2 + t2 , where s1 , s2 ∈ S and t1 , t2 ∈ T . Hence for
every λ ∈ [0, 1] we have
λu + (1 − λ)v = λ(s1 + t1 ) + (1 − λ)(s2 + t2 )
= λs1 + (1 − λ)s2 + λt1 + (1 − λ)t2 .
The two summands in the last expression are in S and T respectively. Hence the
sum is in S ⊕ T . Hence for any two points of S ⊕ T the segment connecting these
points is also in S ⊕ T . Therefore, the set S ⊕ T is convex.
Definition 16.12 Let u ∈ Rn . We call the intersection of all integer vector subspaces
of Rn containing u the integer approximation space of u. It is denoted by Ru .
Example 16.14
√ √
(i) Consider the vector v = [ 2, 3]. The integer approximation space of v is
two-dimensional; the vector v is affinely√irrational.
(ii) The integer approximation space of [1, 2] is also two-dimensional (i.e., it is
R2 ). Nevertheless, this vector is not affinely irrational, since its end is contained
√ x√= 1.
in the integer line
(iii) Let now u = [ 2, 2]. Then Ru is the line y = x, and u is affinely irrational.
(iv) Finally, for the vector w = [1, 1] we have that Rw is the line y = x. The integer
affine hyperplanes in Rw are just integer points. The vector w is integer itself,
so w is not affinely irrational.
Proof The set S is formed by the intersections of the integer affine hyperplanes in
Ru with the line = {λu | u ∈ R}. No such integer hyperplane π contains the line
, since otherwise, π would contain Ru ; hence π intersects in at most one point.
Since there are countably many integer planes, S is countable.
Lemma 16.18 Consider u ∈ Rn and let dim Ru = d. Suppose that the integer lattice
Lu = Ru ∩ Zn has an integer basis (e1 , . . . , ed ).
(i) For any nonzero λ, we have Ru = Rλu .
222 16 Multidimensional Continued Fractions in the Sense of Klein
Proof Lemma 16.18(i) holds, since the linear spaces defined by u and λu coincide.
Let us prove Lemma 16.18(ii). Let u be affinely irrational and let w be as in
the statement of the lemma. Let us first prove that Rw = Ru . A hyperplane of Ru
containing w is integer if and only if a hyperplane shifted by u − w containing w is
integer. Hence Rw = Ru and w is affinely irrational.
Finally we prove Lemma 16.18(iii). Consider an arbitrary integer subspace R
containing ũ. Then the span of R and ed contains u. Hence Ru is in the span of Rũ
and ed . Therefore, Rũ coincides with the span of e1 , . . . , ed−1 .
Now we prove that ũ is affinely irrational by reductio ad absurdum. Suppose
that it is contained in some integer hyperplane of Span(e1 , . . . , ed−1 ) defined by
a linear equation f = 0. Then u satisfies two independent linear equations f = 0
and xd = 0. Hence u is contained in some integer plane π of codimension 2 in Ru .
Hence u is contained in the integer hyperplane Span(π, O) (where O is the origin)
of Ru , and hence Ru is not the integer approximation space of u. We arrive at a
contradiction. The proof is complete.
where for a vector r = (r1 , . . . , rd ), its quotient part is denoted by {r}, i.e.,
{r} = r1 − r1 , . . . , rd − rd .
It is clear that Δ(k1 , k2 ) belongs to the unit coordinate cube. Since this cube is
compact, there exist integers k̂1 > k̂2 > 0 such that
√
Δ(k̂1 , k̂2 ) < 3 ε.
2
Notice that Δ(k̂1 , k̂2 )
= 0, since (k̂1 − k̂2 )u ∈
/ Zn . For every nonnegative m, the ball
B(x0 , ε) ⊕ mΔ(k̂1 , k̂2 )
does not contain integer points, since this ball is obtained from the ball
B(x0 , ε) ⊕ (k̂1 − k̂2 )u
by a shift on the integer vector
x0 + k̂1 u − x0 + k̂2 u,
where r is the vector floor function, which is equal to r − {r}. Hence the set
B(x0 , ε) ⊕ mΔ(k̂1 , k̂2 ) | m ∈ Z+
does not contain integer points. By construction, this set contains an (ε/2)-tubular
neighborhood of a ray with vertex at x0 and direction Δ(k̂1 , k̂2 ), i.e., the set
T = B(x0 , ε/2) ⊕ tΔ(k̂1 , k̂2 ) | t ∈ R+ ;
see Fig. 16.3 (left). Now we are interested in the points of intersections of the ray
{x0 + tΔ(k̂1 , k̂2 ) | k ∈ R+ } with the integer planes parallel to the span of the vec-
tors e1 , . . . , ed−1 . Let the first and second intersections with these planes be y0 and
y0 + v. Then all the intersections are in the set {y0 + kv | k ∈ Z+ }. By construction,
the set
B(y0 , ε/2) ⊕ {kv | k ∈ Z+ }
224 16 Multidimensional Continued Fractions in the Sense of Klein
does not contain any integer point. Let y 0 and v be the point and the vector whose
first d − 1 coordinates coincide with the coordinates of y0 and v, respectively, and
the last coordinate is zero. Since the last coordinate of y0 and v are integers, the set
B(y 0 , ε/2) ⊕ {kv | k ∈ Z+ }
does not contain any integer point.
By Lemma 16.18(ii) it follows that Ru = RΔ(k̂1 ,k̂2 ) . Further, by Lemma 16.18(i),
we get RΔ(k̂1 ,k̂2 ) = Rv . Since the last coordinate of Rv equals 1, we are in the situa-
tion of Lemma 16.18(iii). Hence the vector v is affinely irrational and
Rv = Span(e1 , . . . , ed−1 ).
Hence y0 is in Rv . Then by the induction assumption for (d −1)-dimensional integer
approximation spaces, the set
B(y 0 , ε/2) ⊕ {kv | k ∈ Z+ }
contains infinitely many integer points. We arrive at a contradiction. Therefore, the
statement of Theorem 16.16(v) is true for an arbitrary vector u with dim Ru = d.
Theorem 16.16(iii) is an easy corollary of Theorem 16.16(v): namely, for an ar-
bitrary u take a scalar λ such that λu is affinely irrational (due to Proposition 16.15),
and hence Theorem 16.16(iii) for u follows from Theorem 16.16(v) for λu.
Theorem 16.16(i) and (ii) follow directly from Theorem 16.16(iii), and Theo-
rem 16.16(iv) follows from Theorem 16.16(i).
All these prove the correctness of the step of induction and conclude the proof of
the multidimensional Kronecker’s approximation theorem.
Therefore, we get
A-hull(F ) ⊕ F ⊂ A-hull(C) ∩ F.
226 16 Multidimensional Continued Fractions in the Sense of Klein
Proof Since x is an interior point of A-hull(C), there exists a ball with center at x
contained in A-hull(C). Consider a point x1
= x in this ball such that x − x1 ∈ C.
Then the cone {x} ⊕ C is in the interior of the cone {x1 } ⊕ C. By Proposition 16.19
in the case of F = C we have
A-hull(C) = A-hull(C) ∩ C = A-hull(C) ⊕ C.
Hence,
{x1 } ⊕ C ⊂ A-hull(C).
Therefore, the cone {x} ⊕ C is contained in the interior of the A-hull of the
cone C.
Hence all points of the ray r between x and xr are interior points of the set
A-hull(C), and hence they are not on the sail (which is the boundary of A-hull(C)).
Therefore,
∂ A-hull(C) ∩ r = {xr }.
Therefore, the projection is injective.
We have shown that there is a one-to-one projection sending the sail to Sx \ Cx .
On the one hand, the projection of the sail is continuous. On the other hand, the
inverse to the projection is also continuous (this follows directly from the convexity
of the sail; we leave this as an exercise for the reader). The set Sx \ Cx is homeo-
morphic to an open (d − 1)-dimensional disk (and hence homeomorphic to Rd−1 ),
and therefore, the sail is homeomorphic to Rd−1 .
In this book we mostly discuss sails for cones that are either integer (i.e., each edge
has an integer point distinct from the origin) or in general position. The combina-
toric structure of the sails for integer cones is rather simple and was discussed in
Sect. 16.3 above. Let us examine the situation for sails in general position.
It turns out that for cones in general position the set A-hull(C) is closed.
Proof Consider an arbitrary face F of the cone C. From Proposition 16.19 we have
A-hull(C) ∩ F = A-hull(F ) ⊕ F = ∅ ⊕ F = ∅.
Hence the intersection of A-hull(C) with the union of all faces is empty as well.
Proof
(i) We prove the statement by contradiction. Assume that C is in general posi-
tion. The hyperplane π divides the cone into two connected components, one
of which, say C1 , does not contain integer points in the interior (since it is a
supporting plane of the of A-hull(C)). By construction, the set C1 is not com-
pact, and hence its intersection with one of the edges of the cone C is a ray.
We assume that this edge contains a vector u whose approximation space is Rn .
Now consider any interior point x of the set C1 . On the one hand, there exists a
positive ε such that the tubular neighborhood of the edge shifted to x:
B(x, ε) ⊕ {ku | k ∈ R+ },
is completely contained in C1 . Therefore, by construction,
B(x, ε) ⊕ {ku | k ∈ R+ } ∩ Zn = ∅.
On the other hand, by the multidimensional Kronecker’s approximation theo-
rem (Theorem 16.16(iii)), this tubular neighborhood contains infinitely many
integer points. We arrive at a contradiction. Hence Lemma 16.26(i) holds.
(ii) Let π ∩ C be compact. Consider a sequence of points pi ∈ A-hull(C) that con-
verges to p. By definition, A-hull(C) is the convex hull of a subset of inte-
ger points. Hence by Carathéodory’s theorem, for every i there exists an inte-
ger simplex Δi ⊂ A-hull(C) containing the point pi . Since π is a supporting
plane, the distance from one of the vertices of Δi to the plane π is smaller than
|p − pi |. Since Zn is discrete and π ∩ C is compact, there exists N such that
for every i > N every simplex Δi has an integer vertex in π ∩ C. Since π ∩ C
is compact, there are only finitely many integer points in π ∩ C. Hence we can
choose an infinite subsequence p̃i in the sequence pi all of whose simplices Δ̃i
have the same integer vertex v1 in the plane π .
If v1 = p, then we are done. In case v1
= p, for every Δ̃i we can choose a
vertex wi
= v1 such that the distance from wi to the plane π tends to zero as
i tends to infinity. This is due to the compactness of π ∩ C. Therefore, there
exists Ñ such that all wi belong to π ∩ C for i > N . Since the set
(π ∩ C) ∩ Zn
230 16 Multidimensional Continued Fractions in the Sense of Klein
Remark 16.28 Notice that not every extremal vertex of a convex set S is a vertex
of S. For example, consider the union of the half-disk {x 2 + y 2 ≤ 1 | y ≥ 0} and the
square with vertices (1, 0), (1, −2), (−1, −2), and (−1, 0). Here the points (±1, 0)
are extremal vertices and yet they are not vertices (i.e., 0-dimensional faces).
Now we formulate the main structural theorem on cones in general position (see
also [125]).
To conclude this section we mention just one result on the quasipolyhedral structure
of sails.
Theorem 16.31 A convex set P is quasipolyhedral if and only if the following three
conditions are satisfied:
– P is closed;
232 16 Multidimensional Continued Fractions in the Sense of Klein
We conclude this chapter with a collection of some results and questions on two-
dimensional faces of continued fractions. We distinguish two essentially different
situations: questions related to faces that are a unit integer distance from the origin,
and questions related to faces that are integer distance from the origin greater than
one.
All two-dimensional faces of the sails of continued fractions are convex integer
polygons. If the integer distance to the origin is 1, then the only restriction for a
polygon P to be a face of an n-dimensional continued fraction is whether it is pos-
sible to inscribe P in some (n + 1)-gon Q (not necessarily integer!) such that the
convex hull of all integer points in Q coincides with P .
Proof It is clear that every integer n-gon P can be inscribed in an m-gon Q, for
m ≥ n, such that the convex hull of all integer points in Q coincides with P . We
leave the details of the proof to the reader.
Proof Suppose that the statement is false. Let there exist a two-dimensional con-
tinued fraction that has one of the compact faces (say F ) integer equivalent to a
quadrangle with vertices (−1, 0), (−a − 1, 1), (−1, 2), (b − 1, 1) for some integers
b ≥ a ≥ 1. Let π be the plane of the face. Let us introduce integer coordinates on
the plane π such that the coordinates of the vertices of the face F are exactly (a, 0),
(0, 1), (−b, 0), and (0, −1). Notice that a point in this plane is integer (as a point in
R3 ) if and only if it is integer in the new coordinates in π .
Every two-dimensional simplicial cone defining a two-dimensional continued
fraction has exactly three faces of dimension 2. So the intersection of the edges of
the cone with the plane π is a triangle, which we denote by T . Since F is a face of
the corresponding continued fraction we have, first, that the face F is contained in
the interior of the triangle T and, second, that the set T \ F does not contain any
integer points. Notice that the points (1, 1) and (1, −1) are not in F , and the point
(1, 0) is in F . In addition, the points (1, 0), (1, 1), and (1, −1) are contained in a
straight line. This implies that the open angle (denoted by ϕ) with vertex O(0, 0)
and edges passing through the points (1, 1) and (1, −1) respectively, contains at
least one vertex of the triangle T (see Fig. 16.6).
For two adjacent angles to ϕ and for the opposite angle ϕ, a similar statement
holds. Therefore, each of these four nonintersecting angles has a vertex of T , which
is impossible, since T is a triangle. We have arrived at a contradiction.
All the questions discussed in this subsection are related to the two-dimensional
case. Similar questions are also interesting for the multidimensional case as well,
though it is clear that the complexity rises with the dimension.
234 16 Multidimensional Continued Fractions in the Sense of Klein
16.6.2 Faces with Integer Distance to the Origin Greater than One
Corollary 16.36
(i) The list of faces at integer distance greater than 1 for k-dimensional contin-
ued fractions for k > 2 coincides with the list of bases of multistory marked
pyramids.
(ii) If k = 2, the list of all admissible faces coincides with the list of bases of trian-
gular multistory marked pyramids.
Proof Existence of all admissible faces follows directly from Theorem 15.5 and
Proposition 16.33.
Quadrangular faces are not realizable in the two-dimensional case due to Propo-
sition 16.34.
16.7 Exercises
Exercise 16.2 Construct the sail for the cone with vertex at the origin and generated
by integer vectors (1, 1, 1), (−1, 2, 4), and (2, 1, 4).
Exercise 16.3
(a) What is the sum of a round disk and a quadrangle?
16.7 Exercises 235
(b) Suppose the sum of a convex k-gon and a convex l-gon is an m-gon. For which
triples (k, l, m) is this possible?
(c) Suppose the sum of a convex k-dimensional polyhedron and an l-dimensional
polyhedron is an m-dimensional polyhedron. For which triples (k, l, m) is this
possible?
Exercise 16.4 Let C be a simplicial cone with the vertex at the origin and let F be
one of its faces. Consider a point p in F and a vector v of F . Prove that for every
positive integer N there exists an integer point pn = (a1 (N ), . . . , an (N )) in C \ F
such that
v pN − p 1
−
|v| |p − p| < N .
N
Let us study the structure of Dirichlet groups, which is actually the main reason for
the periodicity of algebraic multidimensional sails (see Chap. 18). The simplest case
of two-dimensional Dirichlet groups was studied in Chap. 8 in the first part of this
book. In this chapter we study the general multidimensional case.
We start with Dirichlet’s unity theorem on the structure of the group of units in
orders in Sect. 17.1. Further, in Sect. 17.2 we describe the relation between Dirich-
let groups and groups of units. In Sects. 17.3 and 17.4 we show how to calculate
bases of the Dirichlet groups and positive Dirichlet groups respectively. Finally, in
Sect. 17.5 we briefly discuss the LLL-algorithm on lattice reduction which helps to
decrease the computational complexity in many problems related to lattices (includ-
ing the calculation of Dirichlet group bases).
Example 17.4 Let K be an algebraic number field. The set of all numbers in K
whose minimal polynomial has integer coefficients is an order in K. This order is
often called maximal, since every other order of K is contained in this order.
Definition 17.7 Denote by Γ (A) the set of all integer matrices commuting with A.
(i) The Dirichlet group Ξ (A) is the subset of invertible matrices in Γ (A).
(ii) The positive Dirichlet group Ξ+ (A) is the subset of Ξ (A) that consists of all
matrices with positive real eigenvalues.
The matrices of Γ (A) form a ring with standard matrix addition and multiplica-
tion. (As a group, Γ (A) is isomorphic to Zn+1 .)
17.2 Dirichlet Groups and Groups of Units in Orders 239
sending p(A) to p(ξ ) for every element p of the quotient ring Q[x]/χA (x). Since
Ξ (A) is contained in Q[A], we have an embedding:
In the proofs of Proposition 17.8 and Theorem 17.9 we use the following notation:
PA = p ∈ Q[x]/χA (x) | p(A) ∈ Mat(n, Z) .
Proof First, the set PA described above is closed under addition and multiplication,
since if p1 (A) and p2 (A) are integer matrices, then (p1 + p2 )(A) and (p1 p2 )(A)
are also integer matrices. Hence hA,ξ (Γ (A)) is closed under addition and multi-
plication. Since Q[A] is a ring (even a field), the addition and multiplication in
hA,ξ (Γ (A)) satisfy all ring axioms. Hence hA,ξ (Γ (A)) is a ring.
Second, as an additive group, hA,ξ (Γ (A)) is a subgroup of Mat(n, Z) isomorphic
to the free abelian group Zn×n , which implies that it is a free abelian group with at
most n2 generators, and hence it is a finite module.
Third, this module contains the identity matrix Id.
And finally, the elements Id, ξ, ξ 2 , . . . , ξ n−1 are linearly independent over Q,
and hence the set hA,ξ (Γ (A)) is a complete module. Therefore, by definition,
hA,ξ (Γ (A)) is an order.
Proof Recall that hA,ξ is a one-to-one map between Ξ (A) and hA,ξ (Ξ (A)).
Letting p(ξ ) ∈ hA,ξ (Γ (A)) be a unit, i.e., it is invertible, then there exists an
element q ∈ P (A) such that p(ξ ) · q(ξ ) = 1. Hence q = p −1 in the quotient ring
Q[x]/χA (x), and
p(A) · q(A) = Id.
Therefore, the matrix p(A) is invertible in Mat(n, Z), and hence it is in Ξ (A).
Suppose now that p(A) ∈ Ξ (A) has an inverse in Ξ (A) that is q(A). Let us
restrict these matrices to the eigenspace corresponding to ξ . We have directly p(ξ ) ·
q(ξ ) = 1. Thus, p(ξ ) is a unit.
Hence, the Dirichlet group Ξ (A) is isomorphic to the multiplicative group of
units in the order hA,ξ (Γ (A)), and the isomorphism is given by hA,ξ .
From Dirichlet’s unit theorem (Theorem 17.6) and Theorem 17.9 we have the
following description of the Dirichlet group Ξ (A).
Ξ (A) = G ⊕ Zs+t−1 .
A polynomial is called cyclotomic if its roots are all distinct primitive nth roots of
unity for some integer n.
A cyclotomic polynomial is an irreducible polynomial. It is usually denoted by
Φn . These polynomials form only a very small subset of all polynomials. For in-
stance, for all n > 2, the degrees of Φn are even, hence if we study matrices with
irreducible characteristic polynomials in odd dimensions, then we do not have any
orders containing roots of unity other than ±1. In fact, the degree equals the Euler
totient φ(n) (the number of positive integers less than or equal to n that are coprime
to n). The sequence of Euler totients (φ(n)) converges to infinity as n tends to in-
finity. Hence for each even d there are only finitely many cyclotomic polynomials
of any fixed degree d.
If the Dirichlet group Ξ (A) related to orders does not have complex roots of
unity, then we directly have
In these cases, for the positive Dirichlet group, we have Ξ+ (A) = Zs+t−1 , since
all symmetries about the plane have negative eigenvalues and every subgroup of a
free abelian group is free abelian.
In the special case that all the eigenvalues of A are real we have the following
statement (we call such a matrix a real spectrum matrix).
Proposition 17.12 For every integer irreducible real spectrum matrix A the set
Γ (A) forms an additive group isomorphic to Zn+1 .
242 17 Dirichlet Groups and Lattice Reduction
Proof In the diagonal basis, the group of all matrices commuting with A is iso-
morphic to Rn+1 by addition. Hence the set of all integer matrices forms an integer
lattice in this (n + 1)-dimensional subspace. So the group is isomorphic to Zk with
k ≤ n + 1.
The matrices Id, A, A2 , . . . , An are linearly independent over Q since the char-
acteristic polynomial of A is irreducible over Q. Hence, k = n + 1.
Corollary 17.13 The group Γ (A) is the intersection of the integer lattice Zn×n ⊂
Mat(n, R) with the space Span(Id, A, A2 , . . . , An ).
Applying the LLL-algorithm described below to the sublattice generated by the ma-
trices Id, A, A2 , . . . , An , one constructs a reduced basis. This will decrease the Eu-
clidean lengths of basis vectors generating the sublattice (see Sect. 17.5 for the full-
rank lattice algorithm and Remark 17.18 for the non-full rank lattice algorithm). In
general it is not necessary to use the LLL-algorithm here, so that one can proceed
directly to Step 3.
1 λi
k−1
Mk = Mk + Mi,
dk di
i=1
where di is the integer distance from the ith point of the set M 1 , . . . , M k−1 , Mk to
the span of the remaining points, and for integers λi we have 0 ≤ λi < di . Since
ld M k , Span(M 1 , . . . , M k−1 ) = 1,
The distances di are calculated by the formula of Theorem 14.9, and the inte-
ger volumes in coordinates are the greatest common divisors of the corresponding
modified Plücker coordinates (see Theorem 14.30). The numbers λi are found by
exhaustive search.
So in n + 1 steps we obtain a basis of Γ (A) that is the integer lattice in the space
Span(Id, A, A2 , . . . , An ) according to Corollary 17.13.
as in Sect. 17.2.2. By Theorem 17.9 this map is an isomorphism between the ring
Ξ (A) and image hA,ξ (Ξ (A)), which is the multiplicative group of units. In the book
[22] the authors show how to construct a basis for the units of an order and a number
ρ such that the norms of all its elements are bounded from above by ρ (here the norm
is the standard norm on the order considered as Rs+t ). The method of constructing
the constant ρ is standard and quite technical, so we omit it. Now according to [22]
we construct a basis by enumeration of all vectors of the set h(Ξ (A)) inside the ball
Bρ (O), where Bρ (O) is a ρ-neighborhood of the origin. The preimage (i.e., h−1 )
of this basis gives us a basis of the group of invertible elements in the ring Ξ (A),
and hence it gives a basis of the subgroup Ξ+ (A).
Remark 17.14 The constant ρ is extremely large (it equals the exponent of some
polynomial of the coefficients of the matrix A). An effective algorithm for this step
can be found in the book [35] by H. Cohen. Using this algorithm one finds the basis
of units in polynomial time (with respect to the coefficients of the matrix A).
below and refer to [23] and [35] for further information. In this subsection we denote
the Euclidean length of a vector v by |v|.
i−1 bi , bj∗
bi∗ = bi − μij bj∗ , where μij =
bj∗ , bj∗
j =1
(by v, w we denote the inner product of two vectors). Actually, the vectors
b1∗ , . . . , bn∗ are the resulting vectors in the Gram–Schmidt orthogonalization process
(and μi,j are the coefficients that arise in the orthogonalization process).
Theorem 17.16 (A.K. Lenstra, H.W. Lenstra, and L. Lovász [130]) Let b1 , b2 , . . . ,
bn be a reduced basis for a lattice L in Rn . Then we have
(i) |bj |2 ≤ 2i−1 |bi∗ |2 for 1 ≤ j ≤ i ≤ n,
%
(ii) d(L) ≤ ni=1 |bi | ≤ 2n(n−1)/4 d(L),
(iii) |b1 | ≤ 2n(n−1)/4 d(L)1/n .
Proof From the size reduced conditions and the Lovász conditions, we have
∗ 2 ∗ 1 ∗ 2
b ≥ 3 − μ 2 b
i i,i−1 bi−1 ≥
4 2 i−1
2 i−1
2 2 i−1
2 2
|bi |2 = bi∗ + μ2i,j bj∗ ≤ bi∗ + 2i−j −2 bi∗ = 2i−2 +1/2 ≤ 2i−1 bi∗ .
j =1 j =1
&n
d(L) = det b1∗ , b2∗ , . . . , bn∗ = bi∗ ,
i=1
the last equality holding since the vectors bi∗ are mutually orthogonal. Now the
inequalities of the second and third items of Theorem 17.16 follow directly from
the inequalities of the first item and the fact that |bi∗ | < |bi |.
LLL-Algorithm
Input data. As input we have a basis of the lattice bi . The additional orthogonal-
ization data (bi∗ , μi,j ) is defined by bi as above. The algorithm is iterative. At
each iteration we replace the basis bi by a new one. Whenever we change the
basis bi we should update the orthogonalization data (bi∗ , μi,j ).
Goal of the algorithm. To calculate a reduced basis of the lattice generated by the
basis bi .
Parameters and conditions of the iteration process. An iteration has one param-
eter k ∈ {1, 2, . . . , n + 1}. For the first iteration we put k = 2. The algorithm
terminates when k = n + 1. After each iteration we have the value of k and the
following conditions satisfied:
(ik ) For 1 ≤ j ≤ i < k, |μi,j | ≤ 12 .
(iik ) For 1 < i < k, |bi∗ + μi,i−1 bi−1
∗ |2 ≥ 3 b∗ .
4 k−1
Description of the iteration. We have the basis bi and the parameter k ∈
{1, 2, . . . , n} (if k = n + 1, the algorithm terminates). First, if k > 1, we replace
bk by [bk ]k−1 and respectively update (bi∗ , μi,j ). Now |μk,k−1 | ≤ 12 . Second, we
choose one of two cases to continue.
246 17 Dirichlet Groups and Lattice Reduction
Theorem 17.17 (A.K. Lenstra, H.W. Lenstra, and L. Lovász [130]) Let L ⊂ Zn be
a lattice with basis b1 , b2 , . . . , bn , and let B ∈ R, B ≥ 2, be such that |bi |2 ≤ B for
1 ≤ i ≤ n. Then the number of arithmetic operations needed by the basis reduction
in the described LLL-algorithm is O(n4 log B), and the integers on which these
operations are performed each have binary length O(n log B).
We are not going to give a proof here, since it is quite technical. The interested
reader is referred to the original manuscript [130].
Remark 17.18 The algorithm works also for the case of lattices of rank l < n. In
this case it terminates when k reaches l + 1.
17.6 Exercises
Exercise 17.1 Prove that Γ (A), considered as a group with matrix addition, is iso-
morphic to Zn+1 .
Exercise 17.2 Find all isomorphism types of Dirichlet groups for matrices in
SL(n, Z) whose characteristic polynomials are irreducible over Q, where
(a) n = 2;
(b) n = 3;
(c) n = 4.
Exercise 17.3 Prove that cyclotomic polynomials are irreducible over Q and of
even degree (except for (x ± 1)).
17.6 Exercises 247
Exercise 17.4 Find the generators and relation of the Dirichlet group Ξ (A), where
⎛ ⎞
0 1 0
(a) A = ⎝0 0 1 ⎠ ;
1 2 −4
⎛ ⎞
0 1 0
(b) A = ⎝0 0 1⎠ .
1 −4 0
Chapter 18
Periodicity of Klein Polyhedra. Generalization
of Lagrange’s Theorem
Proposition 18.2 Let A and B be matrices of GL(n + 1, R) with distinct real eigen-
values. The continued fractions associated to A and B are integer congruent if and
only if there exists a matrix X ∈ GL(n + 1, Z) such that XAX −1 commutes with B.
Proof Let A and B be matrices of GL(n + 1, R) with distinct real irrational eigen-
values and suppose that their continued fractions are integer congruent. Since
the continued fractions are integer congruent, there exists a linear integer lattice-
preserving transformation of the space that maps the continued fraction of A to the
continued fraction of B. Under such a transformation the matrix A is conjugated by
some integer matrix X with unit determinant. All eigenvalues of the matrix XAX −1
are distinct and real (since conjugations preserve the characteristic polynomial of
the matrix). Since the invariant cones of the first continued fraction map to the in-
variant cones of the second one, the sets of eigendirections for the matrices B and
XAX −1 coincide. Hence these matrices are simultaneously diagonalizable in some
basis, and therefore, they commute.
Let us prove the converse. Suppose that there exists X ∈ GL(n + 1, Z) such that
à = XAX −1
commutes with B. Note that the eigenvalues of the matrices A and à coincide.
Therefore, all eigenvalues of the matrix à (just as for the matrix B) are real and
distinct. Let us consider a basis in which the matrix à is diagonal. Simple verifi-
cation shows that the matrix B is also diagonal in this basis. Hence the matrices Ã
and B define the same orthant decomposition of Rn+1 , and thus the same continued
fraction as well.
Definition 18.5 By a fundamental domain of the sail we mean the union of some
faces that contains exactly one face from each equivalence class.
In the top picture we have a fragment of the sail. In the middle there are two triangles
of two different orbits (white ones and dark ones). Finally at the bottom we show
one of the possible fundamental domains with respect to the action of the positive
Dirichlet group.
As we have already seen in the first part of the book (Chap. 7), the LLS sequence
is a complete invariant of one-dimensional sails. The situation in the multidimen-
sional case is much more complicated, since the combinatorial structure of sails
is no longer as simple as in the one-dimensional case. A complete invariant is not
known even for two-dimensional sails in R3 . Further, in this section we discuss
only periodic two-dimensional continued fractions in R3 , recalling that such sails
are homeomorphic to R2 by Theorem 16.21.
Let us discuss the case of R3 . It is not very hard to get an invariant that distin-
guishes all sails. For instance, if we know
– integer affine types of all faces in the sail;
252 18 Periodicity of Klein Polyhedra. Generalization of Lagrange’s Theorem
Goal of the algorithm. To construct the sail (actually some of its finite part, say a
certain fundamental domain for periodic sails).
Step 1. From the third condition we construct the pyramid at v0 of the sail.
Step 2. From the first condition we find all polygonal faces adjacent to v0 .
Inductive step. In previous steps we have found several faces of the sail. By the
first condition identify all integer affine types of pyramids that are adjacent to
the constructed set of faces (at least in two edges each). By the second condition
construct all the faces in new pyramids.
Output. In a finite or countable number of steps we build the whole sail (up to
integer congruence).
Remark 18.7 In the case of algebraic irrationalities all the integer affine types of the
faces and pyramids would repeat periodically. Hence we need to store only a finite
amount of data regarding a single period.
This data set is far from forming a complete invariant of sails. It has many mon-
odromy conditions, so the majority of cases are nonrealizable. So the natural ques-
tion here is as follows: how to find whether a certain data set is realizable?
Recently, V.I. Arnold proposed a weaker invariant to distinguish the periodic
sails.
1 1
.
1 2
254 18 Periodicity of Klein Polyhedra. Generalization of Lagrange’s Theorem
Remark 18.10 Unfortunately, the generalized golden ratios do not always give the
simplest periodic continued fractions in all dimensions, since the characteristic poly-
nomials are not always irreducible. For instance, there are nonconstant factors if the
dimension is k = 4, 7, 10, 12, 13, 16, 17, 19 for k ≤ 20. Further, we study the case
k = 3, which corresponds to an irreducible matrix. The case of the simplest matrices
for k = 4 was studied in [92].
Consider the continued fraction associated to the generalized golden ratio matrix
⎛ ⎞
1 1 1
M = ⎝1 2 2⎠ .
1 2 3
The torus decomposition corresponding to this matrix is homeomorphic to the fol-
lowing one:
Here the segment AB is identified with the segment DC and the segment AD with the
segment BC. In the picture we show only homeomorphic types of the faces (without
lattice structure). The integer affine types of the corresponding faces are given in the
next picture; both triangles have integer affine types of the simplest triangle:
18.4 Three Single Examples of Torus Decompositions in R3 255
The integer distance from the triangle ABD to the origin equals 2, and from the
triangle BCD it equals 1.
This torus decomposition was found by E.I. Korkina [118], G. Lachaud [125],
and A.D. Bryuno and V.I. Parusnikov [27] approximately at the same time.
Example 18.11 The second example was studied by A.D. Bryuno and V.I. Parus-
nikov [27]. They constructed the continued fraction that is associated to the follow-
ing matrix:
⎛ ⎞
1 1 1
M = ⎝1 −1 0⎠ .
1 0 0
The positive Dirichlet group Ξ+ (M) is generated by the following two matrices:
X = M 2, Y = 2Id − M 2 .
Here the segment AB is identified with the segment DC and the segment AD with the
segment BC. The triangle ABD has the integer affine type of the simplest triangle.
The triangle BCD has the integer affine type of the triangle with vertices (−1, −1),
(0, 1), and (1, 0).
The integer distance from the triangle ABD to the origin equals 2, and from the
triangle BCD it equals 1.
256 18 Periodicity of Klein Polyhedra. Generalization of Lagrange’s Theorem
Example 18.12 The third example was given by V.I. Parusnikov [159]. This contin-
ued fraction is associated to the following matrix:
⎛ ⎞
0 1 0
M = ⎝0 0 1 ⎠.
1 1 −3
The positive Dirichlet group Ξ+ (M) is generated by the following two matrices:
X = M 2, Y = 3Id − 2M −1 .
Here the segment AB is identified with the segment DC, and the polygonal line
AGD with the polygonal line BEC (the point G is identified with the point E). All
triangles have the integer affine type of the simplest triangle. The pentagon BEFDG
has the integer affine type of the pentagon with vertices (−1, 0), (−1, 1), (0, 1)
(1, 0), and (1, −1):
The integer distances from the triangle ABC and the pentagon BEFDG to the origin
equal 1, from the triangle CDF to the origin equals 2, and from the triangle CFE
equals 3.
The continued fractions constructed in Examples 18.9, 18.11, and 18.12 are
also known as the continued fractions corresponding to the first, the second, and
the third Davenport forms. Several other single torus decompositions for two-
dimensional continued fractions were investigated in the works [160, 162], and
[161] by V.I. Parusnikov.
For a matrix A we denote by A the sum of absolute values of all the coefficients
for the matrix. Then we have the following theorem (for more information see [90]).
(i) A ≥ 4.
(ii) If A = 5 (there are 48 possible matrices), then the corresponding continued
fraction is integer congruent to the generalization of the regular fraction for
the golden ratio. This is shown in Example 18.9.
(iii) For the case A = 6 there are 912 different possible matrices. Consider the
associated two-dimensional continued fractions:
– 480 continued fractions are integer congruent to the one in Example 18.9;
– 240 of them are integer congruent to the continued fraction of Example
18.12;
– 192 of them are integer congruent to the continued fraction of Example
18.11.
We continue with several infinite series of torus decompositions. The first two in-
finite series were calculated by E.I. Korkina in [118]. The first of them is shown
below.
Example 18.14 The continued fractions of this series are associated to the following
matrices for a ≥ 0:
⎛ ⎞
0 0 1
Ma = ⎝1 0 −a − 5⎠ .
0 1 a+6
The positive Dirichlet group Ξ+ (Ma ) is generated by the following two matrices:
X a = Ma , Ya = (Ma − Id)2 .
Here the segment AB is identified with the segment DC and the segment AD with
the segment BC. The integer distance from the triangle ABD to the origin equals
a + 2, and from the triangle BCD it equals 1.
258 18 Periodicity of Klein Polyhedra. Generalization of Lagrange’s Theorem
The positive Dirichlet group Ξ+ (Ma,b ) is generated by the following two matrices:
−2 −1 −1
Xa,b = Ma,b , Ya,b = Ma,b Ma,b − (b + 1)Id .
Here the segment AB is identified with the segment DC and the segment AD with
the segment BC. Both triangles have the same integer affine type of the triangle with
vertices (0, 0), (0, 1), and (b + 1, 0) (b = 5 in the picture):
The integer distance from the triangle ABD to the origin equals a + 2, and from the
triangle BCD it equals 1.
Proposition 18.16 The continued fractions associated to the following two matrices
are integer congruent (for integers a ≥ 0):
⎛ ⎞ ⎛ ⎞
0 1 0 0 0 1
Ma,0 = ⎝0 0 1 ⎠, Ma = ⎝1 0 −a − 5⎠ .
1 1+a −a − 2 0 1 a+6
Proof The matrices (Id − Ma,0 )−1 and Ma are conjugate by the matrix X in the
group SL(3, Z):
⎛ ⎞
−1 −1 −2
X=⎝ 0 0 −1⎠
1 0 −1
(i.e., Ma = X −1 (Id − Ma,0 )−1 X). Therefore, the corresponding continued fractions
are integer congruent by Proposition 18.2.
The positive Dirichlet group Ξ+ (Ma ) is generated by the following two matrices:
−1
Xa = Ma−2 , Ya = 2Id − Ma−2 .
Here the segment AB is identified with the segment DC and the segment AD with
the segment BC. All four triangles have integer affine types of the simplest triangle:
260 18 Periodicity of Klein Polyhedra. Generalization of Lagrange’s Theorem
The integer distance from the triangles ABD, BDE, and BCE to the origin are equal
to a + 2, a + 1, and 1, respectively.
Example 18.18 The series of this example depend on two integer parameters a > 0
and b ≥ 0. The continued fractions of the series are associated to matrices
⎛ ⎞
0 1 0
Ma,b = ⎝0 0 1 ⎠.
1 (a + 2)(b + 2) − 3 3 − (a + 2)(b + 3)
Here the segment AB is identified with the segment DC and the polygonal line AFD
with the polygonal line BEC (the point F is identified with the point E). The integer
affine types of the faces ABF, CGE, CDG, BDF, and DBEC are as follows:
The integer distance from the triangles ABF, BFD, CDG, CEG to the origin equals
1, 1, 2 + 2a + 2b + ab, and 3 + 2a + 2b + ab respectively. The quadrangle DBEC
is at unit distance from the origin.
Let us consider the family of transpose Frobenius matrices (or transpose companion
matrices)
⎛ ⎞
0 1 0
Am,n := ⎝0 0 1 ⎠,
1 −m −n
where m and n are integers; see Fig. 18.2.
Remark 18.19 Suppose that the characteristic polynomial χAm,n (x) is irreducible
over Q. Then the operator of multiplication by the element x in the field
Q[x]/(χAm,n (x) ) in the natural basis {1, x, x 2 } has the matrix Am,n .
Let us observe some basic properties. Denote by Ω the subset of irreducible RS-
matrices of this family.
In addition it is necessary to subtract all the matrices of types: Aa,−a and Aa,a+2 ,
a ∈ Z (these are the matrices with reducible characteristic polynomials).
Proposition 18.21 The two-dimensional continued fractions for the cubic irra-
tionalities constructed by the matrices Am,n and A−n,−m are integer congruent.
Remark 18.22 There are some other integer congruences of continued fractions
in Ω. For instance, the matrices A0,−a and A−2a,−a 2 have integer congruent contin-
ued fractions (since the matrices A20,−a and A−2a,−a 2 are integer conjugate).
Observe that continued fractions of transpose Frobenius matrices do not cover all
possible integer congruence classes of continued fractions for arbitrary RS-matrices
in SL(3, Z). For instance, the continued fraction of the following matrix is not inte-
ger congruent to that of a Frobenius matrix:
⎛ ⎞
1 2 0
A=⎝ 0 1 2⎠
−7 0 29
(we discuss similar questions later, in Chap. 21). So the following question is of
interest.
262 18 Periodicity of Klein Polyhedra. Generalization of Lagrange’s Theorem
Problem 15 How often are continued fractions integer congruent to those of the
set Ω?
Finally, in Fig. 18.2 we give a table (introduced in [90]) whose squares are filled
with torus decompositions of the sails for continued fractions associated to the ma-
trices of Ω. Integer affine types of all the faces of the torus decomposition for one of
the sails for Am,n is shown in the square at the intersection of the row with number n
and the column with number m. If one of the roots of the characteristic polynomial
for the matrix equals 1 or −1, then the corresponding square is marked with the sign
∗ or # respectively. The squares that correspond to the matrices whose characteristic
polynomial has two complex conjugate roots are shaded in light gray.
Remark Notice that some of the series constructed above are clearly seen in
Fig. 18.2.
Problem 17 (V.I. Arnold) Describe all torus decompositions that are realizable by
periodic two-dimensional continued fractions.
Only a few examples are known in this direction. Let us give several trivial ex-
amples of torus decompositions that do not correspond to sails.
Example 18.23 The following torus decomposition is not realizable by sails of pe-
riodic continued fractions. There is one integer point in the interval AD, which we
denote by E:
18.7 Some Problems and Conjectures 263
Here the polygonal line AEB maps to the polygonal line DFC under one of the
matrices of the group SL(3, Z) that preserves the sail. Since AEB is a segment, DFC
is also a segment. Therefore, the points B, C, F , and D lie in the same plane. And
hence BF is not an edge of some face.
Now we describe the first nontrivial example of torus decomposition, which was
given by E.I. Korkina in the work [118].
Example 18.24 Consider the simplest torus triangulation. It consists of two triangles
with the simplest integer affine type of the triangle (0, 0), (0, 1), and (1, 0).
264 18 Periodicity of Klein Polyhedra. Generalization of Lagrange’s Theorem
The integer distances to both faces are 1. This decomposition is not realizable for
periodic sails.
This example (together with many other ones) allows us to formulate the follow-
ing conjecture.
Conjecture 18 For every torus decomposition for the continued fraction of a cubic
irrationality there exists some face with integer distance to the origin greater than
one.
On the other hand, every known (periodic) sail has a face with the integer distance
to the origin equalling one.
Conjecture 19 For every torus decomposition for the continued fraction of a cubic
irrationality there exists some face whose integer distance to the origin equals one.
Example 18.25 Consider the torus decomposition consisting of one face with inte-
ger affine type of the simplest parallelogram with the vertices (0, 0), (0, 1), (1, 1),
and (1, 0).
The integer distances to this face can be chosen arbitrarily. This decomposition is
not realizable for periodic sails of cubic irrationalities.
It seems that a torus decomposition with a single rectangular face is not realizable
in general. Here we conjecture a stronger statement.
Conjecture 20 The torus decomposition for every sail of every cubic irrationality
contains a triangular face.
The next question is related to continued fractions of the same cubic extension.
18.8 Generalized Lagrange’s Theorem 265
Problem 21 (V.I. Arnold) Classify continued fractions that corresponds to the same
cubic extensions of the field of rational numbers.
Almost nothing is known here (even in the one-dimensional case). For example,
even the finiteness of the number of possible continued fractions associated to the
same extension is unknown. (For properties of cubic extensions of rational numbers
see in the work of B.N. Delone and D.K. Faddeev [52].) Here is a nice example
showing that a characteristic polynomial does not distinguish two-dimensional con-
tinued fractions.
Example 18.26 The following two matrices having the same characteristic polyno-
mial x 3 + 11x 2 − 4x − 1 (and hence the same cubic extension of Q) define integer
noncongruent continued fractions:
⎛ ⎞3 ⎛ ⎞
0 1 0 0 1 0
⎝0 0 1 ⎠ , ⎝0 0 1 ⎠.
1 1 −2 1 4 −11
Definition 18.27 Let C be a cone and v a vertex of the sail for C. An edge star of
v is the union of all edges of the sail adjacent to v; we denote it by Stv .
We say that a star is regular if v is in the interior of the cone C.
Consider a star Stv and a point p (not necessary integer) in the complement
to Stv . We say that Stv is a regular star with respect to p if
– v is in the interior of conv(Stv , p).
– the set conv(Stv , p) \ (conv(Stv ) ∪ {p}) does not contain integer points.
Define
Γ (Stv ) = {p|Stv is regular star with respect to p}.
Theorem 18.28 Let Stv be a regular star. Then the set Γ (Stv ) is bounded.
Consider an arbitrary regular star Stv in Rd . Suppose Γ (Stv ) is not closed. Then
there exists a (unit) direction ξ that is an accumulation points of unbounded direc-
tions, i.e., for every neighborhood U ⊂ S d−1 of ξ and every N > 0 there exists a
point p ∈ Γ (Stv ) with direction in U and |vp| > N . Hence the half-prism
conv(Stv ) ⊕ {λξ |λ ≥ 0}
does not contain integer points in the interior except for the points of conv(Stv ).
There are two different cases to consider here.
The first case is v being inside the half-prism. Then either ξ is rational and the
cone contains integer points, or ξ is irrational and then the half-prism contains in-
teger points by the multidimensional Kronecker’s approximation theorem (Theo-
rem 16.16). We have arrived at a contradiction.
In the second case we have v on the boundary of the half-prism. This mean that
ξ is in the plane of some face Std−1 v of Stv . Let L be the integer hyperplane contain-
ing Std−1
v . The direction ξ is also an accumulation points of unbounded directions
in Γ (Std−1
v ), since the intersections of pyramids with direction close to ξ with L are
also unbounded. This contradicts with induction assumption.
Hence the set Γ (Stv ) is bounded.
Corollary 18.29 The set Γ (Stv ) contains finitely many integer points.
Definition 18.30 Let C be a cone in Rn and let SC be its sail. A two-sided infinite
sequence (vi ) of vertices in the sail SC in Rn is called a chain of SC if for every
integer k we have
– vk and vk+1 are connected by an edge of the sail;
– the points vk , . . . , vk+n−1 are affinely independent (i.e., they span the (n − 1)-
dimensional affine plane).
Definition 18.31 Let C be a cone in Rn , SC its sail, and (vi ) a chain in SC . Consider
an arbitrary subset U of Aff(n, Z). We say that the chain (vi ) is U -periodic if there
exists a period t such that for every i there exists Ai ∈ U satisfying
Remark 18.33 A stronger similar statement holds for the class of all affine transfor-
mations whose corresponding linear transformations satisfy:
– all eigenvalues are distinct from 1;
– all pairs of complex conjugate eigenvalues are distinct and distinct from the ab-
solute values of real eigenvalues.
To avoid technical details we restrict ourselves to the class of affine transforma-
tions with irreducible RS-matrix in GL(3, Z), i.e., to Hn . For more information on
the general case we refer to [70].
Lemma 18.34 Let (vi ) be an Hn -periodic chain of vertices for some sail S in Rn .
Then there exists a transformation A in Hn0 establishing a nontrivial shift of stars,
i.e., there exists a positive integer T such that for every k we have
AT (Stvk ) = Stvk+T .
All these points are in Γ (StvN ), since Ak1 (StvN −k ) = Stk . Since the stars Stv1 and
StvN are integer congruent, the set Γ (StvN ) contains exactly N points. Therefore,
268 18 Periodicity of Klein Polyhedra. Generalization of Lagrange’s Theorem
j
there exist distinct i and j such that Ai1 (O) = A1 (O). Hence Ai−j (O) = O, mean-
ing that Ai−j is a linear transformation in Hn and thus in Hn0 .
The transformation Ai−j ∈ Hn0 establishes a nontrivial shift of the union of all
the stars (Stvi ) along itself.
Am (v) = αk λm
k ek + smaller terms;
m
1
A−m (v) = αl el + smaller terms.
λl
Therefore, the sequences of rays defined by the directions of Am (v) and A−m as
m → +∞ converge to the rays with direction ek and el respectively. (Here to define
the convergence we consider the topology of the unit sphere. A ray is represented
by a point of intersection of this ray with the unit sphere.)
By the above, the set (Am (wi )) should contain the tetrahedron T for every inte-
ger m. Therefore, the limiting cone at +∞ also contains T . Since the tetrahedron T
is (n − 1)-dimensional and the hyperspace of T does not contain the origin, all the
rays in the limit has linearly independent directions. Therefore, the rays are distinct
and (possibly after some permutations of vectors ei ) the matrix of vectors wi in the
basis (ei ) is lower triangular:
⎛ ⎞
α1,1 0 ··· 0 0
⎜ α2,1 α1,2 ··· 0 0 ⎟
⎜ ⎟
⎜ .. .. . . .. ⎟
⎜ . . . . .
. . ⎟
⎜ ⎟
⎝αn−1,1 αn−1,2 · · · αn−1,n−1 0 ⎠
αn,1 αn,2 ··· αn,n−1 αn,n
For the same reasons, all the rays of the limiting cone at −∞ are independent.
Hence for every i there exists a vector wj that is contained in the span of e1 , . . . , ei .
18.9 Littlewood and Oppenheim Conjectures 269
This implies that all the coefficients under the diagonal are zeros. Thus, the matrix
is diagonal, and all vectors wi are eigenvectors of A. Therefore, A(C) = C (recall
that by Lemma 18.34 the transformation A is in Hn0 ).
inf mmαmβ = 0.
m∈Z+
Then there exists an irreducible RS-matrix A ∈ SL(n, Z) such that the planes
Li (p) = 0 for i = 1, . . . , n are the invariant planes of A.
Remark It is known that for n = 3 the Oppenheim conjecture implies the Littlewood
conjecture (see, for instance, [34]).
We also say that the determinant of an edge star Stv is the Euclidean volume of
the convex hull conv(Stv ).
270 18 Periodicity of Klein Polyhedra. Generalization of Lagrange’s Theorem
Theorem 18.36 (O. German [70]) Let n ≥ 3. Consider n linearly independent lin-
ear forms L1 , . . . , Ln in Rn . Denote by C the cone
C = p ∈ Rn | Li (p) = 0, i = 1, . . . , n .
Conjecture 24 Let the faces and edge stars of a cone C have uniformly bounded
determinants. Then the sail SC is algebraically periodic (i.e., it is a sail of some
irreducible RS-matrix of SL(n, Z)).
18.10 Exercises
Exercise 18.1 Suppose that a polynomial p with integer coefficients is irreducible
over Q. Prove that all the roots of p are distinct.
Exercise 18.2 Check that the Dirichlet groups for Examples 18.9, 18.11, and 18.12
are correctly calculated.
Exercise 18.4 Prove that the torus decomposition consisting of a single square of
integer area 2 is not realizable as a sail of a periodic two-dimensional continued
fraction.
Exercise 18.5 Consider arbitrary integers m and n. Suppose that the matrix Am,n is
an irreducible RS-matrix. Prove that the matrix A−n,−m is also an irreducible RS-
matrix and that the continued fraction associated to A−n,−m is integer congruent to
the continued fraction associated to Am,n .
Exercise 18.6 Consider a cone C and its sail SC in Rn . Let (vi ) be some chain of
the sail SC . Prove that each n − 1 consecutive edges of the chain are edges of a face
in SC .
Chapter 19
Multidimensional Gauss–Kuzmin Statistics
This implies that the sets FCF n and CF n admit natural structures of smooth mani-
folds induced by the Cartesian product of n + 1 copies of RP n . Note also that FCF n
is an ((n + 1)!)-fold covering of CF n . We call the map p : FCF n → CF n of “for-
getting” the order in the ordered collections the natural projection of the manifold
FCF n to the manifold CF n .
The group PGL(n + 1, R) takes the set of all straight lines passing through the ori-
gin of (n + 1)-dimensional space into itself (for the definition of PGL(n + 1, R) and
cross-ratios we refer to Chap. 9). Hence, PGL(n + 1, R) naturally acts on the mani-
folds CF n and FCF n . Furthermore, the action of PGL(n + 1, R) is transitive, i.e., it
takes any (framed) continued fraction to any other. Note that for any n-dimensional
(framed) continued fraction, the subgroup of PGL(n + 1, R) taking this continued
fraction to itself is of dimension n.
Definition 19.2 Let M be a smooth enough arcwise connected manifold and let ω
be a volume form on it. Denote by μω the measure on M defined as follows: for
every open measurable set S ⊂ M,
μω (S) = ω.
S
Let us write down explicitly the Möbius forms for the manifold of framed n-
dimensional continued fractions FCF n for arbitrary n.
Consider Rn+1 with standard metrics on it. Let π be an arbitrary hyperplane of
the space Rn+1 with chosen Euclidean coordinates OX 1 . . . Xn , and assume that
π does not pass through the origin. We call the set of all collections of n + 1
ordered straight lines intersecting the plane π the chart FCF n,π of the manifold
FCF n . Let the intersection of π with the ith plane be a point with coordinates
(x1,i , . . . , xn,i ) on π . For an arbitrary tetrahedron A1 . . . An+1 in the plane π , we
denote by Vπ (A1 , . . . , An+1 ) its oriented Euclidean volume (with respect to the
orientation induced by the coordinates OX 1,1 . . . Xn,1 X1,2 . . . Xn,n+1 of the chart
FCF n,π ).
Remark 19.4 The chart FCF n,π is everywhere dense in (Rn )n+1 .
Proposition 19.5 The measure μωπ is a restriction of a Möbius measure to FCF n,π .
Aj Ai
f ij = , i, j = 1, . . . , n + 1; i
= j.
|Aj Ai |π
It is clear that the vectors f ij and f j i are parallel to the vector Aj Ai . The collection
(f ij ) is a basis in the tangent space to FCF n,π , which depends continuously on the
point in the chart. By dvij we denote the 1-form corresponding to the coordinate
along the vector f ij of FCF n,π .
Let us rewrite the form ωπ in new coordinates:
274 19 Multidimensional Gauss–Kuzmin Statistics
ωπ (x1,1 , . . . , xn,n+1 )
&
n+1
Vπ (Ai , A1 , . . . , Ai−1 , Ai+1 , . . . An+1 )
Δvij Δvj i
|Ai Aj |2
for the infinitesimally small Δvij and Δvj i is the infinitesimal cross-ratio of the
four points Ai , Aj , Ai + Δvj i f j i , and Aj + Δvij f ij on the straight line Ai Aj .
Therefore, the form ωπ is invariant under the action of the transformations (of the
everywhere dense set) of the chart FCF n,π that are induced by the projective trans-
formations of the hyperplane π . Hence the measure μωπ coincides with the restric-
tion of some Möbius measure to FCF n,π .
Corollary 19.6 A restriction of an arbitrary Möbius measure to the chart FCF n,π
is proportional to μωπ .
Proof The statement follows from the proportionality of any two Möbius mea-
sures.
Let us fix an origin Oij for the straight line Ai Aj . The integral of the form dvij
(respectively dvj i ) for the segment Oij P defines the coordinate vij (vij ) of the point
P contained in the straight line Ai Aj . As in the one-dimensional case, consider a
projectivization of the straight line Ai Aj . Denote the angular coordinates by ϕij and
ϕj i , respectively. In these coordinates,
Definition 19.8 The value μ(CF n (F )) is called the relative frequency of a face F .
Definition 19.9 We say that two faces of a sail are integer-linear congruent if there
exists an SL(n, Z)-transformation sending one face to the other.
Remark 19.10 It some sense the structure of a face includes the origin and the poly-
hedron of this face. Of course, if two faces are integer-linear congruent, the corre-
sponding polyhedra are integer congruent. The converse is not true. For instance,
two empty triangles (that are integer congruent) can be integer distance one, and
integer distance two to the origin. Such faces are not integer-linear congruent.
Problem 25 Find faces of n-dimensional continued fractions with the highest rel-
ative frequencies. Is it true that for every constant C there exist only finitely many
pairwise integer noncongruent faces whose relative frequencies do not exceed C?
Find the corresponding asymptotics (with respect to C tending to infinity).
Remark 19.12 Let us say a few words about the following particular question of
V.I. Arnold: which faces in two-dimensional sails are more frequent, triangular or
quadrangular? For instance, in the case of R3 , there are two natural models of ran-
dom polyhedra. The first one is to choose k random points in space and to construct
their convex hull. For the second method, one should consider the intersection of k
random half-spaces. In the first method, a random convex hull has only triangular
faces, while in the second the average number of vertices of faces is four (Schläfli–
Saharov theorem). So it is interesting to understand whether two-dimensional sails
are “closer” to random polyhedra in the first sense (in this case they have more
triangles) or in the second sense (then they should have more quadrangles).
The following conjecture connects the notions of relative frequencies and fre-
quencies in the sense of Arnold.
Consider the space R3 with the standard metric on it. Let π be an arbitrary plane in
R3 not passing through the origin and with fixed system of Euclidean coordinates
Oπ Xπ Yπ . Let FCF 2,π be the corresponding chart of the manifold FCF 2 . For an
arbitrary triangle ABC of the plane π , we denote by Sπ (ABC) its oriented Euclidean
area in the coordinates Oπ X1 Y1 X2 Y2 X3 Y3 of the chart FCF 2,π . Denote by |v|π the
Euclidean length of a vector v in the coordinates Oπ X1 Y1 X2 Y2 X3 Y3 of the chart
FCF 2,π . Consider the following form in the chart FCF 2,π :
a triangle ABC in the plane π by three straight lines l1 , l2 , and l3 , where l1 passes
through B and C, l2 passes through A and C, and l3 passes through A, and B. Define
the straight line li (i = 1, 2, 3) in π by the equation (after first making a translation
of π in such a way that the origin is taken to some inner point of the triangle)
ai x + b i y = 1
So, we reduce the computation of relative frequency for the face F (i.e., the value
of μ2 (CF 2 (F ))) to the computation of the measure μω2 (p −1 (CF 2 (F ))). Consider
some plane π in R3 not passing through the origin. By Corollary 19.7 we have
μω2 p −1 CF 2 (F ) = μωπ p −1 CF 2 (F ) ∩ (FCF 2,π ).
Finally, the computation should be made for the set μωπ (p −1 (CF 2 (F ))) ∩ (FCF 2,π )
in dual coordinates ai , bi (see Proposition 19.13).
Remark 19.14 In coordinates ai , bi the computation of the value of the relative fre-
quency often reduces to the estimation of the integral on the disjoint union of a finite
number of six-dimensional Cartesian products of three triangles in coordinates ai ,
bi (see Proposition 19.13). The integration over such a simple domain significantly
increases the speed of approximate computations. In particular, the integration can
be reduced to the integration over some 4-dimensional domain.
Explicit calculation of relative frequencies for the faces seems not to be realizable.
Nevertheless, it is possible to make approximations of the corresponding integrals.
Usually, the greater the area of the polygon, the smaller its relative frequency. The
most complicated approximation calculations correspond to the simplest faces, such
as an empty triangle.
In Fig. 19.1 we examine examples of the following faces:
– triangular faces (0, 0, 1), (0, 1, 1), (1, 0, 1), and (0, 0, 1);
– triangular face (0, 2, 1), (2, 0, 1);
278 19 Multidimensional Gauss–Kuzmin Statistics
Fig. 19.1 The points painted in light gray correspond to the points at which the rays defining the
two-dimensional continued fraction can intersect the plane of the chosen face
– quadrangular face (0, 0, 1), (0, 1, 1), (1, 1, 1), (1, 0, 1).
Denote by An the triangle with vertices (0, 0, 1), (n, 0, 1), and (0, n, 1). Denote
by Bn the square with vertices (0, 0, 1), (n, 0, 1), (n, n, 1), and (0, n, 1).
19.5 Exercises
Exercise 19.1 Show that relative frequencies of integer-linear congruent faces are
equivalent.
280 19 Multidimensional Gauss–Kuzmin Statistics
Exercise 19.3 For every face F of Table 19.1 draw the intersection set of the plane
spanned by F with all the rays that define two-dimensional continued fractions hav-
ing F as a face (see examples in Fig. 19.1).
Exercise 19.4 Calculate up to the second digit the relative frequencies for the faces
of types VII1 and X1 in Table 19.1.
Chapter 20
On Construction of Multidimensional Continued
Fractions
In the first part of this book we saw that the LLS sequences completely determine
all possible sails of integer angles in the one-dimensional case. The situation in the
multidimensional case is much more complicated. Of course, the convex hull algo-
rithms can compute all the vertices and faces of sails for finite continued fractions,
but it is not clear how to construct (or to describe) vertices of infinite sails of multi-
dimensional continued fractions in general. What integer-combinatorial structures
could the infinite sails have? There is no single example in the case of aperiodic in-
finite continued fractions of dimension greater than one. The situation is better with
periodic algebraic sails, where each sail is characterized by its fundamental domain
and the group of period shifts (i.e., the positive Dirichlet group).
In this chapter we show the main algorithms that are used to construct examples
of multidimensional continued fractions (finite, periodic, or finite parts of arbitrary
sails). We begin with some definitions and background. Further, we discuss one
inductive and two deductive algorithms to construct continued fractions. Finally, we
demonstrate one of the deductive algorithms on a particular example.
periodic continued fraction. Hence we are faced with the problem of finding a good
algorithm that enumerates all the faces for this domain.
There was no algorithm for constructing multidimensional continued fractions
until T. Shintani’s work [185] in 1976. The algorithm was further developed by
R. Okazaki [153] in 1993. The authors showed the so-called inductive method for
constructing fundamental domains of multidimensional continued fractions. The al-
gorithm produces the fundamental domain face by face, verifying that each new face
does not lie in the same orbit with some face constructed previously. Applying the
algorithm, one finds the fundamental domain in finitely many steps. In some sense
this is similar to the classical construction of one-dimensional continued fractions.
Using inductive algorithms, E. Thomas and A.T. Vasques obtained several fun-
damental domains for the two-dimensional case in [196]. Later, E. Korkina in [117–
119] and G. Lachaud in [123] produced an infinite number of fundamental domains
for periodic algebraic two-dimensional continued fractions.
For a polytope F and its hyperface E we say that the collection (E, F ) is a hyper-
flag. Every (n − 1)-dimensional face of a sail for an (n − 1)-dimensional continued
fraction is called a hyperface of the sail.
Remark 20.1 The algorithm is relatively slow in Step 2, but the advantage is that
one does not need to construct the basis of the positive Dirichlet group. If we work
with nonalgebraic sails, then this algorithm constructs a finite domain of a sail (here
in Step 2, we only construct an adjacent face).
Consider an arbitrary invariant cone C. Let us find some integer point in C. Shift the
basis unit parallelepiped completely into the cone C. There exists an integer point
lying inside the shifted parallelepiped. The coordinates of this point coincide with
integer parts of coordinates for one of the 2n vertices of this parallelepiped.
Having found some integer point P of the invariant cone C, let us construct some
vertex of the sail of C. Consider some integer plane π passing through the origin
such that
π ∩ C = O,
where O is the origin. Suppose that the integer distance from the point P to this
plane is equal to d. Now we look through all the simplices obtained as intersections
of the cone C with planes parallel to π at integer distances to the origin equal to
1, . . . , d. Suppose that the first simplex containing integer points lies in the plane at
integer distance d ≤ d. The convex hull of all points of this simplex coincides with
some faces of the sail. All vertices of this face are vertices of the sail. Choose one
of them.
face of the sail of dimension k. The face Fk+1 consists of all integer points in the
tetrahedral intersection of the plane spanned by Fk and s with the cone C.
So we get the face Fk+1 . Proceed further until we get a hyperface Fn .
20.1.4 Step 1b, 4: How Decompose the Polytope into Its Faces
Proof The condition of the first item means exactly that all points are in one half-
plane with respect to the hyperplane containing the tetrahedron pi1 . . . pir .
The condition of the second item enumerates the points contained in the plane
of T .
Remark 20.3 Using Proposition 20.2 one constructs all hyperfaces of a polyhe-
dron P . Further decomposition into faces is done iteratively, applying the hyperface
search to already constructed faces.
Suppose that we know a hyperface F and one of its hyperfaces E. Let us construct
the hyperface FE adjacent to F via E.
20.2 Deductive Algorithms to Construct Sails 285
Consider two ordered bases (ei1 ) and (ei2 ) with matrices M1 and M2 in the standard
coordinate basis. It is clear that (ei1 ) is integer congruent to (ei2 ) if and only if
In inductive algorithms one constructs faces of the sail one by one inductively. There
is another approach to constructing faces of the sail for the cone C. First, one should
286 20 On Construction of Multidimensional Continued Fractions
construct an approximation of the sail itself, and, second, choose the faces of the
approximation that are the faces of the sail. This method is especially useful in
the algebraic periodic case. Suppose that we are given an integer irreducible real
spectrum matrix A ∈ SL(n + 1, Z). To compute some fundamental domain of a sail
of the continued fraction associated to A it is sufficient to do the following:
1. Compute a convex hull approximation of the sail. Namely, take a large enough
convenient set of integer points and find its convex hull.
2. Make a conjecture on some fundamental domain. Here we need to guess a set of
faces that might form a fundamental domain. We do this by finding a repeatable
pattern in the geometry of the faces.
3. Prove the conjecture if possible.
4. If you cannot prove the conjecture, start with 1, but with a larger convenient set
of points.
In this situation the following two questions are relevant:
How can one find a convenient set of integer points for the approximation of the
sail?
How can one test whether the conjecture of a fundamental domain of the sail is
true?
Let us give answers to these questions.
In the first step we approximate the cone in the following way. The cone is de-
fined by its edges, so we take a good approximation of all the edges, and get a cone
approximating our cone.
In the second step we should check that certain polyhedra are faces of the sail
for the cone. So we are solving the following problem: starting with a cone C with
vertex at the origin and an integer polyhedron F , find out whether F is a face of the
sail for C. Denote by πF an integer hyperplane containing F .
(i) Check that the intersection of the cone C with πF is a simplex, find all the
integer points inside it, and check that their convex hull coincides with F .
(ii) Find the integer distance d from the origin to F . Check that all integer planes
parallel to πF and having the origin and F in distinct half-planes (there are
exactly d − 1 such planes) do not have integer points in the intersection with
the cone C.
In the third step we find out which faces of the constructed part are shifted by
matrices of the group Ξ+ (A) to each other. This was discussed in Sect. 20.1.5.
As an example, we discuss one particular case of two-dimensional sails later, in
Sect. 20.2.4.
Step 4. Make a conjecture on a fundamental domain of the sail (using the results of
Step 2 and Step 3).
Step 5. Test the resulting conjecture from Step 4.
Output. A fundamental domain of an operator A in the given invariant cone.
Remark 20.4 It is assumed that the fundamental domain conjectured in Step 4 and
the basis A1 , . . . , An of the group Ξ+ (A) satisfy the following conditions:
(i) the closure of the fundamental domain is homeomorphic to the disk;
(ii) the operators with matrices A1 , . . . , An define the gluing of this disk to the n-
dimensional torus.
Remark 20.5 Note that all deductive algorithms are not algorithms in the strict
sense. One needs to choose some basis of Ξ+ (A) in the right way, produce a good
conjecture, and then test it. Even the algorithmic recognition of the period for the
given picture of a sail approximation is thought to be a hard problem. That is why
this “algorithm” cannot be completed with certainty by some computer program.
But on the other hand, this deductive algorithm is effective in practice. All of the ex-
amples listed in the article [89] were produced using this algorithm. The examples
of this paper generalize and expand almost all known periods of the sails calculated
before.
General questions concerning the lattice bases (Steps 1 and 2) were discussed in
Chap. 17 for the calculation of a vertex of the sail (Step 3); see Sect. 20.1.3. Step 5
is general for both deductive algorithms. It will be discussed in the next subsection.
So it remains to describe Step 4.
Suppose that we know some vertex V of the sail in the invariant cone (from Step 3)
and a basis A1 , . . . , An for the group Ξ+ (A) (from Step 2). Let us briefly discuss
how to produce a conjecture on a fundamental domain of the sail. First, we compute
20.2 Deductive Algorithms to Construct Sails 289
the set of integer points that contains all vertices of some fundamental domain of
the sail. Second, we show how to choose an infinite sequence of special polyhedron
approximations for the sail. Finally, using a picture of these approximations, we
formulate a conjecture on a fundamental domain of the sail.
Proposition 20.7 Let V be a vertex of the sail of the n-dimensional continued frac-
tion of an (n + 1)-algebraic irrationality. Then there exists a fundamental domain
of the sail such that all vertices of this domain are contained in the convex hull H
of the origin and of the 2n distinct points of the form
n
& ε
Vε1 ,...,εn = Ai i (V ),
i=1
Proof Consider the polyhedral cone C with vertex at the origin and base at the
convex polyhedron with vertices Vε1 ,...,εn . We take the union of all images of this
polyhedral cone under the actions of the operators with matrices
&
n
Am1 ,...,mn = Am i
i ,
i=1
We skip the classical description of the computation of the convex hull for the
integer points contained in the polyhedron H . Denote the vertices of this convex
hull by Vr for 0 < r ≤ N . Here N is the total number of such points.
The defined set contains approximately, but fewer than, mn N points. (Since we
calculated some image points for the boundary of H several times, we do not
290 20 On Construction of Multidimensional Continued Fractions
know the exact number of points.) The number N is fixed for the given genera-
tors A1 , . . . , An while m varies. We should try to make a conjecture with the least
possible m.
Remark 20.9 For all the examples listed in the paper [89] (and for the example
of the last section) it was sufficient to take m = 2 to produce the corresponding
conjectures.
Remark 20.10 The “quality” of the approximation strongly depends on the choice
of the basis of Ξ+ (A).
Remark 20.11 Note that though the algorithm works for n + 1 arbitrary linearly
independent matrices of Ξ+ (A), it is not necessary to find generators of Ξ+ (A).
Suppose that we know matrices A1 , . . . , An that generate only some full-rank sub-
group of the group Ξ+ (A). Let the index of this subgroup be equal to k. Then we
are faced with the following two problems. Firstly, the number N will be approx-
imately k times greater than in the previous case. Secondly, one should also find a
conjecture on generators of the group Ξ+ (A).
In this section we explain how to test conjectures for the case of two-dimensional
periodic continued fractions (for more information we refer to [95]). The test con-
sists of seven stages. We prove here that these seven stages are sufficient for the
verification of whether the conjecture produced is true. The complexity of these
stages is polynomial in the number of all faces.
20.2.4.1 Brief Description of the Test Stages and Formulation of the Main
Results
Suppose that we have a conjecture on some fundamental domain D for some sail of a
two-dimensional periodic continued fraction associated to some integer irreducible
real spectrum matrix A. Let B1 , B2 be the basis of the group Ξ+ (A). Let pk (for k =
0, 1, 2) be the number of k-dimensional faces of the fundamental domain D. Denote
by Fi (i = 1, . . . , p2 ) the two-dimensional faces, i.e., polygons. All vertices and
edges adjacent to each face are known. It is also conjectured that the fundamental
domain D and the basis B1 , B2 satisfy the following conditions:
(i) the closure of the fundamental domain is homeomorphic to the two-dimensional
disk;
(ii) B1 and B2 define the gluing of this disk to the n-dimensional torus T 2 (the
fundamental domain D is in one-to-one correspondence with this torus).
20.2 Deductive Algorithms to Construct Sails 291
Test of the Conjecture Our test consists of the following seven stages:
1. test of condition (i);
2. test of condition (ii);
3. calculation of all integer distances from the origin to the two-dimensional planes
containing faces Fi and verification of their positivity;
4. test the nonexistence of integer points inside the pyramids with vertices at the
origin and bases at Fi (here the integer points of faces Fi are permitted);
5. test the convexity of dihedral angles (for all edges of the fundamental domain);
6. verification that all stars of the vertices are regular;
7. test whether all vertices of D are in the same invariant cone.
Let us give several technical definitions related to conditions 5 and 6.
Definition 20.12
(i) A dihedral angle is called well-placed if the origin is contained in the corre-
sponding opposite dihedral angle.
(ii) Consider an arbitrary polyhedral surface P and its vertex v. The n-star of the
vertex v in P is the union of all faces of P of dimension not greater than n
containing v.
Let us now define the regular stars at vertex v of a fundamental domain. First, we
construct a 2-star Rv of the conjectured sail using the algorithm of sail reconstruc-
tion (see page 252). Second, let p : W → T 2 be the universal covering of the torus
obtained by gluing the fundamental domain via the shift operators. Take any vertex
w ∈ W corresponding to the vertex v. Denote the 2-star at w ∈ W by Rw . Denote
the canonical projection of Rw to Rv by ξ .
If v
= (a, 0, 0) for some positive a, then we set v m = (1/m, 0, 0) for m ∈ Z+ . (If
v = (a, 0, 0), then we set v m = (0, 1/m, 0).) A 2-star at v is called regular if for the
sequence of rays lm with vertex at the origin and passing through the point v + v m
there exists a positive k such that for every m ≥ k the following holds: the set
ξ −1 (lm ∩ Ru )
Theorem 20.13 The described conjecture test for the fundamental domain D re-
quires every than
C(p0 + p1 + p2 )4
additions, multiplications, and comparisons of two integers, where C is a universal
constant that does not depend on the pi .
We skip the proof of this technical theorem here and refer the interested reader
to [95].
292 20 On Construction of Multidimensional Continued Fractions
Remark 20.14 Note that here we do not take into account the complexity of addi-
tions, multiplications, and comparison of two large integers. We think of any such
operation as a single operation (as a unit of time). There are some known bounds
for the number of digits of the integers that are linear with respect to the coefficients
of the matrix A. So the complexity should be multiplied by some polynomial in the
coefficients of A.
Theorem 20.15 Let the set of faces D satisfy the following conditions:
(1) condition (i);
(2) condition (ii);
(3) positivity of all integer distances from the origin to the two-dimensional planes
containing faces Fi ;
(4) there are no integer points inside the pyramids with vertices at the origin and
bases at Fi (here integer points on the faces Fi are permitted);
(5) all dihedral angles are convex;
(6) all stars of the vertices are regular;
(7) all vertices of D are contained in the same invariant cone.
Then D is a fundamental domain of some sail of the continued fraction associated
to the matrix A.
Let us prove that these seven stages are sufficient for the test.
Lemma 20.16 For any face of the polygonal surface U , the map π is welldefined
and injective on it.
20.2 Deductive Algorithms to Construct Sails 293
Let x ∈ R3 \ O. Denote by Nx the tetrahedral angle with vertex at the origin and
base with vertices x, B1 (x), B1 B2 (x), and B2 (x). Notice that
Bn,m (Nx ) \ O
n,m∈Z
is one of eight invariant cones of the continued fraction associated to A that con-
tains x. Note that from conditions 1, 2, and 6 it follows that all points of D are
contained in one open invariant cone, which we denote by C.
Lemma 20.17 Let x be some point of the open invariant cone C. Then the union of
all faces of D is contained in a finite union of solid angles of the type Bn,m (Nx ).
Proof By Dirichlet’s unit theorem it follows that for every interior point a of the
open invariant cone C there exists an open neighborhood satisfying the follow-
ing condition. The neighborhood can be covered by four solid angles of the type
Bn,m (Nx ) when a belongs to an edge of some Bk,l (Nx ), by two solid angles when a
belongs to the face of some Bk,l (Nx ), and by one solid angle in the remaining cases.
In every case, the neighborhood can be covered by some finite union of solid angles
of type Bn,m (Nx ).
Consider a covering of D by such neighborhoods that correspond to each point of
the closure of D. Since the closure of D is closed and bounded in R3 , it is compact.
Hence this covering contains some finite subcovering. Therefore, the union of all
faces of D is contained in the finite union of solid angles of type Bn,m (Nx ).
Corollary 20.18 Let x be contained in the open invariant cone C. Then the solid
angle Nx contains only points from a finite number of fundamental domains of the
type Bn,m (D).
Proof From the last lemma it follows that D is contained in the finite union
/ l
k=1 Bnk ,mk (Nx ) (for some positive l). Then the solid angle Nx can contain only
points of the fundamental domains B−nk ,−mk (D) for 1 ≤ k ≤ l.
294 20 On Construction of Multidimensional Continued Fractions
Lemma 20.19 The map π bijectively takes the polygonal surface U to the set
S 2 ∩ C.
Since every ball centered at the origin contains only a finite number of vertices of U
(and they do not form a sequence tending to the boundary of the invariant cone C),
the polyhedral surface U divides the space R3 into two connected components. De-
note by H the connected component of the complement to U that does not contain
the origin.
Proof Suppose that some plane passing through the origin intersects the polygonal
surface U and does not contain any vertex of U . By Lemma 20.19 such a plane
intersects U at some piecewise-connected broken line with an infinite number of
edges. The complement of the plane of this broken line consists of two connected
components. By assumption all vertices of this broken line are contained in open
edges of U . By condition 5 all dihedral angles of U are well-placed. Thus the angle
at every vertex of intersection of H with our plane is less than a straight angle.
Hence by the previous lemma the intersection is convex.
Consider the set of all planes that pass through the origin, intersect U , and do not
contain vertices of U . This set is dense in the set of all planes passing through the
origin and intersecting U . Therefore, by continuity it follows that the intersection of
H with any plane passing through the origin (and intersecting U ) is convex.
Now we prove that the set H is convex. Let x1 and x2 be some points of H .
Consider the plane that spans x1 , x2 , and the origin. This plane intersects U , since
x1 is in H and the origin is not in H . By the above, the intersection of H with this
plane is convex. Hence the segment with endpoints x1 and x2 is contained in H .
Thus H is convex (by the definition of convexity).
How Does One Test the Conjecture for Fundamental Domains of Multidi-
mensional Continued Fractions? The verification of conditions (i) and (ii) is
straightforward and is omitted. If these conditions hold, we check whether all the
n-dimensional faces of the fundamental domain are faces of the sail. This can be
done in the following way.
Suppose that the integer distances from the origin to the planes of faces Fi are
equal to di (i = 1, . . . , p, where p is the number of all n-dimensional faces). Our
conjecture is true if and only if for all i = 1, . . . , p the following conditions hold:
(a) For every integer d < di consider the plane parallel to the face Fi with integer
distances to the origin equal to d. The intersection of our invariant cone with
this plane does not contain any integer point.
(b) For d = di the convex hull of all integer points in the intersection coincides with
face Fi .
The verification of conditions (a) and (b) is quite complicated from the algorithmic
point of view.
We conclude this section with the important inverse question of constructing pe-
riodic continued fractions.
Problem 28 (V.I. Arnold) Does there exist an algorithm to decide whether a given
type of fundamental domain is realizable by a periodic continued fraction?
The answer to this question is unknown even for the two-dimensional periodic
continued fractions.
20.3 An Example of the Calculation of a Fundamental Domain 297
Then the following set of faces forms one of the fundamental domains:
(1) the vertex A;
(2) the edges AB, AD, and BD;
(3) the triangular faces ABD and BDC.
Proof of Theorem 20.21 Steps 1 and 2. We omit the first and the second steps (these
steps are classical; see [35]) and here write down the result. The following two
matrices generate the group Ξ+ (A):
−1
Xa,b = A−2m,n , Ya,b = A−1
m,n Am,n − (b + 1)Id ,
do not have any integer solution for 0 < α < 1, the integer distance from ABD to
the origin is equal to one. There are exactly three integer points (A, B, and D) in
the intersection of the plane and the invariant cone. We leave the proof of this as an
exercise for the reader.
Step 4. The conjecture of the fundamental domain was produced in the statement
of this theorem.
Step 5. It remains to test the conjectured fundamental domain. For the test we
need some extra points:
−1
E = Xa,b (B)
= 1, −ab − a − 2b − 2, a 2 b2 + 2a 2 b
+ 4ab2 + a 2 + 8ab + 4b2 + 5a + 7b + 5 ;
F = Ya,b (B) = (−a − 2, 1, 0);
−1
H = Xa,b (F ) = 0, −b − 1, ab2 + 2ab + 2b2 + a + 4b + 3 .
1. (Test of condition (i)). It can be shown in the usual way that the faces have com-
mon edge BD, and the edges intersect only at vertices. This implies that all ad-
jacencies are correct, and that only one or two faces are adjacent to each edge.
The closure of the boundary is a closed broken line ABCDA, homeomorphic to
the circle.
2. (Test of condition (ii)). Direct calculations show that the operator with matrix
Xa,b takes the segment AB to DC (the point A goes to the point D and B to
C). The operator with matrix Ya,b takes AD to BC (A goes to B and D to C).
Obviously no other points glue together. The Euler characteristic of the obtained
surface equals 2 − 3 + 1, i.e., zero, and the surface is orientable.
3. (Calculation of all integer distances from the origin to the two-dimensional
planes containing faces.) The integer distance to the plane of ABD equals
⎛ ⎞
1 0 b 2 + 2b + 1
1 b+1
· ⎝ 0 0 b + 1 ⎠ = = 1.
b+1 a+1 1 1 b+1
The integer distance from the origin to the plane of BDC equals
⎛ ⎞
0 b2 + 2b + 1 b − a − 1
1
· ⎝0 b+1 1 ⎠ = ab + 2b + a + 2 = a + 2.
b+1 1 b+1
1 0
4. (Test on nonexistence of integer points inside the pyramids with vertices at the
origin and bases at the faces.) Since the integer distance from the origin to the
plane containing ABD equals one, the pyramid corresponding to ABD does not
contain integer points different from O and the points of the face ABD.
20.3 An Example of the Calculation of a Fundamental Domain 299
The face BDC is integer-linear congruent to the face with vertices (1, a + 1,
−a − 2), (b + 2, a + 1, −a − 2), (1, a + 2, −a − 2) of the list “T-W” (see
Fig. 15.2). The corresponding transformation taking BCD to the face of the list
“T-W” is as follows:
⎛ ⎞
b+1 b−a−1 b−a
⎝ 1 1 1 ⎠.
0 −1 −1
By Theorem 15.5 the pyramid corresponding to BDC does not contain integer
points different from O and the points of the face BDC.
5. (Test on convexity of dihedral angles.) Let us first consider the edge BD. This
edge is adjacent to the faces ABD and BDC. The face ABD is contained
in the plane fABD (x, y, z) = 0, and the face BDC is contained in the plane
fBDC (x, y, z) = 0, where
Since a ≥ 0, the inequalities hold. Thus the dihedral angle associated with
the edge BD is well-placed. Since the cases of dihedral angles associated to the
edges AB (and the faces ADB and AEB) and BC (and the faces BDC and CBF)
can be verified in the same way, we omit their descriptions. This concludes the
test of Condition 5.
6. (Verification that all 2-stars of the vertices are regular.) There is only one vertex
in the torus decomposition. Any lift of this point to the universal covering W is
adjacent to six edges and six faces. Consider a vertex of the universal covering
that maps to the point B. The corresponding 2-star maps to six edges BC, BD,
BA, BE, BH, and BF and to six faces BCD, BDA, BAE, BEH, BHF, and BFC,
where
−1
H = Xa,b (F ) = 0, −b − 1, ab2 + 2ab + 2b2 + a + 4b + 3 .
300 20 On Construction of Multidimensional Continued Fractions
We will check that for every sufficiently small positive ε, a ray lε with vertex at
the origin and passing through the point Pε = (ε, 0, 1) intersects exactly one of
the faces of 2-stars.
First we will check that for every sufficiently small positive ε, the ray lε in-
tersects the triangle BCF, or equivalently, that the ray lε is contained in the tri-
hedral angle with vertex at the origin O and base in the triangle BCF. The two-
dimensional face of the trihedral angle containing B, C, and O can be defined
by fABO = 0, and the two-dimensional face of the trihedral angle containing B,
F , and O can be defined by fBFO = 0; the two-dimensional face of the trihedral
angle containing C, F , and O can be defined by fCFO = 0, where
For every sufficiently small positive ε, the ray lε is contained in the dihedral
angle defined above if the following conditions hold: the points Pε and F are
in the same closed half-space with respect to the plane fBCO = 0, the points Pε
and C are in the same closed half-space with respect to the plane fBFO = 0, and
the points Pε and B are in the same closed half-space with respect to the plane
fCFO = 0. Since the points Pε and B are close to each other for sufficiently small
ε, they are in the same closed half-space with respect to the plane fCFO = 0. We
now check the remaining two conditions:
fBCO (Pε ) · fBCO (F ) ≥ 0, (−b − 1)ε ≥ 0,
⇔
fBFO (Pε ) · fBFO (C) ≥ 0, (b + 1)ε ≥ 0.
Since b, ε ≥ 0, the first inequality does not hold. Thus for every sufficiently small
positive ε, the ray lε does not intersect the triangle BCF.
The cases of the triangles BCD, BDA, BAE, BEH, and BHF are similar to
those described above and are omitted here.
The ray lε (for any sufficiently small positive ε) intersects the bijective im-
age of the 2-star of the vertex at exactly one point contained in the edge AB.
Therefore, all 2-stars associated to the vertices are regular.
7. (Test that all the vertices of D are in the same invariant cone.) The test of the
seventh stage for this theorem is trivial, since D contains exactly one vertex.
20.4 Exercise
B = CAC −1 .
The natural problem here is as follows: describe the set of integer conjugacy classes
in SL(n, Z). In this section we give an answer to this question for matrices whose
characteristic polynomials are irreducible over the field of rational numbers.
Gauss‘s reduction theory gives a complete geometric description of conjugacy
classes for the case n = 2, as we have already discussed in Chap. 7. In the multi-
dimensional case the situation is more complicated. It is relatively simple to check
whether two given matrices are integer conjugate (see, for instance, [4] and [73]),
but to distinguish conjugacy classes is a much harder task. In this chapter we de-
scribe a generalization of Gauss’s reduction theory to the multidimensional case
(see also [99]). We study questions related to the three-dimensional case in more
detail.
There exists an alternative algebraic approach to the study of SL(n, Z) conjugacy
classes, which we do not touch in this book. There one starts with GL(n, Q) con-
jugacy classes and splits them into GL(n, Z) conjugacy classes. This reduces the
problem to certain problems related to orders of algebraic fields that are defined by
the roots of the characteristic polynomial of the corresponding matrices.
in the QR-algorithm for the eigenvalue problem (see [53, 198], and [154]). Fur-
ther, in [99] they were considered in the framework of the multidimensional
Gauss reduction theory as a multidimensional analogue of reduced matrices.
In Sect. 21.2 we introduce a natural notion of Hessenberg complexity for
Hessenberg matrices. Further we show that each integer conjugacy class of
SL(n, Z) has only finitely many distinct Hessenberg matrices with minimal
complexity (Theorem 21.8).
II. Complete invariant of integer conjugacy classes. In Sect. 21.3 we introduce
a complete geometric invariant of a Dirichlet group. It is a periodic multidi-
mensional continued fractions in the sense of Klein–Voronoi (Theorem 21.23).
Klein–Voronoi continued fractions naturally generalize Klein continued frac-
tions. Further, we deduce the complete invariants of integer conjugacy classes
of GL(n, Z) matrices. We show that the conjugacy classes are represented by
periodic shifts of Klein–Voronoi periodic continued fractions (Theorem 21.24).
III. Calculation of reduced matrices. In Sect. 21.4 we introduce a technique to con-
struct reduced matrices in any integer conjugacy class that has minimal Hessen-
berg complexity in this class.
IV. Solution of Diophantine equations related to certain decomposable forms. In
Sect. 21.5 we study the integer points at the level sets of the decomposable
forms (we have already discussed this question in Chap. 7 in connection with
the Markov spectrum). As a technical detail we introduce a notion of multi-
dimensional w-continued fractions similar to its one-dimensional version of
Chap. 10.
V. Integer conjugacy classes of SL(3, Z). We examine the structure of integer con-
jugacy classes of SL(3, Z) via Klein–Voronoi continued fractions. The main re-
sults concern only the operators with a pair of complex conjugate eigenvectors.
It turns out that in this case, Hessenberg matrices distinguish corresponding
conjugacy classes asymptotically (Theorem 21.48). The analogous statement
does not hold for the case of operators with three real eigenvalues. In general it
is much less known for the case of three real eigenvalues.
In Sect. 21.2.1 we begin with necessary definitions and notation related to Hessen-
berg matrices. Further, in Sect. 21.2.2 we discuss algorithmic aspects of construc-
tion of perfect Hessenberg matrices integer congruent to a given one. In Sect. 21.2.3
we show that every integer conjugacy class with irreducible characteristic polyno-
mial has a finite nonzero number of ς -reduced matrices. Further in Sect. 21.2.4 we
study the set of perfect Hessenberg matrices. We think of this set as a “book” whose
“pages” are enumerated by Hessenberg types. Since the matrices from the same
page are distinguished by their polynomials, any two matrices on one page are not
integer congruent. We conclude this section with a brief discussion of ς -reduced
2-dimensional matrices of the same Hessenberg type.
21.2 Hessenberg Matrices and Conjugacy Classes 303
We study only the simplest general case of SL(n, Z) matrices whose characteris-
tic polynomials are irreducible over Q. Every such matrix is integer conjugate to a
perfect Hessenberg matrix, and each conjugacy class of SL(n, Z) contains infinitely
many perfect Hessenberg matrices. To reduce the number of such matrices, we in-
troduce a natural notion of complexity.
&
n−1
|aj +1,j |n−j
j =1
a2,1 = · · · = an,n−1 = 1.
In the literature such matrices are known as Frobenius matrices (or companion ma-
trices). The elements of the last column of any Frobenius matrix are exactly the
coefficients of the characteristic polynomial multiplied alternately by ±1.
This matrix is a perfect Hessenberg matrix of type 1, 2|2, 3, 5. Its Hessenberg
complexity is 22 · 5 = 20.
Remark Let us say a few words about the relation between reduced matrices in-
troduced in Chap. 7 and ς -reduced matrices in SL(2, Z). On the one hand, every
reduced real spectrum matrix is a perfect Hessenberg matrix. On the other hand,
every ς -reduced matrix is also reduced, although there are certain reduced matrices
that are not ς -reduced.
Later, in Theorem 21.8, we show that the number of ς -reduced matrices is finite
for every integer conjugacy class. Some classes contain more than one ς -reduced
perfect Hessenberg matrix.
In this subsection we show how to construct perfect Hessenberg matrices that are
integer congruent to a given one. The main ingredients of the algorithm are in the
proof of the following proposition.
In other words, for a given integer operator A, every integer vector of unit integer
length is uniquely extended to the basis of the integer lattice in which the operator
A has a perfect Hessenberg matrix.
Since the characteristic polynomial of M is irreducible, the set of all spaces Vi forms
a complete flag in Rn . All the vectors Aj (v) are integer vectors; hence every space
Vi contains a full-rank integer sublattice.
Let us inductively construct an integer basis {ẽi } of a vector space Rn such that:
– for every i the vectors ẽ1 , . . . , ẽi generate the sublattice Zn ∩ Vi ;
– the matrix of A is perfect Hessenberg in the basis {ẽi }.
Base of induction. Let ẽ1 = v. Since l(ẽ1 ) = 1, the vector ẽ1 generates Zn ∩ V1 .
Step of induction. Suppose that we have already constructed ẽ1 , . . . , ẽk forming
a basis of Zn ∩ Vk . Let us now calculate ẽk+1 . Choose an integer vector gk+1 of the
space Vk+1 a unit integer distance to the space Vk . Since ẽ1 , . . . , ẽk generate Zn ∩Vk ,
the vectors ẽ1 , . . . , ẽk , gk+1 generate the sublattice Zn ∩ Vk+1 . Since A(ẽk ) ∈ Vk+1 ,
we have
k
A(ẽk ) = qi,k ẽi + ak+1,k gk+1 .
i=1
For i = 1, . . . , k, we define bi,k and ai,k as integer quotients and remainders of the
following equations:
Then we have
k
k
A(ẽk ) = |ak+1,k | sign(ak+1,k )gk+1 + bi,k ẽi + ai,k ẽi
i=1 i=1
(where sign is the sing function over the real numbers). Finally, we get
k
ẽk+1 = sign(ak+1,k )gk+1 + bi,k ẽi .
i=1
Both M and M̂ are matrices of A in two different integer bases, which means they
are integer conjugate. The integers ai+1,i are nonzero (i = 1, . . . , n − 1), since the
characteristic polynomial of A is irreducible. Therefore, ς(M̂) > 0. Denote by C
the transition matrix to the basis {ẽi }. Then C(ẽ1 ) = v and M̂ = CMC −1 (where M̂
is a perfect Hessenberg matrix).
Uniqueness. Let C1 and C2 satisfy the conditions of the theorem. Suppose that
C1 and C2 are the transition matrices to the bases {ẽi } and {êi } respectively. Then
For i > 1 the equality ẽi = êi follows from the uniqueness of the choice of the coef-
ficients in the proof of existence. Hence the bases {ẽi } and {êi } coincide. Therefore,
C1 = C 2 .
k
A(ẽk ) = qi,k ẽi + ak+1,k gk+1 .
i=1
21.2 Hessenberg Matrices and Conjugacy Classes 307
Find bi,k and ai,k (i = 1, . . . , k) as integer quotients and reminders of the equa-
tions
qi,k = bi,k · |ak+1,k | + ai,k .
Put
k
ẽk+1 = sign(ak+1,k )gk+1 + bi,k ẽi .
i=1
Output data. The perfect Hessenberg matrix CMC −1 , where C is the transition
matrix to {ẽk }.
Note that the basis {ẽi } is constructed in n steps.
Later, in the proof of Theorem 21.31, we use the following corollary.
Proof Each step of the algorithm produces the same data for v and B(v), due to the
fact that A and B commute. Therefore, (M|v) = (M|B(v)).
It turns out that ς -reduced Hessenberg matrices are good for the classification of
conjugacy classes in SL(n, Z). Although they do not form complete invariants of
conjugacy classes, they are present in each class and their number is finite. We give
an explicit construction of all ς -reduced Hessenberg matrices conjugate to a given
one in Sect. 21.4.3 via Klein–Voronoi continued fractions.
Theorem 21.8 Let M be an SL(n, Z) matrix. Then the number of ς -reduced Hes-
senberg matrices integer conjugate to M is finite and greater than zero.
x n + cn−1 x n−1 + · · · + c1 x + c0 ,
308 21 Gauss Reduction in Higher Dimensions
Since ς(M)
= 0, the product in the parentheses is nonzero. Hence ak+1,n is uniquely
defined by ck and the elements ai,j of the first n−1 columns. Therefore, all elements
of M are defined by the Hessenberg type of M and the characteristic polynomial
of M.
Proof of Theorem 21.8 Existence. By Proposition 21.6 there exist perfect Hessen-
berg matrices integer conjugate to M. The set of values of Hessenberg complexity
is discrete and bounded from below; hence there exists a perfect Hessenberg matrix
M̃ integer conjugate to M and with minimal possible Hessenberg complexity. By
definition the matrix M̃ is a ς -reduced.
Finiteness. Let c be the Hessenberg complexity of M̃. By definition the Hessen-
berg complexity of all the other ς -reduced Hessenberg matrices integer conjugate to
M equals c. The number of Hessenberg types whose Hessenberg complexity equals
c is finite. Notice that the integer conjugate matrices have the same characteris-
tic polynomials. From Proposition 21.9 there exists at most one integer Hessenberg
matrix with a given Hessenberg type and a given polynomial. Therefore, the number
of ς -reduced Hessenberg matrices integer conjugate to M is finite.
In this subsection we discuss the structure of Hessenberg matrices with given Hes-
senberg type. We begin with a particular example.
Example 21.10 Let us examine the Hessenberg type 0, 1|1, 0, 2. The set of all
matrices of this Hessenberg type is a two-parameter family with parameters m and
n:
⎧⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎫
⎨ 0 1 1 0 0 0 0 0 1 ⎬
H 0, 1|1, 0, 2 = ⎝1 0 0⎠ + m ⎝0 0 1⎠ + n ⎝0 0 0⎠ m, n ∈ Z .
⎩ ⎭
0 2 1 0 0 0 0 0 2
Define
⎛ ⎞
0 1 n+1
H0,1|1,0,2 (m, n) = ⎝1 0 m ⎠.
(1,0,1)
0 2 2n + 1
21.2 Hessenberg Matrices and Conjugacy Classes 309
(1,0,1)
The discriminant of H0,1|1,0,2 (m, n) equals
The set of matrices with negative discriminant for the given family coincides with
the union of the sets of integer solutions of the following quadratic inequalities:
2m ≤ −n2 − n − 2 and 2n ≤ m2 + m.
(1,0,1)
In Fig. 21.1 we represent a matrix H0,1|1,0,2 (m, n) by the square in the inter-
section of the mth column and the nth row. Matrices with reducible characteris-
tic polynomials correspond to black squares. Light gray squares represent matrices
with three real eigenvalues. Matrices shown as white squares have a pair of complex
conjugate eigenvalues.
It turns out that the general case is similar. Consider a Hessenberg type
and denote by Mk (Ω) the matrix whose first n − 1 columns are equal to zero and
the last one equals to vk (Ω). Denote also by σ (Ω) the (n − 1)-dimensional simplex
with vertices
O (the origin), v1 , . . . , vn−1 .
Theorem 21.11
(i) The set H (Ω) is not empty if and only if lV(σ (Ω)) = 1.
(ii) Let M0 ∈ H (Ω). Then H (Ω) is an integer affine (n − 1)-dimensional sublattice
in the lattice of all integer (n × n) matrices, i.e.,
$
n−1
H (Ω) = M0 + ci Mi (Ω) c1 , . . . , cn−1 ∈ Z .
i=1
Notice that
A Span Sgn−1 = Span σ (Ω) and A(gn ) = v.
Sufficient condition. Suppose that both conditions of the lemma hold. Then the
operator A preserves the integer lattice (generated by g1 , . . . , gn ). Therefore, M ∈
SL(n, Z).
Proof of Theorem 21.11 (i) Suppose that lV(σ (Ω)) = 1. Choose an integer vector
v at unit integer distance to the plane Span(σ (Ω)). By Lemma 21.12 both matrices
21.2 Hessenberg Matrices and Conjugacy Classes 311
defined by Ω and vectors ±v are in GL(n, Z). One of them is in SL(n, Z). Con-
versely, if H (Ω) contains an SL(n, Z) matrix, then by Lemma 21.12 the integer
volume of σ (Ω) equals 1.
Statement (ii) is straightforward, since the determinant of a matrix is an additive
function with respect to the operation of vector addition in the last column.
Let us say a few words about ς -reduced 2-dimensional matrices in families H (Ω).
We start with the following example.
3 3m + 2
.
4 4m + 3
Their Hessenberg complexity equals 4. Almost all matrices of this family have
irreducible, over Q, characteristic polynomials with two real roots (except for
m = −1, −2). For m = 0, 1, 2, 3, . . . the LLS-periods of the matrices are
(2, 2), (1, 2, 1, 1), (1, 2, 1, 2), (1, 2, 1, 3), ..., (1, 2, 1, m), ....
The matrices are ς -reduced for m ≥ 2. For m = −3, −4, −5, . . . the corresponding
LLS-periods are
(4, 1), (4, 2), (4, 3), (4, 4), ..., (4, −2 − m), ....
The example of Hessenberg type 3, 4 shows that almost all matrices of this
Hessenberg type are ς -reduced. That is actually the case for all Hessenberg types in
SL(2, Z).
Theorem 21.14 Almost all matrices of a given Hessenberg type in SL(2, Z) are
ς -reduced.
This follows from Theorem 21.31 below and direct calculation of complexities
for all vertices of the period; we skip the proof here.
In Theorem 21.48 we prove similar behavior for SL(3, Z) matrices having two
complex conjugate eigenvalues (see also Conjecture 29).
312 21 Gauss Reduction in Higher Dimensions
where spec(B) is the spectrum of B and S 1 is the complex unit circle. In fact, T l (A)
is an abelian group with the operation of matrix multiplication.
For a vector v in Rn , we denote by TA (v) the orbit of v with respect to the
action of the group of operators T l (A). If v is in general position with respect to the
operator A (i.e., it does not lie in invariant planes of A), then TA (v) is homeomorphic
to the l-dimensional torus. For a vector of an invariant plane of A the orbit TA (v) is
also homeomorphic to a torus of positive dimension not greater than l, or to a point.
21.3 Complete Geometric Invariant of Conjugacy Classes 313
Example 21.15 Suppose that A is a real spectrum matrix, i.e., l = 0. Since all its
eigenvectors are real, T 0 (A) consists only of the unit operator and TA (v) = {v}.
Example 21.16 Now consider the case of a pair complex conjugate eigenvalues,
i.e., l = 1. The group T 1 (A) corresponds to elliptic rotations in the invariant plane
of A corresponding to complex eigenvalues. Such rotations are parameterized by
the angle of rotation. A general orbit of TA (v) is an ellipse around the (n − 2)-
dimensional invariant subspace corresponding to real eigenvalues. Every orbit in
the invariant subspace spanned by real eigenvalues consists of one point.
Definition 21.17 A point p in the cone π+ is said to be π -integer if the orbit TA (p)
contains at least one integer point.
Consider all (real) hyperplanes invariant under the action of the operator A. There
are exactly k such hyperplanes. In the above coordinates, the ith of them is defined
by the equation xi = 0. The complement to the union of all invariant hyperplanes in
the cone π+ consists of 2k arcwise connected components C1 (A), . . . , C2k (A).
Definition 21.18 The convex hull of all π -integer points except the origin contained
in an arcwise connected component Ci (A) is called a factor-sail of Ci (A). The set
of all factor-sails is said to be the factor-continued fraction for the operator A. We
denote it by Ŝi (A).
Definition 21.19 The union of all orbits TA (∗) in Rn represented by the points in
the factor-sail of Ci (A) is called the sail of TA (Ci ), which we denote by Si (A). The
union of all sails is said to be the continued fraction for the operator A in the sense
of Klein–Voronoi (see Fig. 21.2 below). We denote it by KVCF(A).
Fig. 21.2 A three-dimensional example (Example 21.22): a the cone π+ and the eigenplane; b the
continued factor-fraction; c a sail of the continued fraction
This operator has one real and two complex conjugate eigenvalues. Therefore, the
cone π+ for A is a two-dimensional half-plane. In Fig. 21.2a the half-plane π+ is
shaded in light gray and the invariant plane corresponding to the pair of complex
eigenvectors is in dark gray. The vector shown in Fig. 21.2a with endpoint at the
origin is an eigenvector of A.
In Fig. 21.2b we show the cone π+ . The invariant plane separates π+ into two
parts. The dots on π+ are the π -integer points. The boundaries of the convex hulls
in each part of π+ are two factor-sails. One factor-sail is taken to another by the
induced action of − Id.
Finally, in Fig. 21.2c we show one of the sails. Three orbit-vertices shown in
the figure correspond to the vectors (1, 0, 0), (0, 1, 0), and (0, 0, 1): the large dark
points (0, 1, 0) and (0, 0, 1) are visible at the corresponding orbit-vertices.
The positive Dirichlet group Ξ+ (A) in our example is homeomorphic to Z, and
is generated by A. The group Ξ (A) is homeomorphic to Z ⊕ Z/2Z with generators
A and − Id. The operator A takes the point (1, 0, 0) and its orbit-vertex to the point
(0, 1, 0) and the corresponding orbit-vertex. Therefore, every fundamental domain
of the continued fraction for the operator A contains one orbit-vertex and one ver-
tex edge. For instance, we can choose the orbit-vertex corresponding to the point
(1, 0, 0) and the orbit-edge corresponding to the “tube” connecting orbit-vectors for
the points (1, 0, 0) and (0, 1, 0).
316 21 Gauss Reduction in Higher Dimensions
One of the main properties of Klein–Voronoi continued fractions is that they classify
all Dirichlet groups.
Remark If the characteristic polynomial of a matrix is irreducible over Q, then all its
eigenvectors are distinct, so all matrices of Proposition 21.23 possess Klein–Voronoi
continued fractions.
Proof of Theorem 21.23 Supposing that Ξ (A) = Ξ (B) then A and B commute.
Hence they have the same eigenvectors (since they do not have multiple eigenvalues)
and the same orbits: TA = TB . In addition, we choose the same cone π+ for both A
and B. Therefore, by definition, the Klein–Voronoi continued fractions for A and B
coincide.
Let us prove the converse statement. Assume that the Klein–Voronoi continued
fractions for A and B coincide. Suppose that A √ has real eigenvectors g1 , . . . , gk
and complex conjugate eigenvectors gk+2j −1 ± −1gk+2j for j = 1, . . . , l, where
k + 2l = n. Consider the coordinate system corresponding to the basis {gi }:
OX 1 X2 . . . Xk Y1 Z1 Y2 Z2 . . . Yl Zl .
i=1 j =1
Similarly we define the form ΦB for the operator B. Note that A preserves the form
ΦA up to a multiplicative scalar: a simple calculation shows that for every v ∈ Rn
we have
ΦA A(v) = det(A)ΦA (v).
The same is true for B and ΦB .
The Klein–Voronoi continued fraction for A (which is also KVCF(B)) asymp-
totically coincides with the set ΦA = 0 (and ΦB = 0 respectively) at infinity, and
hence the matrices A and B have all the same invariant subspaces. In particular,
their one-dimensional real eigenspaces corresponding to real eigenvectors and their
two-dimensional eigenspaces (we denote them by π1 , . . . , πl ) defined by pairs of
complex conjugate roots coincide. This implies that A and B commute if and only
if they commute for the vectors of the invariant planes π1 , . . ., πl .
Let us show that operators A and B restricted to a plane πi (i = 1, . . . , l) com-
mute. For a vertex v of KVCF(A) we set
Tv = KVCF(A) ∩ v + π1 , . . . , πl .
21.3 Complete Geometric Invariant of Conjugacy Classes 317
From construction, Tv coincides with the orbit TA (v). Since the space spanned by
all planes π1 , . . . , πl is invariant for both A and B, the set Tv coincides with the
orbit TB (v), and hence TA (v) = TB (v).
Consider an arbitrary matrix C of the set TA . Since all eigenvalues of C have
unit modulus and C is diagonalizable in the eigenbasis of A, C preserves ΦA up to
a scalar det(C) = ±1. Therefore, |ΦA | is constant on TA (v). For the same reason,
|ΦB | is constant on TA (v) = TB (v). Therefore, by linearity, ΦA = c · ΦB for some
constant c
= 0. Hence, A preserves ΦB up to a multiplicative scalar.
Consider now the plane πj for some 1 ≤ j ≤ l and take coordinates (yj , zj ). It
is clear that if an operator preserves ΦB (v) up to a multiplicative scalar, then its
restriction to the plane πj preserves the form
yj2 + zj2
up to a multiplicative factor. There are two types of matrices that preserve the set of
level sets of this form:
a b −a b
and
−b a b a
with arbitrary real parameters a and b. The matrices of the second family have two
real eigenvalues in πj , which is by the above not the case for B. Therefore, both A
and B are from the first family. All matrices of the first family commute. Hence A
commutes with B in planes πj for 1 ≤ j ≤ l.
Therefore, A and B are both diagonalizable in a certain complex basis. Hence A
and B commute. Therefore, Ξ (A) = Ξ (B).
In this subsection we fix some integer basis in Rn and identify operators with
matrices in this basis. On the one hand, from Theorem 21.23 it follows that the
Klein–Voronoi continued fraction for A identifies the Dirichlet group Ξ (A) in a
unique way. On the other hand, the operator A defines a shift of KVCF(A) along
itself, which we denote by PA . It is clear that distinct operators of Ξ (A) de-
fine inequivalent shifts. So every A ∈ GL(n, Z) is uniquely identified with a pair
(KVCF(A), PA ). The group GL(n, Z) naturally acts on pairs (KVCF(A), PA ) by
left multiplication on the first factor and by conjugation on the second factor. We
have the following important tautological theorem.
Remark 21.25 The important consequence of this theorem is that all geometric
GL(n, Z)-invariants of the Klein–Voronoi continued fraction for A (such as integer
distances and integer volumes of certain integer point configurations of KVCF(A))
are invariants of the conjugacy class of A. Fundamental domains of Klein continued
fractions and their invariants in the real spectrum three-dimensional case (n = k = 3,
l = 0) were studied in works [89, 90, 118, 119, 123, 125], etc. Currently there is not
much known about the construction of Klein–Voronoi continued fractions in the
nonreal spectrum case.
i=0
Proposition 21.27 Consider A ∈ SL(n, Z) and let B ∈ Ξ (A). Then for an arbitrary
point v we have
ΔA (v) = ΔA B(v) .
OX 1 X2 . . . Xk Y1 Z1 Y2 Z2 . . . Yl Zl
i=1 j =1
Therefore,
k k
& & l α & &
l
2
ΔA (t1 , . . . , tn ) = α ti (tk+2j −1 tt+2j ) = l xi yj + zj2 .
4
i=1 j =1 i=1 j =1
Proof Denote by Vk the plane spanned by vectors v, A(v), A2 (v), . . . , Ak−1 (v).
Let us inductively show that
k
&
A (ẽ1 ) =
k
ai,i+1 ẽk+1 + vk , where vk ∈ Vk .
i=1
The second summand in the last expression is in Vm+1 . We have completed the
induction step.
Therefore,
&
n−1
ΔA (ẽ1 ) = |ai+1,i |n−i = ς(M).
i=1
21.4 Algorithmic Aspects of Reduction to ς -Reduced Matrices 321
Corollary 21.30 Consider an operator A with matrix M. Let v be any primitive in-
teger vector. Consider the matrix (M|v) constructed by the algorithm of Sect. 21.2.2.
Then we have
ς(M|v) = ΔA (v).
Suppose that the minimal absolute value of ΦA on the set of integer points except
the origin equals m0 .
Consider a cone π+ in the plane z1 = · · · = zl = 0 and choose the coordinates
(x1 , . . . , xk , y1 , . . . , yl ) in it. Consider a projection of Rn to the cone along the orbits
TA (∗). Since the MD-characteristic is constant on TA (u) for every u, the projection
of the MD-characteristic is well defined. Denote it by Φ̃A . In the chosen coordinates
of π+ , the function Φ̃A is written as follows:
k
& & l
2
α xj y j .
j =1 j =1
322 21 Gauss Reduction in Higher Dimensions
This function is convex in every orthant of the cone π+ . Since every factor-sail is
the boundary of a certain convex hull in each orthant, all minima of the convex
function Φ̃A restricted to the convex hulls are attained at the boundary, i.e., at π -
integer points of factor-sails. Therefore, all integer minima of ΦA on Zn \ {O} are
at vertices of the Klein–Voronoi continued fraction.
By Corollary 21.30, the Hessenberg complexity ς(M|v) coincides with the MD-
characteristic ΔA (v). Since every matrix integer conjugate to M has a presentation
in the form (M|v) and all integer minima of the MD-characteristic are attained at
vertices of the Klein–Voronoi continued fraction, every ς -reduced operator M̃ is
represented as (M|v0 ) for some vertex v0 ∈ KVCF(A). By Corollary 21.7, for every
B ∈ Ξ (A) we have
M|B(v0 ) = (M|v0 ).
Hence a vector v0 can be chosen in the fundamental domain U . This concludes the
proof.
are integer conjugate but do not coincide. The reason for this is as follows. Con-
sider the Klein–Voronoi continued fraction of A with matrix M1 . It contains in-
teger vertices p1 = (1, 0, 0) and p2 = (0, 1, −1). It turns out that p1 and p2 are
not in the same orbit of the Dirichlet group but have the same MD-characteristic,
namely 3. Hence we get two distinct integer conjugate ς -reduced Hessenberg ma-
trices: M1 = (M1 |(1, 0, 0)) and M2 = (M1 |(0, 1, −1)).
As we have already proved, the ς -reduced Hessenberg matrix for the operator A
is constructed starting from some vertex in a fundamental domain of the Klein–
Voronoi multidimensional continued fraction. We use this property to find all ς -
reduced Hessenberg matrices in a given integer conjugacy class of matrices.
Step 1. Find a fundamental domain of the Klein–Voronoi continued fraction for the
operator A (see Remark 21.33 below).
Step 2. Take all vertices of the fundamental domain constructed in Step 1 and
find among them all vertices with minimal value of the MD-characteristic (say
v1 , . . . , vk ).
Step 3. By Theorem 21.31(i) and (ii) all ς -reduced matrices integer conjugate to
M are (M|v1 ), . . . , (M|vk ). They are all constructed by the algorithm described
in Sect. 21.2.2.
Output. A list of all ς -reduced matrices integer conjugate to M.
Remark 21.33 Currently, Step 1 is the most complicated. The real spectrum case of
matrices with all eigenvalues being real has been well studied. For the algorithms
of constructing multidimensional continued fractions in this case, see Chap. 20 and
the papers by R. Okazaki [153], T. Shintani [185], J.-O. Moussafir [142], and the
author [95]. E. Korkina in [119] and [118], G. Lachaud in [123, 125], A.D. Bruno
and V.I. Parusnikov in [27, 161, 162], and [163], and the author in [89] and [90]
have produced a large number of fundamental domains for periodic algebraic two-
dimensional continued fractions (see also the site [25] by K. Briggs). Some funda-
mental domains in the three-dimensional case are found, for instance, in [92]. The
case with complex conjugate eigenvalues which we study in the next section, is new.
Example 21.34 Let us consider the example of the operator A defined by the matrix
⎛ ⎞
−2 −4 −3
⎝1 2 2 ⎠.
−1 −1 3
The characteristic polynomial of this operator has three distinct real roots. There-
fore, the Klein–Voronoi continued fraction consists of eight sails. The compositions
of operators − Id, A, and 2 Id +A−1 define integer congruences for all these sails,
and hence all ς -reduced operators are written from vertices of one sail. Consider
a sail containing the point (1, 0, 0). There are exactly three distinct orbits of the
Dirichlet group containing the vertices in this sail. They are defined by the follow-
ing points:
(0, 0, 1), (1, 0, 0), and (3, −1, 1).
(We skip all calculations of convex hulls corresponding to the sail.) The MD-
characteristic of these vectors are respectively 1, 2, and 4. So the minimum of the
MD-characteristic (which is 1 in this case) is attained on the vertices of the orbit of
the Dirichlet group containing (0, 0, 1). Therefore, there exists a unique ς -reduced
Hessenberg matrix, which is
⎛ ⎞
0 0 1
⎝1 0 1 ⎠ .
0 1 −3
324 21 Gauss Reduction in Higher Dimensions
The perfect Hessenberg matrices for the vertices (1, 0, 0) and (3, −1, 1) are respec-
tively
⎛ ⎞ ⎛ ⎞
0 1 −1 1 0 −1
⎝1 0 0 ⎠ and ⎝2 0 3 ⎠ ,
0 2 −3 0 1 −4
and their ς -complexities are 2 and 4.
ΔA (v) = N. (21.1)
21.5.1.1 Definition
Definition 21.35 We define the n-sails for an arbitrary simplicial cone C with ver-
tex at the origin inductively.
– let the 1-sail be the sail of C.
– suppose that all w-sails for w < w0 are defined. The convex hull of all π -integer
points except the origin and except for the π -integer points for all w-factor-sails
with w < w0 contained in an arcwise connected component Ci (A) is called the
w0 -factor-sail of Ci (A). We denote it by Ŝw0 ,i (A).
21.5 Diophantine Equations Related to the Markov–Davenport Characteristic 325
The set of all w-factor-sails for a given operator is called the w-factor-continued
fraction for the operator A.
Definition 21.36 The union of all orbits TA (∗) in Rn represented by the points in
the w-factor-sail of Ci (A) is called the w-sail of TA (Ci ); we denote it by Sw,i (A).
The union of all sails is said to be the w-continued fraction for the operator A in
the sense of Klein–Voronoi. We denote it by KVCF w (A).
Remark 21.38 The w-sails for arbitrary w > 0 are not necessarily homothetic to the
1-sail, as it was in the one-dimensional case of Sect. 10.2.5.
OX 1 X2 . . . Xk Y1 Z1 Y2 Z2 . . . Yl Zl
i=1 j =1
for some nonzero constant c. Consider an arbitrary ray r with vertex at the ori-
gin and not contained in any invariant subspace of A. Assume that its direction is
(λ1 , . . . , λn ). Then at points of this ray the function ΔA is as follows:
k
& & l
2 n
ΔA (λ1 t, . . . , λn t) = c λi λk+2j −1 + λk+2j t .
2
(21.2)
i=1 j =1
326 21 Gauss Reduction in Higher Dimensions
It is clear that the ray r intersects the w-sail first and only thereafter intersects the
(w + 1)-sail. Notice that the expression on the right side of Eq. (21.2) is proportional
to t n (with respect to t), which increases as t increases. Therefore, the value of ΔA
at the point of intersection of the ray r with the (w + 1)-sail is strictly greater than
the value at the point of intersection with the corresponding w-sail.
Notice that every possible limit value of ΔA on the (w + 1)-sail is attained at
some point of its fundamental domain (since the closure of a fundamental domain
is compact). In particular, the infimum of |ΔA | at the (w + 1)-sail is attained as a
minimum at some point. Therefore, αw+1 > αw .
The following general techniques helps to write all the solutions of Eq. (21.1).
ΔA (v) = N.
Remark 21.40 By Corollary 21.39 it is enough to check only the vertices of the
w-sails for w ≤ N .
21.6 On Reduced NRS-Matrices in SL(3, Z) 327
In this section we study the case of Hessenberg SL(3, Z) matrices with two complex
conjugate and one real eigenvalue. It turns out that the majority of such matrices
of the same Hessenberg type are ς -reduced. We start in Sect. 21.6.1 with some
notation and definitions. In Sect. 21.6.2 we formulate a supplementary theorem on
the parabolic structure of the set of nonreal spectrum matrices, whose proof we give
in Sect. 21.6.5. Further, in Sect. 21.6.3 we formulate the main result on asymptotic
uniqueness of ς -reduced matrices, whose proof is in Sect. 21.6.6. In Sect. 21.6.4 we
examine some particular examples of families of matrices with fixed Hessenberg
type. All the results of this section are new.
Definition 21.41 Let Ω = a11 , a21 |a12 , a22 , a32 . Consider v = (a13 , a23 , a33 )
such that the determinant of the matrix (aij ) equals 1. Define
⎛ ⎞
a11 a12 a11 m + a12 n + a13
HΩv (m, n) = ⎝a21 a22 a21 m + a22 n + a23 ⎠ .
0 a32 a32 n + a33
It is clear that
H (Ω) = HΩv (m, n) | m ∈ Z, n ∈ Z .
In this context, to choose a vector v means to choose the origin O in the plane
H (Ω). So the set H (Ω) has the structure of a two-dimensional plane (see also
Sect. 21.2.4 above). We denote by OMN the coordinate system corresponding to the
2 a .
parameters (m, n). Notice that the Hessenberg complexity of this family is a12 23
v
Let DiscrΩ (m, n) denote the discriminant of the characteristic polynomial of
HΩv (m, n). Then the set NRS(Ω) is defined by the following inequality in variables
n and m:
DiscrvΩ (m, n) < 0.
328 21 Gauss Reduction in Higher Dimensions
Example 21.42 In Fig. 21.3 we show the subset NRS(0, 1|0, 0, 1). For this exam-
ple we choose v = (0, 0, 1).
The set NRS(0, 1|0, 0, 1) in Fig. 21.3 “looks like” the intersection of Z2 with the
union of the convex hulls of two parabolas. Let us formalize this in a general state-
ment.
Consider the matrix HΩv (0, 0) = (aij ) and define b1 , b2 , and b3 such that the
characteristic polynomial of this matrix in the variable t is
−t 3 + b1 t 2 − b2 t + b3 .
p1,Ω (m, n) = m − α1 n2 − β1 n − γ1 ;
where
⎧ a32 ⎧ a32 a21
⎪ ⎪
⎪ α1 = − 4a21 ,
⎪
⎪ ⎪ α2 = 4b3 ,
⎪
⎪
⎪
⎪ ⎪
⎪
⎨ a11 − a22 − a33 ⎨ b2
β1 = , β2 = − ,
⎪
⎪
2a21 ⎪
⎪ 2b3
⎪
⎪ ⎪
⎪
⎪
⎪ 4b2 − b12 ⎪
⎪ b2 − 4b1 b3
⎩ γ1 = , ⎩ γ2 = 2 .
4a21 a32 4a21 a32 b3
In the real plane OMN of the family HΩv (m, n) we consider the interior of the
circle of radius R with center at the origin (0, 0) and denote it by BR (O). For a real
number t we set
Λt = (m, n) | p1,Ω (m, n) − t p2,Ω (m, n) − t < 0 .
Theorem 21.43 For every positive ε there exists R > 0 such that the following in-
clusions hold:
Recall that a ray is said to be integer if its vertex is integer and it contains integer
points distinct from the vertex.
Definition 21.44 An integer ray in H (Ω) is said to be an NRS-ray if all its integer
points correspond to NRS-matrices. A direction is said to be asymptotic for the set
NRS(Ω) if there exists an NRS-ray with this direction.
As stated in Theorem 21.43, for every Hessenberg type Ω the set NRS(Ω) almost
coincides with the union of the convex hulls of two parabolas. This implies the
following statement.
Proposition 21.45 Let Ω = a11 , a21 |a12 , a22 , a32 . There are exactly two asymp-
totic directions for the set NRS(Ω), defined by the vectors (−1, 0) and (a11 , a21 ).
m,n m,n
By R1,Ω,v (t), or respectively by R2,Ω,v (t), we denote the tth element in the corre-
sponding family.
m,n m,n
Remark 21.47 The families R1,Ω,v and R2,Ω,v coincide with the sets of all integer
points of certain rays with directions (−1, 0) and (a11 , a21 ) respectively. Conversely,
from Proposition 21.45 it follows that the set of integer points of every NRS-ray
m,n m,n
coincides either with R1,Ω,v or with R2,Ω,v for some integers m and n.
(1,0,0)
Fig. 21.5 The family of Hessenberg matrices H0,1|1,0,2 (m, n)
Example 21.49 Every NRS-ray for the Hessenberg type 0, 1|0, 0, 1 contains only
ς -reduced perfect matrices. Experiments show that every NRS-ray for 0, 1|1, 0, 2
contains at most one ς -nonreduced matrix (see Fig. 21.5).
In this subsection we study several examples of families NRS(Ω) for the Hessenberg
types:
0, 1|0, 0, 1, 0, 1|1, 0, 2, 0, 1|1, 1, 2, and 1, 2|1, 1, 3.
In Figs. 21.5, 21.6, and 21.7 the dark gray squares correspond to ς -nonreduced
matrices. We also fill with gray the squares corresponding to ς -reduced Hessenberg
matrices that are the nth powers (n ≥ 2) of some integer matrices.
(1,0,0)
Hessenberg Perfect NRS-Matrices H0,1|0,0,1 (m, n) The Hessenberg com-
plexity of all these matrices is 1, and therefore, they are all ς -reduced (see the family
in Fig. 21.3 on page 328).
332 21 Gauss Reduction in Higher Dimensions
(1,0,1)
Fig. 21.6 The family of Hessenberg matrices H0,1|1,1,2 (m, n)
(0,0,−1)
Fig. 21.7 The family of Hessenberg matrices H1,2|1,1,3
(1,0,0)
Hessenberg Perfect NRS-Matrices H0,1|1,0,2 (m, n) The Hessenberg com-
plexity of these matrices equals 2. Experiments show that 12 such matrices are ς -
nonreduced (see the family in Fig. 21.5). It is conjectured that all other Hessenberg
matrices of NRS(0, 1|1, 0, 2) are ς -reduced.
(1,0,1)
Hessenberg Perfect NRS-Matrices H0,1|1,1,2 (m, n) The Hessenberg com-
plexity of these matrices equals 2. We have found 12 ς -nonreduced matrices in
21.6 On Reduced NRS-Matrices in SL(3, Z) 333
the family. It is conjectured that all other Hessenberg matrices of NRS(0, 1|1, 1, 2)
are ς -reduced. See Fig. 21.6.
(0,0,−1)
Hessenberg Perfect NRS-Matrices H1,2|1,1,3 (m, n) This is a more compli-
cated example of a family of Hessenberg perfect NRS-matrices, with their com-
plexity equaling 12. We have found 27 ς -nonreduced matrices in the family. It is
conjectured that all other Hessenberg matrices of NRS(1, 2|1, 1, 3) are ς -reduced.
See Fig. 21.7.
We begin the proof with several lemmas, but first a small remark.
(1,0,0)
Lemma 21.50 The planar curve Discr0,1|0,0,1 (m, n) = 0 is contained in the do-
main defined by the inequalities
2
(m − 4n + 3)(n2 + 4m + 3) ≥ 0,
(m2 − 4n − 3)(n2 + 4m − 3) − 72 ≤ 0.
(1,0,0)
Remark Lemma 21.50 implies that the curve Discr0,1|0,0,1 (m, n) = 0 is contained
in some tubular neighborhood of the curve
2
m − 4n n2 + 4m = 0.
Thus, we have
(1,0,0)
Discr0,1|0,0,1 (m, n) − m2 − 4n + 3 n2 + 4m + 3
and
(1,0,0)
Discr0,1|0,0,1 (m, n) − m2 − 4n − 3 n2 + 4m − 3 + 72
Lemma 21.51 For every Ω = a11 , a21 |a12 , a22 , a32 there exists an affine (not nec-
essarily integer) transformation of the plane OMN taking the curve DiscrvΩ (m, n) =
(1,0,0)
0 to the curve Discr0,1|0,0,1 (m, n) = 0.
Proof Let HΩv (0, 0) = (ai,j ). Note that every matrix HΩv (m, n) is rational conjugate
to the matrix
(1,0,0)
H0,1|0,0,1 m ,n ,
where
m = a23 a32 − a11 a33 + a12 a21 − a22 a33 − a11 a22 + a21 a32 m − a11 a32 n,
n = a11 + a22 + a33 + a32 n,
by the matrix
⎛ 2 +a a
⎞
1 a11 a11 12 21
v
XΩ = ⎝0 a21 a11 a21 + a21 a22 ⎠ .
0 0 a21 a32
(1,0,0)
Therefore, the curve DiscrvΩ (m, n) = 0 is mapped to the curve Discr0,1|0,0,1 (m,
n) = 0 bijectively. In OMN coordinates this map corresponds to the following affine
transformation:
m a a m − a11 a32 n a a − a11 a33 + a12 a21 − a22 a33 − a11 a22
"→ 21 32 + 23 32 .
n a32 n a11 + a22 + a33
Proof of Theorem 21.43 Consider a family of matrices HΩv (−p1,Ω (0, t) + ε, t) with
real parameter t. Direct calculations show that for ε
= 0 the discriminant of the
matrices for this family is a polynomial of the fourth degree in the variable t, and
1
DiscrvΩ −p1,Ω (0, t) + ε, t = a21 a32
5
εt 4 + O t 3 .
4
21.6 On Reduced NRS-Matrices in SL(3, Z) 335
Therefore, there exists a neighborhood of infinity with respect to the variable t such
that the function DiscrvΩ (−p1,Ω (0, t) + ε, t) is positive for positive ε in this neigh-
borhood, and negative for negative ε.
Hence for a given ε there exists a sufficiently large N1 = N1 (ε) such that for ev-
ery t > N1 , there exists a solution of the equation DiscrvΩ (m, n) = 0 at the segment
with endpoints
−p1,Ω (0, t) + ε, t and −p1,Ω (0, t) − ε, t
a11 a21
HΩv t − a11 p2,Ω (t, 0) − ε, −a21 p2,Ω (t, 0) − ε .
2 + a2
a11 2 + a2
a11
21 21
For the same reasons, for a given ε there exists a sufficiently large N2 = N2 (ε)
such that for every t > N2 there exists a solution of the equation DiscrvΩ (m, n) = 0
on the segment with endpoints
a11 a21
t − a11 p2,Ω (t, 0) − ε, −a21 p2,Ω (t, 0) − ε and
2 + a2
a11 a 2 + a2
21 11 21
a11 a21
t − a11 p2,Ω (t, 0) + ε, −a21 p2,Ω (t, 0) + ε
2 + a2
a11 2 + a2
a11
21 21
Then by Lemma 21.51, the curve DiscrvΩ (m, n) = 0 is contained in some tubular
neighborhood of the curve
outside some ball centered at the origin. Finally, by Viète’s Theorem, the intersec-
tion of the curve DiscrvΩ (m, n) = 0 with each of the parallel lines
a11 + a21
t : n−m=t
a21
contains at most four points. Therefore, there exists a sufficiently large T such that
for any t ≥ T , the intersection of the curve DiscrvΩ (m, n) = 0 and t contains ex-
actly 4 points corresponding to the branches of the parabolas p1,Ω (m, n) = 0 and
p2,Ω (m, n) = 0 lying in Λ−ε \ Λε .
From Lemma 21.50 and Lemma 21.51 it follows that the set
a11 + a21
n − m < T ∩ DiscrvΩ (m, n) = 0
a21
is compact.
Therefore, there exists R = R(ε, N1 , N2 , T ) such that in the complement to the
ball BR (O) we have
Λε ⊂ NRS(Ω) ⊂ Λ−ε .
The proof of Theorem 21.43 is complete.
Consider an NRS-operator A and any integer point p distinct from the origin O.
Denote by ΓA0 (p) the convex hull of the union of two orbits corresponding to the
points p and A(p). For every integer k we denote by ΓAk (p) the set Ak (ΓA0 (p)).
Proposition 21.52 Let p be a nonzero integer point. Then there exists a fundamen-
tal domain of the Klein–Voronoi continued fraction for A with all (integer) orbit-
vertices contained in the set ΓA0 (p).
Lemma 21.53 Let p be a nonzero integer point. Then one of the Klein–Voronoi
sails for A is contained in the set ΓA (p).
Proof Notice that the set ΓA (p) is a union of orbits. Let us project ΓA (p) to the half-
plane π+ . The set ΓA (p) projects to the closure of the complement of the convex
hull for the points ΓA (Ak (p)) ∩ π+ for all integer k in the angle defined by the
eigenspaces. Since all points Ak (p) are integer, their convex hull is contained in the
convex hull of all points corresponding to integer orbits in the angle. Hence the set
ΓA (p) ∩ π+ contains the projection of the sail. Therefore, the set ΓA (p) contains
one of the sails.
Proof of Proposition 21.52 Since − Id exchanges the sails, one can choose a fun-
damental domain entirely contained in one sail. Let this sail contain a point p.
By Lemma 21.53 the set ΓA (p) contains this sail. Therefore, all orbit-vertices of
a fundamental domain for the Klein–Voronoi continued fraction can be chosen in
ΓA0 (p).
We begin with the case of matrices of Hessenberg type Ω0 = 0, 1|0, 0, 1. Such
matrices form a family H (Ω0 ) with real parameters m and n:
⎛ ⎞
0 0 1
H0,1|0,0,1 (m, n) = ⎝1 0 m⎠ .
(1,0,0)
0 1 n
Remark Lemma 21.55 means that the corresponding domain tends to be flat as t
m,n
tends to infinity. Recall that R1,Ω 0 ,v0
(t)(1, 0, 0) = (0, 1, 0) for every t.
Proof Let us find the asymptotics of eigenvectors and eigenplanes for operators
m,n m,n
R1,Ω 0 ,v0
(t) as t tends to +∞. Denote a real eigenvector of R1,Ω 0 ,v0
(t) by e(t).
Direct calculations show that there exists μ
= 0 such that
e(t) = μ (1, 0, 0) + O t −1 .
m,n
Consider the invariant real plane of the operator R1,Ω 0 ,v0
(t) (it corresponds to a pair
of complex conjugate eigenvalues). Note that this plane is a union of a certain family
m,n
of closed orbits of R1,Ω 0 ,v0
(t). Direct calculations show that any orbit in this family
is an ellipse with semiaxes λgmax (t) and λgmin (t) for some positive real number λ,
where
λ|gmax (t)|
= |t|1/2 + O |t|−1/2 .
λ|gmin (t)|
First, since
1
(1, 0, 0) − e(t) = O |t|−1 ,
μ
the minimal semiaxis of the orbit-vertex TR m,n (1, 0, 0) is asymptotically not
1,Ω0 ,v0 (t)
greater than O(t −1 ). Therefore, the length of the maximal semiaxis is asymptot-
ically not greater than some function of type O(|t|−1/2 ). Hence, the orbit of the
point (1, 0, 0) is contained in the (C1 |t|−1/2 )-ball of the point (1, 0, 0), where C1 is
a constant that does not depend on t.
Second, we have
1
(0, 1, 0) − gmax (t) = O |t|−1 .
t
Thus, the length of the maximal semiaxis of the orbit-vertex TR m,n (1, 0, 0) is
1,Ω0 ,v0 (t)
asymptotically not greater than some function 1+O(t −1/2 ). Hence, the length of the
minimal semiaxis is asymptotically not greater than some function O(|t|−1/2 ). This
implies that the orbit of the point (0, 1, 0) is contained in the (C2 |t|−1/2 )-tubular
neighborhood of the segment with vertices (0, 1, 0) and (0, −1, 0), where C2 is a
constant that does not depend on t.
21.6 On Reduced NRS-Matrices in SL(3, Z) 339
is contained in the C-tubular neighborhood of the triangle with vertices (1, 0, 0),
(0, 1, 0), (0, −1, 0), where C = |t|−1/2 max(C1 , C2 ). This concludes the proof of
the lemma.
Let us now extend the statement of Proposition 21.52 to the general case of Hes-
senberg matrices.
m,n
Corollary 21.56 Let Ω = a11 , a21 |a12 , a22 , a32 and let R1,Ω,v be an NRS-ray.
Then for every ε > 0 there exists C > 0 such that for every t > C the convex hull of
the union of two orbit-vertices
The operator X defines two linear functions l1 and l2 on two variables m and n with
coefficients depending only on a11 , a21 , a12 , a22 , and a32 satisfying
t
, l2 (m, n) = XHΩ (m − t, n)X −1 .
(1,0,0)
H0,1|0,0,1 l1 (m, n) −
a21 a32
m,n
Therefore, the ray R1,Ω,v after a change of coordinates and a rescaling is taken to
m̃,ñ
the ray R1,Ω 0 ,v0
of matrices with Hessenberg type Ω0 = 0, 1|0, 0, 1 for certain m̃
and ñ.
Lemma 21.55 implies the following. For every ε > 0 there exists a positive con-
stant such that for every t greater than this constant the convex hull of the union of
two orbit-vertices
is contained in the ε-tubular neighborhood of the triangle with vertices (1, 0, 0),
(0, 1, 0), (0, −1, 0).
340 21 Gauss Reduction in Higher Dimensions
If we reformulate the last statement for the family of operators in the old coordi-
nates, then we get the statement of the corollary.
m,n
Proof of Proposition 21.54 Notice that the operator R1,Ω,v (t) takes the point
(1, 0, 0) to the point (a11 , a21 , 0). Therefore, the convex hull of the union of two
orbit-vertices
TR m,n (1, 0, 0) and TR m,n (a11 , a21 , 0)
1,Ω,v (t) 1,Ω,v (t)
(we denote it by W (t)) coincides with the set ΓR0m,n (1, 0, 0).
1,Ω,v (t)
From Proposition 21.52 it follows that there exists a fundamental domain for
a Klein–Voronoi continued fraction with all its orbit-vertices contained in W (t).
Choose a sufficiently small ε0 such that the ε0 -tubular neighborhood of the triangle
with vertices
does not contain integer points distinct from the points of the triangle. From Corol-
lary 21.56 it follows that for a sufficiently large t the set W (t) is contained in
the ε0 -tubular neighborhood of the triangle. This implies the statement of Propo-
sition 21.54.
Now let us study the remaining case of the rays of matrices with asymptotic direc-
tion (a11 , a21 ). We recall that Ω0 = 0, 1|0, 0, 1.
m,n
Proposition 21.57 Consider an NRS-ray R2,Ω,v . Then there exists a constant
C > 0 such that for every t > C all orbit-vertices of one of the fundamental domains
m,n
for the Klein–Voronoi continued fraction of the matrix R2,Ω,v (t) are contained in
the set of all integer orbits corresponding to the integer points in the convex hull of
(1, 0, 0), (−1, 0, 0), and (a11 , a21 , 0).
Proof First, notice that the Klein–Voronoi continued fractions for the operators A
and A−1 coincide.
Second
where
⎛ ⎞
0 −1 −n − t
X = ⎝−1 0 −m ⎠ .
0 0 −1
Thus, in the new coordinates we obtain the equivalent statement for the ray
−n,−m
R1,Ω 0 ,v0
(t). Now Lemma 21.58 follows directly from Lemma 21.55.
m,n
Corollary 21.59 Let Ω = a11 , a21 |a12 , a22 , a32 and let R2,Ω,v be an NRS-ray.
Then for every ε > 0 there exists C > 0 such that for every t > C the convex hull of
the union of two orbit-vertices
THΩv (m+a11 t,n+a21 t) (1, 0, 0) and THΩv (m+a11 t,n+a21 t) (a11 , a21 , 0)
is contained in the ε-tubular neighborhood of the triangle with vertices (1, 0, 0),
(−1, 0, 0), and (a11 , a21 , 0).
Remark We omit the proofs of Corollary 21.59 and Proposition 21.57, since they
repeat the proofs of Corollary 21.56 and Proposition 21.54.
This triangle contains only finitely many integer points, all of which have the last
coordinate equal to zero. The value of the MD-characteristic for the points of type
(x, y, 0) equals
(a21 x − a11 y)a32
2 2
y t + C,
where the constant C does not depend on t, but only on x, y, and the elements ai,j
of the matrix HΩv (m, n).
So, for any point (x, y, 0), the MD-characteristics linearly increase with re-
spect to the parameter t. The only exceptions are the points of type λ(1, 0, 0) and
μ(a11 , a21 , 0) (for integers λ and μ). The values of the MD-characteristics are con-
stant in these points with respect to t.
Since there are finitely many integer points in the triangle (1, 0, 0), (a11 , a21 , 0),
and (−a11 , −a21 , 0), for sufficiently large t the MD-characteristic at points of the
m,n
triangle attains its minimum at (1, 0, 0) or at (a11 , a21 , 0). Since R1,Ω,v (t) takes
the point (1, 0, 0) to the point (a11 , a21 , 0), the values of the MD-characteristics at
(1, 0, 0) and at (a11 , a21 , 0) coincide.
Therefore, for sufficiently large t the matrix
(1,0,0)
Ha11 ,a21 |a12 ,a22 ,a32 (m − t, n)
is always ς -reduced and there are no other ς -reduced matrices integer conjugate to
the given one. This implies both statements of Theorem 21.48 for every NRS-ray
with asymptotic direction (−1, 0).
Step 4.2. The case of NRS-rays with asymptotic direction (a11 , a21 ). This case is
similar to the case of NRS-rays with asymptotic direction (−1, 0), so we omit the
proof here.
Proof of Theorem 21.48 is complete.
Ω
0, 1|0, 0, 1 0, 1|1, 0, 2 0, 1|1, 1, 2 0, 1|1, 0, 3 0, 1|1, 1, 3 0, 1|1, 2, 3
ς(Ω) 1 2 2 3 3 3
#(Ω) 0 12 12 6 10 10
Ω
0, 1|2, 0, 3 0, 1|2, 1, 3 0, 1|2, 2, 3 1, 2|0, 0, 1 0, 1|1, 0, 4 0, 1|1, 1, 4
ς(Ω) 3 3 3 4 4 4
#(Ω) 14 10 10 94 6 8
Ω
0, 1|1, 2, 4 0, 1|1, 3, 4 0, 1|3, 0, 4 0, 1|3, 1, 4 0, 1|3, 2, 4 0, 1|3, 3, 4
ς(Ω) 4 4 4 4 4 4
#(Ω) 10 8 10 12 8 8
RS-Matrices Now we say a few words about real spectrum matrices (i.e., about
SL(3, Z)-matrices with three distinct real roots). Mostly we consider the family
H (0, 1|1, 0, 2); the situation with the other Hessenberg types is similar.
Recall that
⎛ ⎞
0 1 n+1
H0,1|1,0,2 (m, n) = ⎝1 0 m ⎠.
(1,0,1)
0 2 2n + 1
This matrix is of Hessenberg type 0, 1|1, 0, 2, and its Hessenberg complexity
(1,0,1)
equals 2. Hence H0,1|1,0,2 (m, n) is ς -reduced if and only if it is not integer conju-
gate to some matrix of unit Hessenberg complexity. All such matrices are of Hes-
senberg type 0, 1|0, 0, 1.
(1,0,1)
In Fig. 21.8 we show all matrices H0,1|1,0,2 (m, n) within the square
−20 ≤ m, n ≤ 20.
The square in the intersection of the mth column with the nth row corresponds to
(1,0,1)
the matrix H0,1|1,0,2 (m, n). It is colored in black if the characteristic polynomial
has rational roots. The square is colored in gray if the characteristic polynomial is
irreducible and there exists an integer vector (x, y, z) with the coordinates satisfying
−1000 ≤ x, y, z ≤ 1000,
such that the MD-characteristic of H0,1|1,0,2 (1, 0, 1)(m, n) equals 1 at (x, y, z).
All the rest of the squares are white.
344 21 Gauss Reduction in Higher Dimensions
−10 ≤ m, n ≤ 10,
and found that all of them are ς -reduced (such squares are contained inside the big
black square shown on the figure).
We show a boundary broken line between the NRS- and RS-squares in gray.
Remark In Fig. 21.8 the NRS-domain is easily visualized. It almost completely con-
sists of white squares. On the other hand, the RS-domain contains a relatively large
number of black squares. This indicates a significant difference between the RS and
NRS cases.
21.8 Exercises 345
Remark On the one hand, Proposition 21.60 implies the existence of rays en-
tirely consisting of ς -reduced matrices. On the other hand, in contrast to the NRS-
matrices, there are some rays consisting entirely of ς -nonreduced RS-matrices. For
instance, all matrices corresponding to the integer points of the lines
(1) m = n;
(2) m = n + 2;
(3) m = −n;
(4) m = −n − 2;
(5) n = 3m − 4;
(6) m = 3n + 6
are not ς -reduced (we do note state that the list of such lines is complete).
So Theorem 21.48 does not have a direct generalization to the RS-case and we
end up with the following problem.
It is likely that almost all Hessenberg matrices are ς -reduced (except for some
measure-zero subset).
21.8 Exercises
Exercise 21.1 Let π be an integer k-dimensional plane in Rn . Describe the locus
of all integer points a unit integer distance to π .
Exercise 21.3 Let A ∈ GL(n, R) preserve the level sets of the form x 2 + y 2 . Prove
that there exists α such that either
(1,0,1)
Exercise 21.7 Prove that all matrices H0,1|1,0,2 (m, n) corresponding to integer
points of the lines
(1) m = n;
(2) m = n + 2;
(3) m = −n;
(4) m = −n − 2;
(5) n = 3m − 4;
(6) m = 3n + 6
are ς -reduced. Are there some other similar lines in the plane (m, n)?
For an arbitrary matrix A of GL(n, R), we denote by CGL(n,R) (A) its centralizer,
i.e., the set of all the elements of GL(n, R) commuting with A.
We say that a matrix is regular if all its eigenvalues are distinct (but not nec-
essarily real). In this chapter we discuss only the most studied case of centralizers
defined by regular matrices. Notice that if a matrix A is regular, then the centralizer
of CGL(n,R) (A) is commutative.
Definition 22.1 Let A be a regular matrix in GL(n, R). We call the centralizer
CGL(n,R) (A) the maximal commutative subgroup of GL(n, R) (or MCRS-group for
short).
Remark 22.2 Notice that if A is a regular matrix in GL(n, Z), then the Dirichlet
group Ξ (A) is a subgroup of the centralizer CGL(n,R) (A). We would like just to
mention that Dirichlet groups usually play an important role in the study of best
approximations of the corresponding MCRS-groups.
Remark 22.3 The space Simpln is not compact for n > 1. It admits a transitive right
action of GL(n, R) and possesses an invariant measure μn . This measure is usually
called the Möbius measure, and is similar to the Möbius measure for the space of
all multidimensional continued fractions discussed in Chap. 19.
22.1 Rational Approximations of MCRS-Groups 349
Remark 22.5 Notice that Markov–Davenport forms are closely related to MD-
characteristics studied in the previous section. Namely, if A ∈ A is an operator with
distinct eigenvalues, then the Markov–Davenport form is proportional to the MD-
characteristic ΔA .
1 1
.
1 0
The Fibonacci operator has two eigenlines,
y = −θ x and y = θ −1 x,
√
1+ 5
where θ is the golden ration 2 . So the Markov–Davenport form of the Fibonacci
operator is
(y + θ x)(y − θ −1 x) 1
−1
= √ −x 2 + xy + y 2 .
θ −θ 5
Proof All regular operators of A have the same set of eigenlines. Hence the forms
Li are uniquely defined by the MCRS-group up to a multiplication by a scalar and
permutations. Every scalar multiplication is normalized by the determinant in the
denominator, while a sign of a permutation is not caught by the denominator.
350 22 Approximation of Maximal Commutative Subgroups
Proposition 22.8 All the coefficients of the Markov–Davenport form are either si-
multaneously real, if there is an even number of minimal invariant planes of the
corresponding MCRS-group, or simultaneously complex.
Notice that in the case of C1 there are four primitive vectors (or units): they are
±1 and ±I .
For a complex vector v = (a1 + I b1 , . . . , an + I bn ), denote by |v| the norm
5 6
max ai2 + bi2 .
i=1,...,n
It is clear that the minimum of the norm | ∗ | among the Gaussian vectors in a Gauss-
ian one-dimensional subspace is attained at a primitive Gaussian vector.
Now we are ready to define rational MCRS-groups.
Take two maximal absolute values of the coefficients of these forms (separately).
The minimum of these two maximal values is considered the distance between A1
and A2 , which we call the discrepancy and denote by ρ(A1 , A2 ).
0 −1
with eigenvectors (I, 1) and (−I, 1),
1 0
1 1
with eigenvectors (1, 2) and (1, −2).
4 1
These operators have distinct eigenvalues, and therefore, they define MCRS-groups
(denote them by A1 and A2 respectively). Both MCRS-groups are rational, where
the operators of the first one have real eigenvalues, and where the operators of the
second have pairs of complex conjugate eigenvalues. Direct calculations show that
Remark 22.14 There is another important class of MCRS-groups that would also be
interesting to consider in this context. An MCRS-group is said to be algebraic if it
contains regular operators of GL(n, Z). It would be interesting to develop approxi-
mation techniques of MCRS-groups by algebraic MCRS-groups.
p
ν A = q.
q
Let us calculate the discrepancy between two MCRS-groups A[α1 ] and A[α2 ]
for arbitrary positive real numbers α1 and α2 ,
x(y − α1 x) x(y − α2 x)
ΦA[α1 ] − ΦA[α2 ] = − = (α2 − α1 )x 2 ,
1 1
x(y − α1 x) x(y − α2 x)
ΦA[α1 ] + ΦA[α2 ] = + = 2xy − (α2 + α1 )x 2 .
1 1
Since α1 > 0 and α2 > 0, we have
ρ A[α1 ], A[α2 ] = |α1 − α2 |.
Notice that
ν A[a, b, c] = max(a, b, c)
and
b2 + c2 3 b 2 c 2 1 2 1 2
ΦA[a,b,c] (x, y, z) = I − x + x y + x z − xy − xz .
2a 2 a a 2 2
b b c c b2 + c2 b 2 + c 2
= min max − , − , − ,
a a a a 2a 2 2a 2
b b c c b2 + c2 b 2 + c 2
max + , + , + .
a a a a 2a 2 2a 2
In the case of positive a, b, c, a , b , and c , we simply have
b b c c b2 + c2 b 2 + c 2
ρ A[a, b, c], A a , b , c = max − , − , − .
a a a a 2a 2 2a 2
354 22 Approximation of Maximal Commutative Subgroups
Remark 22.15 For the case of classical simultaneous approximation one takes a
slightly different distance between the vectors:
b b c c
ρ A[a, b, c], A a , b , c = max − , − .
a a a a
So the described approximation model is not exactly the classical model of simulta-
neous approximation.
This operator can be thought of as the simplest nonreal spectrum operator from a
geometric point of view. Denote the real eigenvalue of the operator B by ξ1 and its
complex conjugate eigenvalues by ξ2 and ξ3 . Notice that
Let us approximate the eigenline corresponding to ξ1 . Let vξ1 be the vector in this
eigenline having the first coordinate equal to 1. In this setting we have
The set of best approximations with size not exceeding 106 contains 48 elements.
All these elements are of type B ni (1, 0, 0), where (ni ) is the following sequence:
We conjecture that the set of best approximations coincides with the set of points
B k (1, 0, 0), where k = 4 or k ≥ 6; the approximation rate in this case is CN −3/2 .
The structure of the set of best approximations is usually closely related to the cor-
responding Dirichlet group. In this case the Dirichlet group Ξ (B) is homeomorphic
to Z ⊕ Z/2Z; the generators are B and −Id.
22.2 Simultaneous Approximation in R3 and MCRS-Group Approximation 355
Consider now the algebraic real spectrum operator of the two-dimensional golden
ratio
⎛ ⎞
3 2 1
G = ⎝2 2 1⎠ .
1 1 1
As we have already seen in Chap. 18, this operator has the simplest continued frac-
tion in the sense of Klein. The Dirichlet group Ξ (G) is generated by the following
two operators:
⎛ ⎞ ⎛ ⎞
1 1 1 0 1 0
H1 = ⎝1 1 0⎠ and H2 = ⎝1 −1 1 ⎠ .
1 0 0 0 1 −1
Notice that G = H12 and H2 = (H1 − Id)−1 . The operator H1 here is the three-
dimensional Fibonacci operator. Denote the eigenvalues of H1 by ξ1 , ξ2 , and ξ3
such that
|ξ1 | > |ξ2 | > |ξ3 |.
Let us approximate the eigenline corresponding to ξ1 . Denote by vξ1 the vector
of this eigenline having the last coordinate equal to 1. Notice that
The set of best approximations of size not exceeding 106 contains 41 elements;
40 of them (except for the third best approximation) are contained in the set
m n
H1 H2 (1, 0, 0) | m, n ∈ Z .
In the following table we show the consecutive pairs of powers (mi , ni ) for the these
41 best approximations except for the third one. So in the column i we get m = mi ,
n = ni for the approximation H1mi H2ni (1, 0, 0).
i
1 2 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
mi 1 2 3 3 4 4 5 5 6 6 6 7 7 8 8 9 9 10 10 11 11
ni 1 1 2 1 2 1 3 2 3 2 1 3 2 3 2 4 3 4 3 5 4
i
23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41
mi 11 12 12 13 13 14 14 15 15 15 16 16 17 17 18 18 19 19 19
ni 3 4 3 5 4 5 4 6 5 4 5 4 6 5 6 5 7 6 5
356 22 Approximation of Maximal Commutative Subgroups
The third best approximation is A[3, 2, 1]; it is not of the form H1m H2n (1, 0, 0).
We conjecture that the set of all best approximations except A[3, 2, 1] is contained
in the set of all points of type H1m H2n (1, 0, 0), with the approximation rate in this
case being CN −3/2 .
We conclude this chapter with the following general remark.
Remark 22.16 In the above two examples we have several algebraic artifacts (such
as missing B 5 (1, 0, 0) as a best approximation for the first example and an addi-
tional best approximation A[3, 2, 1] for the second example). This is not a surprise,
since we do not approximate the triples of eigenvectors simultaneously but a certain
eigenvector together with two vectors
(0, 1, I ), (0, 1, −I ).
22.3 Exercises
Exercise 22.1
(a) Prove that if an operator A is regular, then the centralizer of A ∈ GL(n, R) is
commutative.
(b) Describe all matrices whose centralizers are commutative.
Exercise 22.3 Let G be a commutative subgroup of GL(n, R). Suppose that for
every matrix A ∈ GL(n, R) \ G there exists a matrix à ∈ G such that A and à do
not commute. Is it true that there exists a matrix B ∈ G such that G = CGL(n,R) (B)?
Exercise 22.4 Prove that the space Simpl2 is homeomorphic to the Möbius band.
Exercise 22.5 Find the first 10 best approximations A[a, b, c] for the minimal and
the middle eigenvectors of the three-dimensional golden ratio G.
The main idea of the Minkowski approach is to define extremal nodes of a lattice
Γ with respect to a family of convex sets that can be chosen with certain freedom
(for instance, vertices of sails considered in previous chapters of this book can be
considered tetrahedral local minima). In this section we describe the classical ap-
proach that works with coordinate parallelepipeds.
For simplicity, in this section we consider only sets in R3 in general position (in
the sense that no two vertices of the lattice are in a plane parallel to some coordinate
plane).
Definition 23.2 Let S be an arbitrary subset of Rn≥0 and let Ŝ be the set of all
its Voronoi relative minima. A finite subset T ⊂ Ŝ is called minimal if the paral-
lelepiped Π(T ) contains no points of Ŝ \ T except for the origin. We denote the set
of all minimal k-element subsets of Ŝ ⊂ S by Mk (S).
The Voronoi graph is the graph whose vertices and edges are respectively vertices
and edges of the Minkowski–Voronoi complex.
The Minkowski graph is the graph whose vertices are edges are respectively faces
and edges of the Minkowski–Voronoi complex (two vertices in the Minkowski graph
are connected by an edge if and only if the corresponding faces in the Minkowski–
Voronoi complex have a common edge).
So in some sense Minkowski and Voronoi graphs are dual to each other.
There are only five Voronoi relative minima for the set S0 , namely the vectors
s1 , . . . , s5 . The Minkowski–Voronoi complex contains 5 vertices, 6 edges, and 5
faces. Its vertices are
Definition 23.6 Let S be an arbitrary subset of R3≥0 . The Minkowski polyhedron for
S is the boundary of the set
S ⊕ R3≥0 = s + r | s ∈ S, r ∈ R3≥0 .
In other words, the Minkowski polyhedron is the boundary of the union of copies
of the positive octant shifted by vertices of the set S.
Proposition 23.8 The Minkowski–Voronoi tessellation for a nice set S has the com-
binatorial structure of the Minkowski–Voronoi complex MV(S).
Remark 23.9 Here we do not specify what the word “nice” means. We say only that
it includes finite sets and complete lattices considered below.
Example 23.10 Consider the set S0 as in Example 23.5. In Fig. 23.1 we show the
Minkowski polyhedron (on the left) and the corresponding Minkowski–Voronoi tes-
sellation (on the right). The local minima of the function x + y + z on the Minkowski
polyhedron for S0 are the relative minima f1 , . . . , f5 . They identify the faces of the
complex MV(S0 ). The local maxima of the function x + y + z on the Minkowski
polyhedron for S0 are v1 , . . . , v5 , corresponding to vertices of the complex MV(S0 ).
The vertices v1 , . . . , v5 are as follows:
Fig. 23.1 The Minkowski polyhedron (on the left) and the corresponding Minkowski–Voronoi
tessellation (on the right)
Γ = {m1 v1 + m2 v2 + m3 v3 : m1 , m2 , m3 ∈ Z},
Definition 23.11 We say that an edge {s1 , s2 } of the complex MV(|Γ |) with vertices
{s1 , s2 , s3 } and {s1 , s2 , s4 } is nondegenerate if at least one of the triples (s1 , s2 , s3 )
and (s1 , s2 , s4 ) generates the lattice Γ .
Suppose that {s1 , s2 , s3 } generates the lattice. Then we equip the edge {s1 , s2 }
with a matrix that sends s1 and s2 to themselves and takes s3 to s4 .
So if both triples (s1 , s2 , s3 ) and (s1 , s2 , s4 ) generate the lattice Γ , then the cor-
responding edge is equipped with a pair of matrices inverse to each other.
Finally, we have all the tools that are needed to give a definition of the
Minkowski–Voronoi continued fraction.
We refer to [139] and [75, 76] for more details related to general minimal systems
and to the papers [71] and [201] concerning the arbitrary case.
Let us give a small remark on the periodic algebraic case.
362 23 Other Generalizations of Continued Fractions
From Dirichlet’s unit theorem it follows that the multiplicative subgroup of di-
agonal matrices with positive diagonal elements preserving the lattice is isomorphic
to Z2 . The diagonal matrices of this group preserve both the Minkowski–Voronoi
complex and all translation matrices for the corresponding equipped edges. There-
fore, the Minkowski–Voronoi continued fraction in this case is doubly periodic.
Remark 23.14 Let us say a few words about the two-dimensional case. Consider a
complete lattice Γ ⊂ R2 . The Minkowski–Voronoi complex for Γ is a broken line.
Each edge ei is equipped with a matrix
0 ±1
.
1 ai
Consider now sails of lattices in each of the four coordinate octants for the lattice Γ
(i.e., we consider the boundaries of the convex hulls for points of Γ but not of Z3 ).
Then the integers ai correspond to the integer lengths of these sails.
Remark 23.13 above). So for the infinite case, we propose a simpler version of Prob-
lem 32.
3 (for k ∈ Z).
– All finite edges are straight segments of one of the directions kπ
– Each vertex that does not contain infinite edges is a vertex of the following 8
types:
Example 23.16 Consider the lattice Γ171 generated by the following three vectors:
Notice that the first integer point on the z-axis is (0, 0, 171). The diagram of
the Minkowski–Voronoi tessellation for |Γ171 | satisfying the conditions of Theo-
rem 23.15 is shown in Fig. 23.2.
364 23 Other Generalizations of Continued Fractions
Definition 23.17 Let pq11 and pq22 be two rational numbers. We assume that p1 is
relatively prime to q1 and p2 is relatively prime to q2 . The rational number
p1 p 2 p 1 + p 2
⊕ =
q1 q2 q1 + q2
p1 p2
is called the Farey sum of q1 and q2 . (Note that for every integer m we consider the
fraction m1 .)
There is a nice way to construct the set of rational numbers using the operation
of Farey addition. We do it inductively in countably many steps.
In other words, we put between every two numbers of the sequence (ai,k ) their
Farey sum.
Example 23.18 The sequences for the first three steps are as follows:
Remark 23.19 The part of the sequence (ai,k ) contained between the elements 0
and 1 is sometimes called the kth Farey sequence.
23.2 Farey Addition, Farey Tessellation, Triangle Sequences 365
Remark 23.20 The denominators appearing in the Farey sequences form Stern’s
diatomic sequence, which is defined as follows:
a1 = 1;
a2i = ai , for i ∈ Z+ ;
a2i+1 = ai + ai+1 , for i ∈ Z+ .
1, 1, 2, 1, 3, 2, 3, 1, 4, 3, 5, 2, 5, 3, 4, 1, . . .
There is a simple way to construct this tessellation. We start with the ideal tri-
angle with vertices 0, 1, and ∞. Reflecting this triangle with respect to the axes
l(0, 1), l(0, ∞), and l(1, ∞), we get three new triangles with vertex sets {1, 2, ∞},
{−1, 0, ∞}, and {0, 12 , 1} respectively. Add all of them to the tessellation. Continue
iteratively to reflect the obtained picture with respect to the edges of new triangles.
366 23 Other Generalizations of Continued Fractions
In countably many steps we construct all ideal triangles of the tessellation. We show
the Farey tessellation in Fig. 23.3.
Proposition 23.22
(i) The set of vertices of all ideal triangles in the Farey tessellation coincides with
the subset of the points on the absolute whose coordinates are rational or ∞.
(ii) An ideal triangle with vertices p < q < r is in the Farey tessellation if and
only if there exist i and k such that p = ai,k , r = ai+1,k , and q = p ⊕ r (i.e.,
q = a2i+1,k+1 ).
The continued fractions for the vertices of ideal triangles of the Farey tessellation
has the following surprising regularity.
Proposition 23.23 For every ideal triangle T in the Farey tessellation there exists
a sequence of integers a0 , a1 , . . . , an+1 such that the vertices of T have the coordi-
nates p, q, and p ⊕ q, where
It turns out that the continued fraction of a real number itself can be interpreted in
terms of the Farey tessellation. Let us explain this.
It is clear that the Farey tessellation is invariant with respect to the shift on the
vector (1, 0), so for simplicity we restrict ourselves to the case of real numbers α
satisfying 0 ≤ α < 1.
In the upper half-plane model one can imagine the Farey tessellation as a pyra-
mid. We say one more time that we consider now only the part of the tessellation
contained in the band 0 ≤ x < 1. On the top of the pyramid there is the ideal triangle
with vertices 0, 1, and ∞. Suppose that we are at this triangle. It is permitted either
23.2 Farey Addition, Farey Tessellation, Triangle Sequences 367
Fig. 23.4 Descent in the Farey pyramid. The descent sequence is LLRR, and hence the exit point
is 37 = [0; 2 : 3]
to exit the pyramid at vertex 1 or to descend to the adjacent triangle with vertices 0,
1
2 , and 1. To fix notation, we say that we then descend left.
Suppose now that we are at some ideal triangle of the tessellation with vertices
x1 < x2 < x3 . Then it is permitted either to exit the pyramid at vertex x2 or to
descend to one of the neighboring triangles in the tessellation: either to descend left
to the triangle with vertices x1 < x4 < x2 or to descend right to the triangle with
vertices x2 < x5 < x3 for the appropriate point x4 or x5 respectively (see Fig. 23.4).
In a finite or infinite number of steps we descend to some point α of the absolute.
At each step of the descent we go either to the right or to the left, so it is natural to
define the descent sequence whose elements are letters L (if we descend to the left)
and R (if we descend to the right). If the sequence is nonempty, the first is always L.
Let us denote by a1 the number of letters L standing before the first letter R.
Denote by a2 the number of letters R standing before the next letter L, etc.
a1 = 3; a2 = 2; a3 = 4; a4 = 2; a5 = 2.
Theorem 23.25 In the above notation the traveler will either exit the pyramid at
the rational number
[0; a1 : a2 : · · · : an−1 : an + 1]
368 23 Other Generalizations of Continued Fractions
[0; a1 : a2 : · · · ]
on the absolute.
In this regard we would like to mention the works [143–147] by V.V. Nikulin on
discrete reflection groups in hyperbolic spaces.
for k ∈ Z+ . Now let us define the map T that maps each triangle n to the trian-
gle . For every point (x, y) of the triangle n , define
y 1 − x − ky
T (x, y) = , .
x x
Definition 23.27 Consider an arbitrary point (x, y) of the triangle . Let T k (x, y) ∈
n . Then we put ak = n, for k = 1, 2, . . . . The sequence
(a0 , a1 , a2 , . . .)
Remark 23.28 In [138] the authors proved that the triangle map T : → is er-
godic with respect to the Lebesgue measure.
p1 pn r1 rn
T= ,..., and S = ,..., .
q1 qn s1 sn
p1 + r1 pn + rn
T ⊕S = ,..., .
q1 + s1 qn + sn
p2 p 3 pn
T̂ = , ,..., .
p1 p1 q1
370 23 Other Generalizations of Continued Fractions
The map T defines a natural nested partition of the triangle into smaller triangles.
This partition contains triangles of the following type.
Our next goal is to write an explicit formula for such triangles. We begin a brief
geometric description of triangle sequences. One can consider the triangle sequence
for (α, β) as a method to construct a sequence of integer vectors (Ck ) whose el-
ements are “almost orthogonal” to the vector (1, α, β), i.e., the dot product of the
vectors in the sequence with the point (1, α, β) in R3 is small. Set
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
1 0 0
C−3 = ⎝0⎠ , C−2 = ⎝1⎠ , C−1 = ⎝0⎠ .
0 0 1
Suppose that the triangle sequence for (α, β) is (ai ). Then we set by definition
Define also
Xk = Ck × Ck+1 ,
where v × w denotes the cross product of two vectors v and w in R3 .
Theorem 23.31 The vertices of the triangle (a0 , . . . , an ) are X̂n−1 , X̂n , and
Xn +̂ Xn−2 .
A triangle sequence does not always converge to a unique number. However, it does
so under certain conditions.
23.2 Farey Addition, Farey Tessellation, Triangle Sequences 371
d(X̂n−1 , X̂n+1 )
λn = ,
d(X̂n−1 , Xn +̂ Xn−2 )
Theorem 23.32 Assume that (an ) is a triangle sequence of nonzero elements. Then
the triangle sequence describes a unique pair (α, β) when
∞
&
(1 − λn ) = 0.
n=N
Theorem 23.33 Consider a sequence (an ). Suppose that there exists N such that
an > 0 for any n > N and
∞
&
(1 − λn ) > 0.
n=N
Then the triangle sequence (an ) does not correspond to a unique point.
We refer to [12] for the proof of the above two theorems. Weak convergence is
discussed in [138].
One of the most challenging properties that one thinks of while generalizing con-
tinued fractions to the multidimensional case is algebraic periodicity. If a triangle
sequence is periodic, then the corresponding pair of real numbers correspond to cer-
tain solutions of a cubic equation with integer coefficients. Although the converse
statement is not true, there is the following nice result announced in [44], which
seems to improve the situation.
For simplicity we embed the triangle in R3 , simply by adding the first coordi-
˜ The vertices of
nate 1 and denoting the resulting triangle by . ˜ are
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
1 1 1
v1 = ⎝0⎠ , v2 = ⎝1⎠ , v3 = ⎝1⎠ .
0 0 1
372 23 Other Generalizations of Continued Fractions
Definition 23.35 Let us define the map Tσ,τ0 ,τ1 for an arbitrary triple of column per-
mutations σ, τ0 , τ1 . Consider a pair (x, y) ∈ . Suppose that (x, y) ∈ k (σ, τ0 , τ1 ).
Then we define
−1 t
(a, b, c) = (1, x, y) · BF0,σ,τ F −k B −1 .
0 1,σ,τ1
b c
Tσ,τ0 ,τ1 (x, y) = , .
a a
Remark 23.36 We have actually defined 216 different maps corresponding to dif-
−1
ferent triples of permutations. Notice that the matrix (BF0,σ,τ F −k B −1 )t is in
0 1,σ,τ1
SL(3, Z). The triangle partition maps Tσ,τ0 ,τ1 are called TRIP maps.
Conjecture 36 (T. Garrity) For each positive integer d, there is a finite num-
ber of multidimensional continued fraction algorithms, so that for every d-tuple
(ξ1 , . . . , ξd ) with Q(ξ1 , . . . , ξd ) a degree-d algebraic number field, there is a multi-
dimensional continued fraction algorithm in the family spanned by the initial algo-
rithms such that it generates a periodic output.
Remark 23.39 Farey fractions give rise to a thermodynamic approach to real num-
bers (for more information we refer to [56, 110, 111], and [61]). Triangle sequences
are used to generalize this approach to the case of pairs of real numbers [62].
Remark 23.40 There are several other generalizations in the spirit of triangle se-
quences. For instance, the classical Minkowski ?(x) function (see [75, 76, 140, 197,
203], etc.) was generalized to the two-dimensional case by O.R. Beaver and T. Gar-
rity in [17] (see also [158]).
Example 23.43 Let us consider two bricks R(6, 11) and R(22, 3). We show their
decompositions in Fig. 23.5 simultaneously. Each of the bricks is subdivided into
six smaller rectangles labeled by Roman numerals. The rectangle with label n is
shifted to the rectangle with label n . For instance, the gray rectangle labeled by V
is taken to the gray rectangle labeled by V . Here all the translation vectors are in
the linear space x + 2y = 0.
In the next two subsections we answer the following two natural questions:
(a) Which coordinate bricks are (asymptotically) Π -congruent?
23.3 Coordinate Rectangular Bricks and O’Hara’s Algorithm 375
(b) How does one construct the decompositions and translations showing the
(asymptotic) Π -congruence of such boxes?
Theorem 23.45
(i) Two coordinate bricks of the same dimension are asymptotically Π -congruent
if and only if they have the same volume.
(ii) Two relatively rational rectangular coordinate boxes are Π -congruent if and
only if they have the same volume.
In addition we have the following simple formula to calculate the equation of the
plane π .
The answer to the second question (on construction of decompositions and transla-
tions) is given by O’Hara’s algorithm, which comes from the theory of general par-
titions (we refer the interested reader to [3, 151, 152, 155, 156]). In this subsection
we consider only the geometric representation of the algorithm related to continued
fractions. In some sense the idea of this algorithm is an algorithm-definition, giv-
ing a constructive description of all the translations Ti : for an arbitrary point of the
first coordinate brick the algorithm constructs the image of this point in the second
coordinate brick. The resulting decomposition is a decomposition into boxes, all of
whose edges are parallel to coordinate vectors.
376 23 Other Generalizations of Continued Fractions
Remark 23.47 It turns out that the result w does not depend on the choice of the
coordinate j in any recursive step, but entirely on the input data v (see [151] and
[152] for more details).
Definition 23.48 Let P and Q be two coordinate bricks of the same volume in Rn .
We define a bijection ϕP ,Q : P → Q at every v ∈ P as follows:
ϕP ,Q (v) = w,
Definition 23.50 Let P and Q be two coordinate bricks of the same volume in
Rn . Consider a natural decomposition of the coordinate brick P into regions on
which the function ϕP ,Q is linear. Define a decomposition of Q as the image of the
decomposition of P via the map ϕP ,Q . We say that these decompositions of P and
Q are associated to the bijection ϕP ,Q .
23.3 Coordinate Rectangular Bricks and O’Hara’s Algorithm 377
Here we notice again that in the case of relatively rational coordinate bricks P
and Q, one has π -congruence instead of asymptotic π -congruence.
Remark 23.52 In the special case of rectangular bricks R(a, b) and R(b, a) with
relatively prime positive integers a and b and the space π = {x + y = 0}, O’Hara’s
algorithm is a version of the Euclidean algorithm. Geometrically the elements of the
continued fraction for b/a correspond to the numbers of equivalent squares in the
corresponding layers of the decomposition of R(a, b).
Example 23.53 We illustrate this with the example of a = 9 and b = 25. The as-
sociated decompositions of R(9, 25) and of R(25, 9) consist of two large squares,
one average square, three small squares, and two very small squares. This can be
read from the regular continued fraction: 25 9 = [2; 1 : 3 : 2]. In Fig. 23.6 we show
the layers of equivalent squares as bold rectangles.
Remark 23.54 Notice that after a coordinate rescaling the decomposition remains
combinatorially the same. Therefore, we can always choose the coordinate scale to
work with R(1, b) and R(b, 1), where π = {x + y = 0} for some b > 0.
378 23 Other Generalizations of Continued Fractions
From the above two remarks it follows that the notion of partitions in the two-
dimensional case correlates with the notion of regular continued fractions. So we
conclude with the following natural problem.
Step 0. Put v0 = v.
Recursive Step k. We have constructed vk−1 . If vk ∈ Δ̂, then the algorithm termi-
nates. If vk ∈
/ Δ̂, the recursive step consists of three substeps:
Substep k.1: perform an operation of subtraction S;
Substep k.2: invert the result via the operation of inversion T ;
Substep k.3: write the (k − 1)th element of the continued fraction based on
Substep k.1 and Substep k.2.
We conclude the step by putting
vk = T ◦ S(vk−1 ).
Output. If the algorithm stops (say at step m + 1), then the last element vm is
considered a generalized greatest common divisor. The corresponding contin-
ued fraction is generated in the iterative steps; it is calculated by parameters of
subtractions S and inversions T .
First we write the classical Euclidean algorithm in the framework of the general
algorithmic scheme. We will then give several examples of generalizations.
x
S : (x, y) "→ x − y, y ;
y
T : (x, y) "→ (y, x);
Δ̂ = (x, 0) | x ∈ Z+ .
In Substep k.3 we remember the element ak−1 = pqkk , where vk−1 = (pk−1 , qk−1 )
is the vector defined in Step k − 1. The resulting continued fraction is
[a0 ; a1 : a2 : · · · ].
Example 23.56 (Jacobi–Perron algorithm ([85, 164, 165])) Historically, the first al-
gorithmic generalization of regular continued fractions is the Jacobi–Perron algo-
rithm (see also [176]). We put
Δ = (x0 , x1 , . . . , xn ) | x0 ≥ xi ≥ 0, 1 ≤ i ≤ n ;
x0 x2 xn
S : (x0 , x1 , . . . , xn ) "→ x0 − , x1 , x2 − , . . . , xn − ;
x1 x1 x1
380 23 Other Generalizations of Continued Fractions
where vk−1 = (x0,k−1 , x1,k−1 , . . . xn,k−1 ) is the vector in Rn+1 defined in Step k − 1.
The resulting generalized continued fraction here is the sequence of n-dimensional
vectors
(a0 , a1 , a2 , . . .).
Remark 23.57 If the Jacobi–Perron algorithm terminates, one actually can remove
the zero coordinate x1 = 0 and continue the algorithm in the real space whose di-
mension is one less (finally we end up at n = 1).
We remember the permutation that was used in Substep k.2. Indeed, we always ex-
change the first element with some element ak−1 ∈ {0, 1, . . . , n}. So we can choose
a sequence of integers
(a0 , a1 , a2 , . . .)
as the generalized p-subtractive continued fraction.
Remark 23.59 The first subtractive algorithm was introduced in [26] by V. Brun,
where he considered the case p = 1. Later, E.S. Selmer in [183, 184] studied the
subtractive algorithm for the case p = n.
Example 23.60 (Fully subtractive algorithm ([178, 179])) In the fully subtractive
algorithm we also fix p and consider
Δ = (x0 , x1 , . . . , xn ) | x0 ≥ x1 ≥ · · · ≥ xn ≥ 0 ;
S : (x0 , x1 , . . . , xn ) "→ (x0 − xp , x1 − xp , . . . , xp−1 − xp , xp , . . . , xn );
23.4 Algorithmic Generalized Continued Fractions 381
We remember the permutation that was used in Substep k.2, denoting it by ak−1 .
The generalized continued fraction is the sequence of permutations (a0 , a1 , a2 , . . .).
Problem 38 (Jacobi’s last theorem) Let K be a totally real cubic number field. Con-
sider arbitrary elements y and z of K satisfying 0 < y, z < 1 such that 1, y, and z
are independent over Q. Is it true that the Jacobi–Perron algorithm generates an
eventually periodic continued fraction with starting data v = (1, y, z)?
This problem is open not only for the Jacobi–Perron algorithm but also for other
similar algorithms. The converse statement is true.
Let us briefly explain how to generate convergents for a vector v with a generalized
continued fraction (a0 , a1 , . . .).
This is usually done by inverting the map T ◦ S. First of all, notice that the values
in the image of the map (T ◦ S)−1 are enumerated by the possible values of the
elements of the corresponding continued fractions, so we can write (T ◦ S)−1 a to
indicate which of the images should be taken.
One starts with some vector w0 . It is natural to choose w0 either (1, 0, . . . , 0) or
the generalized greatest common divisor (in case of existence). Then the vectors
w1 = (T ◦ S)−1
a0 (w0 ), w2 = (T ◦ S)−1
a1 (w1 ), w3 = (T ◦ S)−1
a2 (w2 ), ...
x1 xn
p = (x0 , . . . , xn ) "→ p̃ = ,..., .
x0 x0
Now arises the question of the quality of approximation of the vector ṽ by the vectors
w̃i . For further information on approximation aspects we refer to [180].
382 23 Other Generalizations of Continued Fractions
Equation (23.1) holds for any continued fraction whose sequence of convergents
converges to a real number. Here we use dots in matrices to denote the limit of the
ratios of determinants of the corresponding (n + 1) × (n + 1) matrix and n × n
matrix as n tends to infinity. We will also do the same in the above formula. In the
case of Eq. (23.1) we have an even stronger statement for n-convergents:
a11 a12 0 · · · 0 0
−1 a22 a23 · · · 0 0
0 −1 a33 · · · 0 0
.
· · · .. · ·
0 0 0 · · · an−1,n−1 an−1,n
a12 0 0 0 ··· −1 an,n
a11 + = .
a23 a22 a23 · · · 0 0
a22 + an−1,n−1
··· + −1 a33 · · · 0 0
an,n .
· · .. · ·
0 0 · · · an−1,n−1 an−1,n
0 0 ··· −1 an,n
23.5 Branching Continued Fractions 383
Definition 23.63 Consider a real number x and a sequence of real numbers ai . Sup-
pose that
−a1 −a2 −a3 −a4 · · ·
−1 −a1 −a2 −a3 · · ·
0 −1 −a1 −a2 · · ·
0 0 −1 −a1 · · ·
· · · · · ·
x= ,
−a1 −a2 −a3 · · ·
−1 −a1 −a2 · · ·
0 −1 −a1 · · ·
· · · · · ·
i.e., the limit from the right exists and equals x. Then the expression from the right
is called the Hessenberg continued fraction for x.
The most interesting case here occurs when a sequence (ai ) has only finitely
many nonzero entries.
Proposition 23.64 Let (ai ) be a sequence of real numbers with finitely many
nonzero elements, assuming that the element an is the last nonzero element. Suppose
that the Hessenberg continued fraction for the sequence (ai ) converges to some real
number x. Then x satisfies the equation
Proposition 23.65 Suppose that a Hessenberg continued fraction for a real number
x is defined by a sequence (a1 , a2 , a3 , 0, 0, . . .). Then we have
a2 − a3
a3
a2 −
a1 − · · ·
a1 − a2 − · · ·
a1 −
a1 − · · ·
x = −a1 + a3 .
a2 − a2 − · · ·
a1 −
a1 − · · ·
a1 − a3
a2 −
a1 − · · ·
a1 − a2 − · · ·
a1 −
a1 − · · ·
384 23 Other Generalizations of Continued Fractions
Definition 23.66 Consider a real number xk and a two-sided sequence of real num-
bers (ai ). Suppose that
−ak −ak+1 −ak+2 −ak+3 · · ·
−ak−1 −ak −ak+1 −ak+2 · · ·
−ak−2 −ak−1 −ak −ak+1 · · ·
−ak−3 −ak−2 −ak−1 −ak · · ·
· · · · . . .
xk =
−ak −ak+1 −ak+2 · · ·
−ak−1 −ak −ak+1 · · ·
−ak−2 −ak−1 −ak · · ·
−ak−3 −ak−2 −ak−1 · · ·
· · · . . .
−ak−1 −ak −ak+1 −ak+2 · · ·
−ak−2 −ak−1 −ak −ak+1 · · ·
−ak−3 −ak−2 −ak−1 −ak · · ·
−ak−4 −ak−3 −ak−2 −ak−1 · · ·
· · · · . . .
: ,
−ak−1 −ak −ak+1 · · ·
−ak−2 −ak−1 −ak · · ·
−ak−3 −ak−2 −ak−1 · · ·
· · · . . .
i.e., the limit from the right exists and equals xk . Then the expression from the right
is called the Nikiporetz continued fraction for x.
Proposition 23.67 Let (ai ) be a two-sided sequence of real numbers with finitely
many nonzero elements. We assume that all elements with negative indices are zeros,
a0 = 1, and the element an is the last nonzero element. Suppose that the Nikiporetz
continued fraction for the sequence (ai ) converges to some real number xk . Then xk
satisfies the equation
Definition 23.70 Consider a real number x and a collection of real numbers ai,j .
Suppose that
a11 a12 a13 a14 · · ·
−1 a22 a23 a24 · · ·
0 −1 a33 a34 · · ·
0 0 −1 a44 · · ·
· · · · · · ·
x= ,
a22 a23 a24 · · ·
−1 a33 a34 · · ·
0 −1 a44 · · ·
· · · · · ·
386 23 Other Generalizations of Continued Fractions
i.e., the limit from the right exists and equals x. Then the expression from the right
is called the quasideterminant for x.
We mention once more that the notion of quasideterminant in the case of non-
commutative elements ai,j is more complicated; see [65] and [66].
In this section we explain how continued fractions are used in topology for the
classification of rational knots and links. For additional information and references
we refer to [103] and [104].
Recall that a knot is a smooth embedding of a circle R/Z into R3 ; a link is a smooth
embedding of several circles into R3 .
Definition 23.74 We say that a 2-tangle is a rational tangle if there exists a smooth
deformation of the three-dimensional ball under which the 2-tangle evolves to the
trivial 2-tangle.
Let us now define the addition, multiplication by −1, and inversion operations
on 2-tangles.
The Sum Operation The sum of two tangles T1 and T2 is the tangle T1 + T2
defined as follows:
T 1 + T2 = .
Ti = .
As we will see later, in Theorem 23.76, the isotopy class of a tangle T ( pq ) does
not depend on the choice of the integer elements of the continued fractions repre-
senting the fraction pq .
The next theorem shows that the set of rational tangles is in a natural one-to-one
correspondence with the rational numbers. This theorem is a reformulation of the
Conway theorem from [36].
Theorem 23.76
(i) Two tangles T ([a0 ; a1 : a2 : · · · : an ]) and T ([b0 ; b1 : b2 : · · · : bm ]) for contin-
ued fractions with integer coefficients are isotopic if and only if
[a0 ; a1 : a2 : · · · : an ] = [b0 ; b1 : b2 : · · · : bm ].
p
(ii) For every rational tangle T there exists a rational number q such that T is
isotopic to T ( pq ).
To get a knot from a tangle one should use the following closing operation.
Each tangle diagram has two endpoints on the line y = 0. Connect them by some
curve that does not intersect the diagram. Do the same for the two endpoints on the
line y = 1. See an example of a tangle T ( 32 ) and the corresponding trefoil knot in
Fig. 23.8.
Theorem 23.77 (H. Schubert [175]) Let pq11 and pq22 be two rational numbers satis-
fying gcd(p1 , q1 ) = gcd(p2 , q2 ) = 1. Then the corresponding knots or links K( pq11 )
and K( pq22 ) are isotopic if and only if the following two conditions hold:
(a) p1 = p2 ;
(b) either q1 ≡ q2 (mod p1 ) or q1 q2 ≡ 1(mod p1 ).
23.6 Continued Fractions and Rational Knots and Links 389
Remark 23.78 The classification of rational knots arises in the topology of three-
dimensional manifolds. Namely, the 2-fold branch coverings spaces of S 3 along the
rational knots and links represent lens spaces; see the book [182] for more details.
1. F. Aicardi, Symmetries of quadratic form classes and of quadratic surd continued fractions. I.
A Poincaré tiling of the de Sitter world. Bull. Braz. Math. Soc. (N.S.) 40(3), 301–340 (2009)
2. F. Aicardi, Symmetries of quadratic form classes and of quadratic surd continued fractions.
II. Classification of the periods’ palindromes. Bull. Braz. Math. Soc. (N.S.) 41(1), 83–124
(2010)
3. G.E. Andrews, The Theory of Partitions. Cambridge Mathematical Library (Cambridge Uni-
versity Press, Cambridge, 1998). Reprint of the 1976 original
4. H. Appelgate, H. Onishi, The similarity problem for 3 × 3 integer matrices. Linear Algebra
Appl. 42, 159–174 (1982)
5. V. Arnold, Arithmetics of binary quadratic forms, symmetry of their continued fractions
and geometry of their de Sitter world. Bull. Braz. Math. Soc. (N.S.) 34(1), 1–42 (2003).
Dedicated to the 50th anniversary of IMPA
6. V.I. Arnold, Statistics of integral convex polygons. Funct. Anal. Appl. 14(2), 79–81 (1980).
Russian version: Funkc. Anal. Prilozh. 14(2), 1–3 (1980)
7. V.I. Arnold, A-graded algebras and continued fractions. Commun. Pure Appl. Math. 42(7),
993–1000 (1989)
8. V.I. Arnold, Ordinary Differential Equations. Springer Textbook (Springer, Berlin, 1992).
Translated from the third Russian edition by Roger Cooke
9. V.I. Arnold, Higher-dimensional continued fractions. Regul. Chaotic Dyn. 3(3), 10–17
(1998). J. Moser at 70 (in Russian)
10. V.I. Arnold, Preface, in Pseudoperiodic Topology. Am. Math. Soc. Transl. Ser. 2, vol. 197
(Am. Math. Soc., Providence, 1999), pp. ix–xii
11. V.I. Arnold, Continued Fractions (Moscow Center of Continuous Mathematical Education,
Moscow, 2002) (in Russian)
12. S. Assaf, L.-C. Chen, T. Cheslack-Postava, B. Cooper, A. Diesl, T. Garrity, M. Lepinski,
A. Schuyler, A dual approach to triangle sequences: a multidimensional continued fraction
algorithm. Integers 5(1), A8 (2005)
13. J. Barajas, O. Serra, The lonely runner problem with seven runners, in Fifth Conference
on Discrete Mathematics and Computer Science, University of Valladolid. Ciencias (Val-
ladolid), vol. 23 (Secr. Publ. Intercamb. Ed., Valladolid, 2006), pp. 111–116 (in Spanish)
14. J. Barajas, O. Serra, The lonely runner with seven runners. Electron. J. Comb. 15(1), R48
(2008)
15. I. Bárány, A.M. Vershik, On the number of convex lattice polytopes. Geom. Funct. Anal.
2(4), 381–393 (1992)
16. M. Barile, D. Bernardi, A. Borisov, J.-M. Kantor, On empty lattice simplices in dimension 4.
Proc. Am. Math. Soc. 139(12), 4247–4253 (2011)
17. O.R. Beaver, T. Garrity, A two-dimensional Minkowski ?(x) function. J. Number Theory
107(1), 105–134 (2004)
18. M. Beck, S. Robins, Computing the Continuous Discretely: Integer-Point Enumeration in
Polyhedra. Undergraduate Texts in Mathematics (Springer, New York, 2007)
19. U. Betke, J.M. Wills, Untere Schranken für zwei diophantische Approximations-Funktionen.
Monatshefte Math. 76, 214–217 (1972)
20. W. Bienia, L. Goddyn, P. Gvozdjak, A. Sebő, M. Tarsi, Flows, view obstructions, and the
lonely runner. J. Comb. Theory, Ser. B 72(1), 1–9 (1998)
21. T. Bohman, R. Holzman, D. Kleitman, Six lonely runners. Electron. J. Comb. 8(2), R3
(2001). In honor of Aviezri Fraenkel on the occasion of his 70th birthday
22. A.I. Borevich, I.R. Shafarevich, Number Theory. Pure and Applied Mathematics, vol. 20
(Academic Press, New York, 1966). Translated from the Russian by Newcomb Greenleaf
23. P. Borwein, Computational Excursions in Analysis and Number Theory. CMS Books in
Mathematics, vol. 10 (Springer, New York, 2002)
24. A.J. Brentjes, Multidimensional Continued Fraction Algorithms. Mathematical Centre
Tracts, vol. 145 (Mathematisch Centrum, Amsterdam, 1981)
25. K. Briggs, Klein polyhedra. http://keithbriggs.info/klein-polyhedra.html
26. V. Brun, Algorithmes euclidiens pour trois et quatre nombres, in Treizième congrès des math-
ématiciens scandinaves, tenu à Helsinki 18–23 août 1957 (Mercators Tryckeri, Helsinki,
1958), pp. 45–64
27. A.D. Bryuno, V.I. Parusnikov, Klein polyhedra for two Davenport cubic forms. Math. Notes
- Ross. Akad. 56(3–4), 994–1007 (1995). Russian version: Mat. Zametki 56(4), 9–27 (1994)
28. J. Buchmann, A generalization of Voronoı̆’s unit algorithm. I. J. Number Theory 20(2), 177–
191 (1985)
29. J. Buchmann, A generalization of Voronoı̆’s unit algorithm. II. J. Number Theory 20(2),
192–209 (1985)
30. G. Bullig, Zur Kettenbruchtheorie im Dreidimensionalen (Z 1). Abh. Math. Semin. Hansis-
che Univ. 13, 321–343 (1940)
31. R.T. Bumby, Hausdorff dimensions of Cantor sets. J. Reine Angew. Math. 331, 192–206
(1982)
32. V.A. Bykovskiı̆, Connectivity of Minkowskii Graph for Multidimensional Complete Lattices
(Dalnauka, Vladivostok, 2000)
33. V.A. Bykovskiı̆, O.A. Gorkusha, Minimal bases of three-dimensional complete lattices. Sb.
Math. 192(1–2), 215–223 (2001). Russian version: Mat. Sb. 192(2), 57–66 (2001)
34. J.W.S. Cassels, H.P.F. Swinnerton-Dyer, On the product of three homogeneous linear forms
and the indefinite ternary quadratic forms. Philos. Trans. R. Soc. Lond. Ser. A 248, 73–96
(1955)
35. H. Cohen, A Course in Computational Algebraic Number Theory. Graduate Texts in Mathe-
matics, vol. 138 (Springer, Berlin, 1993)
36. J.H. Conway, An enumeration of knots and links, and some of their algebraic properties,
in Computational Problems in Abstract Algebra, Oxford, 1967 (Pergamon, Oxford, 1970),
pp. 329–358
37. T.W. Cusick, View-obstruction problems. Aequ. Math. 9, 165–170 (1973)
38. T.W. Cusick, The largest gaps in the lower Markoff spectrum. Duke Math. J. 41, 453–463
(1974)
39. T.W. Cusick, View-obstruction problems in n-dimensional geometry. J. Comb. Theory, Ser.
A 16, 1–11 (1974)
40. T.W. Cusick, View-obstruction problems. II. Proc. Am. Math. Soc. 84(1), 25–28 (1982)
41. T.W. Cusick, M.E. Flahive, The Markoff and Lagrange Spectra. Mathematical Surveys and
Monographs, vol. 30 (Am. Math. Soc., Providence, 1989)
42. T.W. Cusick, C. Pomerance, View-obstruction problems. III. J. Number Theory 19(2), 131–
139 (1984)
43. V.I. Danilov, The geometry of toric varieties. Russ. Math. Surv. 33(2), 97–154 (1978). Rus-
sian version: Usp. Mat. Nauk 33(2), 85–134 (1978)
References 393
123. G. Lachaud, Polyèdre d’Arnol’d et voile d’un cône simplicial: analogues du théorème de
Lagrange. C. R. Acad. Sci., Sér. 1 Math. 317(8), 711–716 (1993)
124. G. Lachaud, Sails and Klein polyhedra, in Number Theory, Tiruchirapalli, 1996. Contemp.
Math., vol. 210 (Am. Math. Soc., Providence, 1998), pp. 373–385
125. G. Lachaud, Voiles et Polyhedres de Klein (Act. Sci. Ind., Hermann, 2002)
126. M. Laczkovich, G. Szekeres, Tilings of the square with similar rectangles. Discrete Comput.
Geom. 13(3–4), 569–572 (1995)
127. J.C. Lagarias, Best simultaneous Diophantine approximations. I. Growth rates of best ap-
proximation denominators. Trans. Am. Math. Soc. 272(2), 545–554 (1982)
128. J.C. Lagarias, Geodesic multidimensional continued fractions. Proc. Lond. Math. Soc. (3)
69(3), 464–488 (1994)
129. J.-L. Lagrange, Solution d’un problème d’arithmétique, in Œuvres de Lagrange, vol. 1
(Gauthier-Villars, Paris, 1867–1892), pp. 671–732
130. A.K. Lenstra, H.W. Lenstra Jr., L. Lovász, Factoring polynomials with rational coefficients.
Math. Ann. 261(4), 515–534 (1982)
131. P. Levy, Sur les lois de probabilité dont dependent les quotients complets et incomplets d’une
fraction continue. Bull. Soc. Math. Fr. 57, 178–194 (1929)
132. J. Lewis, D. Zagier, Period functions and the Selberg zeta function for the modular group,
in The Mathematical Beauty of Physics, Saclay, 1996. Adv. Ser. Math. Phys., vol. 24 (World
Scientific, River Edge, 1997), pp. 83–97
133. K. Mahler, Lectures on Diophantine Approximations. Part I: g-adic Numbers and Roth’s
Theorem (University of Notre Dame Press, Notre Dame, 1961). Prepared from the notes by
R.P. Bambah of my lectures given at the University of Notre Dame in the Fall of 1957
134. Y.I. Manin, M. Marcolli, Continued fractions, modular symbols, and noncommutative geom-
etry. Sel. Math. New Ser. 8(3), 475–521 (2002)
135. A. Markoff, Sur les formes quadratiques binaires indéfinies. Math. Ann. 15(3–4), 381–406
(1879)
136. A. Markoff, Sur les formes quadratiques binaires indéfinies. Math. Ann. 17(3), 379–399
(1879)
137. P. Mattila, Geometry of Sets and Measures in Euclidean Spaces: Fractals and Rectifiability.
Cambridge Studies in Advanced Mathematics, vol. 44 (Cambridge University Press, Cam-
bridge, 1995)
138. A. Messaoudi, A. Nogueira, F. Schweiger, Ergodic properties of triangle partitions. Monat-
shefte Math. 157(3), 283–299 (2009)
139. H. Minkowski, Généralisation de la théorie des fractions continues. Ann. Sci. Éc. Norm. Sup.
(3) 13, 41–60 (1896)
140. H. Minkowski, Gesammelte Abhandlungen (Am. Math. Soc., Chelsea, 1967), pp. 293–315
141. J.-O. Moussafir, Sails and Hilbert bases. Funct. Anal. Appl. 34(2), 114–118 (2000). Russian
version: Funkc. Anal. Prilozh. 34, (2) 2000, 43–49
142. J.-O. Moussafir, Voiles et Polyédres de Klein: Geometrie, Algorithmes et Statistiques. Doc-
teur en sciences thèse, Université Paris IX—Dauphine, 2000
143. V.V. Nikulin, On the classification of arithmetic groups generated by reflections in
Lobachevskiı̆ spaces. Izv. Akad. Nauk SSSR, Ser. Mat. 45(1), 113–142 (1981)
144. V.V. Nikulin, On the classification of arithmetic groups generated by reflections in
Lobachevskiı̆ spaces. Izv. Akad. Nauk SSSR, Ser. Mat. 45(1), 240 (1981)
145. V.V. Nikulin, Quotient-groups of groups of automorphisms of hyperbolic forms by subgroups
generated by 2-reflections. Algebro-geometric applications, in Current Problems in Mathe-
matics, vol. 18 (Akad. Nauk SSSR, Vsesoyuz. Inst. Nauchn. i Tekhn. Informatsii, Moscow,
1981), pp. 3–114
146. V.V. Nikulin, Discrete reflection groups in Lobachevsky spaces and algebraic surfaces, in
Proceedings of the International Congress of Mathematicians, Berkeley, CA, 1986, vol. 1
(Am. Math. Soc., Providence, 1987), pp. 654–671
147. V.V. Nikulin, Finiteness of the number of arithmetic groups generated by reflections in
Lobachevskiı̆ spaces. Izv. Math. 71(1), 53–56 (2007). Russian version: Izv. Ross. Akad.
References 397
174. A.L. Schmidt, Ergodic theory of complex continued fractions, in Number Theory with an
Emphasis on the Markoff Spectrum, Provo, UT, 1991. Lecture Notes in Pure and Appl. Math.,
vol. 147 (Dekker, New York, 1993), pp. 215–226
175. H. Schubert, Knoten mit zwei Brücken. Math. Z. 65, 133–170 (1956)
176. F. Schweiger, The Metrical Theory of Jacobi-Perron Algorithm. Lecture Notes in Mathemat-
ics, vol. 334 (Springer, Berlin, 1973)
177. F. Schweiger, Ergodic properties of multi-dimensional subtractive algorithms, in New Trends
in Probability and Statistics, Palanga, 1991, vol. 2 (VSP, Utrecht, 1992), pp. 91–100
178. F. Schweiger, Invariant measures for fully subtractive algorithms. Anz. Österreich. Akad.
Wiss. Math.-Natur. Kl. 131, 25–30 (1995)
179. F. Schweiger, Fully subtractive algorithms. Österreich. Akad. Wiss. Math.-Natur. Kl.
Sitzungsber. II 204, 23–32 (1996)
180. F. Schweiger, Multidimensional Continued Fractions. Oxford Science Publications (Oxford
University Press, Oxford, 2000)
181. F. Schweiger, Multidimensional continued fractions—new results and old problems. Preprint
(2008). http://www.cirm.univ-mrs.fr/videos/2008/exposes/287w2/Schweiger.pdf
182. H. Seifert, W. Threlfall, Seifert and Threlfall: A Textbook of Topology. Pure and Applied
Mathematics, vol. 89 (Academic Press, New York, 1980). Translated from the German edi-
tion of 1934 by Michael A. Goldman, with a preface by Joan S. Birman, with “Topology of
3-dimensional fibered spaces” by Seifert, translated from the German by Wolfgang Heil
183. E.S. Selmer, Continued fractions in several dimensions. Nord. Mat. Tidskr. 9, 37–43 (1961)
184. E.S. Selmer, Continued fractions in several dimensions. Nord. Mat. Tidskr. 9, 95 (1961)
185. T. Shintani, On evaluation of zeta functions of totally real algebraic number fields at non-
positive integers. J. Fac. Sci., Univ. Tokyo, Sect. 1A, Math. 23(2), 393–417 (1976)
186. V.I. Shmoı̆lov, Nepreryvnye Drobi. Tom I. Periodicheskie Nepreryvnye Drobi (Merkator,
Lviv, 2004)
187. V.I. Shmoı̆lov, Nepreryvnye Drobi. Tom II. Raskhodyashchiesya Nepreryvnye Drobi (Merka-
tor, Lviv, 2004)
188. V.I. Shmoı̆lov, Nepreryvnye Drobi. Tom III. Iz Istorii Nepreryvnykh Drobei (Merkator, Lviv,
2004)
189. V.Ya. Skorobogatko, Ideas and results of the theory of branching continued fractions and
their application to the solution of differential equations, in General Theory of Boundary
Value Problems (Naukova Dumka, Kiev, 1983), pp. 187–197
190. V.Ya. Skorobogatko, Teoriya Vetvyashchikhsya Tsepnykh Drobei i Ee Primenenie v Vychis-
litelnoi Matematike (Nauka, Moscow, 1983)
191. B.F. Skubenko, Minima of a decomposable cubic form of three variables. Zapiski nauch.
sem. LOMI (1988)
192. B.F. Skubenko, Minima of decomposable forms of degree n of n variables for n ≥ 3. Zapiski
nauch. sem. LOMI (1990)
193. J. Stoer, R. Bulirsch, Introduction to Numerical Analysis, 3rd edn. Texts in Applied Mathe-
matics, vol. 12 (Springer, New York, 2002). Translated from the German by R. Bartels, W.
Gautschi and C. Witzgall
194. H.P.F. Swinnerton-Dyer, On the product of three homogeneous linear forms. Acta Arith. 18,
371–385 (1971)
195. G. Szekeres, Multidimensional continued fractions. Ann. Univ. Sci. Bp. Rolando Eötvös
Nomin., Sect. Math. 13, 113–140 (1970)
196. E. Thomas, A.T. Vasquez, On the resolution of cusp singularities and the Shintani decompo-
sition in totally real cubic number fields. Math. Ann. 247(1), 1–20 (1980)
197. R.F. Tichy, J. Uitz, An extension of Minkowski’s singular function. Appl. Math. Lett. 8(5),
39–46 (1995)
198. L.N. Trefethen, D. Bau, Numerical Linear Algebra (Society for Industrial and Applied Math-
ematics, Philadelphia, 1997)
199. A. Trevisan, Lattice polytopes and toric varieties. Master’s thesis, Leiden University, 2007.
http://www.math.leidenuniv.nl/scripties/trevisan.pdf
References 399
200. H. Tsuchihashi, Higher-dimensional analogues of periodic continued fractions and cusp sin-
gularities. Tohoku Math. J. 35(4), 607–639 (1983)
201. A.V. Ustinov, Minimal vector systems in 3-dimensional lattices and analog of Vahlen’s the-
orem for 3-dimensional Minkowski’s continued fractions. Sovrem. Probl. Mat. 16, 103–128
(2012)
202. A.M. Vershik, Statistical mechanics of combinatorial partitions, and their limit configura-
tions. Funct. Anal. Appl. 30(2), 90–105 (1996)
203. P. Viader, J. Paradís, L. Bibiloni, A new light on Minkowski’s ?(x) function. J. Number
Theory 73(2), 212–227 (1998)
204. G. Voronoı̆, A generalization of the algorithm of continued fractions. Ph.D. thesis, Warsaw,
1896 (in Russian)
205. G.F. Voronoı̆, On a Generalization of the Algorithm of Continued Fraction. Collected Works
in Three Volumes (USSR Ac. Sci, Kiev, 1952) (in Russian)
206. D. Wells, The Penguin Dictionary of Curious and Interesting Numbers (Penguin, Middlesex,
1986)
207. G.K. White, Lattice tetrahedra. Can. J. Math. 16, 389–396 (1964)
208. J.M. Wills, Zwei Sätze über inhomogene diophantische Approximation von Irrationalzahlen.
Monatshefte Math. 71, 263–269 (1967)
209. E. Wirsing, On the theorem of Gauss-Kusmin-Lévy and a Frobenius-type theorem for func-
tion spaces. Acta Arith. 24, 507–528 (1973/1974). Collection of articles dedicated to Carl
Ludwig Siegel on the occasion of his seventy-fifth birthday, V
210. C. Yannopoulos, Zur Kettenbruchtheorie im Dreidimensionalen. Math. Z. 47, 105–110
(1940)
211. R.T. Živaljević, Rotation number of a unimodular cycle: an elementary approach (2012).
arXiv:1209.4981
Index
Integer pseudo-regular, 63
affine type, 251 self-dual, 63
angle, 20 vector, 185
adjacent, 51 volume, 187, 192
associated to an extended angle, 161 Integer-linear congruence of faces, 275
between planes, 188 Inverse broken line, 154
between spaces, 187
irrational, 47 K
L-irrational, 47 k-form, 194
LR-irrational, 47 Kepler planetary motion, 150
opposite interior, 55 Knot, 386
R-irrational, 47 n-bridge, 386
right, 54 n-bridge representation of, 386
transpose, 49 rational, 386
approximation space, 220
arctangent, 48 L
area, 24, 27 Lexicographic
arrangement, 124 basis, 195
broken line, 137 ordering, 195
cone, 185
LLS period, 70
congruence, 20, 185
LLS sequence, 43
proper, 138, 154
for a broken line, 142
conjugate matrices, 67, 301
of a matrix, 69
cosine, 161
LSLS sequence, 138
distance, 23
between planes, 188
extended angle, 154 M
corresponding to an integer angle, 162 M-sum
normal form, 157 of extended integer angles, 163
opposite, 162 of integer angles, 164
geometry, 20 Marked pyramid, 204
lattice, 21 Markov
length, 22 Diophantine equation, 83
line, 20 minimum, 131
parallel lines, 55 number, 83
plane, 185 spectrum, 81
point, 20, 185 multidimensional, 82
polygon, 20 tripple, 84
polyhedron, 185 Markov–Davenport
pyramid, 205 characteristic, 319
completely empty, 205 form for an MCRS-group, 349
one-story, multistory, 205 Matrix
ray, 329 complex spectrum, 71
segment, 20 Frobenius (companion), 304
sine, 42, 161 transpose, 261
sublattice, 21 Hessenberg, 301, 303
affine, 21 ς -reduced, ς -nonreduced, 304
subspace, 220 perfect, 303
tangent, 43, 47, 161 irreducible, 250
triangle NRS-, 327
dual, 62 real spectrum, 72, 241
integer isosceles, 63 reduced, 72
integer regular, 63 regular, 348
pseudo-isosceles, 63 RS-, 250, 327
404 Index