1 Cassels - An Introduction To Geometry of Numbers PDF
1 Cassels - An Introduction To Geometry of Numbers PDF
An Introduction
to the Geometry
of Numbers
Springer
J.W.S. Cassels
University of Cambridge
Department of Pure Mathematics
and Mathematical Statistics
16, Mill Lane
CB2 ISB Cambridge
United Kingdom
This work is subject to copyright. All rights are reserved. whether the whole or part of the material is
concerned. specifically the rights of translation. reprinting. reuse of illustrations. recitation. broadcasting.
reproduction on microfilm or in any other way. and storage in data banks. Duplication of this publication or
parts thereof is permitted only under the provisions of the German Copyright Law of September 9.1965. in
its current version. and permission for use must always be obtained from Springer-Verlag. Violations are
liable for prosecution under the German Copyright Law.
The use of general descriptive names. registered names. trademarks etc. in this publication does not imply,
even in the absence of a specific statement. that such names are exempt from the relevant protective laws and
regulations and therefore free for general use.
An Introduction to the
Geometry of Numbers
Geschiiftsfuhrende Herausgeber:
This work is subject to copyright. All rights are reserved, whether the whole or part of the material is
concerned, specifically those of translation, reprinting, re-use of illustrations, broadcasting, reproduction
by photocopying machine or similar means, and storage in data banks.
Under § 54 of the German Copyright Law where copies arc made for other than private use, a fee is payable
to the publisher, the amount of the fee to be determined by agreement with the publisher.
@ by Springer-Verlall Berlin· Heidelberg 1959, 1971. Library of Congress Catalog Card Number
75-154801.
Preface
Of making many bookes there is no
end, and much studie is a weari-
nesse of the flesh.
Ecclesiastes XII, 12.
of the integers u1 , ... , U" ? It may well be that one has trivially
1(0, ... ,0) =0, for example when l(x1 , ••• , xn) is a homogeneous form;
and then one excludes the set of values U 1 = U 2 = ... = Un = 0. (The
"homogeneous problem".)
In general one requires estimates which are valid not merely for
individual functions I but for whole classes of functions. Thus a typical
result is that if
(1 )
is a positive definite quadratic form, then there are integers u1 , u 2 not
°
both such that
(2)
where
u~ + UI U 2 + u~ ~ 1
for all pairs of integers uI , u2 not both zero; and here D = !.
Of course the positive definite binary quadratic forms are a par-
ticularly simple case. The result above was known well before the birth
of the Geometry of Numbers; and indeed we shall give a proof sub-
stantially independent of the Geometry of Numbers in Chapter II, § 3.
But positive definite binary quadratic forms display a number of argu-
ments in a particularly simple way so we shall continue to use them as
examples.
P3. The result just stated could be represented graphically. An
inequality of the type
t(X I , x 2) ~ k,
where t(xl , x2) is given by (1) and k is some positive number, represents
the region fA bounded by an ellipse in the (Xl' x 2)-plane. Thus our
result above states that fA contains a point (ul , u 2 ), other than the
origin, with integer coordinates provided that k~ (4D/W.
A result of this kind but not so precise follows at once from a
fundamental theorem of MINKOWSKI. The 2-dimensional case of this
states that a region fA always contains a point (ul , u 2) with integral
co-ordinates other than the origin provided that it satisfies the following
three conditions.
(i) fA is symmetric about the origin, that is if (Xl' X 2) is in fA then so
is (- Xl' - x 2 ).
(ii) fA is convex, that is if (Xl' x 2) and (YI' Y2) are two points of fA
then the whole line segment
GGGO
the point (~I-UI' ~2-U2) must
be in .'7. Hence, by the symmetry
of .'7, the point (UI-~I' U2-~2) IS
GGGO
in .'7. Finally, the mid-point of
(UI-~I' U2-~2) and (~I' ~2) is m
Y) because of convexity, that is
G GGG
(iu l , iu 2 ) is in .'7, and (ul , u 2 )
is in [Jl, as required. I t is clear
that .'7(ul , u 2) overlaps .'7(u~, u~)
when and only when .'7 overlaps
GGGG
Y'(u l - u~, u 2 - u~).
To prove MINKOWSKI'S theorein,
it is thus enough to show that when
the .'7 (ul , u 2) do not overlap then Fig. 1
the area of each is at most 1. A
little reflection convinces one that this must be so. A formal proof
is given in Chapter 3. Another argument, which is perhaps more intuitive
is as follows, where we suppose that .'7 is entirely contained in a square
hl:;:;:X, Ix 2 1:;:;:X.
Let U be a large integer. There are (2U +1)2 regions .'7(ul , u 2 ) whose
centres (ul , u 2) satisfy
0)
where
Xl = IX Xl +Px2 , X 2 =rXI + bX2 (4)
and IX, p, }', b are constants, e.g. by putting
IX = aL, P= a11;aI2 ,
r = 0, b = all~ D~ .
Conversely if IX, p, r, b are any
real numbers with IXb-Pr=t=o and
{ Xl' X 2 are given by (4), then
a-ft,}'-d)
xi+ X~=all x~+ 2a l2 Xl X 2 + a22 x~,
with
(5)
expressed in the form (7) with tt z integral and ttl only real, while for
the lines of the second family the roles of UI and tt z are interchanged.
In this way the plane is subdivided into parallelograms whose vertices
are just the points of I\. Of course the subdivision into parallelograms
depends on the choice of basis, but we show that the area of each
parallelogram, namely
jocb-Prj,
is independent of the particular basis. We can do this by showing that
the number N(X) of points of A in a large square
satisfies
.V(X) 1
---*--- (X -* (0).
4X2 ICIa-Prl
Indeed a consideration along the lines of the proof of MINKOWSKI'S
convex body theorem sketched above shows that the number of points
of A in !2 (X) is roughly equal to the number of parallelograms contained
in !2 (X), which again is roughly equal to the area of !2 (X) divided by
the area joc b - Pr j of an individual parallelogram. The strictly positive
number
d(A) = jocb - Prj (8)
(9)
contains a point of every lattice A with d (A) < (l)i, and the fact that
there are forms such that equality is necessary in (2) is equivalent to
the existence of a lattice Ac with determinant d (AJ = (1)1 having no
point in £1). So our problem about all definite binary quadratic forms
is equivalent to one about the single region £1) and all lattices. Similarly
consideration of the lattices with points in
6 Prologue
±(-t, VI)
on X~ +X~ = 1, but no
points in X~ +X~< 1.
We have thus shown
"<
thaL1(D) =d(N) =(-W
provided that g has
a critical lattice. MIN- Fig. 3
KOWSKI showed that
critical lattices exist for a fairly wide set of regions fJi by, roughly speak-
ing, showing that any fJi-admissible lattice A can be gradually shrunk
and distorted until it becomes critical. In the text we give a more
general proof of the existence of critical lattices using concepts due to
MAHLER which turn out to have much wider significance.
Lattices
1.1. Introduction. In this chapter we introduce the most important
concept in the geometry of numbers, that of a lattice, and develop some
of its basic properties. The contents of this chapter, except § 2.4 and
§ 5, are fundamental for almost everything that follows.
In this book we shall be concerned only with lattices over the ring
of rational integers. A certain amount of work has been done on
lattices over complex quadratic fields, see e.g. MULLENDER (1945 a) and
K. ROGERS (1955 a). Many of the concepts should carryover practically
unaltered. Again, work on approximation to complex numbers by
integers of a complex quadratic field [e.g. MULLENDER (1945a), CASSELS,
LEDERMANN and MAHLER (1951 a), POITOU (1953 a)] and on the minima
of hermitian forms when the variables are integers in a quadratic field
[e.g. OPPENHEIM (1932a, 1936a, 1953f) and K. ROGERS (1956a)] may
be regarded as a generalization of the geometry of numbers to lattices
over complex quadratic fields. We shall not have occasion to mention
lattices over complex quadratic fields again in this book; we mention
them here only for completeness. For lattices over general algebraic
number fields see ROGERS and SWINNERTON-DYER (1958a).
1.2. Bases and sublattices. Let aI' ... , a" be linearly independent
real vectors in n-dimensional real euclidean space, so that the only set of
numbers t1, ... , tn for which tl ~ +... +
tn an = 0 is tl = t2 = ... = tn = O.
The set of all points
(1 )
with integral u 1 , .•• , Un is called the lattice with basis aI' ••• , a". We
note that, since aI' ... ,an are linearly independent, the expression of
any vector ~ in the shape (1) with real u1 , ••• , Un is unique. Hence if ~
is in A and (1) is any expression for ~ with real U 1 , ... , U II , then U 1 , ... , U II
are integers. We shall make use of these remarks frequently, often
without explicit reference.
The basis is not uniquely determined by the lattice. For let a~ be
the points
L;
a; = v;;a, (1~i, i~n), (2)
Then
aj = L: wjiaj (4)
i
with integral Wi;' It follows easily that the set of points (1) is precisely
the set of points
where U~, ... , u~ run through all integers; that is ai' ... , all and a~, ... , a~
are bases of the same lattice. We show now that every basis of a a;
lattice 1\ may be obtained from a given basis a j in this yvay. For since
a; belongs to the lattice with basis ai' ... , an there are integers vij such
that (2) holds: and since a j belongs to the lattice with basis a~, ... , a~
there are integers Wii such that (4) holds. On substituting (2) in (4)
and making use of the linear independence of the a j , we have
L:
WiiVil =
{1 0
if i = I
otherwise.
Hence
det(w ij ) det (ViI) = 1
and so each of the integers det (wii) and det (ViI) must be ± 1; that is
0) holds as required.
We denote lattices by capital sanserif Greek letters, and in particular
by 1\, M, N, r.
If ai' ... , a" and a~, ... , a~ are bases of the same lattice, so that
they are related by (2) and (3), then we have
det(a~, ... , a~) = det(vii) det(a1 , ... , an) = ± det(a1 ,···, all),
where, for example, det (a\, ... , an) denotes the determinant of the
n xn array whose i-th row is the vector a i . Hence
(2)
is called the index of /\ in M. From the last expression the index depends
only on /\ and M, not on the choice of bases. Since a 1 , •.• , a" are
linearly independent, we have D> O. On solving (1) for the b i and
using (2), we have
Db i = L wija j ,
i
where the Wij are integers. Hence
DM(I\(M, (3)
where D M is the lattice of Db, bEM.
It is often convenient to choose particular bases for 1\ and M so
that (1) takes a particularly simple shape.
I
THEOREM I. Let /\ be a sublattice 0/ M.
A. To every base bl , ... , b" 0/ M there can be found a base aI' ... , an
all\ 0/ the shape
al=vllb l
a2 = V 21 b 1 + V 22 b 2
(4)
B. Conversely, to every basis aI' ... , a" 0/ " there exists a basis
b1 , ••• , b" 0/ M such that (4) holds.
where vii' ... , Vj; are integers and v;i*O, since, as we have seen,
Db/_A We choose for a j such an element of " for which the positive
integer Ivi;1 is as small as possible (but not 0), and will show that
a1 , ..• , a" are in fact a basis for A Since a 1 , .•• , a" are in /\, by con-
struction, so is every vector
(5)
where WI' . _', w" are integers. Suppose, if possible, that c is a vector
of" not of the shape (5). Since c is in M, it certainly can be expressed
in terms of bl , . _., b", and so can be written in the shape
c= tl bl + ... + tk b k ,
°
where 1 ~ k ~ n, tk=1= and t1, ... , tk are integers. If there are several
such c, then we choose one for which the integer k is as small as pos-
sible. Now, since Vu=1= 0, we may choose an integer s such that
(6)
The vector
is in " since c and a k are; but it is not of the shape (5) since c is not.
°
Hence t k- SVkk=1= by the assumption that k was chosen as small as
possible. But then (6) contradicts the assumption that the non-zero
integer Vkk was chosen as small as possible. The contradiction shows
that there are no c in " which cannot be put in the form (5), and so
proves part A of the theorem.
Proof of B. Let aI' ... , an be some fixed basis of A Since D M
is a sublattice of " by (3), where D is the index of " in M, there exists
by Part A a basis Dbl , ... ,Db" of DM of the type
Dbl=wUal \
D b2 = W 21 a l + w22 a2 (7)
with integral wii and W ij =1=O (1~i~n). On solving (7) for aI' ... ,an
in succession we obtain a series of equations of the type (4) but where
Bases and sublattices 13
at first we know only that the Vi; are rational. But clearly b 1, ... , b"
are a basis for M and so the Vi; are in fact integers, since the a i are
in M, and since the representation of any vector a in the shape
(t1' ... , tn' real numbers)
(8)
and that
Q~ Vi; < Vi; in case A, (9)
o ~ t'i; < Vii in case B. (10)
Proof of A. To obtain (8) it is necessary only to replace a i or b;
by -ai' -bi respectively if originally Vii<O. To obtain (9) we replace
the a i by
a; = ti1 a1 + ... + t;,i--1ai-1 + ai'
where the ti ; are integers to be determined. For any choice of the ti ;
the a;are a basis for I\. We have
where
For each i we may now choose t i . i - 1, t i ,i-2' ... , til in that order so that
as was required.
Proof of B. Similar.
COR OLLAR Y 2. Let aI' "', am be linearly independent vectors of a
lattice M. Then there is a basis b 1 , ... , b" of M such that
a 1 =v11 b 1
a2=v21b1+V22b2
(12)
But now every C( M is in the same class as precisely one of the vectors
(1~i~m) (1 )
(m + 1~ i~n) (3)
for some integers Vii' Since ai (1 ~ i ~ n) and b, (1 ~ i ~ n) are bases
of the same lattice, we have
det (Vi;) = ± 1. (4)
We may expand the determinant (4) by the first 111 and last (n - m)
rows [Laplace-development] and obtain
where the v,: are the (:) determinants formed from columns of (2)
and W, is the (n - m) X (n - m) determinant formed from the remaining
(n-m) columns and the (n-m) rows,
since b 1 , ••• , b" is a basis for A. We may solve (7) for the u j ' and
indeed in a multitude of ways. For example let ii j be the cofactor of
VI; in the expansion of the determinant
(1~i~tn, 1~i~lII).
Then
2: vjl;=l{u1 ,
l~j~m
so Vi U 1 is an integer. Similarly
v, u; = integer (1~i~1n),
be (n -1) arbitrary real vectors and let e> 0 be an arbitrarily small real
number. Then for all real numbers N greater than a number No depending
only on A, e and the c;, there exists a basis ai' ... , a" of A such that
(1~i~1t-1). (1)
Here
(2)
denotes the usual euclidean distance.
To prove Theorem II we shall need a result about the distribution
of integers prime to a given integer. We prove this first, and then
Theorem II.
LEMMA 3. For each 15>0 there is a number k(b) with the following
property: Every interval of length k (b) q6, where q is a positive integer,
contains an integer prime to q.
1 § 2.4 may weli be omitted at a first reading
Bases and sublattices 17
Let
q= II p«f, (3)
l;;;'i~,J 1
where the Pi are distinct primes and the (Xi> 0 are integers. An integer
is prime to q if and only if it is not divisible by any of P1' ... , Pl'
Consider some interval
V<U~V+U (4)
of length U, where U, V are fixed integers. For j1 <j2 < ... <jS' where
s~J, let
M(j1' ···,js)
be the number of integers u in the interval (4) which are divisible by
Pi., Pi,' ... , Pi. (and perhaps also by other primes from Pl' ...• Pf)' We
show next that
(5)
s>o
N Y12 which is not 0 and prime to vll . For j> 2 we choose for VI; the
integer nearest to N Yl;' Then (8) holds for i = 1 and j =} 2 trivially
and for i = 1, j = 2 by Lemma 3, and since clearly V ll = 0 (N). The
integers Rl =Vll and 51 =V12 have the required properties.
Now suppose that 1>1, and that the Vi; with i<1 have already
*
been constructed. For j 1, 1 + 1 we take for V 1 j just the nearest
integer to Ny 1.;' On expanding R I and 51 by their last rows, we now
have
R I = ±vIIRI_l +A,
51 = ± VI,!+! 5 1 - 1 + vIlE + C,
where A, E, C are integers which have already been determined. Since
R I - 1 is prime to 51-1, we may choose the integer VII so that RI is not 0
and prime to 5 1 -- 1 , We choose for V II the integer nearest to Ny II for
which this is true, so that, by the corollary to Lemma 3,
vII -NYII=O(5Ll) = o(N(l-I) 0) ,
since 5 1 - 1 =0 (NI-l), being a sum of products of 1 -1 numbers Vi; each
of order N. Having determined VII we now take for VI,I+l the integer
nearest to Ny I,1 +! such that 51 is not 0 and prime to R I, so that
similarly
VI,!+! - NY!,I+! =O(5~) =O(N IO ).
This completes one stage of the induction. We have thus shown the
existence of integers Vi; satisfying (8).
From (7) and (8) we have
lai-Neil = o (N(n-l) 0) (1~i~1t-1).
The truth of the statement of the theorem now follows on taking <5 = eln.
1.3. Lattices under linear transformation. It is convenient here to
consider briefly the effect of a non-singular affine transformation
x-+X =(lX of n-dimensional space into itself. Let the transformation
X =(lX be given by
Xi = L (XiiXj (1~i~n), (1 )
l:;;;i:;;;n
where
X=(X1 ,···,X,,), x=(x1 ,···,xn)
are corresponding points in the transformation and (Xii are real numbers
such that
det((l) =det((Xii) =}O.
Let /\ be a lattice and denote by a./\ the set of points (lX, xEI\.
••. , b" is a basis for /\, then the general point b = U 1 b 1 + ... + uti b n
If b 1 ,
2*
20 Lattices
d (a. A) = Idet (a. b l , ... , a. b,,)i = Idet (a.) II det (bl , ... , bn)l = Idet (0.)1 d (A) .
"
I(x) = ~ Iii Xi x; (1 )
i,i=l
where
(2)
be a non-singular quadratic form of signature 1 (r, n - r); that is, there
exist independent real linear forms
Xi = ~ diixi (3)
l;:>j;:;; ..
Conversely, if di ; is any set of real numbers with det(d ji) =1=0, then (3).
(4) and (6) determine a quadratic form (1) of signature (r, n - r) and
(7) holds. We shall be concerned a great deal with the values which
I (x) takes when Xl' "', XII are integers. By (3), these are precisely the
1 Many writers define the signature of a form to be the number of positi"e
squares less the number of negative squares in (6). i.e. 2r - 11. But it is more
convenient to give explicitly the number of each kind of square than to do the
arithmetic every time.
Forms and lattices 21
values which q; (X) takes when X runs through the vectors of the
lattice A with basis
1q;(A)1 ~x.
(iii) In every lattice A of determi1tant d(A)~y'-n/2 there is a vector
.4+0 in
Iq;(A)1 ~ 1-
(iv) For every quadratic form L fii Xi Xi of signature (r, n-r) there
+
is an integer vector a 0 such that
I/(a)1 ~xldet(/;iWI".
That (i), (ii) and (iii) are equivalent follows from homogeneity, since
q;(tX) =t2q;(X) and since the set tA of all tX (XEA) is a lattice tA of
determinant Iti"
d (A); and we may choose t so that t" d (A) = 1. That
(iii) and (iv) are equivalent follows at once from the earlier discussion
and, in particular, from (8).
The foregoing argument is quite general. For example the behaviour
for integer values of the variables of any form I (~) of degree n which
can be expressed as the product of 1t real linear forms:
for all integral 11 =(111' ... , Uti)' Since <p(X) is a form, (1) is an identity
in the variables U l , ... , u". Let w be the uniquely determined homo-
geneous transformation such that
Then
b* b. = {1 if i = j (2)
I' 0 otherwise.
1 This section will not be referred to until Chapter \"III and will not be of
importance until Chapter X and XI.
24 Lattices
The lattice A* with basis bj is called the polar (or dual or reciprocal)
lattice of A. and bj is the polar basis to b;. The polar lattice A* of A
is independent of the choice of the particular basis. as we now show.
LEMMA 5. The polar lattice A* 01 A consists 01 all vectors a* such
that a*a is an integer lor all a in A. Then A is conversely the polar
lattice 01 A*. Further.
d(A) d(A*) = 1.
Suppose. first. that
a-£...
* - " ui b*j •
are in A* and A respectively. so that the ui • Vi are integers. Then
a*a = L ujvi
is an integer. Now let c be any vector such that ca is an integer for
all a in A. In particular
(1~i~n)
and so c =a* since the bi are linearly independent. This proves the
first sentence of the theorem. The second sentence follows immediately
from the first and also from (2). Finally. (2) implies that
det(bt •...• b!) det(b1 . . . . . b ll ) = 1.
and so d (A*) d (A) = 1. This concludes the proof of the lemma.
1.5.2. When Y=F0 is fixed. the points z such that yz=O lie in a
hyperplane through o.
LEMMA 6. A necessary and sulficient condition that there be 11 - 1
linearly independent points Ot •...• a,,-1 in A with ya;=O (1 ~i~n -1)
is that y = ta* tor some real t and some a* in A*.
Suppose first that yaj=o (1~i~n-1). Then by Theorem I
Corollary 2 there is a basis b j (1 ~i~n) for A such that
(Vi; =F 0)
for integers Vii' Hence ybj=O (1 ~i~n -1). Let yb .. =t. Then
clearly y =tb: where bj (1 ~i~n) is the polar basis to b i . This proves
half the lemma.
Suppose now that y = ta*. where a* EA·. If a* = 0 there is nothing
to prove. Otherwise. a* =mbt. where m is an integer and bt is primitive l .
1 That is, not of the shape u c·, c· EA· for an integer u > t.
The polar lattice 25
Then bi can be extend to a basis bj for 1\.*. Let b j be the polar basis.
Then
(2~j~ n).
Xi = L: Tiixi
i
where
det(1:) =det(Tii) =1=0. (2)
If Y is any vector, we have
YX = L: ¥;X; = L: ¥;T;jXi'
i i~ i
26 Reduction
Hence
YX=yx, (3)
where
(1~j~n). (4)
Since det ('t) =f= 0, by hypothesis, the equations (4) define Y as a function
of y. We write
Y = 't*y,
where 't* is called the transformation polar to 'to
Chapter II
Reduction
11.1. Introduction. In investigating the values taken by an algebraic
form I (x) for integer values of the variables it is often useful to sub-
stitute for I a form equivalent to it (in the sense of Chapter I, § 4) which
bears a special relation to the problem under consideration. This process
is independent of the geometrical notions introduced by MINKOWSKI
and depends only on the properties of bases of lattices developed in
Chapter I. Indeed only the lattice 1\0 of integer vectors comes into
consideration.
It is convenient to collect together in one chapter the various
applications of reduction. The later parts of the chapter involve some
moderately heavy computation. The beginner might well omit all after
the enunciation of the results in § 4.2. Indeed the next few chapters
are practically independent of Chapter II, which might well have been
deferred until later.
In § 2 we discuss the general method. In the rest of the chapter
we shall be mainly occupied in investigating
M(f) = inf [f(u) [
u~o
" integral
e:=e.=
, , .0
'
•...• 0.1.0 •...• 0 (1;;:;'j~n)
for the above basis, then / (x) is said to be reduced (in the sense of
MINKOWSKI). The above proof shows that every positive definite form
is equivalent (in the sense introduced in Chapter I. § 4) to at least one
and to at most a finite number of reduced forms.
We may make the definition of a reduced form more explicit. By
Corollary 4 to Theorem I of Chapter I (or by Lemma 2 of Chapter I).
a necessary and sufficient condition that ell ...• e j _ l and the integral
vector u = (u l , ... , u,,) be extensible to a basis for /\0 is that
g.c.d. (U;, ... , un) = 1- (2)
Hence the fOIT)1 I (x) is reduced if and only if
/(UI.···, un) ~ /(eJ
for all j and for all integers U l •...• Un satisfying (2).
11.2.2. When the form /(x) is not definite, then there is no generally
valid procedure to replace the reduction procedure for definite forms.
*
If we know (or may assume) that /(u) does not assume arbitrarily
small values for integral u 0 then it is possible to salvage something
of the reduction procedure. Let e> 0 be chosen arbitrarily small. By
hypothesis,
Ml = inf I/(u)l > o.
",,"0
*
integral
write
M; = inf I/(u)1
where the infimum is over all integral vectors u such that ('~. 000. ('i-I. U
is extensible to a basis for Aoo Then
Mj~Ml>O.
Then we have
II'(u l • 0 ••• un)! ~ (1 - e) II' (ei )I
for all sets of integers U l • •••• Un such that goc. d. (u i • ...• un) = 1. But.
of course. there is no reason to suppose that there are only a finite
number of I' with this property and equivalent to a given f.
An alternative procedure which is sometimes possible is to find some
other form g(x). related to our given I. which is definite and to reduce
g(x)o We shall do this for binary cubic forms in § 6. This method goes
back to HERMITE. who applied it to indefinite quadratic forms as follows.
Let I (x) be an indefinite quadratic form of signature (r. n - r). so
that. as before.
I(x) =X~ + ... +X~ - X~+l-'" - X!. (1)
where the Xi are linear forms in Xl' .•.• XII' Then
g (x) = X~ + ... + X~ + X~+l + ... + X: (2)
is a definite quadratic form with the same determinant. except. perhaps.
for sign. The forms Xl' .... Xn are not uniquely determined by I(x)
but we say that I(x) is reduced (in the sense of HERMITE) if the form
g (x) is reduced in the sense of MINKOWSKI for some choice of Xl' .... X".
Clearly I (x) is always equivalent to a reduced form. since we may choose
any representation (1) and then apply the transformation which reduces
g(x). Reduction more or less of this kind was first introduced by
HERMITE. and has been further discussed. amongst others. by SIEGEL
(1940a). as a tool for investigating the arithmetical properties of quad-
ratic forms. In general a form I (x) is equivalent to infinitely many
HERMITE-reduced forms. but SIEGEL shows that it is equivalent to only
finitely many if the coefficients of I (x) are all rational.
We note here that the relationship between (1) and (2) allows
estimates for the minimum of a definite form to be extended to an
30 Reduction
indefinite one, since clearly 1/(:r) I~ g (:r) for all real vectors:r. But in
general, the information so obtained is quite weak.
11.3. Definite quadratic forms. We shall be considering definite
quadratic forms from many different points of view in the course of
this book. Here we see what can be done by reduction methods alone.
The study of reduction is of great importance in the arithmetical theory
of quadratic forms, see WEYL (1940a) or VAN DER WAERDEN (1956a),
who give references to earlier literature. Here we are concerned only
with the minima of forms.
Let
I (Xl' X 2) = III X~ + 2/12 Xl X 2 + 122 X~
be a positive definite quadratic form. We wish to prove that there are
integers (u I , u 2) =l= (0, 0) such that
I (u l , u 2) ~ (4D/3)~,
where
D = III 122 -/~2'
By taking an equivalent form, if need be, we may suppose that
M(f) = inf I(U I ,U2) =/ll'
U" u%
integers not both 0
We have
I(x l , X 2) = III (Xl + ~
III
X2)2 + ~ X~.
III
Put U2 = 1 and choose for UI an integer such that
l
UI + 11~_1
III
~ ~.
2
(1 )
Then, on the one hand,
that is
I~l ~ 4D/3,
as required. That ~ here cannot be replaced by < is shown by the
form
IO(Xj, x 2 ) = xi + X 1 X 2 + x~
for which D =! and I (u 1 , u 2 ) ~ 1 for integers (u 1 , u 2 ) =l= (0,0). It is not
difficult to show by examining when equality can occur in the above
Definite quadratic forms 31
I(x) = L: Iii X; Xi
represents a value I (u) with
I/(u)1 ~ (!)(n-l)/2I D I1/n, (1 )
where u =1= 0 is integral and
D = det (Iii) .
By the remarks at the end of § 2.2 we may suppose, without loss
of generality, that I (x) is positive definite. We may now suppose, as
before, that
111~ I(u)
for all integral u =1= o. Then
lUI + 1~
hi
U 2 + ... + h u" I ~ ~- .
hi 2
Then
and so
I11 :s;:
-- 3
(~)~ (n-l) Dl/n.
M(f)/Dl
may take any value t-i~ (t)i. This is in striking contrast with the
behaviour of indefinite quadratic forms (see § 4).
For later convenience we collect what has been proved so far and
express it as a theorem.
C. The signs 01 equality are required when and only when I(a:) is
equivalent to a multiple 01
lo(a:) =x~ +x~ +x; +X2X3 +XaX1 +X1X2.
We note again that we get as good an estimate for 111/22/33 as we
do for fl.l' This will be put in a wider setting in Chapter VIII, § 2.
Since lo(u) is an integer we have 10(u)~1. Since D(/o) =i, this
shows that the equality signs are required for 10' Part A of the theorem
follows from the rest. Hence we need only prove Part B and that
equality in B occurs only for multiples of 10'
Following GAUSS (1831a) we distinguish two cases. Suppose first
that
°< (! ~ (t)~,
where (t)i was the maximum possible value. It is not difficult to verify
that definite quadratic forms in any number of variables behave simi-
larly, d. Chapter V, Lemma 6. The first difference in the behaviour
of indefinite quadratic forms is rather trivial: it is quite possible that
M(I) =0, and this may occur either because there is an integral U::f.:O
such that I(u) =0, or because there are integral U::f.:O such that I(u)
3*
36 Reduction
is arbitrarily small but not o. The second difference is deeper: the values
of M(/)/I D (1)\1 do not fill the complete interval up to the maximum
possible value.
The position for indefinite binary quadratic forms has been the
most investigated. Here a very great deal is known about the possible
values of M(I)/I D (/)1 i. The greatest value is (f)l, given by the multiples
of X~+XIX2-X~. Otherwise M(t)~(~_)lID(t)ll. A well-known theorem
of MARKOFF ("the MARKOFF chain") states that there are only de-
numerably many possible values of M(/)/ID(f) 11 greater than t. There
are certainly intervals to the left of i which contain no values of
M(I)/I D (f) I~· The author has given a proof of the Markoff chain theorem
in his Cambridge Tract [CASSELS (1956a)], to which the reader is referred
for references for the various statements made in this paragraph. Here
we shall be content with finding the two largest possible values of
M(/)/1 D(/)I!·
There is a similar state of affairs for ternary quadratics but much
less is known. The most complete information is due to VENKOV (1945 a)
who has found the eleven largest values of M(f)/1 D (/)11, but they do
not seem to follow any general pattern, except that they are all given
by forms with integral coefficients. There are two unsolved problems
about indefinite ternaries which appear completely intractable. It is
not known whether there are forms t with M(f) > 0 which are not
multiples of integral forms; and it is not known whether the set of
values of M(f)/ID(f)ll has any limit point other than o. These two
problems are closely related [CASSELS and SWINNERTON-DYER (19SS·a);
see also Chapter X, Theorem XII].
This phenomenon of "successive minima" (not to be confused with
the "successive minima" of a lattice with respect to a point set which
is discussed in Chapter VIII) occurs very widely with indefinite forms.
It takes a great many different shapes and a general theory hardly
exists *. It is not possible to predict when it occurs: for example it does
not occur in the problems discussed in § 4.5 or § 5.
It is not difficult to see how "successive minima" can occur. An
inequality of the type I/(u)1 ~ 1, where 1(;£) is an indefinite form and u
is an integer vector, is really a pair of alternatives
either I(u) ~ 1
or l(u)~-1.
I{x) = } (i)
xr + ex Xl X 2 + .B xL
where the coefficients ex
and .B are to be investi-
gated. The only such form
which satisfies the ine-
qualities
I{O.1) ;;;:;-1.
1{1.1) ;S+1 .
Fig. 4
1{2.-1) ;S +1.
is xi + Xl x 2 - x~. as the reader will easily verify. Hence any other form
with I (i. 0) = 1 and M(f) = 1 must satisfy at least one of the inequalities
I{O.1);S+1. 1(1.1);;;:;-1. 1(2. -1);;;:;-1. The form X~+XIX2-X~ is
thus in a strong sense isolated from all other forms (1) with M(f) = 1.
For example if ex and .B are plotted as cartesian coordinates for the
form I. a condition I/{u l • u2)1;S 1 excludes a strip of the plane between
two parallel lines. The three conditions
exclude three strips. What is left consists of the point (1. - 1) and a
number of infinite regions which are separated from the point by one
of the strips (see Fig. 4).
In the actual proofs. this general principle tends to be obscured. If I
is an indefinite form and M(f) = 1 there is not necessarily an integral
vector u with II (u)1 = 1. though there are integral vectors with
1;;;:; II (u) 1< 1 + e for any given e> O. and further devices must be used
to deal with this. The difficulty is that if t> 1. then the form t I (x) = I' (x)
satisfies the same choice of inequalities "I (u);S 1 or I (u);;;:; - 1" as the
original/(x). Here t might be arbitrarily close to 1. that is. the coef-
ficients of I'(x) might be arbitrarily close those of I(x). Hence to pin
down I (x) uniquely we must some-how make use of the normalization
M(f) = 1. We do this by first finding the determinant of the form in
38 Reduction
question and then using this as part of our information. The actual
proofs will make the details clearer.
We shall later deal with isolation of this type from a more sophisti-
cated point of view (Chapter X). The treatment there will also help to
show why the additional devices just mentioned are effective.
11.4.2. The problem of the minimum of indefinite binary quadratics
has already been discussed in § 4.1. All we shall actually prove here
is the following.
THEORElI! IV. Let
I (x) = III x~ + 2/12 Xl X2 + 122 X~ (1 )
be an indelinite lorm and
D =D(I) =/1l/22-/~2'
Then
(2)
where B is any given positive number in the range 0 < B < 1. Put
± g(1, 0) = (1 -11)-1,
where
Indefinite quadratic forms 39
1]2-21]~0.
(13)
On eliminating IDI between (12) and (13) we have 4IX~1], so 0~4IX~1]
by (6). On substituting in (12) and (13) and using the fact that 1] =0
or 1] can be made arbitrarily small and positive, we find that ID I = 2.
Finally on putting I D I = 2, IX ~ 0 in (12) we get 1] = 0, so IX = 0 and
± g(x) = x~ - 2X~ = 11 (x) .
Otherwise for all 8 less than some 82> 0 we must have (8) with u =
(- 3,2), that is
4(1-1])2IDI ~ (1-1]) + (-3 + 2OC)2. (14)
But now the right-hand sides of (12) and (14) increase and decrease
respectively in O~IX~l. If IX~ 1~ we use (14) and if IX;;;; l~ we use (12).
In either case we obtain I D I ~ 2.21 + 0 (1]), so I D I;;;; 2.21 since I D I is
independent of 1].
It is at first sight remarkable in these proofs that the inequalities
obtained show that 1] = o. As already mentioned, this is tied up with
the phenomenon of "isolation" which we shall discuss more fully later.
11.4.3. We consider now the "one-sided" problem for ·indefinite
binary quadratic forms. In contrast with § 4.2 there is here no set of
successive minima. Theorem V A, which we now enunciate, is a special
case of Theorem IX of Chapter XI and is due to MAHLER.
THEOREM V. A. Let
I (x) = 111 x~ + 2/12 Xl X2 + 122 x~
be an indelinite quadratic lorm and
D = 111/22 - 1~2.
Indefinite quadratic forms 41
B. For any e> 0 there are infinitely many forms, not equivalent to
multiples of each other, such that
M+ (I) = I(u)inf> 0 f(u) > (2 - e) IDli. (2)
u integral
the ratio
111+ (/k)!1 D (/k)l~
may be arbitrarily close to 2.
Another simple proof of B would be by means of continued fractions.
11.4.4. As an interpolation between the problems of § 4.2 and 4.3
one may consider the forms / (x) such that there is no integral point
u =FO in
-a</(u)<b,
where a and b are given positive numbers.
For some values of a and b one may deduce the least possible value
of D (f) from the results of § 4.2. For example I if
a = 1, b=~
10
we certainly have
111(/) ;:s 1 ,
and so by Theorem IV either
ID(/)I ;:s 2
or / is equivalent to
t (xi + Xl X 2 - x~)
for some t. In the second case it is clearly enough that t?:.~. The
- 10
corresponding determinant is (~)2. ~ < 2. Hence we have an isolated
10 4
first minimum. Note that the form with the least IDI does not take
any values in the neighbourhood of - a.
For any given values of a and b the techniques of §§ 4.2, 4.3 some-
times apply. For example, the minimum determinant when a = 5, b = 3
is ID I = 24 given by 3xi - 8 x~; this being isolated. The verification of
this statement is left to the reader. Here we shall prove only the follow-
ing theorem due essentially to SEGRE (1945a).
THEOREM VI. Let
(1)
have determinant
(2)
--+
ID-I 21 -1) c c2 (1-cx)2.
(1-1))2
Hence
ID I ;;;. c + i c2 ;;;' k + i k 2
with equality only when
T}=O, cx=!, c=k,
so
Since Ik (1, 0) > 0 but Ik (1, x 2) -+ - 00 as x 2-+ + 00, there must be some
integer v ~ 0 such that
Ik(1,v}~O>lk(1,v +1}.
If (8) were insoluble, we should have
Ik(1,v)~k, Ik(1,V+1)~-1:
that is
v(k - v) ~ 0, (v + 2) (k - v) ~ O.
This is possible only when v =k, i.e. when k is an integer.
It remains only to show that when k is an integer there is no integral
U=f=O such that -1<lk(u)<k. Since the roots {} of Ik({}' 1)=0 are
irrational, it is impossible that Ik(U) =0. Hence we must deduce a
contradiction from
(9)
If there are several solutions of (9) we choose one for which the integer
I"11 is as small as possible. Clearly
"1 =f= O.
We require the identities
I. (z) = Ik {(k + 1) Xl - x2 , - k Xl + X 2}
= Ik{Xl + X2 , kX1 + (k + 1) x 2}
= (k + 2) Xl {(k + 1) Xl - X2 } - {(k + 1) Xl - X2 }2 - X~
= (k + 2) Xl (Xl + X2) - (Xl + X 2)2 - x~.
Since I. (u) > 0, the last of these identities shows that
"1 {(k + 1)"1 - "2} > 0
"1 ("1 + "2) > O.
On writing - U for U if necessary, we thus have
But then
I.(u) = k"~ + U2(ku1 - u 2 ) ~ ku~ ~ k.
Hence our assumption (9) was false.
Indefinite quadratic forms 45
1(-1,1)~-1, 1(1,k)~k,
the form I" gives an isolated first minimum when k is an integer. The
proof of these statements is left to the reader (d. papers quoted at the
beginning of § 4.4).
11.4.5. We now consider indefinite ternary forms. As already noted
(§ 4.1) there is a set of successive minima, the first eleven having been
found by VENKOV (1945 a). There is a derivation of the first four
minima due to OPPENHEIM in DICKSON (1930a) and a neat proof of the
first minimum only by DAVENPORT (1947a). Here we shall prove only
the following result.
THEOREM VII. Let
I (a:) = L Iii Xi Xi (1)
be an indelinite ternary quadratic lorm with determinant
D(f) =det(fii) *0. (2)
Then
M(f) = inf I/(u)! ~ I!Dli, (3)
"*'o
integral
If (6) is true but (8) is false, there must be integral u such that
I(u) = - (1-1]tl , where 1] ~O may be chosen arbitrarily small. Hence
1(x) is equivalent to a form g (x) of the shape
(1 -1]) g(x) = - (Xl + rx. X2 + (J X3)Z + h(X2' x3),
where rx., (J are real numbers and the form
h(x) = hZ2 x~ + 2h23 X2Xa + h33 xi
must be positive definite. The determinant of h (x) is
h22h33 - h~3 = - (1 _1])3 D = (1 -1])31 DI.
After a transformation on the variables x 2 , x3 , we may suppose that
h(x) is reduced; and so
( 10)
by Theorem II.
We now consider the indefinite binary form
G(XI' x 2) = (1 -1]) g(XI' x 2 , 0) =- (Xl + rx.x z)2 + h22X~,
of determinant - h22 • Clearly
M(G) ~ (1 -1]) M(g) = 1 -1].
Hence, by Theorem IV, either
h ~ 221 (1 _ )2 ( t 1)
22- 100 1]
Indefinite quadratic forms 47
M+(f) = M+ (g) = 1
since 17~ 0 may be chosen arbitrarily small. Otherwise the first alter-
native, namely (11), holds, and so, by (10),
IDI2(1_17).]_.(221)2> 7
- 4 100 2
This proves the proposition.
We may now suppose that
M+(f) = 1. (12)
As before, there is a form g equivalent to t such that
(1 -17) g(x) = (Xl + (X X2 + (J XS)2 + h(X2' xs),
where 17 ~ 0 may be chosen arbitrarily small, and the form
and
0;:;:;; lUI +«Ull+PUaI ;:;:;;i·
Then in each case we have
(1- t l) Ig(u)1 < 1 - fJ.
contrary to hypothesis.
Suppose that (17) holds. There is an integer t and a real number
l' such that by choice of sign
and so
h(ua.ua) + (1- T)2 =g(U~,U2.Ua);:;:;; -1 +fJ, (19)
h(u2• us) + (1 + T)2 = g(u~'. U2• us) ~ 1 - fJ. (20)
By subtracting (19) from (20) we have
i(1-fJ);:;:;;1';:;:;;i·
This is equivalent to (18) and so proves the proposition.
COROLLARY. If (17) holds, then U2 and ua cannot have a common
factor except ± 1.
For if U2=VU~, Ua=vu~, where v> 1, none of (15), (16) or (17) would
be satisfied by h(u~, u~).
PROPOSITION 3. Either
IDI~i (21)
or, after an equivalence transformation, we may suppose that
-i-fJ;:;:;;h(1,O);:;:;;-i+fJ. (22)
h(1,-1)~!-fJ, (23)
-! - fJ;:;:;; h(1,1);:;:;; -! +fJ. (24)
h(2.-1);:;:;;-2+fJ. (25)
IIX - il;:;:;; ifJ. (26)
1.BI~fJ (27)
provided that fJ is less than some absolute constant fJo> O.
Indefinite quadratic forms 49
so
Then, by (14),
(1-rJ)3I D I =h~3-h22h33;;;; -hU h3S'
Cassels, Geometry of Numbers 4
50 Reduction
so
IDI -~ (1 -t1]LJ_ > ~
(1 _7])2 2 = 2 '
Iv' + l- oc I~ IfJ,
Iv" + l- (oc + fJli ~ IfJ·
After a substitution of Xl +v' X 2 + (v" - Vi) Xa for Xl we may suppose
indeed that
1 .!-ocl~J-l1
2 - S·I'
Hence
by (36). Hence and by (14),
(1 .- 1]) 31 D I = h~ 3 - h22 hsa = I + (- A- ,u + i V) 1] + 0 (1]2) • (38)
= I+O(1])
But D is independent of 1], so*
IDI =f.
Suppose, if possible, that 1] * O. On putting ID I = I in (38) we have
- A - ,u + iv = - ! + 0 (1]) .
For small enough 1] this contradicts (34), (35) and (36), since they give
- A - ,u + iv = - t A + i (A - 2,u + v) + i (A + 2,u +v)
~-t-i-i=-f·
Hence 1] =0, so by (13), (26), (27), (31), (32), (33), we have
g(x) = (Xl + IX2)2 - ix~ - x2 xa + x~ = 10 (x) .
Since g(x) is equivalent to I(x), this concludes the proof of Theorem VII.
1I.S. Binary cubic forms. We must first consider briefly the
algebra associated with a binary cubic form
(1 )
Such a form may always be split up into linear factors with real or
complex coefficients:
I(X I ,X2 ) =[J({}jXI +"PjX2 ), (2)
l:oi:o3
With the form is associated the discriminant
In particular, D(f') =D(/) if 1and I' are equivalent, since then (5) holds
for some integers (1., fJ, y, <5 with (1. <5 - fJ Y= ± 1-
If a, b, c, d are real, then either all the ratios 1f';I1'); are real or two
of them are conjugate complex and the third is real, since roots ~ of
an equation I(t 1) =0 with real coefficients occur in complex conjugate
pairs. This subdivides the real non-singular binary cubic forms into
two essentially distinct types. We show now that two forms in the
same type may be transformed into each other by a transformation of
the type (7) with real «, fJ, y, <5. It is enough to show that all forms I
of a given type may be transformed into an I' which is fixed for the type.
We may suppose without loss of generality that either
in our two respective cases, where the bar denotes the complex conjugate.
Clearly these two cases are characterised by D> 0 and D < 0 respectively.
There exist numbers At, A2' Aa not all 0 such that
Then
or
(16)
(17)
and
(18)
I (Xl' X 2) = a X~ + b X; X 2 + C Xl X~ + d X~ (1 )
= II ({)jX 1 + "p;x 2), (2)
1:::>;:::>3
(3)
(4)
where
Binary cubic forms 55
IS
h' (Xl' X 2) = h (oc Xl + {3 X 2 , YXl + b X 2) .
Indeed this follows at once from (6) and the expressions (7), (8) of § 5.1,
on noting that
{);tp~ - {)~ tpi = (oc b - {3 y) ({)itpk - {)k tpi) = ± ({)itpk - -Ok tpi) ,
on using (7).
From either (5) or (6) we see that the determinant of h(xl , X 2 ) IS
(8)
In particular, h (Xl' X 2) is definite when and only when D> 0, i.e. when
1is a product of three real linear forms [when the {)i' tpi are real the form
(6) is clearly positive definite, but the converse is not so clear without
using (8) J.
When the {)i' tpi are real, the form 1 was said by HERMITE to be
reduced when the definite quadratic form h is reduced in the sense of
MINKOWSKI I .
Every form with real {)i' tpi is equivalent to a reduced form. For
the transformation which reduces the h (x) in MINKOWSKI'S sense also
reduces 1(x) ; since h (x) is a covariant of 1(x), as we have seen. Further,
this reduction can be carried out in only a finite number of ways since
we saw that a definite quadratic form can be reduced by only a finite
number of transformations.
II.5.3. We may now enunciate and prove DAVENPORT'S theorem:
THEOREM VIII. Let 1(x) be a binary cubic lorm with discriminant
D>O which is reduced in the sense 01 HERMITE (§ 5.2). Then
h (Xl' XI) = A x~ + B Xl XI + C x~
be the hessian of 1(z). so that
O~B~A~C. A>o (4)
CHALK'S result is that
with equality only for the forms (2) and (3). Since 4AC-BI~3Aa.
and 4AC-BI=3DU) by (8) of § 5.2. this will be a stronger result
than Theorem VIII.
We may suppose by homogeneity that
A =7. (5)
1(z) = a x~ + b x~ XI + CXl X: + d X:
these inequalities are
lal~1, Idl~1. (6)
Ia + b + C + dl ~ 1. Ia - b + c - dl ~ 1. (7)
By taking -I for 1 we may suppose that
a~1. (8)
Binary cubic forms 57
c<O<b,
and (13) becomes
(14)
Further, (5) becomes
7=A = b2 + 3a 1c1G b2 + 31 cI. (15)
7 G b2 + 31 c1~ b2 + 6/b,
and so
(16)
58 Reduction
Similarly we have
7~~+3Icl,
c
and so
1~ - c~ 2. (17)
Clearly a sign of equality can hold in (16) or (17) only if
a=-d=1, bc=-2. (18)
From (14), (16) and (17) we now have
9aldl~7+lbcl~11
and so
...--11
a~
- 9 ' I dl~~
- 9 .
But now
11 11
a-b+c-d~--1-1+-<1
- 9 9'
and so
a-b+c-d~-1. (19)
We now consider the two possibilities for 1(1, 1). If
a+b+c+d~-1, (20)
then on adding (19) and (20) we have
a-lcl~-1, so Icl~1+a~2.
Comparison with (17) shows that IcI = 2, and, since there is equality
in (17), we must have (18); that is
a=-d=1, b=1, c=-2.
Similarly, if
a+b+c+d~+1,
then
b+d~+1, so b~2;
and we have
a = - d = 1, b = 2, c= - 1.
This concludes the proof of the theorem.
11.5.4. When the binary cubic form 1 has discriminant D(/)<O the
hessian form is indefinite, and so a reduction of the hessian does not
single out a finite number of reduced forms from amongst the forms
equivalent to I. However, if D<O then only one of the linear factors
of 1 is real, and 1 may be put in the shape
l(x I , x 2) = (-&aXl + 'f'aX2)(P x~ + QX I X 2 + R x~), (1)
Binary cubic forms 59
I/(u)I;::;;;I~ll.
II, lurther, I (x) is reduced, then
1/(1,0)1;;;; 1, 1/(0,1)1 ~ 1,
1/(1, -1)1 ;;;;1, 1/(1'-f)I~1,
i.e. when
PI1fal;;;;1, RIV'al;;;;1, (~)
l1fa - V'al (P - Q + R) ~ 1, (4~
l1fa - 2V'al (P - 2 Q + 4R) ;;;; 1, (4a)
and that ID(f)il is a convex function of (P, Q,R) for fixed {}a,"Pa'
Hence the maximum of DU) is attained at the vertices of g, where
three of the plane faces meet [since it is easily seen that ID I~ 00 as
max (I PI, IQI, IR I) ~ 00]. The proof then follows from a rather tricky
estimation of DU) at the vertices of g.
II.6. Other forms. We briefly survey here results on the reduction
of forms other than those already discussed.
II.6.2. For binary forms of degree n~ 4 there is more than one
invariant. For example, a binary quartic form 1(Xl' X 2 ) which is the
product of two pairs of complex conjugate linear forms may be reduced
to the shape
where
XI =OCXI +{JX 2 , X 2 =yxl +bx2 ,
for some real (I., {J, y, band ft =ft (/) is a real number lying in
Iftl <i·
Two forms with different ft cannot be transformed into each other by
a homogeneous linear transformation of the variables. Further, ft (/) is
an absolute invariant in the sense that ft (t I) = ft (I), where t is any num-
ber. Of course we still also have the discriminant
where
1(Xl' X2) = II (1J;Xl + "P; x 2)·
i
The problem for definite binary quartics was solved independently
by DAVIS (1951 a) and CERNY (1952a) in the sense that they found the
best possible function y (ft) of ft such that every form 1 with invariant ft
has
inf 1(u) ~ y (ft) {D U) }A.
"*,o
integral
DAVIS (1951 a) also gives some results for indefinite binary quartic
and full references to earlier work. It is no longer true, as it was for
quadratic and cubic forms, that forms 1with D (/) = 0 assume arbitrarily
small values. This case was completely elucidated by DAVENPORT
(1950a).
The methods of these authors combines reduction techniques with
other tools drawn from the geometry of numbers.
There does not seem to be any systematic work on binary forms of
degree greater than 4.
Other forms 61
II. 6. 3. The only other types of forms I (Xl' .... xm) of degree n with
m> 2. n> 2 for which the best estimate of
M(I) = inf If(u)1
"*0
integral
This is the only invariant in each type. since there are obvious real
transformations taking I into
and
Xl(X~ +X=).
respectively. The two types are distinguished by D>O and D<O
respectively. The following two results are known:
THEOREM X. Let I(xl • X2 • xa) be a lactorisable ternary cubic lorm with
D (f) > O. Then there exist integers u 4= 0 such that
DO
I I (u)1 < 9.1'
except when I is equivalent to a multiple 01 one 01 the lorms
149 = x~ + x~+ x~- x~ x 2 + 5x~ x3 -2 Xl X~ + 6 Xl X:-2 x2 x:- X: X3 - Xl X. X3•
lSI = X~+ X~+ ~+6x~xa-3 Xl X~+9Xl X:-3 X2 X:-3 Xl X2 Xa.
lor which M(f) = 1 and D (I) = 49. 81 respectively.
THEOREM XI. Let I (:I)) be a lactorisable ternary cubic lorm with
D (f) < O. Then there exist integers u 4= 0 such that
II (u)l ~ I~ Ii .
The sign 01 equality is needed when and only when 1(:1)) is equivalent to
a multiple 01 the lorm
123 = x~ + x~ + xg + 2x~xa - XIX~ + XIX: - xax: - 3X1 X2 X3 •
We note that 140.181 and 123 are all of the shape
Norm (Xl + tpXa +tpx3).
62 Reduction
where 1, tp, VI are a basis for the integers of a cubic field. We shall
discuss later the reasons why this might have been expected (Chapter X).
For 149,/81 and 123 we have VI =tp2, and tpsatisfies the respective equations:
[By Norm is meant the product of the three forms obtained from the
given one by inserting the three pairs of conjugate values for tp and VI.]
The first equation here corresponds in an obvious way tQ the form in
Theorem VIII. The third equation here corresponds to the binary form
where the vii are integers and V Z2 V33 - v2av32= ± 1. Since h(:z:) has determinant
(1 - 7])8 D(f) and is reduced, we have bounds for the coefficients. The proof now
continues by an intricate and delicate chain of computations using these bounds
and the fact that Ig(u)1 ~t for all integers u*,o.
DAVENPORT'S treatment of Theorem XI starts off with a similar reduction but
the completion of the proof requires different ideas and the detailed consideration
of an intractable 2-dimensional figure.
11.6.4. The corresponding problem for the product of n> 3 homo-
geneous forms in n variables has been much worked on. Estimates
but no precise results are known, and these estimates were obtained
by other methods. We shall consider the case of large 1t in Chapter IX,
§ 8. The best estimates for n = 4, 5 in print appear to be those of
ZILINSKAS (1941 a) and GODWIN (1950a) respectively; but GODWIN refers
to a better estimate for n = 4, presumably the Vienna dissertation of
G. BOHM (1942) also mentioned in KELLER'S encyclopedia article
[KELLER (1954a)] but unavailable to me.
There is however a striking result of CHALK on the product of. the
values taken by n linear forms when these values are positive. He
shows that if L I , ••. , Ln are n linear forms in n variables ;E = (Xl' ... , xn)
with determinant LI =1= 0, then there exist integers u =1= 0 such that
That the implied constant 1 on the right-hand side of (2) is the best
possible is shown by the simple example Li = xi' CHALK'S theorem is
indeed more general than the form given here since it refers to the
product of inhomogeneous linear forms. Consequently we do not prove
it here, but later in Chapter XI, § 4.
64 Theorems of BLICHFELDT and MINKOWSKI
Chapter III
when it contains :E and y] and has volume V> 2", then it contains an
integral point u other than the origin. In this way we have a link
between the "geometrical" properties of a set - convexity, symmetry
and volume - and an "arithmetical" property, namely the existence
of an integral point in f/. Another form of the same theorem, which
is more general only in appearance, states that if A is a lattice of
determinant d (A) and f/ is convex and symmetric about the origin,
as before, then f/ contains a point of A other than the origin, provided
that the volume Vof f/ is greater than 2"d (A). In § 2 we shall prove
MINKOWSKI'S theorem and some refinements. We shall not follow
MINKOWSKI'S own proof but deduce his theorem from one of BLICH-
FELDT, which has important applications of its own and which is
intuitively practically obvious: if a point set at has volume strictly
greater than d (A) then it contains two distinct points :El and :E2 whose
difference :El -:E 2 belongs to A
The theorems of BLICHFELDT and MINKOWSKI may be regarded as
statements about the characteristic functions of a set f/, that is the
function X(:E) which is 1 if :E E f/ but otherwise O. There are generalisa-
tions of the theorems of BLICHFELDT and MINKOWSKI to non-negative
functions ""(:E) due to SIEGEL and RADO. These we present in § 3.
We do not in fact use these theorems later.
In § 4 we use MINKOWSKI'S theorem to obtain a characterisation of
a lattice which is independent of the notion of a basis: a lattice is any
set of points A in n-dimensional space which (i) contains n linearly
independent vectors, (ii) is a group under addition, i.e. if :E and y
are in A so are :E ± y, and (iii) has only the origin in some sphere
x~ + ... + X~<1J2, where 1J> O.
In § 5 we introduce the notion of the lattice constant LI (f/) of a
set f/. This is a number with the property that every lattice A with
d(A)<LI (f/) has a point other than 0 in f/, while there are lattices
whose determinant d (A) is arbitrarily near to LI (f/) with no other
point than 0 in Y. In § 6 we discuss at length a method due to MORDELL
Introduction 65
for all vectors z and y. The length o~ a vector is not an invariant under
all unimodular transformations, unlike most of the concepts we work
with, but we shall be concerned only with the topology induced by the
metric Izl and not the metric itself. Let
(1~i,i~n) (1)
be a real transformation of determinant
(2)
Clearly
lyl2 = ~ (~>XijX;t;£ n3A2 'Lx: = n3A21 z 12,
where
A = max IOCi;l.
Since det(oci;)=t=O, we may solve (1) for the xi and obtain, say,
X,='LPiiYi'
;
(3)
Then similarly
IzI2~n3B2IyI2,
where
B = max IPiil.
Hence there exist constants cl • c2 independent of z and y such that 1
(4)
bj=(fJli"",P"i) (1~j~n),
then the points of A are just the points (3) with Y EAo. The truth of
(i), (ii) in general now follows at once from (4) and the truth of (i), (ii)
for Ao.
111.1.3. We say that a sequence of vectors J!, (r = 1,2, ... ) con-
verges to the vector J!' as limit if
lim IJ!, - J!'I = 0
in the usual sense. Clearly a necessary and sufficient condition for this
is that the co-ordinates of J!, should converge to the corresponding
co-ordinates of J!', since clearly
maxlxil ~IJ!I ~nimaxlxil
for any vector J! = (Xl' ... , X,,). An immediate consequence of Lemma 1
ci) is that a sequence of vectors u, of a lattice A can converge only if u,
is the same for all sufficiently large r, say
u,=u'
A set [I' of points is said to be compact if every sequence of points
J!,t [I' contains a subsequence Y. =J!" (rl <r2 <···) which converges to
a limit in [1':
lim y;
5-+00
= Y' E f/ .
A classical theorem of WEIERSTRASS states that a set [l'in n-dimensional
euclidean space is compact if and only if it is both bounded (i.e. con-
tained in a sphere IJ! I< R for some sufficiently large R) and closed
(i.e. if J!, E f/ (1 ~ r < (0) and J!' = lim J!, exists, then J!' E f/).
For the sake of completeness we give a proof of WEIERSTRASS'S theorem.
Suppose first that .9' is a compact set. If.9' were unbounded, we could find a
sequence of points iX,E.9' such that JiX,J- 00, and then it clearly cannot contain
a convergent subsequence. Hence a compact set .9' is bounded. If.9' were not
closed, we could find a sequence of points iX,E.9' such that lim iX, = iX' is not in .9'.
Clearly every subsequence of the original sequence tends to IX'. Hence a compact
set .9' is closed. Now let .9' be a set which is both bounded and closed. We shall
show that .9' is compact. Let iX, (1 ~ I' < 00) be a sequence of points of .9'. We
may suppose that originally all the iX, are contained in a n-dimensional cube ~o
of side 2R for some R. This cube may be dissected into 2" cubes of side R by
taking planes through the centre of 'to parallel to the faces. For definiteness we
take the cubes of side 'to to be closed, that is to include their boundary points.
At least one of the cubes of side R must contain iX, for infinitely many 1'. Let'tl
be one of these. On repeating the original process with 'tl instead of 'to we obtain
t
a cube 't2 of side R contained in 'tl which contains iX, for infinitely many r.
And so on. In this way we obtain a sequence of cubes 'ts (0 ~ S < 00)' of side
5·
68 Theorems of BLICHFELDT and MINKOWSKI
21 -. R. such that ilS+1 is contained in il.. Each ils contains ;E, for infinitely
many". The cubes '1/. define a point ;E' which is contained in all of them. We may
now find a subsequence ;E" tending to ;E' as follows: ;E" is any point of the original
sequence in ilo: if "1' ...• ". have already been fixed with
then ".+1 is anyone of the infinitely many indices" >". such that ;E, is in ils '
Finally. since
;E' = lim ;E, •
s~oo '
lim~.h
s-+oo '
exist and are in [/. For if
of the A, such that Yl. is convergent. Then pick out a subsequence C,= (zu •...• Zmt)
of the 8. such that z.,
is convergent. The sequence Zu is also convergent. being a
subsequence of the convergent sequence YU' And so on. After m repetitions of
the process one obtains the required subsequence.]
111.1.4. By volume we shall mean in this book LEBESGUE measure
unless the contrary is stated. We shall however have no need of any
of the more recondite properties of measure; the sets we shall be mainly
concerned with have a volume by any definition. for example the
interiors of cubes or ellipsoids.
111.2. BLICHFELDT'S and MINKOWSKI's theorems. We use the no-
tation and results of Chapter I. To BUCHFELDT is due the realization
BLICHFELDT'S and MINKOWSKI'S theorems 69
that the following almost intuitive result forms a basis for a great
portion of the geometry of numbers [BLICHFELDT (1914a)).
THEOREM I. Let m be a positive integer, A a laUice with determinant
d(A), and 9' a point-set 01 volume V(9'), possibly V(9') = 00. Suppose
that either
V(9') > md(A), (1 )
and 9' is compact. Then there exist m + 1 distinct points ~l' ... , ~"'+l
01 9' such that the dillerences ~i -~; are all in A
Let b l , ... , b" be any basis of A and .let f1J be the generalized paral-
lelopiped of points
y1b1 + ... +y"b"
Then f1J has volume
(3)
Every point ~ in space may be put in the shape
~=u+1', uEA, 1'Ef1J,
and this expression is unique, since the points of A are just the
y1b 1+ .. · + y"b" , where Yl' ... , y" are integers.
This parallelopiped f1J will play an important part later (Chapter VII),
where it will be called a fundamental paral1elopiped for A
For each 11, E A let al(u) be the set of points v such that
1'Ef1J. v +UE9'.
Clearly the corresponding volumes V{al(u)} satisfy
~ V {al (un = V(9'). (4)
II
Since the al(u) are all contained in f1J, there must be at least one
point 1'0 Ef1J which belongs to at least m + 1 of the al(u), say
1'oE: al(u;) (1 ~ i;;;; m + 1),
where the 11,; are distinct. Then the points
~;=1'o+u;
70 Theorems of BLICHFELDT and MINKOWSKI
lim s, = O.
For each r, the set (1 +s,)!/ of points (1+s,)z, ZE!/ clearly has
volume
Zj,<=(1+B,).9' (1;:;:;;j;:;:;;m+1)
such that
But now the u,(i, i) are in I\. Hence (d. § 1.3) u,(i, i) is independent
of r from some stage onwards:
U,(i,i) = u'(i,i)
Hence
z·, -z·, = '(..) {E /\
U t }
• 1 ,7 =l= 0 (i =l= j), '
as required.
For later reference (Chapter VII) we note that in the proof for the
first alternative we have implicitly proved the following:
COROLLARY. Let.9' be any set 01 points and let 9;. be the set 01 points
v of the fundamental parallelopiped which can be put in the shape
V=Z-U, zE.9', UE/\.
BLICHFELDT'S and MINKOWSKI'S theorems 71
Then
V(~) ~ V(9").
If no difference Xl - x 2 between distinct points of 9" belongs to A then
V(~) = V(9").
The first paragraph is clear. The second follows since then no two
9l(u) overlap.
111.2.2. From Theorem I we deduce almost at once the following
theorem which is due, at least! for m = 1 to MINKOWSKI ("MINKOWSKI'S
convex body theorem").
THEOREM II. Let 9" be a point set of volume V(9") (possibly infinite)
which is symmetric 2 about the origin and convex 2 • Let m be an integer
and let A be a lattice 0/ determinant d(A). Suppose that either
(1~i~m+1),
such that
1
z:l:; -
1
z:l:i
{EA
=+= 0 (i =+= i)
}.
We introduce an ordering of the real vectors and write
Put
Then clearly
has volume m 2" but contains only m - 1 pairs of points of the lattice /\"
of integral points other than 0 namely
± (u, 0, ... ,0) (1~u~m-1).
where the ali are real or complex numbers. Such a set is clearly sym-
metric. It is also convex, since if x, yare in f/ and
(O~).~1),
then clearly
For sets f/ of this kind one can relax the condition of compactness
in Theorem II somewhat. We enunciate the theorem for the most
important case when the ali are all real. It will be observed that the
argument might be used for a wide class of convex sets Y.
Generalisations to non-negative functions 73
These define a set !/ in the space of X of volume 2" ci ... c". Hence
if there is strict inequality in (1) the theorem follows from the first
alternative in Theorem II. Let now 8 be any number in
0<8<1.
Even if there is equality in (i), there is certainly a point X.E Mother
than 0, with co-ordinates (X lt , •..• X".), such that
IX Ie I ~ CI + 8 < CI + 1
IXi.I<Ci (2~i~n).
But now there are only a finite number of possibilities for X., by
Lemma 1 (ii). Since 8 is arbitrarily small, one of those possibilities
must therefore satisfy (4). This proves the theorem unless det(aij) =0.
But then it is readily verified that (2) defines a region of infinite volume.
and so Theorem II certainly applies.
111.3. Generalisations to non-negative functions 2• The results of
§ 2 may to some extent be generalised to non-negative functions ",(x)
1 Cj>0 follows from (1) except when det(aij) = o. But we do not exclude this.
2 The results of § 3 will not be used later.
74 Theorems of BLICHFELDT and MINKOWSKI
since now tp (~) is given by (2). But this means that there are m + 1
distinct vectors u i such that v o+ UiE Y, and this is just the conclusion
of Theorem 1.
The proof of Theorem IV follows that of Theorem 1. Let b 1 , .•. , b"
be a base of A, and ~, as before, the set of
y1b1+···+y"b" (O~Yi<1);
so that every ~ is uniquely of the shape
~=v+U, VE~, uEA.
Then
V(tp) =f tp(~) d~
=~ ftp(u+v)dv
uEII vE~
=
VE~
J {~tp(u
uEII
+v)} dv.
Since ~ has volume V(~) =d(A), the theorem now follows at once.
Generalisations to non-negative functions 75
11.3.2. SIEGEL (1935il) has given a stronger form of Theorem IV which has,
however, remained rather sterile of applications. For notational simplicity we
enunciate it only for the lattice 110 of integral vectors. The function
cp(v) = ~!p(v + U) (1 )
"E A,
is periodic by definition. Its Fourier coefficients c (p) = C (PI' "', Pn), where pE 110 ,
are given by
c (p) = J cp(t,) e- 21Ii (pv) dv, (2)
9
where (p v) denotes the scalar product
PI VI + ... + Pn vn ·
On substituting (1) in (2), we have
c(p) = J !pix) e- 21Ii (px) dx, (3)
- 00 < XI <00
(1;;> i;;> u)
Since cp(u) :2: 0 for all v, there must be some Vo such that
L IJI (vo + te) = cp(vo) ~ V(!p) + {V(!p) }-1 L 1f !p (x) e- 2ni (px) dx 12. (7)
"EA, PEA.
P400
This is SIEGEL'S inequality.
When a general lattice II is substituted for 110 on the left-hand side of (7) then
11* must be read for 110 on the right-hand side, where 11* is the polar lattice of II
defined in Chapter I, § 5.
111.3.3. We now give RADO'S generalisation of MINKOWSKI'S convex
body theorem II. [RADO (1946a), see also CASSELS (1947a).] RADO
considered very generally a homogeneous linear mapping A of n-dimen-
sional vector space into itself given by
(1 )
when X=Ax. We write det(A)=det(Aij).
THEOREM V. Let 1p(x) be a non-negative junction oj the vector x in
n-dimensional space which vanishes outside a bounded set, and suppose that
if the first non-zero coordinate of a:1 - a:2 is positive. If a:1 =f= a: 2 then
either a: 1 > a: 2 or a: 2 > a: 1 • We construct w o, ... , w" ... in turn, so that
w,>o (r> 0).
The vector Wo is given. Suppose that W o , ... , W,_1 have already been
constructed, where r;;:;;1. There is a unique permutation Zkl(O~j~r)
Generalisations to non-negative functions 77
The r vectors
Zkj - Zk. (i = 1, 2, ... , r)
are distinct from each other and from o. Hence we may choose as w,
one of them which is distinct also from the r -1 vectors WI' ... , W,-I'
Since wi> 0 (1 ;;;'i~r) we cannot have w,= -wi' Hence the w,
do what is required.
Theorem V, will be an almost immediate consequence of the following
Lemma.
LEMMA 3. Suppose that (2) holds and that det (A) =!= 0, so that a trans-
formation A-I reciprocal to A exists. Then
l: tp (A -1 U + A-I t) ;;;, tp (0) + l l: tp (u) (5)
UE/\ UE/\
*0
for every real vector t.
For fixed t let z, be the sequence of vectors Z of 1\ such that
tp(A-1 Z+A-1 t»0 arranged so that
tp (A -1 z, + A-I t) ;;:;; tp (A -1 Zs + A-I t) (r;;;' s). (6)
Let w, be the corresponding sequence defined by Lemma 2. We apply
(2) with
x = x, = A-1 Zlr + A-It
Y = Y, = A-1 Z mr + A-It,
where lr and mr are defined by (4). Then
min {tp (X r), tp (Yr)} ~ tp (A -1 Zr + A-I t) ( 7)
by (6), and since lr;;;'r, mr~r. But now, by (4) again,
A(Xr - Y,) = wr
and so, by (2) and (7)
tp(Wr );;:;; tp (A-l zr + A-It).
Similarly, on interchanging x, and Yr' we obtain
tp (- w r) ;;:;; tp (A-1 zr + A-I t) .
Hence, since tp;;:;;O, we have
l:tp (u) ;;:;;tp(wol + l: {tp(W r) +tp(-Wr }}
UE/\ r>O
=- tp (A -1 Zo + A-I t) + 2 l: tp (A U + A-I t) ,
-1
u~/\
78 Theorems of BLICHFELDT and MINKOWSKI
since every vector uEA with 1JI(A-1 U+A-1 t) >0 occurs as a z,. But
now (2) with y = x implies that 1JI (0) ~ 1JI (x) for any x, and in particular
(9)
The truth of the lemma follows now at once from (8) and (9).
Finally Theorem V follows from (5) on integrating with respect to t
over a fundamental parallelopiped f!P of A defined as in § 2.1. The
left-hand side becomes
c j =(c 1 j,···,c ni )
Characterisation of lattices 79
so that trivially
* *
Here v 0, since a 1 , ••• , a.. are linearly independent. Further,
v ± 1 since h is not in Ml by hypothesis. We may suppose that b is
chosen so that Ivi in (1) is as small as possible. Let p be a prime divisor
of v and write
Then
PhI = u 1 a 1 + ... + un an,
where not all of u 1 , ••• , un are divisible by p since hI is not in Ml (because
v was chosen minimal). Without loss of generality, p does not divide U I ,
and so
lp -mu 1 =1
contrary to hypothesis. Hence the chain of lattices M1 , ... , M" ... must
have a last, MR; and MR then coincides with A.
111.5. Lattice constants. We must now introduce a number of new
definitions relating to lattices and points sets. The new concepts will
be subjected to a searching analysis in Chapters IV and V; here we just
prove enough to show their use and to enable applications of MIN-
KOWSKI'S theorem to be made.
Let g> be any point set. If a lattice A has no points in !7 other
than 0 (if 0 is in !7), then we say that A is admissible for !7 or
!7-admissible. We call the infimum (greatest lower bound) of d (A) for
all A-admissible lattices the lattice constant of !7 and write
L1 (!7) = inf d (A) (A is !7-admissible).
If there are no !7-admissible lattices then we say that !7 is of infinite
type, and write L1 (!7) = 00; otherwise !7 is of finite type and
O~L1(!7)<oo. An !7-admissible lattice A with d(A)=L1(!7) is said
to be critical. Critical lattices playa very prominent role in Chapter V.
Of course in general there is no reason why a general set !7 shbuld have
critical lattices at all.
Our definitions do not quite correspond with those of MAHLER
(1946d, e). He is usually concerned with closed sets !7 and says that
A is !7-admissible if no interior point of !7 except 0 belongs to A, that
is if A is admissible in our sense for the set of interior points of !7.
Our usage is a compromise between MAHLER'S and that proposed by
ROGERS (1952a).
111.5.2. The definition of L1 (!7) may be stood on its head: L1 (!7) is the
greatest number L1 such that every lattice A with d (A) < L1 has a point
Lattice constants 81
has lattice constant L1 (P)) = (Wand that the critical lattices are pre-
cisely those with a base b1=(b l l , b21 ), b 2=(b12 , b22 ) such that
(b 11 Xl + b12 X2)2 + (b 21 Xl + b22 X2)2 = X~ + Xl X2 + x~ (2)
identically. The reader will have no difficulty in making the translation
for himself (d. Lemma 4 of Chapter n. We can also make a geometrical
interpretation of (2l. Put
b11 = cos{), b21 = sin{},
b12 = cos1p, b22 = sin 1p.
Then (2) is true provided that
cos {} cos 1p + sin {} sin 1p = ! ,
that is provided that
-{} - 1p = ± n/3 .
Hence the critical lattice has as basis two points at angular distance
n/3 on +
X~ + X~ = 1. A further point on X~ X~ = 1 is b l - b 2 , as is
clear from (2). It is readily verified that the six points ± bl , ± b 2 ,
± (b l - b 2 ) are the vertices of a regular hexagon inscribed in X~ + X~ = 1.
111.5.3. In this and in the next section we shall use MINKOWSKI'S
convex body Theorem II to evaluate or estimate L1 (Y) for various
sets Y. Theorem II is directly applicable when Y is symmetric and
convex, since it asserts that then
L1 (Y) ~ 2- 11 V(Y). (1 )
This applies for example to the circular discP): X~+X~<1 and gives
L1 (P)) ~ n/4 = 0.785 .... , which may be compared with the exact value
(!)t=0.866 ... obtained above.
Even if our region Y is not convex or symmetric, we may obtain
estimates for L1 (Y) below if a convex symmetric body .:T is inscribable
in it. Clearly L1 (Y) ~ L1 (.:T) if .:T is a subset of Y, since every Y -admis-
sible lattice is automatically .:T-admissible. Hence
Hence
Ll (Y) ~ n"/n!.
We shall later obtain a rather better estimate than this (Chapter IX,
§ 8). We note the translation into the theory of forms: Let
Lj(~) =L CjiX i
I;:;;.;:;; ..
be real linear forms in the n variables ~ = (XI' ... , x,,) with det (cij)
Then there exists an integral u*o such that
*o.
117 L j (u) I~ :~ Idet (cii )!·
MINKOWSKI'S convex body theorem also permits the evaluation of
Ll (Y) for sets Y which are not symmetric in o. We reproduce here,
with his kind permission, Professor MAHLER'S elegant treatment of the
simplex, hitherto unpublished l . Let Y be an open simplex in n-di-
mensional space containing o. If the faces of Yare given by the equa-
tions
(O~i~n),
where the L j (~) are linear forms, then Y is the set of points satisfying
(O~i~n).
L I1.j Lj(~) =0
O;:;;i~"
identically in ~, where the I1.j are real numbers, and without loss of
generality
11.0> o.
1 It is given, however, in his mimeographed lecture course, Boulder (Colorado),
U.S.A., 1950, together with other interesting results about non-symmetric sets.
Lattice constants 83
If, say, Ct.l ~ 0, then Y would contain the infinite ray of points x satisfying
(j=l=O,1);
which is impossible, since Y is a simplex. Hence
(O~j~n).
where
(4)
We show that
(5)
where V('if) is the volume of the parallelopiped
(1~j~n).
and then
(O~j~n) ;
(1~j~n)
is admissible for Y.
If a is in Y, we must have Uj~O (1~j~n), and minuj~-1 if
a=l=o. But then, by (4), we should have
Lo(a) = - L Ct.j Uj 2; 1;
and so a is not in Y. Hence 0 is the only point of M in Y. Since
d(M)=rnv('if), this completes the proof of (5). We note that
2~n V('~) = Idol ~l, where do is the determinant of the n forms L 1 , .•• , L".
By (3) and (4), do is the least in absolute value of the determinants of
selections of n out of the n + 1 forms L o , ... , Ln.
Estimates of ,1 (Y) for non-convex sets Y may be obtained from
Theorem I instead of Theorem II. Let fJ£ be any set such that all the
6*
84 Theorems of BLICHFELDT and MINKOWSKI
differences
~l - ~2' ~l EfJl, ~2EfJl (6)
lie in !/. Then
LI (!/) ~ V(fJl),
since by Theorem I if d(A)<V(fJl) there exist two points ~I'~2EfJl
such that ~1-~2EA; and by hypothesis ~1-~2E!/. Of course if :Y
is a convex symmetric set inscribed in !/ we could take fJl = i:Y: but
then we get just the same estimate LI (!/) ~ z-" V(:Y) as by the use of
Theorem II. However MORDELL and MULLENDER found suitable sets
fJl in the case they were treating such that V(fJl) was greater than
2-" V(:Y) for any convex symmetric inscribed :Y. The increases are
usually comparatively small and obtained at the expense of some
complication. We refer the reader to MULLENDER (1948a) and the
literature quoted there for further information.
In Chapter VI are obtained upper estimates for LI (!/) in terms of
V(!/) which are valid for all sets (Minkowski-Hlawka Theorem and
related topics).
111.6. A method of MORDELL. In this section we develop a method
of MORDELL for finding LI (!/) precisely for point sets !/ which mayor
may not be convex. The method applies primarily to star bodies.
This class of sets is defined by the properties that the origin is an inner
point and any radius vector meets the boundary either not at all or in
precisely one point: in other words, if ~ is any vector other than 0, then
either tZE g for all t~ 0 or there exists a to such that tz is an inner
point of !/, a boundary point of !/ or not in !/ according as t<to,
t = to or t> to. We now have the I rather trivial
LEMMA 4. Let!/ be a star body and suppose that a constant Llo exists
with the following two properties.
(i) every lattice A with d(A) =Llo has a point other than 0 in or on
the boundary of !/.
(ii) there exist lattices A, with d (A,) = Llo having no points other
than 0 in the interior of !/.
Then LI (!/) =Llo. If further, !/ is open l , then the critical lattices are
lust the A,.
For suppose, if possible that M is an !/-admissible lattice with
d(M)<Llo. Let 1'>1 be defined by y"d(M) =Llo. Then the lattice I'M
of points y~, ~E M has clearly no points in or on the boundary of !/,
contrary to (i). Hence LI (!/) 6L10' On the other hand (1 +e)A, has
no points in !/ for any e> 0, where A, is one of the lattices given in (ii).
1 i.e. does not contain any of its boundary points. MINKOWSKI and following
him MAHLER define a star body to be closed. We depart from their nomenclature.
A method of MORDELL 85
Hence LI (9") ;£ (1 + t: t LI 0' so .1 (9") = LI o' The truth of the last sentence
of the lemma is now obvious.
When the description of star-bodies by distance-functions is intro-
duced in the next chapter, Lemma 4 will fall into place as part of a
wider theory.
MORDELL'S method of finding .1 (9") for a given star-body 9" may
now be described. First one must make an intelligent guess Llo at LI (9"):
in particular so that (ii) of Lemma 4 is true. If Llo has been correctly
chosen, then it may be possible to verify (i) and to find all the Ac in
(ii) by the following general procedure, of which the details naturally
vary widely from case to case. We suppose for simplicity that 9" is
open. Let M be any 9"-admissible lattice with d (M) = Llo. Then if
~ (1 ;£ j;£ r) is any collection of closed convex symmetric sets each of
volume
(1;£ j;£ r),
there must be points P;=FO of M in ~ for 1 ;£j;£r. Since M is 9"-
admissible, the P; must lie in fJt;, the set of points of ~ which are not
in 9". We may now use the hypothesis that the Pi are in a lattice M
of determinant Llo to obtain further points of M. Since these cannot
lie in 9", this gives further information about the Pi' In the end it
may be possible to show that M is one of a set of lattices A" all of which
have points on the boundary of 9". Lemma 4 shows that LI (9") =.1 0 ,
Of course the power of the method depends on a suitable choice of the ~.
MORDELL'S method is at its best in dealing with 2-dimensional
regions, since for these it is easier to grasp the geometry of the figure.
Before giving some concrete examples we must therefore study the
geometry of a 2-dimensional lattice more closely.
111.6.2. Throughout § 6.2 we denote by A a 2-dimensional lattice.
We regard vectors as coordinates of points on a 2-dimensional euclidean
plane, and use the normal geometric language to discuss their relations.
By distance we mean the usual euclidean distance. For later reference
we formulate our conclusions as lemmas.
We say that a point u of a (not necessarily 2-dimensional) lattice
is primitive if it is not of the shape u =ku1 , where U1EA and k>l is
an integer.
LEMMA 5. Let U be a primitive point of the 2-dimensional lattice A
Then the points of A lie on lines IT, (r =0, ± 1, ... ) which are parallel
to 0 U and at a perpendicular distance
rd(A)/lul
from it!. Each line IT, contains infinitely many points of A and these
are spaced at a distance Iu I.
1 As before lui = (u~ + u:)l, that is the distance from 0 to u.
86 Theorems of BLICHFELDT and MINKOWSKI
det(u,v) = ±d(A),
Clearly the points with r fixed but s varying lie on a line IT, with the
required properties.
LEMMA 6. Let u, v be points of the 2-dimensional lattice A such that
0, u, v are not collinear. Then a necessary and sufficient condition that
u, v be a basis for A is that the closed 1 triangle ouv should contain no
points of A other than the vertices.
The condition is clearly necessary, by Lemma 5, so we must prove
it sufficient. If there are no points of A in the triangle ouv other than
the vertices, then the same must be true of the triangles with vertices
where
0;;:;; Jli ;;:;; 1 , 0;;:;; "i;;:;; 1 (j = 1,2).
The index I of p, q in 1\ is
and that the only critical lattices of :% are those with the following
bases:
Al basis (1.1) and (1. -1)
AI basis (t. -t) and (-t.f)
As basis (t. t) and (t. t)·
It is readily verified that these lattices are :%-admissible and have
determinant 2. Hence by Lemma 4. it is enough to show that any
Xz :%-admissible lattice A with
r----- -----, d(A) =2 must be one of AI'
I h 0 :
J---I------i.' A2 • As·
/
',0
/
/
0' From now on we suppose
i-- that
I
I
I
I
d(A) = 2: A is :%-admissible.
0
I 0
XI The convex symmetric oc-
I C
I tagon
I
IL __
,,
~: IXII <to
,, /
/
/
IX21 <to
IXII + IXII < i
I I
IL _____ _ _ _ _ _ JI
has area
~ > 2I d(A).
Fig. 5
and
det(a,b)<1·2+1·1 =t,
so
det(a, b) = 2 or 4,
since det (a, b) is an integral multiple of d (1\). Suppose first, if possible,
that det (a, b) = 4, so that the index of a, b in 1\ is 2. For any integer
k> 1 the points k-1a, k-1b clearly lie in .YI" and so are not in the .YI"-
admissible lattice 1\: that is a and b are primitive points of A We
show now that l (b - a) is in A Since a is primitive there is a basis
a,d where, say, det(a, d) =d(/\) =2. Than b=ua+vd for some
integers u, v; and indeed v = 2 since det (a, b) = 4 = 2 det (a, d). Then
u is odd since b is primitive, so {(b - a) is in 1\ as asserted. But
} (b - a) is clearly in .:it, so we have a contradiction. Hence we can only
have
det(a, b) =2 =d(I\). (4)
This gives the estimate
(5)
since otherwise we should have the contradiction
Similarly
det(a, c) = - 2 = - d(/\) (6)
and
(7)
Since a, b is a basis for 1\ we have
c=sa+rb
for some integers r, s. On substituting this in (6) and using (4) we
obtain r = - 1 and so
b +c =sa (8)
i.e.
(9)
But
-~ < b1 + C1 < 3, 1~ a1 < I
by (1), (2), (3); so there are only the two possibilities
s =1 or s = 2.
First case s = 1. From (1), (2), (3) and (9) we have
b1 < 0, c2 < 0. (10)
From (4), (5), (6) we have
det(c, b) = 2
90 Theorems of BLICHFELDT and MINKOWSKI
that is
c1 b2 - c2 bI = 2.
But Cl~.i, b2~1 by (2) and (3); and O>bl~ -l, O>C2~ -l by (5),
(7) and (10). Hence (8) can hold only if
C1 = b2 = I, C2 = b1 = -l,
which gives the lattice 1\2'
Second case s =.2. By (1), (2), (3) and (9) we now have
bl~O, C2~O. (11)
We now consider the lattice-point
(db d2 ) =d= (b - a) = l' (b - c) .
By (2), (3) and (11) we have
o~ 2d1 = bI - Cl~ - 2,
O~2d2=b2-C2~2.
of area 2113= 4d (1\) (see Fig. 6). The only portion of fl not contained
in !!) is the curvilinear triangle ~ cut off by the arc of the circle between
a = (1,0) and b =(l, - VI)
and the image of ~ in the origin. We may
suppose without loss of generality that q is in ~.
Cleariy both p and q are primitive, since, if either were of the shape
ku with u"A and integer k>1, then u would be in!!). Further p=j=:q,
since P2= - VI but \ q2\ < I T·
We now apply Lemma 8. From what
"\--------
\
\
\
\
\
\
\
\ (10)=0
Fig. 6
\/(u)I~I~ll.
The sign 01 equality is needed when and only when I is equivalent to a
multiple 01
92 Theorems of BLICHFELDT and MINKOWSKI
except when 1\ has a basis a = (aI' a2L b = (bl , b2) such that identically
(3)
In stating the equivalence of Theorem VII and Theorem VII A we
have tacitly applied Lemma 4 to the star body f/. From now on we
shall be concerned only with Theorem VII A. We use capital letters
to denote points and coordinates, except that 0 is still the origin.
Further, 1\ is a lattice with d(/\) given by (1) which has no point other
than 0 in the set f/' defined by (2). The set f/' is shown in Fig. 7.
First, since Llo < 1, there is certainly a point P =F 0 of 1\ in the square
(4)
Since P does not lie in f/', either P or - P must lie in the first quadrant
and we may suppose without loss of generality that.
O~1l<1, O~Pz<1. (5)
From Fig. 7 (or from elementary algebra) we must have
ll+Pz~1. (6)
Suppose, if possible, that there were two such points, P and P'. Then
their difference P"=P-P' satisfies (4). Hence on interchanging P
and pi if need be, we may suppose that P" is in the first quadrant:
of course it may coincide with P or P'. Hence
p=p,+p".
A method of MORDELL 93
o2L+P
o
2fl-P
B
\
\
Fig. 7
The linear factors ·of both sides must coincide, and so, after inter-
changing AI, BI and A 2, B2 if need be we have
A~+A~=1
(9- V69)A~ + (9+ V69)A~ = o.
This determines At Ag uniquely, and so AI' A 2, B I , B2 since they are
all real.
Hence there are only two lattices of the type specified in the theorem,
namely those with base
A = (AI' A 2), B = (BI' B 2),
and
respectively.
The approximate values are
Al : 1.014, A2 : - 0·347
BI-:- - 0.017, B 2 : 1.0005.
All we shall in fact use are the inequalities
(8)
IX~-XIX~-x~1 <1.
Further, -r- l A is a lattice of determinant
(9)
(d. Chapter I, § 3).
The region -r-l!l' is shown in Fig. 8. The line Xl = 1 touches 10 (x) =
°
xi - Xl X~ - X~ = 1 at X 2 = and meets it again at x 2 =-1. The line
X2
-- --
o
X,
--
-:1-(lJI- --
°
lies in -r-l!l', except for a small region fJi in Xl < 0, X 2 < and the image
- ~ of fJi in the origin.
Suppose first that (1, 0)E-r-11\. Since d (-r- l A) =1, there are points
of -r- l A on the line x 2 = 1 spaced unit distance apart, by Lemma 5.
Since none of these can lie in -r-l!l' the only possibility is that -r-1A = Ao ,
the lattice of points with integral co-ordinates. But then A = -r Ao ,
which is one of the exceptional lattices permitted by the theorem.
Similarly, if (0, 1) E-r-1A, then A = -r Ao. Hence from now on we may
assume that
(10)
By Lemma 7, either there is a point q=l=o of A in the square !l,
or (1, 0) EA, or (0, 1) EI\. But the second and third alternatives have
96 Theorems of BLICHFELDT and MINKOWSKI
already been disposed of, and so, since q cannot lie in ",-1.9', we may
suppose that q is in Bt. Further, q must be primitive, since if q =kqI,
with integral k>1 and q I E",-lA, then qi would lie in IXII<j, IX21<t
and so certainly 1 in ",-1.9', contrary to the hypothesis that ",-1 A is
",-l.9'-admissible. Hence q is unique by Lemma 8.
We require another point of ",-IA. The tangent to lo(~) =1 at
(0, -1) is
Q (say) =-rq
lies in -r fJl.
+
Similarly -rf/ is bounded by Xi X~ = 1 and the tangents at
-r(O, -1)=B and at -r(1,O) =A. We now show that -rf/lies in
(13)
Indeed, since B 2> 1, the tangent to X~ + X; = 1 at B has negative
gradient and so meets Xi +X~ = 1 again at a point in (13). Since-rff
lies below this tangent, its points satisfy X 2 < 1. Similarly, since AI> 1,
the points of -rff satisfy X 1 <1. They clearly satisfy X 1 >O, X2>O.
But now we saw earlier that there is only one point, P, of /\. in (13).
Since -rp is in -rf/ we must have
-rp =P.
To sum up the results of our translation: there is precisely one point
Q E /\. in -r fJl. This point Q together with the unique point P of /\. in
(13) form a basis for I\. The point 2 Q - P lies in Xi X~~ -1- +
Let £' be the mirror image of -r fJl in XI + X 2 = O. By symmetry
there is precisely one point L, say, of /\. in £': this point together with
- P forms a basis for /\., and the point 2L P lies in Xi X~~ 1- + +
Cassels, Geometry of Numbers 7
98 Theorems of BLICHFELDT and MINKOWSKI
But now every point of the triangle oLQ is in one of the regions
Y, T 31 and.Yf'. By hypothesis there is no point of 1\ in Y, and we
proved that Q, L are the only points of 1\ in T 31,.Yf' respectively.
Hence Q, L forms a basis of 1\ by Lemma 6.
We have three bases P, Q: Q, Land L, -P for 1\ and must study
their relations. Now
det (P, Q) = det (Q, L) = det (L, - P) = d (1\),
since the determinants are ± d (1\) and are clearly positive. Write
P=uQ+vL.
Then
det (P, Q) = v det (L, Q),
det (P, L) = u det (P, Q) .
Hence
P=Q-L.
We have now reached a contradiction, since
2Q -P=2L+P,
and this point has been shown to lie both in X~ + X~;S; 1 and in
X~ + X~;;;; -1. The contradiction shows that there are no Y-admissible
lattices with d (1\) = Llo except those mentioned in the enunciation of
the theorem.
We have shown rather more. Let the line joining Band - A + B
(which forms part of the boundary of T3I) meet Xl +X2 =O in the
point (- c, c). Then it is clear that our argument shows that there is
a point of every lattice 1\ with d (1\) ;;;; Llo in the bounded region
IX~ + X~ I < 1 , max {i XII, IX21} ;;;; c,
except when 1\ is one of the two critical lattices. That is, IX~ + X~ I< 1
is boundedly reducible and indeed fully reducible in the sense of
Chapter V, § 7.
111.7. Representation of integers by quadratic forms 1. In this
section we digress to present a number of results in the arithmetic
theory of quadratic forms which can be proved very simply by the
methods of the geometry of numbers. The principle tool is the following
lemma.
LEMMA 9. Let n, m, kl' ... , k m be positive integers and ajj (1;;;; i;;;; m,
The set /\ 0/ points u with integral co-ordinates
1 :;;'j:;;' n) be integers.
m=PI"'Pg ,
with distinct primes PI' "', pg • We now show that to every prime P
there exist integers ap ' bp such that
a! + b! + 1 == 0 (P) . (1)
Indeed when Pis odd the numbers
(0;;:;; a < iP)' (2)
and
- 1 - b2 (3)
are each a set of i (P + 1) integers which are incongruent modulo p.
Since there are only P classes modulo P, there must be some integer c
which is congruent to an element of each set (2) and (3), that is
a~ == c == - 1- b!, so a~ + b~ + 1 == O. If P= 2, then a2 = 1, b2 = 0 will do.
We now consider (d. DAVENPORT 1947b) the lattice of integral
U = (u 1 , ••• , u 4 ) which satisfy the 2g congruences
U1 == ap u 3 + bp u 4 (P) } (4)
u 2 == bp u 3 - ap u 4 (P)
for P= PI' ... , pg • By Lemma 9, these form a lattice 1\ of determinant
d(/\) ~ pi ... p~ = m 2•
Hence there is a lattice-point other than 0 in the set
x~ + x~ + x~ + x: < 2 m
of volume
in2(2m)2> 24m2~ 24 d(I\).
If u is this point, then
o< u~ + u~ + u~ + u: < 2 m
and, by (1) and (4),
u~ + u~ + u~ + ui == (a! + b! + 1) u~ + (a~ + b~ + 1) u~ == 0 (P)
for P= PI' ... , Pg; that is m divides u~ +... +u~.This proves the result.
111.7.4. A famous theorem of LEGENDRE states that a ternary
quadratic form t (Xl' X 2 , x3 ) with rational coefficients represents 0 if
obviously necessary congruence conditions are satisfied. Following
DAVENPORT and MARSHALL HALL (1948a) and MORDELL (1951 a) we
verify this in a particular case, to which indeed the general case may
be reduced by simple arguments.
Representation of integers by quadratic forms 101
Let
1(:£) = al X~ + a2X~ + a3xt
where aI' a2, a3 are square-free integers no two of which have a common
factor, so al a2 a3 is square-free. We show that there exist integers u*o
such that I (u) = 0 provided that the following two .conditions are
satisfied
(i) there are integers AI' A 2, A3 such that
al + A~a2 == 0 (a3), a2 + A~a3 == 0 (a l ) , a3 + A~al == 0 (a 2)
and
(ii) there are integers VI' V 2 , V3 not all even such that
al VI2 + a2 V22 + aaV32 == 0 (22+A) ,
u2 == A 3 uI (Pi·
Then
al u~ + a2u~ + a3u~ == al u~ + a2u~ == (a l + a2A~) u~ = 0 (P) .
(II",) Suppose A. =0, so aI' a2 , aa are all odd. Now
v2 == 0 or 1 (22)
for any integer v. In condition (ii) precisely one of VI' V 2 , Va must be
{'ven, say V 3 • Then
o == al v~ + a2 v~ + aa v= == al + a2 (22) .
We impose the two congruences
(2), }
(2) .
Then
Chapter IV
Distance-Functions
IV.t. Introduction. In this chapter we introduce a number of
concepts which are useful tools in all that follows.
IV.t.2. A distance-function F(;lJ) of variable vector;lJ is any function
which is
(i) non-negative, i.e. F(;lJ) ~ 0,
(ii) continuous,
and
(iii) has the homogeneity-property that
F(t;lJ) = tF(;lJ) (t ~ 0, real).
The set [/ defined by
[/: F(;lJ) <1 (1 )
turns out to be a star-body in the sense already introduced in the last
chapter: that is, the origin 0 is an inner point of [/ and a radius vector
t;lJo (0 ~ t < (0)
either lies entirely in [/ [which happens when F(;lJo) =0] or there is a
°
real number to = {F(;lJo)} -1> such that t;lJo is an interior point of, a
boundary point of or outside of [/ according as t<to, t=to or t>to.
In § 2 we examine this relationship and show that conversely every
star-body [/ determines a distance function F(;lJ) such that the set (1)
is the set of interior points of [/. Since many, though not all, of the
point-sets of interest in the geometry of numbers are star-bodies, the
concept of distance-function plays an important rOle.
Most of the problems considered in Chapter II relate to star-bodies;
and then it is easy to write down the corresponding distance functions.
For example if 1(;lJ) is a positive definite or semi-definite 1 form, the set
1(;lJ) <1
corresponds to the distance-function
F(;lJ) = {I (;lJ) }1t"
where r is the degree of 1(;lJ). Again, if 1(;lJ) is an indefinite form of
degree r andk>O is a number, then the set
- 1 < 1(;lJ) <k
1 By semi-definite we mean that f (;E) Z 0 for all a: but f (;E) = 0 for some ;E * o.
104 Distance-functions
F(3:) _ {k--
1/'1/(3:)1 1/' if 1(3:) ~ 0,
- . 1/(3:)1 11' if 1(3:) ~ O.
The reader will readily verify that both the functions just defined
are in fact distance-functions. One advantage of introducing distance-
functions is that some of the ideas of Chapter II can be carried over
to all star-bodies. A simple example of a 2-dimensional set which is
not a star-body is
0<X1 X 2 < 'I.
Clearly star-bodies!/' which are symmetric, i.e. have the property
that - 3:E!/' when 3:E!/' correspond to distance-functions which are
symmetric in the sense that
F( - 3:) = F(3:) .
K. MAHLER (1950a) and C. A. ROGERS (1952a) have investigated a
wider class of sets which ROGERS calls star-sets and which include the
closed star-bodies as a sub-class. A star-set is a closed set such that
t3: E!/' whenever 0 ~ t ~ 1 and 3: E!/'. They are important in connection
with certain problems ("bounded reducibility" d. Chapter V, § 7) and
we shall mention them again; but we refer the reader to the original
memoirs for the details.
IV.l.3. Convex sets:f( are important as MINKOWSKI'S convex-body
theorem shows. It turns out that the convex sets which have the origin
o as an interior point are precisely the star-bodies whose distance-
function satisfies the inequality
F(3: + y) ~ F(3:) + F(y) .
This we prove in § 3. We call such distance-functions convex.
In § 4 we show that an n-dimensional convex set :f( has a tac-
(hyper) plane 1 at every point a on the boundary of :f(; that is a
(hyper) plane
which passes through a and is such that :f( lies entirely on one side
of or in n; say
(all 3: E:f() .
Clearly if there is a tangent plane to:f( at a, then it is the only tac-
plane. But tac-planes exist even when tangent planes do not, and they do
1 We use the words tac-plane and plane for tac-hyperplane and hyperplane.
'When n = 2 the corresponding thing is called a tac-line. The term supportplane
(German: Stiitzebene) is sometimes used.
General distance-functions 105
~: maxlx;1 ~ 1
and the generalized octahedron
£1): 2:IY;I~1.
It is easy to see that a plane 2: Y;x;= 1 for fixed y can contain a point
of the interior of ~ only if y is not in £1); and vice versa. We discuss
polar sets in § 4.
There is a rich theory of convex sets but we do not prove more than
is relevant to the geometry of numbers. For the rest the reader is
referred to the report of BONNESEN and FENCHEL (1934a) or EGGLE-
STON'S tract (1958a).
IV.2. General distance-functions. We set up now the relationship
between distance functions and star-bodies sketched in § 1.2.
THEOREM I. A. II F(x) is any distance lunction then the set
Y: F(x) <1
is an open star-body. The boundary 01 Y is the set 01 points x with
F(x) = 1 and points with F(x) > 1 are exterior to Y (that is, have a neigh-
bourhood which does not meet Y).
B. Conversely any star-body .r determines a unique distance-Iunction
F(x). II Y is the set 01 interior points 01 .r then Y is related to F(x)
in the way described in A.
1 We say that two point-sets meet if they have a point or points in common.
2 Strictly speaking the set .%'"** obtained from .%'"* coincides with.%'" except
possibly on the boundary. The distance-functions of.%'" and.%'"** are thus the same.
106 Distance-functions
We note first that two distinct star-bodies 9";. and 5; determine the
same distance-function F(x) if they have the same set of interior points,
but a distance-function defines precisely one open star-body, namely
F(x) < 1 and one closed star-body, namely F(x) ~ 1. Distinct distance-
functions 1;, ~ always determine distinct star-bodies. For then 1; (xo) =1=
~ (xo) for some x o' say Fl (xo) < F; (xo); and then there is a t such that
Ix - xol < r;
of Xo which lies in Y; so Xo is an interior point of Y. Here we have used
the standard notation
Ixl = (x~ + ... + x!)~.
Similarly, if F(xo) > 1, then there is a neighbourhood of Xo which does
not meet Y. Finally, if F(xo) = 1, then every neighbourhood of Xo
contains points txo both with t>1 and t<1, for which F(tx o) > 1,
F(txo) < 1 respectively: and so Xo is a boundary point of Y.
It remains to prove B. If.'T is any star-body, we define a function
F(x) as follows:
(cx.) F(x) =0 if tXE.'T for all t>o. In particular F(o) =0.
(p) If tx is not in .r for all t> 0 then, by the definition of a star-
body, there is a to=to(x»O such that tx is interior to or exterior to Y
according as t < to or t> to; and tox is on the boundary of ,i/. vVe put
(8)
lor all x.
We record for later reference two results, the first of which we have
already proved.
LEMMA 1. For every distance-function F(x) there is a constant C such
that
F(x) ~ Clxl
lor all x.
108 Distance-functions
then
(2)
This is true for r = 2 by the definition of convexity, and it is true for
r>2 by induction, since we may suppose that t1=F1, and then
t1or1+ ... + t,or, = t1or1+ (1 - t1) y,
Convex sets 109
where
y=--:r2
t2 + ... + --:r,EoA,
t, .go
1-tl 1-tl
are in f . If a = (aI' ... , a,,) be any other point of f , we shall show that
max Iail ~ 11-,,+1 (n!) V(.~).
1::;;1~"
If, say, a1=FO, then the whole of the simplex with vertices 0, a, rJe2'
... ,11e" is contained in f and has volume
(n!)-l· 11"-ll ~I·
Since this can be at most V(f), the result follows.
Finally we prove
THEOREM II. A convex body f of which 0 is an interior point is a
star-body. The corresponding distance function F(:r) satisfies the inequality
F(:r + y) ~ F(:r) + F(y) (3)
for all :r and y.
Conversely if F(:r) is a distance function for which (3) holds, then the
star-body
Ft:r) < 1 (4)
is convex.
110 Distance-functions
Hence
F(x + y) :£ F(x) + F(y)
since F(x), F(y) are the infima of S-I, t- 1 over s, t respectively which
satisfy (7).
The function F(x) is continuous at 0 by the same argument as was
used for distance functions. Since 0 is an interior point there is a neigh-
bourhood
Consider the distance l:rl - :r2 1 when :r1 ,:r2 run through the points
of ~, %2 respectively. Since Jt'i and Jt; are closed, this distance
attains its infinum at some points :rjEJtf (j = 1, 2); and :r~ =F:r~ since
Jt'i and ·""2 have no points in common. We show that the hyperplane 1T
which bisects perpendicularly the line-segment :r~:r~ will do what was
required. After a suitable rotation of the co-ordinate system and a
possible change of origin we may suppose that
:r~ = (- 'f}, 0, ... ,0), :r~ = ('f}, 0, ... ,0)
F*(y
1
+ Y2) = sup
..,
:Jl(Yl + Y.)
F(:Jl)
::;; sup :JlYl
-.., F(:Jl)
= F*(Yl)
+ sup :JlY.
.., F(:Jl)
+ F*(Y2)'
I (6)
But now (3). (4), (5) and (6) show that F*(y) is the distance-function
of a convex set, by Theorem II and its Corollary. This convex set is
bounded because of (5) and Lemma 2.
It remains only to prove (2); and here we need the convexity of
F(al), which we have not yet seriously used. If al =0, then (2) is trivial,
so let alo=FO be fixed. From (1) we have
F(al) F*(y) ~ a:y (7)
for all al and y: and so certainly
since one need clearly only consider the x with xy = 1 in (1), by homo-
geneity, if y*o. Further,
xoYo> 1, (12)
since xi! is on the other side of 1T from the origin, which is a point of (9).
From (11) and (12) we have
The required result (2) now follows from (8) and (13), since B is arbi-
trarily small.
This concludes the proof of the theorem. The reader will be able
to verify readily that the sets F(x) < 1 and F*(y) < 1 are related to
each other in the way described in § 1.3.
We have at once the
COROLLARY 1
F(x) F*(y) ~ xy
lor all x, y For any Yo* °there is an x o*°such that
F(xo) F*(yo) = xoYo; (14)
and vice versa.
We have already noted the first inequality, which is an immediate
consequence of the definition. By symmetry it is enough to show the
existence of x o, given Yo. The set fJI of points x with F(x) = 1 is
bounded; and it is closed since F(x) is continuous. Hence the continuous
function xYo attains its upper bound, say at xo. But we have already
seen that the upper bound is F*(yo), so (14) must hold.
We also shall need later
COROLLARY 2. Let ~, Jt; be convex sets with non-zero volume having
the origin as inner point and with respective polars ~* and :Yt;*. If .fl
contains .)("2 then :Yt;* contains ~*.
Let the corresponding distance functions be 1\ (x), F; (x), Fl*(X) ,
F2*(X). Then F;(x)~1\(x) by Theorem I Corollary. The definition (1) of
the polar distance-function then gives immediately F2*(Y) ~Fl*(Y) for ally.
The following corollary links polar distance-functions with the polar
lattices and transformations introduced in Chapter I, § V.
COROLLARY 3. Let F(x), F*(y) be a pair of mutually polar convex
distance-functions. Let 't be a homogeneous linear transformation and 't*
its polar transformation. Then F('tx) and F*('t*Y) are mutually polar.
For by the definition of 't* we have 'tx't*Y =xy for all x, y. The
truth of the corollary now follows from (1) and (2).
Convex sets 115
where Xl' ... , x,. are fixed linearly independent vectors and tl , "', tn
run through all real numbers in
(2)
Convex sets 117
0)
We first prove the following refinement! of a result of MAHLER
(1939 b).
THEOREM V. Let f be any closed symmetric convex set with volume
V(f) such that 0< V(f) < 00. Then there exist points ±X1 , ... , ±XnEf
such thatf is contained in the parallelopiped'C with faces ±TIJ(1 ~ J ~ n),
where TIJ is the hyperplane through the points xJ±Xj (j=t-j). Further,
the generalized octahedron £& with vertices ± Xi (1 ~ j ~ n) is contained
info
The last sentence is in any case trivial by convexity. We take for
Xl' ... , Xn points of fsuch that the volume of £& is a maximum. Such
a choice is possible since f is closed and bounded. If f were not
contained in 'C, there would be a point y on the opposite side of the origin
from one of the faces ± TIJ' say on the opposite side of TI". Then the
generalized octahedron with vertices ±XI , ... , ±X,,_l' ± Y would have
greater volume than £&, contrary to construction.
COROLLARY 1.
V(f) ~ V('C) ~ n! V(f) ,
V(f) ~ V(£&) ~ (n !tl V(f).
For the left-hand inequalities are trivial, and the right-hand ones
follow from them and V('C)~n! V(£&).
COROLLARY 2. Let f , !l' be any two closed symmetric convex sets of
finite non-zero volume. Then there is a homogeneous linear transforma-
tion ~ of the variables such that
n-l~ ff ( f (n~ ff
and
(n!tl V('t'") ~ V(~ ff) ~ (n!) V(f).
Let Xl' ... , Xn be the points of the theorem for f and let Yl' ... , Yn
be the corresponding points for ff. We determine ~ by the equations
(1~j~n).
Then the 'C, £& of the theorem are the same for f and ~ ff. The stated
results are now trivial, since n-l'C(£&.
1 Suggested by Professor C. A. ROGERS, who disclaims originality. The same
method proves a corresponding result for non-symmetric bodies in which the
inscribed and circumscribed bodies are both simplexes (d. MAHLER 1950a). There
are also results about inscribed and circumscribed ellipsoids (J OH:-I 1948 a).
118 Distance-functions
~ ~ V(f) V(f*) ~ 4n
(n!)2 - -
and
(/)2~ J(f) J(f*) ~ 1,
where (/) is the lattice constant 0/ the octahedron L Ixi I~ 1.
The first pair of inequality is MAHLER'S (1939a, b) and the proof
of the second pair is practically identical. When n =2 MAHLER (1948a)
has determined the best possible inequalities namely
i~J(f) J(f*) ~1,
equality on the left-hand side being necessary when f is a square and
on the right when f is a circle. For related inequalities and references
to later work see BAM BAH (1954c and 1955a).
We now prove the theorem for the lattice constants. The proof for
the volumes is similar. Let -r be any homogeneous linear transformation
and -r* its polar transformation, so
det (-r) det (-r*) = 1. (1 )
The bodies -rf, -r*f* are mutually polar by Theorem III, Corollary 3.
Since
IXil~1 (1~f~n).
We saw already in § 1.3 that fo* is the generalized octahedron
Distance-functions and lattices 119
Hence
A (fo) A (JtO*) = A (JtO*) = f/> (2)
by the definition of f/>.
Now consider a general f , which we may suppose without loss of
generality to be closed. Let ~ and !l) be the parallelopiped and octa-
hedron given by Theorem V so that
(3)
The polar of the parallelopiped ~ is an octahedron ~* which is inscribed
in f* by Theorem III, Corollary 2. Similarly the polar of the octa-
hedron !l) is a parallelopiped !l)* and
~*)f*)~*. (4)
We now show that
(5)
where f/> is given in the enunciation. By Theorem V we have
(6)
any t>O:
tF(A) = F(tA) < 1.
Hence F(A) =0 on letting t-+ 00.
We note also the rather trivial
LEMMA 7. Suppose that the distance-function F(;£) vanishes only for
;£ =0. Then every lattice A contains a point a=t=o such that F(A) =F(a).
In particular, F(A) > O.
For by Lemma 2 there is then a number c> 0 such that
F(;£) ~ cl;£l.
Hence
F(;£) ~ F(A) +1 (8)
implies that
1;£1 ~ c-1 {F(A) + 1}. (9)
But now by Lemma 1 of Chapter III there are only a finite number
of points of A for which 1;£ 1is less than a given bound, and so there
are only a finite number of points ;£ of A satisfying (9). If we take
a =t= 0 to be one of those points for which F(a) is least, then a enjoys
the properties required.
Chapter V
and
(1 )
by l.
We require a measure of the size of the coefficients of the matrix
of a transformation 'to We write
(5)
sInce
, Ix,1 ;;:;; Ixl ;;:;; n~ max
max , IXil
for all x.
124 MAHLER'S compactness theorem
We shall also need to use the fact that if ~ is near to the identical
transformation l, then ~-l exists and is also near to l. This statement
is made more precise in the following lemma.
LEMMA 1. Let ~ = l + a be a homogeneous linear transformation with
as required.
It remains to show that ~ is nonsingular; and for this it is con-
venient to use another characterization of II~II. Put
~(x)=n-lLlxil, ~(x)=maxlxil· (9)
i 1
The reader will have no difficulty in verifying that (2), (3) and Lemma 1 continue
to hold when IIIIF is substituted for 1111. Since we have used lorl to denote the
Linear transformations 125
size of the vector:l:, it might have been more tidy to use lI'tIlF., where 1<;,(:1:) = 1:1:1,
to measure the size of't. We have chosen 1I't1l because of its simpler expression
in terms of the Y'j. The choice of 1111 instead of some IIIIF is, for all essential
purposes, irrelevant, since it follows from Lemma 2, Corollary of Chapter IV that
where c; and c~ are numbers depending on the particular function F, but not on 'to
We shall also need later two lemmas relating to distance functions
and linear transformations.
LEMMA 2. Let F(z) be a distance function such that F(z) =0 only for
;l! = 0, and let 't be a linear transformation. Then there is a number C1
depending only on F and 't such that
F('t z) ~ C1 F(;l!)
for all z.
For
F(;l!) ~ czF('tz).
LEMMA 3. Let F(z) be a distance function such that F(;l!) =0 only for
;l!=0. Then to every 8 in 0<8<1 there is an 1]=1](8»0, depending
only on F and 8, such that
1- £::;:
-
F('t:l:l ::;: 1
F(:I:) -
+£ (13)
M ='t'/\
for some non-singular homogeneous transformation 't', and indeed in
infinitely many ways. For if aI' ... , an, b l , ... , b n are bases for /\ and M
respectively, there is a uniquely defined homogeneous linear trans-
formation 't' such that
(1~i~n);
and then
M ='t'/\.
We say that a sequence of lattices /\, (1~r<oo) tends to the lattice
I\' if there exist homogeneous linear transformations 't', such that
1\ = 't',/\' (2)
and
(3)
so
by (3) of § 2.
LEMMA 4. A necessary and sul/icient condition that the se.quence 01
lattices /\, (1;:;;; r < 00) tends to I\' is that there exist bases
b~, ... , b~,
and
b~, ... , b~
01/\" I\' respectively, such that
bj -+ bi (4)
The last limit is meant, of course, in the sense of the ordinary con-
vergence of vectors: Ibi - bi 1-+ O.
The proof of Lemma 4 is almost trivial. Suppose first that /\,-+1\'
and let 't, be the transformation satisfying (2) and (3). Choose any
basis bi for I\' and put
bj='t,bj (1;:;;;j;:;;;n; 1;:;;;r<00). (5)
Then by (5) of § 2 and (3), we have
Ibi-bil =1('t,-L)bil;:;;;nill't,-Llllbil-+o (r-+oo).
Suppose conversely that the bases are given satisfying (4). We may
define't, uniquely by (5). Then clearly 11't,-LII-+o.
The following criterion is rather less trivial.
THEOREM 1. A necessary and sul/icient condit£on that /\,-+1\' is that
the lollowing two conditions be both satislied:
(i) il a' E1\', there are points a' E/\, lor r = 1, 2, ... such that
a' -+a' (r-+ 00). (6)
(ii) il c is not in 1\', there is a number 'Y}>O and an integer ro>O,
both depending on c, such that
la'-cl>'Y} (7)
lor all a'E/\, with r;;:;;ro.
128 MAHLER'S compactness theorem
It is quite straightforward that (i) and (ii) are satisfied when I\,--+/\'.
In (i) we have only to put
a' = 't,a' ,
Now
II p,II--+ 0 (r --+ 00)
by Lemma 1 and since IIT,- LII--+O. Hence by (5) of § 2 and (11), we
have
Ia' - a' I ~ n& II p, III a' I ~ nt II p, II {I c I + 1'}} < 1'}
for all r greater than some roo From this and (9) and (10) we have
la' - cl ~ 21'} =1'}1'
This is in contradiction to (8). Hence statement (ii) of the theorem is true.
We must now show that if (i) and (ii) of the theorem are true then
I\,--+/\'. We require a lemma of some independent interest.
LEMMA 5. Let C1 , ... , c n be linearly independent points 0/ a lattice 1\
but not a basis. Then 1\ contains a point
d = {}l c1 + ... + {}ncn,
Convergence of lattices 129
and
d = ia
will do what is required. A slight refinement of the argument. which
is left to the reader. shows that the i in (14) may be replaced by! but
by no larger number.
We now revert to the proof of Theorem I. Suppose that /\, and I\'
satisfy (i) and (ii). Let b~ • .... b~ be any basis for 1\'. By (i) there exist
sequences of points
bj_bj (1~j~n. bjE/\,). (15)
We show that bj (1 ~j~n) is actually a basis for /\, except. possibly.
for a finite number of r. For if bL .... b~ is not a basis for /\'. let
(16)
be a point of /\ with
(17)
which is given by Lemma 5. Since the fJi , are bounded. they contain
a convergent subsequence by a classical theorem of WEIERSTRASS (d.
§ 1.2 of Chapter III). say
(18)
where
r 1 <r2 < .. · <r,< .. ·
is an increasing sequence of integers. Then
d' (say) = LfJ;b; = limd".
i '-+00
Cassels, Geometry of Numbers 9
130 MAHLER'S compactness theorem
by (15), (16) and (18). Hence d'eA' by (ii) of the enunciation of the
theorem. This is a contradiction since
J.::;;
4-.
max If)'1 ::;;.l
I - 2'
1
by (14) and (15), and since bj (1 ~i~n) was defined to be a basis for A'.
The contradiction shows that bi is a basis for A, except for a finite
number of r. If the bi are changed for these exceptional r so that bi
(1 ~ i ~ r) is a basis for A, for all r this does not affect the limits (15).
Hence the criterion is certainly sufficient by Lemma 4.
V.3.2. In Chapter X we shall need the notion of a neighbourhood
of a lattice, and we shall mention it again in passing briefly in § 9 of
this chapter.
A set £ of lattices A is said to be a neighbourhood of the lattice M
if it contains all lattices
A =-rM (1 )
with
since then
N =IIA
where
and then
as in § 3.1.
Clearly the sequence A, (1 ~r<oo) of lattices tends to M if and
only if every neighbourhood of M contains all but a finite number of
the A,.
Although we nowhere use it, we note that it is in fact possible to
introduce explicitly a metric into the space of all lattices. Let A and M
be two lattices and let
Convergence of lattices 131
where the infima are over all non-singular a and 't such that
/\=aM M='t/\.
Put
O(M,/\) = 0(/\, M) = max {log (1 +,u),log(1 +v)}.
Then we have the triangle inequality
0(/\, N) ~ 0(/\, M) + O(M, N);
since if
/\ = (l + PI) M, M= (l + P2) N;
then
/\ = (l + Pa) N,
where
II Pall = II PI + P2 + PI P211 ~ II PIli + II P211 + II PIli II P211 ;
and so
log (1 + II P31 \) ~ log (1 + II PI I\) + log (1 + II P211) .
The neighbourhood defined above is the one associated with this
metric, since if
/\=aM
with
lI a -lll< 1J<1;
then
where
and so
0(/\, M) <_11_.
1-11
V.3.3. The continuity of the distance-function F(z) of the vector z
is reflected as a semi-continuity of the function
F(/\) = inf F(a) (1 )
aEII
*0
of the lattice /\ considered in § 4 of Chapter IV. For certain later
applications it is useful to allow the distance-function F and the lattice /\
to vary simultaneously.
THEOREM II. Let /\, (1~r<oo) be a sequence of lattices tending to
the lattice 1\'. Let .F;(z) (1~r<oo) be a sequence of distance functions
which converge uniformly to the distance-function F' (z) in the unit sphere
Izl<1. Then
F'(I\') ~ lim sup F, (/\,). (2)
'--"00
9·
132 MAHLER'S compactness theorem
The proof is very simple. Since F,. (tx) = tF,. (x) for t> 0, the con-
vergence of F,.(x) to F'(x) is uniform in any bounded set of points;
in particular, since the distance-function F'(x) is continuous by defini-
tion, if a, is any sequence of points converging to a point a', we have
lim F,.(a,) =F'(a').
' ..... 00
since F,. (a,) ~ F,. (/\,). The result (2) now follows from the definition (1).
The sign of equality need not hold in (2) even when F,. =F' for alI r,
but we defer giving an example until § 10.5. However, much more
than Theorem II is true if F'(x) = 0 only for x = 0, i.e. if the set F' (x) < 1
is bounded (Lemma 2 of Chapter IV).
COROLLARY. Suppose that the hypotheses 01 Theorem I I hold and that
the only point x such that F'(x) = 0 is x = o. Then
lim }~(/\,)
' ..... 00
for all r ~ r0 and all x with Ix I = 1. Hence for all x whatsoever and
r~ro, we have
1F,.(x) -F'(x)1 <celxl ~eF'(x);
so
F,(x)
1 - e < F'(:E) < 1 + e. (5)
for all r greater than some rl , by Lemma 3. Hence by (6) and (5) with
"C'r~for ~ we have
(1 - e)2< F,(~,a:) < (1 + e)2
F'(a:)
for all r> inax (ro' r1). But now A, is just the set of "C',~ with ~EN,
and SOl
(1 - e)2~ F,(I\,) ~ (1 + e)2.
- F'(i\') -
{F(M'I)}" = 1] d (M'I) .
After Theorem VI we shall be able to replace the second < in (1)
by ~.
Suppose that 1] satisfies (1). Then there exists a lattice N, such that
{F(N)}" > 1] d (N) . (2)
Let b 1 , ''', b" be any basis for N; and for 0 < e < 1 let N. be the lattice
with basis
Then
d(N.) = ed(N)
and
Hence
(e ~ 0).
and so b~, ... , b~ are linearly independent. Hence there exists a lattice
M' with basis b~, ... , b~ and, by Lemma 4,
(t~oo).
Compactness for lattices 135
I
beginning but with nRj2 instead of R.
It remains to prove Lemma 8. By Theorem I of Chapter I there is
a basis Cli •.. , c n of A such that
al=vllcl •
a2 = Vn c1 + V 22 C 2 '
(3)
a" = Vn1 c l + ... + v"ncn'
where the Vii are integers and Vii=F o. We shall take bi of the shape
(4)
136 MAHLER'S compactness theorem
(6)
where
and
(i <i).
Then by the convexity symmetry and homogeneity of F(x) we have
Ui = e = (0.:-::0, 1,~)
j (1~i~j).
1 The reader may prefer, instead of studying the proof here, to turn to §§ 1, 2
of Chapter VIII, which are independent of the intervening matter.
2 I now prefer the proof given by CHABAUTY (1950a) to the version given here.
138 MAHLER'S compactness theorem
Ix;! ~f (1~i~i) }
(5)
IXil ~ 1p U+1~i~ n).
Let ct be one of the finite number of points of Nt other than 0 in (5)
for which
max Ix·11
j+l~i~n
(6)
is a mInImum. Since the c t are in the bounded set (5), they contain a
convergent subsequence, say
(7)
where
Write
(8)
so that clearly
IAil ~ f (1~i~j) }
(9)
IAil ~ 1p(i + 1 ~ i ~ n).
Suppose first, if possible, that AiH = ... = An = 0, so that ai+I is linearly
dependent on a l , ••• , a i . We are assuming statement @3i to be already
established. Hence by (7) we could apply @3;' with Yi=A, (1 ~i~i)
and it would follow from (9) that Al = ... =Ai=O, and so
lim ct, = o.
S--+ co
This contradicts hypothesis (ii) of the theorem, since ct'E Nt, = /\, for
some rand ct,=t= o. Hence the vectors aI' ... , ai +1 are linearly inde-
pendent. We put r. = Nt" and will show that the statement @:ii+I
now holds for N~+! = r..
The statement @:i;+l is trivially true. So far as ai-II is concerned,
@:if+I follows from (7); and so far as the remaining ai (1 ~ i ~i) are con-
cerned, @:if+! follows from @:ij since r.is a subsequence of Nt.
It remains to prove @:i;~l' Suppose, if possible, that there is an
increasing sequence of integers
(10)
and vectors
dS",!O[
>,.
1 The only property of !- we use is t < ! < 1.
Compactness for lattices 139
such that
lim d'm = d (say) }
t-+oo (11 )
0lal + ... + OJ+1 a j+l'
°
=
where 1 , ... , OJ+1 are not all integers. By ®f+1' which we have already
proved, we may add integer multiples of aI' ... , a j +1 to the right-hand
side of (11), after appropriate modification of the sequence d S'". Hence
we may suppose in the first place that
IOJ +1 I ~ i (12)
and in the second place, by (3). that
Idil~i<! (1~i~j), (13)
where, as usual, d=(d l , ... , dOl)' From (8) and (12) we have
. max Id;I=IOj+1l.max
J+l~.~n J+l~.~n
IA;I)
(14)
~i. max IA;I
J+lS.~n
(15)
We now show that this in contradiction with the definition of the vectors
ct as the vectors x of Nt in (5) other than 0 for which (6) is as small
as possible. Since ct· --+ a j +1' we have
lim max ICit I = max IA;I, (16)
r---+oo j+l~i~n ' j+l~i~n
where
By (13) and (15) the vector d Sm certainly lies in the region defined by
(5) when m is large enough. Further, dSmE NT, where T =tsm' But
now, by (14) and (16), the function (6) is certainly greater for c T than
it is for d Sm when m is large enough, which contradicts the choice of CT.
°
The contradiction shows that if (11) holds then 1 " " , OJ +1 are all
integers; that is the statement ®;~l holds.
This ends the deduction of ®j -1 1 from ®j' and so concludes the proof
of Theorem IV.
We note a form which is often useful in applications and which does
not depend on the use of the special distance-function Ix I.
COROLLARY. Let F(x) be any distance function and let A, be any
intinite sequence at lattices satisfying the two conditions
(i) d(A,)~K tor all r, where K is independent at r.
(ii) F(A,)~x>o for all r, where x is independent at r and, as usual,
F(A) = inf F(a).
aE/\
For by Lemma 1 of Chapter IV there is a C> 0 such that F(3J) ~ C 13J I'
and so
IA,I ~ C-IF(A,) ~ C-I" > o.
V.4.3. An almost immediate consequence (d. MAHLER 1949a) of
Theorem IV is
THEOREM V. Let !I' be any open set. Let~,~, ... , ~, ... be a
sequence at open s1-tbsets at !I' such that
(i) ~ is contained in 9; it r < t,
(ii) the origin is an inner point at .9i,
(iii) every point 3J ot !l'is in ~ tor some r.
Then
LI(!I') = lim LI(~).
,-+00
LI(~) ~ LI(!I') (1 )
for all r. Suppose that
lim inf LI(~) < LI(!I'). (2)
'-+00
by Theorem I. Hence 3J =p' satisfies (3) for all r greater than some ri .
For r> max (R, r1 ) this means that p, is in !I' contrary to our assumption.
The contradiction arose from the assumption that (2) is true. Hence
the theorem is true by (1).
Critical lattices 141
and
F(~) = Jx~ + x~ + x~ - x: - x:J~.
The problem whether these functions are of infinite type or not is equi-
valent to the problem whether all indefinite quadratic forms in 5 vari-
ables represent arbitrarily small values (including 0) or not for integer
values of the variables (d. § 3 of Chapter I). A classical theorem of
MEYER says that if the coefficients of the form are rational then it
represents O. Recently DAVENPORT and more recently B. J. BIRCH have
developed an attack on this problem but it appears to work only for
1 When the !I' and ~ are star-bodies, say, with distance-functions F(a:) and
Fr (a:) , the hypotheses of Theorem V imply that, for each x, Fr (a:) tends monotonely
to F(a:). Since Fr(a:) and F(a:) are continuous, this convergence must be uniform;
and so Theorem II applies.
142 MAHLER'S compactness theorem
where r5 (F) is defined in (1) and L1 (Y) is the lattice constant of the region
defined by (4).
The proof of Theorem VI is now quite simple. By the definition of
L1(Y), there exists a sequence of lattices A, such that
F(A,) ~ 1, d (A,) ~ L1(Y). (5)
We may now apply Theorem IV Corollary 1, its conditions (i) and (ii)
being satisfied by (5). Hence there exists a convergent subsequence,
and so, after a change of notation, we may suppose that
A,~N
If F(/\,) > 1 there would exist a real number ·0< 1 such that
(5)
144 MAHLER'S compactness theorem
In particular
(6)
Now consider the distance-function
(7)
so that
401
Po (:Il) ~ 1\ (:Il) ~ 400 Po (:Il) . (8)
r
From (8) and (2) or (5) we have
we have
for every positive integer t and certain integers u~), u~). Hence
11 (say) = (all w', a21 vi) E Ac·
But now, since wtp = -1, we have
F.I (.,1)
N -
- Ia11 aU Ij [1 + 100{i auiIauwI aui
+ au w 1}2
1~ IaII aU Ii (t ~ 00)
=kl.
Hence
Bounded star-bodies 145
(1~i~n).
where
(7)
and 'fIl • ••• , 'fI .. - l are small real numbers. As in the proof of the theo-
rem, if max 1'fI;1 is small enough. the only points of A~ which can lie
in or on the boundary of !7 are ±al ..... ±a,,-l (which are unchanged
I30unded star-bodies 147
where "t, ... , a" are any n-dimensional vectors. For our present purposes
any estimate, however crude, would suffice, but, since we shall later
need a more precise estimate, we prove it here.
LEMMA 9 (HADAMARD). Let aI' ... , an be n-dimensional vectors. Then
On the other hand. on regarding the c1 • •••• c .. in det(c1 ••••• c.. } first as
rows and then as columns and multiplying the two determinants to-
gether. we have 1
{det (c1 • •••• C ..)}2 = det {cjci } = II Ic,12. (5)
by (1). The required inequality now follows from (3). (4) and (5).
V.6.3. We may now show that. in principle. the evaluation of LI(.9')
for a bounded n-dimensional star-body .9' may be reduced to a finite
set of ordinary minimal problems. Except for convex bodies, for which
see § 7. this is hardly in practice a fruitful approach. though it might
well be adaptable to machine computation.
We may suppose without loss of generality that .9' is defined by
.9': F(a:} < 1, {1}
where F(a:} is a distance-function. By Lemmas 1 and 2 of Chapter IV.
there are numbers c>O and C such that
(2)
In particular. a lattice A admissible for .9' has no points in the sphere
1a:1 < C-l.
and so has
(3)
by MINKOWSKI'S convex body Theorem II of Chapter III, where is v..
the volume of the unit sphere Ia: I< 1.
Now let A be a critical lattice, so that there are (at least) n linearly
independent points fit, ... , a.. of A on the boundary F(a:} = 1 of .9'.
Then by (2) we have
(1~i~n), (4)
and so by HADAMARD'S Lemma 9 we have
Idet(a1 ,···, a..}! ~ c-". (5)
Hence in the language of Chapter I the index I of ai' ... , a .. in A is
1= Idet(fIt . .... a..H ~ (~)"v.:-l =I (6)
d(A) - c " o·
so that
g-l = j (k +2 - D) .
Hence the points p and q above do lie on the portion of the boundary
of f/ given by Xl X 2= -1. The point
r =p - q ={j(-k + D) t, j(k + D)t- = (r I} I , r 2)
lies on
Further,
For later use we note that the whole of the line-segment joining
p, r must lie in Y except the end points, since a line can meet a hyper-
bola X I X 2 = -1 or X l x 2 =k in at most two points. Hence the whole
of the closed parallelogram with vertices at 0, p, rand - q must lie
in ,,/ except for p, rand - q.
We are now in a position to prove Lemma 10. Let M be a critical
lattice. Suppose, if possible, that there is no point of M on the portion
integer r> 1; and so there are points of M on the line (i) and at a distance
Iq I apart. Hence there must be a point of M other than 0 and - q
in the closed parallelogram with vertices at 0, - q, p and". But we
have already seen that the only points of this parallelogram which are
not in f/ are the vertices p, " and - q. Hence either p or " is in M;
and in both cases then M coincides with A.
There remains the possibility that q = (- 1, 1). If the definition of
p and" is extended in the obvious way to t = 1, the situation remains
the same, except that now the whole line-segment joining p and" is
part of the boundary Xl +x2 =D of f/. Hence we may deduce only
that M has a basis consisting of (- 1, 1) and some point on Xl + x 2 = D.
For this type of proof compare OLLERENSHAW (1945b).
For later use we note that we have also proved the
COROLLARY 1. The only critical lattices lor
- 1< Xl X 2 < k, IXl + x 1-;;;, D
2
are those 01 type (ii), where now t is allowed also to take the value 1.
For the other lattices of type (i) have a point on -1<XI X2 <D,
IXI +x 2 1 =D. Here our usage differs from that of MAHLER (1946a),
since he calls a lattice admissible for a set f/ if it has no points other
than 0 in the interior of f/. Thus MAHLER calls the lattice of type (i)
admissible (and so critical) for the set of the corollary.
Lemma 10 may be regarded as a more precise version of Theorem IV
of Chapter II. To make the connection more clear we prove
COROLLARY 2. Ilk is an integer, the critical lattices 01 type (ii) are
admissible lor
-1<XIXa<k.
further but refer the reader to the papers quoted. The following example
illustrates the connection with the existence of infinitely many lattice
points in sets.
LEMMA 12. Let k be a positive integer and 1\ a lattice with
d (1\) ~ (k 2 + 4k)~.
Then there are infinitely many points of 1\ in
~: - 1~ Xl X 2 ~ k. (3)
There are infinitely many points of 1\ in
and the aii and 17 are real numbers to be determined later. Let
±P1' ... , ±PN be the only points of A on the boundary of:£ and let
±p~, ... , ±P;' be the points of A' which correspond to them in an
obvious way. Let 1T1, ... , 1TN be tac-planes to:£ at PI' ... ,PN (Theo-
rem IV of Chapter IV). If there is more than one tac-plane, we choose
one arbitrarily. We then impose on A' the condition that PI
lies in 1TJ
for 1 ~ J ~ N. By (1), and since PJ lies on 1TJ, this imposes a condition
of the type
~ a··tW =0 (1~J~N), (2)
L. .. " "
1;:>;;:>
1;:>;;:> ..
where the numbers tlf> depend only on the point PJ and the choice of
tac-plane 1TJ. We also impose the conditions
(i=l=i)· (3)
The total number of linear conditions (2) and (3) imposed on the n Z
numbers ai i is !n(n-1) +N. Hence if N<!n(n+1), there exists a
set of real numbers aii not all 0 satisfying (2) and (3). We select any
one such solution and keep it fixed in what follows.
Since the points PJ lie on tac-planes to the open set:£, they do not
lie in:£. When 1171 is small enough, there are no further points of A'
in :£ other than 0, by the argument of § 6.1. Hence A' is admissible
for:£. Since A is critical, we must then have
d(A') =ldet(b~, ... ,b~)I~ldet(b1, ... ,b..)I =d(A) =.1(:£);
)
that is
1 + all17 ~217
1~ det (
. a21 17 .1:- ~2~17 :::~
a.. 1 17 anZ17 1 + a.. n17
= 1 + A 1 17 + A2172 + ... + An17n (say).
Since this must be true for all sufficiently small values of 1171. it follows
that
and
Az= - L aii aii + L ajiaii ~ o.
i<; i<;
Convex bodies 157
o ~ 2A2 - A~ =- ~ a~j.
1;;;.;;;"
1:Si.j;;;"
where VI • •..• Vn are rational numbers. The sets of numbers 'v such that
(2) is in A clearly form a lattice M of determinant
(3 )
Let F be the distance function associated with.%. so that
F(a j ) =1 (1~j~n).
For the cEA given by (2) and (3) we thus have by the convexity and
symmetry of.%. that
(4)
1 We do not use the rest of § 3,2 later but do refer to it at the end of § 8.5.
Convex bodies 159
Then vn =1, since otherwise tUtEA andF{tal ) <F(al ) =1. If v22 =l=1,
then either ta 2 or t(a l +a 2) is in A; and again we have a contradiction.
Hence
VU =V 22 =1; so vaa=4.
If V3l were even, we should have either i aa or t(a 2 +aa) in A; so Val
is odd. Similarly, V32 is odd. Hence there is a point
c = t(u1 Ut + u 2 a 2 + aa) E A,
where U 1 , U 2 are odd. By adding integer multiples of Ut and a2 to e,
we may suppose that Ut = ± 1, u2 = ± 1. But then
F(c) < i{F(a 1) +F(a2 ) +F(aa)} =! < 1.
Hence I =l= 4.
Finally, when 1=2 it follows, just as for 1=4, that the only pos-
sibility is vn=v22=1, v21 =O and vas=2. Further, the argument that
Val' Va2 are both odd continues to hold. Hence t(Ut+a2 +aa)EA.
V.S.3. When:f{' is a bounded symmetrical strictly convex 2-dimen-
sional set, the lower bound 3 for the number of pairs of points ±a of
a critical lattice on the boundary given by Theorem VIII coincides with
the upper bound give by Theorem IX. We have indeed
160 MAHLER'S compactness theorem
p=!c-!".. q=-!c-!",
the only possibility is that c is a boundary point.
We now show that f contains the whole parallelogram ~ of points
;x!=Ap+,uq
with
max{1 AI,I,uI} < 1.
Indeed
;x! = ec + U",
where
lei +Iul =!IA-,ul +!IA+,u1 =max{IAI,I,uI}·
But now the area V(~) of ~ is
V(~) = 41 det(p, q)1 = 4d (M).
- V(9') :::: det (a, b) + det (b, c) + det (c, a) = 4 det (u, v)
on putting b = a + 2 tt, C = a + 2 v. This proves (3). Then (1) follows
from (2) and (3) since M is 9'-admissible.
Now let A be any critical lattice for 9'. Then d (A) = i: V(,'/). If
A did not have a point on a particular side of 9' there would be a sym-
metric convex set larger than 9' which contained no point of 9' except
0; which would contradict MINKOWSKI'S convex body theorem. Hence,
by Theorem XI, A has precisely 6 points ±p, ± q, ±1' on the boundary
of 9'; one on each side. If, say, the points ±P are not the mid-points
of their sides, then by rotating slightly the sides about ±p, leaving
the other pairs of sides fixed, it would be possible to find a convex
symmetric set or of volume V(Y» V(9') containing no points of A
except 0; again contradicting MINKOWSKI'S convex body theorem.
Hence ±p, ± q, ±1' are the mid-points of their sides, and A = M.
It would, of course, be possible directly to compute the determinants
of all lattices having points p, q, l' with p +q +1' =0 on the boundary
of 9' and to show that M gives a minimum.
V.8.5. MINKOWSKI (1904a) has extended the argument of Theorem XI to
3 dimensions and proved the following.
THEOREM XII. To lind the lal/ice constaflt LI(f) 01 an open symmetrical convu
set f in 3 dimensions it sulfices to consider the minimum 0/ the determinants 0/
lattices generated by three points aI' a 2 , a 3 on the boundary 0/ 1\ alld satisfying one
0/ the followinlf three condition.s:
(A) the points a 1- a 2, a 2- aa' aa- a 1are on the boundary 0/ f and - a 1+ ",+a 3,
a 1-a 2+a3 , "l+aa-aa are outside f.
(B) the points a 1+a2, a 2+03, a 3+o1 al·e on the boundary 01 f and a 1+a,+"8
is ott/side f.
(C) the points ~ + a 2, "2+"s' "s+ a 1 and a1+ a.+ a 3 are on the boundary 0/ f.
We refer the reader to the original paper for the proof. Alternatively the
reader may construct a proof by combining the ideas of the proof of Theorem XI
with those at the end of § 8.2. The corresponding result in 4-dimensional space,
which is fairly complicated, has been found by K. H. WOLFF (1954a), who states
that some of the auxiliary results are due to E. BRUNNGRABER (1944a).
Spheres 163
MINKOWSKI (1904 a) used Theorem XII to find the lattice constant of the
octahedron
namely 19/108. The lattice constants of further convex 3-dimensional bodies have
been determined by CHALK (1950a) and WHITWORTH (1948a and 1951a). In all
cases a considerable amount of rather tedious detail is necessary.
the first half of (3) follows. The second half of (3) is a trivial conse-
quence of the definition of r...
COROLLARY. 1/ n = 3 the index is 0 or 1.
For P}" has volume 4:r:/3, and so
Fa~ 71:16> t,
by MINKOWSKI'S convex body Theorem II of Chapter III.
THEOREM XIV.
Fa=ri.
A critical lattice lor P}s has a basis m l , m 2 , ma such that
som'= ±m. Hence there must beat least one point ~ml +u 2m 2+uarna
with U l U 2 Ua=0 on the boundary of !iJa other than ±~, ±m 2 , ±rna.
We may suppose without loss of generality that it is
m,=ml -m2·
Then neither m l +m 2 nor ~+m2±rna can occur as boundary points,
since they would give index 2 with rna and m,. Hence at least two
of the remaining possibilities
~±ma, m 2 ±ma, ~-m2±ma
Write
I(ul , U 2 , u3} = IUl~ + u 2 m 2 + uam al2,
where ~,U2' ua are variables, so I(u} is a quadratic form. Then
1(1,0,0) = 1(0,1, O} = 1(0, 0,1)
= 1(1, 0, -1) = 1(0,1, -1} = 1(1, -1,0) = 1.
Applications to diophantine approximation 165
Hence
IM=~+~+~+~~+~~+~~
with determinant D (f) = 1. and so
{det(~. m 2 • m a)}2=1.
as required.
V.9.2. Let f/ be a star-body and 1\ an f/-admissible lattice. We say that
1\ is extreme for f/ if there is a neighbourhood 2 of I\, in the sense of § 3.2, in which
every f/-admissible lattice M satisfies
Clearly a critical lattice is extreme; but an extreme lattice need not be critical.
Some of the results proved already extend to extreme lattices, notably SWINNERTON-
DYER'S Theorem VIII.
The extreme lattices of n-dimensional spheres have been exhaustively studied.
For example there are six distinct types of extreme lattice for the 6-dimensional
sphere as was shown by BARNES (1957b). There is a general theorem of VORONOI
(1907 a) which helps to characterise the extreme lattices of an n-dimensional sphere
(they are "perfect" and "eutactic"). BARNES (1957a) has given an extremely
elegant proof of VORONOI'S characterisation. Unfortunately we cannot discuss
these points further here. so we refer the reader to the two papers by BARNES
where there are further references to the copious literature.
l"ol~Q; (2)
since "o{}j-"I (1~i~n) together with "0 form n+1 linear forms in
"0 ..... ",. with determinant 1. Were "0=0. we should have 1"11 < Q-l/".
so "1=0 (1~i~n). Hence "0=1=0. and on replacing "0. " " " I f by
1 Not used later in book.
166 MAHLER'S compactness theorem
(2')
Further, (1) may be written
(1')
which shows that the u;luo are good rational approximations to the {};,
all with the same denominator uo.
We may look at (1) and (2') from another point of view. On elimi-
nating Q we have
(1~j~n),
and then we may put
(O~j~ n),
where r is any positive integer: and then the left-hand side of (3) is 0.
Otherwise we may suppose that {}1 is irrational. Suppose that R integral
solutions u}'l (O~j~n, 1~r~R) have already been found with u~»O.
Since {}l is irrational, we may choose Q so large that
(1~ r~ R).
For this value of Q the solution of (1) and (2') gives a solution of (3)
which is clearly not identical with any of the earlier one~.
V.I0.2. For different purposes one may be interested in different
properties of the approximations u;/uo to the 0;. For example, instead of
be made for infinitely many sets of integers uo> 0 and U1 , ••• , Un? We
write
D ((j) : 1}1, ... , 1}n) = lim inf U o (j)" (u o1}1 - u1 , ... , U o1)n - un) (4)
110-+ 00
tlO! "II ''OJ UrI integers
and
D((j)) = sup D((j) :1}1, ... ,1},.l; (5)
lih··"lJ,.
Then. by the definition of D(tJ». there are real numbers {}l' .... {},. and
an integer 00 such that
(12)
x Iu·I I -~--~c
1 1
-c -
/(n+1)
x
ma l:;;;i;:;on
c /(n+1)
,,-
1/(n+l)
,
by (15). If O<uo~ Uo, then the second term of the outer maximum
in (18) is still ~ c 1/(n+l), by (14). If U o ~ [fo, the first term of the outer
maximum in (18) is ~cl/(n+1) by (12). Hence in any case,
F;. (x) ~ c1/(n+l)
Clearly
(24)
and
p,n+l (M,) = 1\n +l (Ml ) ~ c, (25)
by (17), (20) and (23). Consequently, by (21'), we have the weaker
assertion
F;.n+1 (M,) ~ c> 0 (1~r<oo). (26)
By (24), (26) and Theorem IV Corollary, there exists a convergent
subsequence of the M" say
M,,-+ N.
By (24) we have
deN) =1. (27)
Since (22) holds uniformly in any bounded set, we have
p+l (N) ~ lim sup P;;+ 1(M,,) ~ C, (28)
'--+00
170 MAHLER'S compactness theorem
Since Cwas any positive number smaller than D (C/» , this proves
o(F)~D(c/», the first part of Theorem XV.
The second part of Theorem XV requires quite different techniques
and uses the basis constructed in Theorem II of Chapter 1. By the
Corollary to Theorem VI, there is a lattice 1\ with
d(/\) = 1 (29)
and
p+l (1\) = 0 (F) . (30)
We denote the (n + i)-dimensional vector (xo, ... , x,,) in which xi = 1
but the remaining co-ordinates are 0 by
i n-i
ei=(~,1,~) (O~j~ n).
By Theorem II of Chapter I, with e =} and n + 1 for n, there exists,
for all sufficiently large numbers N, a basis ao' aI' ... , a" of 1\ such that
lai-Neil <NA (1~j~n). (31)
Then
aj =N ~ tj.e. (1~j~n), (32)
0::;;.::;;"
where
It.·11 -11 -~ N-! (1~j~n) (33)
and
Itjil ~ N-i (1~j~n, o ~ i ~ n, i =l= ;) . (34)
Since ao, aI' ... , a" are linearly independent, there are feal numbers
.1.0, AI, ... , An such that
eo = .1.0 a o + Al a1 + ... + An an ,
where we may suppose that
Ao~ 0,
(36)
for the 01 , ••• ,0.. we have just constructed; provided that N is greater
than some No which may depend on <5' and the function tP. After the
first part of Theorem XV, this will complete the proof of the theorem.
If <5 (F) = 0 there is nothing to prove. Otherwise we may suppose without
loss of generality that
o< <5' < <5 (F) . (37')
To prove (37) we may clearly confine attention to integers uo, "', u .. ,
if any, for which
uo>O, uotP"(Y1, .. ·,y..)~<5(F), (38)
where we have put
(1~i~n). (39)
So far we have not used the fact that tP(:r) = 0 only for :r = o.
By Lemma 2 of Chapter IV, this implies that
where uo• u l • u 2 are integers. The corresponding function F(xo. Xl' XI)
is given by
F3(Xo• Xl' X 2) = IXOXIXsl:
d(F) = 1/7.
which we shall prove in Chapter X. But it follows from work of CASSELS
and SWINNERTON-DvER (1955a) and from DAVENPORT'S results about
the successive minima of F, that at least
Applications to diophantine approximation 173
and indeed in general such {}l' ... , {}" do not exist!. When n = 1, however,
a (}l does exist, as is easy to show. Here, of course, the only possibility
for the distance function l/J(Xl) of one variable is
l/J(xl) = { k Xl if Xt~ 0
-tXt if Xt~ 0,
where k and t are positive constants. As in the proof of the second
part of Theorem XV, we consider a lattice 1\ with
d (1\) = 1 , p2 (1\) = «5 (F) .
Let
a= (a o • at). b = (b o , bt )
be a basis for 1\, where without loss of generality
bt > 0 aobt - atbo = d(/\) = 1. (1 )
Put
{} = {}t = iltlbt· (2)
After Theorem XV it is enough to show that
lim inf Uo l/J(uo{}
"0-+ 00
+u l) ~ «5 (F).
Chapter VI
where 9' is any bounded set, not necessarily a star-body and not
necessarily convex, which possesses a volume V(9') in JORDAN'S sense.
1 This is rather more restrictive than the sense of LEBESGUE, but if the volume
is defined in the sense of JORDAN it is also defined in that of LEBESGUE and equal
to it. Let X (xl be the characteristic function of .9', that is X (xl = 1 if xE 5 and
X (xl = 0 otherwise. Then .9' has a volume in the sense of JORDAN if X (xl is integrable
in the sense of RIEMANN, and the volume is equal to the integral of X (xl over all
space.
Z That is points aE A which are not of the form a = Rb, where bE A and
k> 1 is an integer.
3 For a particularly simple exposition of SIEGEL'S averaging process, see
MACBEATH and ROGERS (1958al.
t76 The theorem of MINKOWSKI-HLAWKA
HLAWKA'S theorems follow at once from (4) and (5). If .1 1 > V(9') , then,
from the definition of the average, there must certainly by (4) be at
least one lattice, say M, such that N9'(M) ~ rol(N9'(I\)) < 1. Since
A
N9'(M) is an integer, we must have N9'(M) =0, so M is 9'-admissible.
Similarly, if 9' is a symmetric star-body and Ll1> V(9')/2C(n), then
there must be some lattice N for which P9' (N) < 2. Since 9' is sym-
metric, points of N, other than the origin, occur in pairs, ±a, so
P9'(N) =0. Hence 9' contains no primitive points of N and, being a
star-body, can contain no points of N at all other than o.
The constant C(n) occurs in (5), roughly speaking, because the
probability that a point of a lattice 1\ chosen at random should be
primitive is {C(n)}-l. More precisely, the ratio of the number of primitive
points of 1\ to the total number of points of 1\ in a large sphere Iall < R
tends to {C(n)}-l as R~oo.
When 9' is convex, improvements of the Minkowski-Hlawka theo-
rem were obtained fairly soon after the original proof [see e.g. MAHLER
(1947b), DAVENPORT and ROGERS (1947a) and LEKKERKERKER (1957a)].
However, even so, the smallest value of
Q = 1.3173 ....
It is not known whether this is the smallest possible value for a 2-dimen-
sional set.
For a long time no improvement was obtained on the Minkowski-
Hlawka theorem for general sets or for star-bodies. However, almost
simultaneously, improvements were made by ROGERS (1955 a, 1955 band
1956a) and SCHMIDT (1956a and 1956b). ROGERS'S work depends on
elaborate estimates of the average
(7)
Introduction 177
for positive integers k, where we have used the same notation as in (4).
In a later paper ROGERS (1958a), using ideas of SCHMIDT combined
with his own, shows that there is an absolute constant C such that
V(.9') 1 4
Q(9') = - - ~-nlog- - 210gn - C (8)
Ll(9') - 2 3
for all symmetric sets 1, provided that the dimension n is greater than
some absolute constant no. We shall not discuss ROGERS'S work further
but refer the reader to the original memoires. SCHMIDT, on the other
hand, uses an elegant device which is more effective than ROGERS'S
method for small dimensions but much less effective when the dimension
is large. We shall discuss it more in detail in § 4.
The work just described can be generalized in several directions.
In the first place, instead of operating with the number Ny (A) defined
above, one may consider more generally
2:.1(a), (9)
DEli
*0
where 1(~) is some function defined at all points of space and which
may be subjected to certain conditions (e.g. that it be non-negative or
Riemann-integrable). If f(~) is the characteristic function of 9', then
the sum (9) is just Ny(A). Again, one may confine the sum in (9) to
primitive points of A, when there is an analogue of Py(A). In fact
most of the work so far described has dealt with generalisations of this
kind. Again, it was shown by MACBEATH and ROGERS (1955a) that the
Minkowski-Hlawka theorem extends to more general sets of points than
lattices. It is enough for A to be any set of points such that the ratio
of the number of points of the set A in the sphere I~I < R to the volume
of the sphere should tend to a finite non-zero limit d as R -+ 00. Indeed
(4) continues to hold with a modified definition of the mean !In and with
Lll =d-1•
Finally, we observe that MAHLER'S Theorem V Corollary of Chap-
ter III often permits the results of this chapter to be extended to un-
bounded sets 9' on taking .9, to be the set of points of 9' in the sphere
1~I<r.
VI.l.2. In this book we shall not consider any of these generaliza-
tions in detail. In § 3 we shall prove the Minkowski-Hlawka Theorem
in its original formulation, that is, the existence of a lattice A admissible
for a symmetric star-body 9' with finite volume V(9') and with deter-
minant arbitrarily near to V(9'). We shall use an averaging argument,
but the type of average will be chosen to facilitate the proof, not for
1 Professor ROGERS tells me that Dr. SCHMIDT has obtained an improvement
of (8) which is in course of publication in Acta Mathematica.
Cassels, Geometry of Numbers 12
178 The theorem of MINKOWSKI-HLAWKA
1 Other averaging processes have been used. For a particularly brief proof of
Theorem II using one of them. see CASSELS (1953 a). It has been shown by ROGERS
(1955a) that many of the averaging processes that can be used to prove the
Minkowski-Hlawka Theorem are essentially equivalent to SIEGEL·S.
Sublattices of prime index 179
We have
with
C2U2 +"'+C"U,,=O (P)
then does what is required.
VI.2.2. A refinement of the argument gives a rather more special
result than Lemma 1 in which now the k, must be non-negative.
LEMMA 2. Let p be a prime-number and /\ an n-dimensional lattice.
Let a o, ... , a R be any R + 1 points all\ which are not 0/ the shape Pb,
The Minkowski-Hlawka theorem t8t
bE" and let kl' ... , kR be non-negative real numbers. Then there is a
lattice M 01 index p in " such that
ao~M
and
(1)
Finally, (1) follows from (3), (7) and (9), when p is chosen large enough.
As in § 1 we have the
COROLLARY. Let !/' be a set with Jordan-volume V(!/,) and let
.1 1 > V(!/,). Then there is a lattice M with d(M) =.11 which is admissible
for !/'.
The Minkowski-Hlawka theorem 183
where the star (*) indicates that only primitive points are to be summed
over.
We only indicate briefly the modification required to the proof of
Theorem 1. In any case Theorem II is embraced in the generalization
of Theorem I to point sets /\ other than lattices due to MACBEATH and
ROGERS (1955 a), which was discussed in § 1. The exposition still follows
ROGERS (1947b and 1951 b).
In the first place, it is trivial that a point of M in the cube (2) of
§ 3.1 is a primitive point of M if and only if it is primitive as a point
of I\. Hence it is enough to show that
L I(a) = L L* I(ra).
aEA ,=10EA
0*0
L IJ,.~) = {C(n)}-l.
'~l
As in § 1 we have the
COROLLARY (The "Minkowski-Hlawka Theorem"). Let 9' be a bounded
symmetric star-body with volume V(9') and let 2C(n) Ll 1 > V(9'). Then
there is a lattice M with d(M) =Lll which is admissible lor 9'.
VI.4. SCHMIDT's theorems. We are now in a position to illustrate
SCHMIDT'S method of improving the corollaries to the last two theorems.
We first give a simple example
LEMMA 3. Let 9' be a symmetric star-body in n-dimensions with
Jordan-volume V(9') and let Lll be any number such that
By Theorem II with Ll1/2 for Lll and this e, there is a lattice A with
determinant
such that
L I (a) <6.
"E II. primitive
d(M) = 2d(l\) = L1 1 ,
the lattice M does what is required.
VI.4.2. When n = 2, the result of Lemma 3 is no stronger than
Theorem II Corollary.
By further elaboration, SCHMIDT (1956a) improved Lemma 3 some-
what but for values of n at all large Lemma 3 is weaker than the follow-
ing Theorem III which applies to all Jordan-measurable bounded sets
not merely symmetric star-bodies. To obtain results about symmetric
sets, Theorem III should not be applied to Y directly but, say, to the
"half-set" g;, of points
Then
V(g;) = l V(Y) ,
and a lattice Mis g;-admissible if and only if it is Y-admissible. There
is thus an additional factor 2 for symmetric sets.
THEOREM I II. Let Y be any hounded n-dimensional]ordan-measurable
set 01 volume V(Y\ and let Lfl be any number such that
(1 )
Then there is a lattice M 01 determinant Lfl having no points, except pos-
sibly 0, in Y.
Let g (z) be the characteristic function of S and put
j(z) = g(z) + 2g(2Z) + 3g(3z) + 6g(6z). (2)
Then
JI(z) dz = (1 + 2·2-" + 3 . r" + 6·6-") Jg(z) dz
= (1 + 21 -") (1 + 31 -") V(Y).
By Theorem I there is thus a lattice 1\ of determinant
(the - 4 being the contribution of ao, which is definitely lost). All the
points of X2 are, of course, in r. By (5) and (6) we have
2 N; + 3 Nz + 8N; + 4 ~ 11.
Hence N; =0 and Nt' +Nz~i, so Nt' +Nz~3. But now 'by Lemma 1,
Corollary 1 there is a sublattice M of r of index 3 which contains
+
none of these N..' N 2 points. Then M does what is required.
We may thus suppose now that
Na=O.
We now apply Lemma 1, Corollary 2 with P= 3 to the points a, with
k,=1 if a,EXl and k,=3 if a,EX 2 • Since there are at most
11 < (33 -1)/(3 -1)
1r (2)
be positive integers. Then there is a lattice of index at most
(3)
1 For all m;;;; 107 and all sufficiently large m, according to the review in Mathe-
matical Reviews!
188 The theorem of MINKOWSKI-HLAWKA
±i,a, (4)
We show first that Theorem V implies Theorem IV. Suppose that
1\ in Theorem IV contains fewer than m pairs of points of f/. Since f/
is a star-body. the points of 1\ in f/ can be put in the shape (4). where
the number of pairs is
J<m.
Hence by Theorem V there is a lattice M of index ;;;; m in 1\ which con-
tains none of these points. i.e. M is f/-admissible. Since
Then there is a prime number p>X which divides one 01 the numbers
~ b1 + ... + Un b,I'
1).(] +
does all that is required.
We now consider the case whenR>1 and use induction onJ. Without
loss of generality
fl = 1;:i;,&R
max f,· (5)
Unbounded star-bodies 189
[:]+[~]<[X~Y],
that is
(7)
where for any real number x we denote by [x] in this proof the integer
such that [x] ~ x < [x] + 1. By Lemma 2, there is a lattice r of index p
which does not contain ~ and such that
that is
" .: : ; [ii + ...P + iR ].
L.." 1,- (8)
o,Er
± i~ (p~) (9)
(1 )
is of finite type but infinite volume, since admissible lattices are given
by the norm-forms of totally real algebraic fields of degree n (see
Chapter X). In general, in more than 2 dimensions it is very difficult
to decide whether a given star-body is of finite type or not. Two 3-
dimensional examples are discussed in CASSELS and SWINNERTON-DYER
(19SSa), for which a decision on this point would have interesting reper-
cussions. In 2 dimensions however there do exist general criteria which
we shall now discuss.
VI.6.3. From now on we put 1
n =2.
In an obvious sense, the body ~ defined in (1) of § 6.2 has two pairs
of asymptotic arms, the asymptotes being the Xl and X 2 axis. It is
possible to inscribe in !/ arbitrarily narrow parallelograms with one
pair of sides parallel to an asymptote and area 1, for example
THEOREM VIII. Let PI' "', PK be any real numbers. Then there exists
a real number oc such that
then we have
192 The theorem of MINKOWSKI-HLAWKA
We apply Theorem VIII where the Pl' ... ,PK are the {}i' fIJi in some
order, so K = 2J. Let IX be the number given by Theorem VIII, so that
IU{(IX + {}i) u + v}1 ~ '1 > 0 } (7)
IU{(IX + fIJi) U + v}1 ~ 7J > 0
or integers u =1= 0, v, where
7J = {8(2] +1)2}-I.
Let /\ be the lattice of points
(Xl' X 2) = R(IXU + V, u), (8)
for all (Xl' X2)E/\ other than 0, on distinguishing the two cases u=l=O
and u =0, v=l=O in (8). We may choose R so large that the right-hand
sides of (9), (10) and (11) are all not less than 1. Then for all (Xl' X2)E/\
except 0, we have, by (2),
F(x}, x 2) ~ min 1/;(x}, x 2)! ~ 1;
O;fi,;;fi,J
I: + (a. I
+Pk) > ~ "-2,,,-4
for all a.EJ"'H' by (8); then (11) follows, by (10). Thus J"'+l has all
the required properties.
Cassels, Geometry of Numbers 13
194 The quotient space
Chapter VII
This establishes (i). The proof of (ii) is similar and may be left to the
reader.
It remains to prove (iii). Let t)EBijA and let yoEt), so that the general
element of t) is yo+a, aEA By Lemma 2 of Chapter IV, there is a
constant c>o such that F(x)~clxl for all x; and so
F(Yo+a) ~ cIYo+al;S cllal-IYoil·
In particular, if F(a+Yo)~F(yo), we have
1~ 1= inf 1xl·
orE~
(2)
y,-+y'. (3)
Suppose, first, that the y" y' exist such that (3) holds. Then
Let y' be any element of ~' and put y,=1/' +zr. Then the Yr clearly
have all the properties required.
VII.2.4. Let
(1 )
be any basis for I\. Then every point x of space can be put uniquely
in the shape
(2)
for some real numbers ~l' ... , ; n; and x EA if and only if ~l' ••• , ~n are
integers. Hence to every vector x there is a unique aEA such that
(3)
where
(4)
LEMMA 3. The quotient space fJll/\ is compact. That is, any sequence
I), (1 ~r<oo) 0/ elements 0/ fJlj/\ contains a convergent subsequence:
y,,~y',
m(S)
c + b, where CE C, bE O.
This section is devoted to proving
THEOREM I. Let C and 0 be non-empty sets in 1;1//\ with measures
m(C) and m(O) respectively.
(i) II m(C) +m(O»d(/\), then C +0 is the whole space 1;I/A.
(ii) II m(C) +m{O)~d(A), then m{C + 0) ~ m(C) + m(O).
This theorem is due to MACBEATH (1953a). It was discovered inde-
pendently by KNESER (1955 a), who first recognized its importance for
the geometry of numbers. Theorem I is, in fact, now only part of a
much wider theory, for which see KNESER (1956a) and the literature
cited there. It falls into the same circle of ideas as the so-called "rJ. +P
hypothesis" about the densities of sequences of integers which was first
proved by MANN. As all this is rather aside from the main theme of
the book we do not discuss it further. It is convenient to prove Theo-
rem I here but the application to the geometry of numbers will not be
made until Chapter XI.
Part (i) of Theorem I is easy. Suppose that there is a point !" of
1;1//\ which does not belong to C + O. Then none of the points
!"-C, CEC (1 )
can belong to D. We may denote the set (1) by !"- C. Clearly
m(!"-C)=m(C). (2)
which becomes clear on noting that points of C f"\ 0 occur in two sets
on each side of (4), but points of C v 0 other than those of C f"\ 0 occur
1 The results of § 3 will not be needed until Chapter XI.
The sum theorem 199
C1=(C+!)("\D, D1={(C+;r;)vD}-!.
Write
m(C1) =y1d(A). m(Dl) = CJ1d(A) ,
so that
and
Yl = yCJ
by (4) applied to C +! and D and by Lemma 4 respectively. Further,
200 The quotient space
C+D)C,+D" (7)
and
y, + b, = l' + b, (8)
1', = 1',-1 b,-I' (9)
But now, by the argument used when l' =0, it is certainly true that
m (C, +- D,) ~ m (D,) = b, d (/\). (10)
It follows from (6), (8) with 1 - 1 for 1 and (9), that
Hence
1', --+ ° (1--+00). (11)
But
Jtp(t) +~) X(t)) d~ = X(t)) m(C).
dl/A
the truth of Lemma 4 now follows from the continuity of m{( C +~) "D}
and the connectedness of 9l/A.
Chapter VIII
Successive minima
VII!.t. Introduction. For some purposes one requires to know not
merely that a lattice A has a point in a set .9', but that it has a number
of linearly independent points in .9'.
Let F(a;) be an n-dimensional distance function and A a lattice. If
for some integer k in 1 ~ k ~ n and some number A the star-body
(1 )
(2)
of A, then so does 1-'.9' for any I-'>A, since the points (2) are also in
1-'.9'. We define the k-th successive minimum Ak =Ak(F, A) of the dis-
tance function F with respect to the lattice l A to be the lower bound
of the numbers A such that A.9' contains k linearly independent lattice
points. Clearly
The numbers AI' ... ' A" defined above certainly exist, since if
aI' ... , a" are any n linearly independent points of A, then, trivially,
which is equivalent to (9) when n =2. The inequality (10) was ap-
parently discovered by CHALK and ROGERS (1949a) and CHABAUTY
(1949a) independently. It has been shown by WOODS (1956a and
1958b) that (9) continues to hold for n = 3 when F is symmetric and
convex and for n = 2 when F is convex but not symmetric: the proof
is distinctly intricate and we do not discuss it here. For general n
and symmetric convex F, RANKIN (1953a) indicates that the constant
2*(,,-1) can be replaced by a rather smaller one.
For symmetric convex functions F and any n, there is a result going
back to MINKOWSKI (1907a) which may be regarded as a substitute
for the unproved conjecture that (9) holds. In our notation, MIN-
KOWSKI'S convex body Theorem II of Chapter III states that
(11)
(12)
The proof of (12) remains difficult. Simpler proofs than the original
have been given by DAVENPORT (1939c) and WEYL (1942a). We follow
WEYL in § 4, since the ideas introduced will be needed in Chapter XI.
For symmetric convex F there is also an inequality
2"
AI' .. A" Vj...~ In. d (A) , (13)
the almost trivial proof of which is also given in § 4. From (12) and
(13) it follows that the product Al ... An is determined by VF and d (A),
except for a factor which is bounded in terms of n.
In general, it is hopeless to expect more information about successive
minima than can be deduced from the formulae for the product AI'" A".
For example, let AI' "', A" be any numbers such that
Al~A2~"'~A,,; A1 •• ·A,,=1.
Then the lattice A of points
(Ut, ... , Un' integers)
has d (A) = 1 and has successive minima AI' ... , An with respect to the
distance function
max 1x·11 '
F(:e) = 1;:;;,;:;;"
as is easily verified.
204 Successive minima
VIII. 1.2. For later purposes we shall often need the following two
simple lemmas.
LEMMA 1. Let AI' ... , An be the successive minima 01 a lattice A with
respect to a distance lunction F associated with a bounded star-body
F(:ll) < 1. Then there exist n linearly independent points ~, ... , an EA
such that
(1~i~n).
fl2> All"
For a general distance function F(z) there is, of course, no reason why Al
I
should not be o. Indeed, if F(z) = IXl ••• X" Il/n, we have Al = ... = A" = 0 for the
lattice 11.0 of points with integer coordinates.
Spheres 205
(2)
of A in F(~) <Ps, and, since
of A such that
(t ~ s).
By Theorem I of Chapter I there is a basis bl , " ' , b .. of A such that,
for each i = 1, ... , k s - l , the vector aj is linearly dependent on bl , ... , b j
only. This basis clearly has all the properties required.
VIII.2. Spheres. We first prove the results for spheres. since they
are simplest and the treatment forms the model for what follows.
THEOREM 1. Let
(1)
and let AI' ... , A.. be the successive minima 01 a lattice A with respect to
Eo. Then
d(A) ~ AI'" A.. ~ d(Eo) d(A). (2)
and so
IL ui bj12 = L (L Ui tj
• J~'
Y (3)
We now show that
L .1.;-2 (L Ujtjit~ 1 (4)
i i~i
for all sets of integers tt =1= o. For let Ul , ... , Un be integers, and suppose
that
(j> J). (5)
Then ulbl+ .. ·+unb n is not dependent on bl, ... ,b j - l ; and so
ILUjbiI2~A}. (5')
Further, (5) implies that all the summands in (3) and (4) with i?- ]
are O. Hence, and since Ai~Aj if j~J, the left-hand side of (4) is
L A;-2 (L uitiiy~ L AJ 2(L ujtii)2 = AJ 21L ui bil2~ 1,
i;;:'j i~i i~j i~i i
by the definition of d (Po). The right-hand side of (2) follows now from
(6). (7) and (8). This concludes the proof of Theorem I.
VIII.2.2. As was remarked in Chapter V. the theory of successive
minima shows that the hypotheses of Theorem III and IV of Chapter V
are equivalent. This we do now.
LEMMA 3. The lollowing two statements A and B about a set 2 01
n-dimensional lattices 1\ are equivalent. where x. K. ,10' ,11 are supposed
to depend on 2 but not on I\.
(A) there exist ,11 < 00 and x> 0 such that d (1\) ~ ,11' and 11\1 ~ x > 0
lor all I\E 2.
(B) there exist ,10> 0 and K < 00 such that d (1\) ~ ,10> 0 and the sphere
Ix I~ K contains n linearly independent points 01 1\. lor all 1\ E2.
If AI •...• An are the successive minima of Po (x) =Ixl with respect
to 1\. then clearly (A) and (B) are equivalent to
Thus there is at least one k such that (1) holds with rJ =rJk'
We shall require only the more specialized
COROLLARY. Let fll' ... , fl .. be any numbers such that
Then
d (I\') = II (fl mj) -1 d (/\) . (2)
a = u 1 b~ + ... + U J b J' UJ =F 0,
where u1 , ••. , U J are integers. Then
where
(1~j<]),
are integers, since u j and mJjm j are integers. By Lemma 2, since vJ=F 0,
we have
F(flmJa) ~ AJ'
Hence
F(a)~~ ~1.
JlmJ
This proves (3).
Finally,
F" (I\') < (4)
d (I\') = b (F) ,
by the definition of b(F). The required inequality (1) now follows from
(2), (3), (4) and the inequality
II(/~j) ~2l(n-ll
1
of Lemma 4, Corollary.
A rather more detailed argument shows that the sign of equality
in (1) cannot hold if F(x) < 1 is a bounded star-body. Then it is possible
to ensure that there are not n linearly independent points a of I\' with
F(a) = 1, so I\' cannot be critical, and there is inequality in (4). See
ROGERS (1949a).
Cassels. Geometry of Numbers 14
210 Successive minima
where
Finally, let Y be the set of points which belong neither to C(j' nor
to C(j". Clearly Y is open, and if any point x is in Y, then rx is in Y
for O~ Irl ~ 1: so Y has some of the attributes of a star-body. We shall
later modify Y slightly to obtain a set .c;; which actually is a star-
body.
There certainly exist Y-admissible lattices I\, i.e. lattices having
only the origin 0 in Y. For example the lattice 1\2 of points
This point is not in Y, so must be in C(j' or C(j" and hence has the shape
b12lb 22 = rational.
We wish to show that B~2 is prime to Bn: and must distinguish two
cases for the prime divisors p of Bw If P does not divide Bl2 , then it
divides v. If p divides Bl2 then it does not divide B u , since B u , B l2 , Bn
have no non-trivial common divisor; and p does not divide v. In both
cases p does not divide B~2' Hence, on replacing b 2 by b 2 +vb1 , we may
suppose that Bl2 and B22 have no common non-trivial divisor.
Now b 2 is in the 9'-admissible lattice 1\, so is in re' or re". Hence
I~I ~1,
since Bn and Bn have no common factor. Similarly b i is in re' or f(j",
and so
Hence
either I Bnl = 1, I~I ~ 21 ,
or IBnl ~ 2, I~I ~ 1.
In either case,
d(/\) = IBllBn~21 ~ IBn~21 ~ 2.
This concludes the proof of (1).
We denote, as usual, by fl9' the set of points
fl9': fl~, zE9';
and by Ao the lattice of points ("I' "2) with integer "I' "2' Clearly if
fl;;;;;' 2- 1 there are no points of 1\0 except
0 in ,It 9'; if r l <fl ;;;;;'1, there
are only the further points (± 1, 0) of 1\0 in fl9'; while if fl> 1, the
points (± 1, 0) and (0, ± 1) are in fl9'. If 9' were a star-body F(z) < 1,
14*
212 Successive minima
these statements would imply that the successive minima of 1\0 were
Al = 2-', A2 = 1. Hence
).IA2 = 2' (L1(9'))-ld(l\o);
which is the case of equality in Theorem II if (.1 (9'))-1 is written for
b(F), the two being equal for star-bodies.
It remains now to modify 9' so as to obtain a bounded star-body,
in such a way that its successive minima with respect to 1\0 remain
2- 1 and 1, and so that its lattice constant is arbitrarily close to 2. We
do this by replacing the lines in rr' and rr" by narrow wedges.
Let £>0 be arbitrarily small. For any vector Y=(Yl' Y2)'*0. let
1r, (y) be the set of points z for which
ir,(y) : XIY1+X2Y2-£-llxIY2-X2YI!;;;;Y~+Yi. (2)
Then ir, (y) is an infinite wedge having a vertex at Y, see Fig. 9. Its
precise shape is not important. The two sides of the wedge make the
small angle ±arc tan £ with the outward radius vector from 0 to y .
Now let rr; be the set of points in ir, (2 6, 0) and 1r, (- 26, 0) and
U2'*
let rr;' be the set of points in ir, (u 1 , u 2 ) for any pair of integers with
0. Finally let y. be the set of points in
Since .9, is contained in f/, and since the points (±2i , 0), (0, ±1)
are evidently still boundary points of .9" at least when 8 is small enough,
it follows that the two minima A1 and A2 of Ao with respect to F.(~)
are r1 and 1 respectively.
Further,
.1(.9,) ~ .1(9') = 2.
Indeed
lim .1(.9,) = .1(f/) = 2
....... 0+
by Theorem V of Chapter V. Hence there exist e such that
21 t5 (F.) d (Ao) = 2* (.1 (.9,) )-1
is arbitrarily close to A1 A2 • This shows that for n = 2 the constant
26(,,-1) = 21 in Theorem II cannot be improved.
(5)
Let b 1 , " ' , b n be the basis for 1\ given by Lemma 2. Let ± c be
the points on the boundary of F(x) < 1 at which the tac-plane is parallel
to the plane IT through b 1 , .•• , b n - 1 (Theorem IV of Chapter IV). Then
every point in space can be uniquely put in the shape
x=y+sc, YElT. (6)
We put
fl = An - 1/A n ,
and define I\' to be the lattice of all points
Y+flsc, y+sc,:/\. (7)
Then (4) dearly holds. If s =1= 0 in (7), the point Y + s c is not linearly
dependent on b 1 , ... , b n - 1 ; and so
F(y + sc) ~ An'
Hence
(s =1= 0) (8)
by Lemma 5. On the other hand, the points of I\' with s = 0 are just
the points of 1\ which are linearly dependent on b 1 , ... , b n - 1 . Hence
(8) implies (5).
COROLLARY 1.
).~-1 An ~ b (F) d (1\).
For in the proof of the Theorem put
fl = AI/An
instead of A,.-l/A,.. Then (8) becomes
F(y +flsc) ~flJ.,. = Al (s =1= 0); (8')
so
F(a');;;; Al
for all a' EI\' except o. That is,
F(I\') ;;;; AI'
Further,
d (1\') = fl d (1\) = t-t.) d (1\) . (4')
Convex sets 215
But
P (1\') ~ 1J (F) d (1\')
by the definition of 1J (F); and then the corollary follows from (4')
and (8').
COROLLARY 2.
1 1
AI'" A,,~ 22(n-l)-,.- 1J(F) d(/\).
We only sketch the proof. By varying fl in the proof of the theorem,
we may obtain a lattice I\' with successive minima where Ai,
(1~j<n),
and
d (1\') ~ A~:1 d (/\) .
Then
Al ... A" < A; ... A~
d(i\) = d(R)'
Hence it is enough to prove the corollary when An - 1 =An • But it is
easy to see that if two of the numbers 'fJj in Lemma 4 are equal, then
the right-hand side of (1) of § 3.1 may be replaced by ~ (n -1) - : .
When this improvement is inserted in the proof of Theorem II, it gives
the corollary.
VIII.4.2. Before treating MINKowSKI'S estimates for the product of
the successive minima of a bounded symmetric convex body in terms
of the volume we must first prove a result, which we shall also use
later, relating to convex bodies and the quotient space !Jill\. We
shall use the concepts and notation of Chapter VII. As was done
there, we denote the points of fJl by small bold letters and those of
!JIj/\ by small gothic letters.
THEOREM IV. Let F(;r) be a convex symmetric distance-function
associated with a bounded convex set
Y: F(;r) <1 (1 )
of volume
Vp= V(Y). (2)
Let /\ be a lattice with successive minima AI, ... , An with respect to F.
For real t>O denote by Sit) the set of t.)E!JII/\ which have at least one
representative y in t Y (i.e. F(y) < i). Then ihe measure m {S (i)} of S(i)
satisfies ihe inequality
= tn VF if i ~ VI 1
m{S(i)} 1 ~(-Pl) .. ·aAJ)tn-JVF if !AJ~i~-~AJ+1J
~ (-}A}) ... (iAn) Vp if t ~ iAn.
216 Successive minima
(1)
F(a j ) =Aj
lie in F(;r) < 1. Hence Vp. ~ V' where V'is the volume of the set of (2)
subject to (3). But clearly
= -I det(a I ,· .. , a,.) 1=
I 2 n 2n I
V -d(A), (4)
n! n!
Polar convex bodies 219
where I is the index of a 1 • •••• a" in /\. This proves the left-hand side
of (1). since I~ 1.
COROLLARY. The index I of a1 • .••• a" is at most n!.
This follows from (4) and the right-hand side of (1). (Compare the
proof of Theorem X of Chapter V.)
VIII.5. Polar convex bodies. Let /\* and F* be the respective polars
of the lattice /\ (Chapter I. § 5) and the symmetric convex distance-
function F (Chapter IV. § 3). MAHLER (1939b) has shown that the
successive minima of /\* with respect to F* are determined by the suc-
cessive minima of /\ with respect to F apart from factors which have
bounds depending only on the dimension n. Thus relationship will be
exploited in Chapter XI in the discussion of inhomogeneous problems
and is of importance in other contexts. The theorem is. of course.
closely related to Theorem VI of Chapter IV dealing with the lattice
constants of mutually polar convex bodies.
THEOREM VI. Let AI' ...• An be the successive minima of a lattice /\
with respect to the symmetric convex distance-function F and let At I A!
••••
be the successive minima of the polar lattice /\ * with respect to the distance-
function F* polar to F. Then
(1~j~n). (1 )
We attack first the left-hand inequality. By Lemma 1 there exist
linearly independent vectors ai' af of /\ and /\* respectively such that
F(ai ) = Ai' F*(aj) = Ai. (2)
By Theorem III of Chapter IV we have
F(~) F*(~*) ~ ~~*
(scalar product) for any two vectors ~ and ~*. On applying this
~=±a,. ~*=±a' for any pair of indices i,j we have
since F(~) and F*(~) are symmetric. But a,af is an integer by Lemma 5
of Chapter 1. and so
either A, Ai ~ 1 or a,af = O. (4)
Since this is true for any I, this gives the left-hand inequality of (1).
We now prove the right-hand inequality in the enunciation. Let
a j (1 ~i~n) be as above. Then (d. Chapter I, § 5) there are n primitive
vectors bf of A* such that
(i =1= i) . (5)
(1~i~n). (6)
Hence the b! are linearly independent.
By Theorem III, Corollary 1 of Chapter IV, there are vectors ;):;
such that
F(;):;) F*(bf) = ;):ib;*. (7)
(1~i~n). (8)
The next stage is to show that for fixed J the determinant D J formed
from;):J and the a j (i=l=]) has absolute value at least d(A). For fixed J,
there is a basis ct, ... , c! for A* with
(9)
;):J = ± c" + L t, c i
l~i:o,,-1
DJ = Idet (ai' ... , a.1-1';):J' a HI' ... , a,,)1 = Idet (Vji};,~.fll det (cl , ... , c,,)I.
,*"
Polar convex bodies 221
The first factor here is a non-zero integer since the a i are linearly in-
dependent; the second factor is just d (/\). Thus
(11)
as required.
The points
with
ItJI F(xJ) +i*,J
L Itil F(a < 1j)
lie all in the set F(x) < 1 of volume Vp. This set of points has volume
and finally
(13)
by (7), (8). The inequality (13) holds for each integer] and for the
independent vectors by of /\*.
Now AI;:;;; A2;:;;;"';:;;; An and so, for each integer], there are the n+ 1 - ]
linearly independent vectors b*=br (J~j~n) of /\* such that F(b*)
~n!Ajl. By the definition of A:+ 1 -J it follows that
A:+1-J;:;;; n!Ajl.
This is the required inequality and so concludes the proof of the theorem.
The applications of the theorem are usually only qualitative so the
magnitude of the factor n! on the right-hand side is usually irrelevant.
MAHLER (1939b) showed that the weaker inequality
AJA!+l-J;:;;; (n!)2
can be deduced very simply from the left-hand inequalities, Theorem V
and Theorem VI of Chapter IV. We have
~A1· .. An~2"d(/\),
VpoAt ... A!;:;;; 2"d(/\*) ,
222 Successive minima
and so
VF VF* n AjA:+1_;
;
~ 22K d (A) d (A*).
Now
d(A)d(A*) =1
by Lemma 5 of Chapter I, and
22 "
VF~·~ (n!)1
JI
;
A;A:+1_;~AJA:t-l-J
and
(3)
The existence of a basis b j satisfying (2) follows at once from Lemma 8
of Chapter V on defining a j there to be the linearly independent points
with F(a i ) =A;. But now on multiplying (1) by F(b J) and using Theo-
rem V, we have
2"d(A)F(b J)F*(bj) ~ n! ~ II F(b j ) ~ (-i)"-I(n!)2VF
l~j:;;"
n Aj~2(n!)2d(A).
l;:;;j;:;; ..
Introduction 223
Chapter IX
Packings
IX.1. Introduction. If Y is any n-dimensional set and y a point,
we denote by Y + y the set of points
(1 )
By a packing of Y in some other set :T we shall mean a collection of
sets
(2)
each of which is contained in :T, and no two of which have points in
common. If:T is the whole space 9t we speak simply of a packing
of Y. If the Yr in (2) run through the points of a lattice A then we
say that the packing is a lattice packing. In this chapter we examine the
consequences of these ideas for the geometry of numbers. This chapter may
be regarded as a sequel of Chapter III but we shall also require some of
the general properties of convex bodies discussed in Chapter IV. We
shall find that the general theory of packings is relevant even to strictly
lattice-theoretic problems.
There is an admirable account of the theory of packing in FEJES
T 6TH (1953 a) and a conspectus of the more important results in
BAMBAH and ROGERS (1952a).
IX.l.2. The three following theorems show the relevance of packings
to the theory of Chapter III. We give the simple proofs here
THEOREM I. A necessary and suflicient condition that the lattice A
give a packing 01 the set Y is that no diflerence ~1 - ~2 01 two distinct
points 01 Y belong to A
Suppose, first, that ~1-~2=aEA Then the sets Y=Y+o and
Y +a both contain the point ~1 =~2+a, and so overlap. Conversely,
suppose that the sets Y +a1 and Y +a2 have the point y in common
where "t, a 2 are in A Then the two points y-a1=~1' y-a2=~2 are
in Y, and their difference a 2 -"t is in A
BLICHFELDT'S Theorem I of Chapter III shows that
V(Y) ~ d(A)
whenever A packs Y. The following theorem shows when the sign of
equality can occur. To avoid irrelevant topological considerations we
confine attention to rather special sets Y.
THEOREM II. Let Y be a bounded open star-body and A a lattice with
V(Y) = d (A). (1 )
224 Packings
(A) 1/ A packs !/', then every point in space either belongs to precisely
one set!/' +a, aEA and is not a boundary point 0/ any other!/' +a,
or is a boundary point 0/ at least two such sets !/' + a.
(B) 1/ every point 0/ space either belongs to or is a boundary point 0/
at least one set!/' +a, then A packs !/'.
By hypothesis, there is an R such that !/' is contained in
We now prove (A). Suppose, first, that A packs!/' and that there
is some point y which is not in or on the boundary of any!/' +a. aEA
We may choose e in the range 0<£<1, so small, that the sphere 9;.
of points a: with
9;.: Ia:-yl<e (2)
is clearly admissible for tt' and has the same determinant as I\.
There is an interesting unsolved problem in this connection. Let
~ and Jt; be convex symmetric bodies in n l and n 2 dimensions respec-
tively and let tt' be the (nl + n 2)-dimensional "topological product" of
~ and Jt;; that is the set of points
Can it ever happen that there is strict inequality here? The cylinder
is, of course, the case ~ = 2, n z= 1. WOODS (1958a) has shown that there
is equality in (3) for n1=3. n z=1 when ~ is a 3-dimensional sphere.
IX.l.6. In §§ 7, 8 are given applications by BLICHFELDT techniques
based on packing considerations, or at least BLICHFELDT'S Theorem I
of Chapter III, to the estimation of the lattice constants of the sets
x~+ ... +x:<1
and
IX1 ... X,.1 <1
respectively. The relationship of BLICHFELDT'S results to later work
will be discussed there.
IX.2. Sets with V(9') = 2" ,1(9'). We prove here the following result
of MINKOWSKI (1896a).
THEOREM V. Let 9' be an open symmetric n-dimensional convex set
which admits a lattice A with d(A) =2-"V(9'). Then 9' is defined by
m ~ 2" -1 inequalities 1 01 the shape
1~/;iXil < 1. (1 )
1
T1(X1-1) + L,T;X;= 0,
;<?;2
and so at least one equation
L,S;X;= o.
;<?;2
There is an equation of this type for each 1 for which z is on the boundary of .r(b/).
If xi' ... , X" are chosen so as not to satisfy any of these conditions, and arbitrarily
since TTk cannot pass through the inner point 0 of .9""(0). The correspond-
ing tac-plane - TTk through - lCk is obtained by changing the sign of
the fk; in (9). Hence every point of the open set .9""(0) satisfies the
inequalities
IIfk,x;1 <i· I
(10)
V(9') = 4d(A)
is that either
(i) 9' is a parallelogram and 1\ is generated by a mid-point 01 one
side and a point on one 01 the other pair 01 sides or
(ii) 9' is a hexagon and 1\ is the lattice generated by the mid-points
01 any two non-opposite sides. Then A contains the mid-points 01 all the
sides.
That 9' is a parallelogram or hexagon follows from Theorem V,
since 2"- 1 = 3 for n = 2. The lattices A are critical by MINKOWSKI'S
convex body theorem. The critical lattices of parallelograms and
hexagons have already been determined in Lemma 7 of Chapter III and
Lemma 13 of Chapter V respectively.
IX.3. VORONOi's results. We already saw in § 1 that if g(x) is a
positive definite quadratic form and A a lattice, then the set of points
such that
~(x) < inf g(x + a)
"Ell
*0
232 Packings
g (Xl. X 2) = a X~ + 2 h Xl X 2 + b X~
is reduced. in the sense that
(1)
(2)
is
Since (-u1 • -u2 ) occurs as well as (ul • U 2). we thus have the infinitely
many conditions
(3)
where
(4)
In particular.
21X1 1<a )
21X21 <b (5)
2IXI+Xal<a+2h+b=c (say).
where
O<a;£b;£c;£a+b. (1')
The set j(> defined by (5) is a proper hexagon unless h = O. when it
degenerates into a parallelogram. The area V(j(» of j(> is reasily com-
puted from (4) and (5) to be
V(j(» = 4 = 4d (Ao) .
But f/' is a subset of j(> and V(f/') =4. by Theorem II. Hence f/' =j(>.
since both are open. This implies that the infinitely many inequalities
(3) all follow from (5). which the reader may verify directly with little
trouble.
Further. every non-degenerate convex symmetric hexagon j(> with
its critical lattice may be generated in this way. as we now show. The
VORONOl's results 233
and
'iJ! =lliXl +l2i x 2
is the scalar product. The three 2-dimensional vectors are linearly
dependent and, by multiplying them by suitable factors, we may suppose
'i
without loss of generality that
'1 +'2 + fa = 0,
and, on re-indexing, that
kl -;;;' k 2 -;;;' ka · (7)
On taking Xi = 'iJ! (i = 1,2), the inequalities (6) become
(8)
Further,
of (4). On comparing with the earlier part of this section, we see that
the unique critical lattice of .Tf must be given by integral values of
xi, x~. We may thus suppose, without loss of generality, that (xi, x~)
was in fact the original co-ordinate system (Xl, X2), and then we have
the situation already discussed.
IX.3.2. From the results of § 3.1 we deduce the so-called "hexagon-
lemma" of DIRICHLET 1 which illustrates the connection between homo-
geneous and inhomogeneous problems that will be discussed in more
detail in Chapter XI.
I For an alternative derivation of the lemma and a partial generalization to
n dimensions, see MORDELL (1956a).
234 Packings
Then to every real point Xo = (Xl 0' x 20 ) there is a point u = (ul , u 2 ) with
integer co-ordinates, such that
4(ab-h2)g(xo+u)~abc, c=a+2h+b. (3)
The sign 01 equality is required when and only when
2(ab - h2) (xo + v) = ± {bra + h), - a(b + h)}, (4)
where v has integral co-ordinates.
For by the results of § 3.1 and by Theorem II there is certainly a
point xo+u with integral u in the closed hexagon
where
(5)
But the positive definite quadratic form g (x) can reach its maximum
in £ only at the vertices l of £. It is now readily verified that the
vertices are of the shape (4) and that the value of g (x) at all the vertices
is given by the right-hand side of (3). The calculations are facilitated
by the identity
g(X2' - Xl) = (ab - h2 ) g(x),
where Xl' X 2 are given by (5).
Finally, the ~ in (3) cannot be replaced by < if Xo is any vertex
of £, since
g(x) = inf g(x + u)
"EA,
for the points x of £. This last remark also shows that it was sufficient
to compute g(x) at anyone vertex a'l (say) since, from the nature of a
critical lattice, all the other vertices are of the shape ±xI + w, where W
has integral co-ordinates.
IX.3.3. Theorem VII gives yet another proof of the result that a
definite ternary quadratic form I(x) represents an number a~ (2D)! for
integral values of the variahles not all 0, where D is the determinant
of I(x) (§ 3.4 of Chapter II). We may suppose, without loss of generality,
1 Perhaps the easiest way to see this is to make a homogeneous linear trans-
formation Y='tz so that g(z) = iyi 2 , when it is obvious.
Preparatory lemmas 235
that I(~) is reduced in MINKOWSKI'S sense (d. Chapter II, § 2.1). We have
I(~) =ax~ + bx= + c x: + 2hx1 X 2 + 2g xlxa + 21 x 2 xa }
= a (Xl + (xXa)2 + 2h(Xl + (Xxa}lX2 + PXa) + b(X2 +PXa)2 + y x: (1)
for some (x, p, y. We may suppose that h~ O. Then
(2)
and
(3)
for all integers u1 , u 2 , by the condition of the reduction. But now, by
Theorem VII, we may choose u 1 , u 2 so that
I (u1 , u2 , 1)::;;
-
ab(a + 2h+ b)
4 (a b _ h2)
+ y. (4)
Hence from (1), (3), (4) we have
4D = 4(ab - h2)y ~ 4b(ab - hI) - ab(a + 2h + b)
Now
12h + }al ~ -i a ,
by (2); and so
4D ~ 3ab 2 - a2 b ~ 2ab 2 ~ 2aa,
by a further application of (2). This is the required result. Further,
using the knowledge of the cases of equality in Theorem VII, it is easily
verified that 2D=a3 can occur only for forms equivalent to multiples
of the critical form
Xl2 + X2 + Xa -
2 2 Xl Xa - Xa Xa - Xa Xl'
IX.4. Preparatory lemmas. In §§ 5, 6 we shall need three lemmas,
each of independent interest, which it is convenient to prove first. We
use the word polygon to mean indifferently a 2-dimensional set bounded
by a finite number of line-segments or the boundary of such a set.
Which is meant will be clear from the context. We shall say that a
convex polygon is circumscribed to a convex set :Yt" if it contains :Yt"
and if every side of the polygon is a tac-line l of :Yt". The first lemma is
an analogue of Theorem XI of Chapter V due to REINHARDT (1934a),
and found independently by MAHLER (1947c).
LEMMA 1. Let:Yt" be a convex symmetric open 2-dimensional set. Then
p+q+r=o.
Let yto be the hexagon formed by tac-lines at ±p, ±q, ±r to f,
taking the corresponding tac-line - TI at p to the tac-Iine TI taken at p,
if that is not unique, etc. Then yto is a symmetric hexagon circum-
scribed to f. The lattice M is admissible for yto by Theorem XI of
Chapter V, and so
L1(f) = d(M) ~ L1(yto) = t V(ytoL
by Lemma 13 of Chapter V. This concludes the proof of Lemma 1.
IX.4.2. The following lemma due to DOWKER (1944a) relates the
areas of circumscribed polygons to a convex set f, which need not be
symmetric. We sketch the proof, for which see also FEJES TOTH (1953 a).
LEMMA 2. Suppose that there exists a circumscribed (n +
1)-gon 91',,+1
and a circumscribed (n -1)-gon &'''-1 to a convex set f. Then there exists
a circumscribed m-gon with m ~ n and area
Similarly let ~jand b j be defined with respect to &'"H' where 1 ~j~ n+1.
Preparatory lemmas 237
we have
az '...,, \
\ 1
1
, \ 1
min{V(&,~), V(&,~')}
IJ~
"
'u \\ 1
1
~.... \ I
"J.v
:;;; HV(&',,-l) + V(&'''+1)}' 'f-.,
, ........ I
1
1
of .9'2n be tac-lines at
where
Let
(5)
where the bar denotes the image in the origin. Then, by symmetry,
the ~i are the sides of the circumscribed polygon .9'2n' which is the
Preparatory lemmas 239
Then
by (5).
Let &;n, .9';~ have sides
a1, "', an, /3"+1' ... , /32n and /31' ... , /3n, a,,+1' ... , a 2n ,
so .9';n and &;~ are symmetric, by (5). Precisely as in the second case
of the proof of Lemma 2 we have
the As' Then the whole plane is dissected by the As into a number rp
of connected pieces (the "faces") one of which contains all points
outside a large circle 1a:1
=R. Then EULER'S formula is
LEMMA 4.
rp+N=S+2.
This may be readily verified by induction on S.
IX.S. FEJES TOTH's theorem. In this section we prove a result
due to FEJES TOTH (1950a), see also FEJES TOTH (1953a). He proves
something more general and also gives interesting related results but
we give here only what is needed to treat the lattice constants of
cylinders.
THEOREM VIII. Let.Yf be a convex open polygon with at most 6 sides.
Let .:f be any convex open set and suppose that the sets
~ = .:f + a:, (1 ~ r ~ R)
are packed it~ .?It', i.c. the ~ are subsets 01 .?IF and no two have points in
common. Then
R U(6) ~ V(.Yf) ,
where U(6) is the lower bound 01 the areas 01 m-gons circumscribed to .:f
with m~6.
The notation U(6) is in conformity with that of Lemma 2, Corollary.
FEJES TOTH'S own version of his proof is very compact, and we have
found it desirable to expand it.
IX.S.2. The stages in the proof of Theorem VIII are enunciated for
convenience as propositions.
PROPOSITION 11. Let.Yf be a convex open 2-dimensional polygon and
let.Yt", (1 ~r~R) be open convex sets packed in.Yf. Then there are open
convex polygons fl, (1 ~ r~ R) such that fl, contains .Yt", and
( i) the fl, are packed in .Yf,
(ii) il 0' is a side 01 fl, then either,
(iiI) 0' is part 01 the boundary ol.Yf,
or
(ii2) there is a subsegment 0" 01 0' containing more than a single point
which is part 01 the boundary 01 a fls' (s=4=r), and
(iii) il 0' is a side 01 .Yf then some subsegment 0" 01 .Yf consisting 01
more than a single point is part 01 the boundary 01 some fl,.
Note that the .Yt", are not required to be similar to each other. We
shall give two proofs of proposition 1. The first is by transfinite induc-
1 Mr. H. L. DAVIES has pointed out that this Proposition is false as it stands by
giving a counter example. The proof of Theorem VIII can, however, be salvaged.
FEJES T6TH'S theorem 241
(O~t~1),
implies {~/l} ={~'}. By ZORN'S Lemma, since (I), (II), (III) are satis-
fied, to any packing {~} there is at least one maximal packing {~/l}
such that
{~} -< {~/l}.
But it is easy to see that in a maximal packing {~/l} the sets~:' must
be polygons fl, which satisfy the conditions (i), (ii) and (iii) of Proposi-
tion 1. Since this will be clear from the constructive proof which we
give later, we do not give the detailed argument here. This concludes
the first proof of Proposition 1.
We now sketch a second, constructive, proof of Proposition 1. The
fundamental process is this. If ~ is any open convex bounded set and
Bountlory of JI!
Fig. 12. r,' consists of Jr, together with the shaded region
P is any point not in~, then the open convex cover of ~ and p is the
least convex set which contains ~ and has p as a boundary point:
that is, ~ is the set of
tp+(1-t)q, qE~,
the open convex cover of p and Jt';. contains no points of any f, (r=l= 1).
It is possible that p = a. We then get a new packing {f'} on replacing
Jt';. by the portion of open convex cover of Jt';. and p which is in 1 Jf.
If the open convex cover of p and Jt';. does not meet any f, (r +-1)
for all q to the right of a, then f{ is to be the set of points in :K which
are in the convex cover of Jt';. and any point q to the right of a on ex.
Similarly one may consider points to the left of a alongex~
We may repeat the process on the new sets {f'} and will indicate
how after a finite number of steps. it must come to an end with polygons
fl, have the properties (i), (ii), (iii) of Proposition 1. We denote the
sets at the i-th stage by {fi}, so {fi- 1} < {fi}. Suppose first that
there is a pair of indices r, s such that f/ and f,1 have a boundary
point a in common. Then f,/+1, f,1+1 are obtained from f/, f,1 by
taking ex to be a common tac-line (Chapter IV, Lemma 6) to f/, f,1
at a and by applying the above process both to f/ and f,1 and both
to right and left along ex. Once this has been done for a pair of indices
r, s at the i-th stage we do not do it again for the same pair of indices at
a later stage. If there is no pair r, s of indices for which :f/, f,1 have
a common boundary point and which have not already been treated,
then there may be a body f,/ with a boundary point a on the boundary
of Jf. If so, we take ex to be the side of Jf on which a lies (both sides
in tum if a is a vertex of Jf) and apply the process. Again, once this
has been done for f,/ and a side of Jf we do not do it again for the
same r and the same side of Jf. Neither of the first two steps may be
allowable. Suppose that one of the f,/ is not a polygon. Then a is
taken to be any point on the boundary of f,! which is not in a line-
segment forming part of the boundary of f,s nor on the boundary of
.Jt1 (s =l= r). Finally, if all the :7("/ are polygons and the first two stages
are impossible, then a is taken to be any vertex of a f,! at which at
least one of the two sides is not also a tac-line to some f,1 (s=l=r).
It is clear that the steps outlined above will come to an end. And
the final set of .Yt[is clearly a set of polygons fl, having the properties
(i), (ii). (iii) of the enunciation.
IX.S.3. The next stage is an ,application of EULER'S formula (Lem-
ma 4) to the configuration of Proposition 1-
PROPOSITION 2. Let fl, 01 Proposition 1 have q, sides (1 ~r~R).
Then 2
where
aEA, maxlail <X. (4)
I
la3-YI<~-·
These L(y) cylinders give rise to a packing in the square
1xii < X +R (j = 1, 2)
of L(y) sets similar and similarly situated to tf. Hence
L(y) U'(6) < 4(X + R)2 (6)
by Theorem VIII, where U'(6) is infimum of the areas of circumscribed
m-gons to t f with m ~ 6.
But clearly
X+R
J L(y)dy =N
-X-R
from the definition of L(y). Hence
U'(6) N < 8(X + R)3, (7)
by (6).
Since Rand U'(6) are independent of X, the comparisGn of (1) and
(7) as X ~ 00 gives
d (A) ~ U'(6).
But
U'(6) = 4L1(tf) = L1(f)
by Lemma 3, Corollary. This completes the proof of (1), and so of the
theorem.
IX.7. Packing of spheres. The unit sphere
~,,: 1:1:1 <1
in n dimensions has volume
v.. = V(~,,) (1)
Packing of spheres 247
(3)
We shall need to know the asymptotic behaviour of the volume v".
From STIRLING'S formula l we have
lim n
"-+00
V:'" = 2ne, (4)
where
and
lim inf ny;l~ ine. (7)
"-+00
Of course the factor 2C (n) in (5) has no effect in (6) and might as well
have been replaced by 1. Indeed none of the improvements of the
Minkowski-Hlawka Theorem discussed in Chapter VI affect the constant
on the right-hand side of (6). On the other hand, BLICHFELDT (1929a)
has improved (7) to
lim inf ny;I~ ne, (8)
which appears to be the best asymptotic form to date 2. The argument
is a purely packing one and makes no use of the fact that only lattice
packings are relevant to (8). BLICHFELDT'S results have been improved
by RANKIN (1947a), and yet further, by a more perspicuous argument,
1 See any reputable text book on analysis, for example WHITTAKER and
WATSON (1902a) Chapter XII.
2 The improvement in (8) announced by CHABAUTY (1952a) is not correct,
see the review by RANKIN in Maths. Reviews 14, 541.
248 Packings
Suppose now that there is some function rp (~) of the vector variable
Z such that
(i) rp(z) =0 if Izl ~ e for some e
and
for all z,
since all points with rp(z-zr) =1=0 lie in Y(e). The comparison of (3)
and (4) gives
(5)
Of course the characteristic function of /7, which is 1 on /7 and 0
elsewhere, has the properties (i) and (ii). With this as the function rp,
the inequality (5) is rather weaker than (2). because we have replaced
V(Y) by V{Y(e)}: though of course this can be avoided by a refinement
of the argument. BLlCHFELDT observed that there are sometimes
Packing of spheres 249
R ~ 1:Il-:Il,12~~I:Il,-:IlsI2~2R(R-1),
l::i!,::i!R ,<s
by (2). But this is just the same as (8).
From this we have almost immediately
THEOREM X. Let:ll, (1 ~r~R) be points in the n-dimensional sphere
(9)
and let
Then
R ~ 2-,,/2 (1 + ;) (X + 26)". (10)
where V" is the volume of the unit sphere. The result now follows from
(5), since .9'"(e) is now the sphere
and let
The set .;v,. plays an important part in algebraic number theory (see
Chapter X), but the only precise values of v" known are
v: = 51, v~ =7
given by Chapter II, Theorem IV and Chapter X, Theorem V respec-
tively. Here we shall be concerned with estimates for v" when n is large
rather as in § 7. For information about what is known for n =4 or 5
see Chapter II, § 6.4.
In Chapter III, § 5.3 we already gave MINKOWSKI'S estimate
J(.#.:) ~ n"
, , - n!
In § 8.2 and 8.3 we give two lemmas and then in § 8.4 BLICHFELDT'S
proof of (1).
IX.8.2. The following Lemma of SCHUR (1918a) also occurs in the
theory of the "transfinite diameter" in analysis.
LEMMA 6. Let ~l' ..• ,~,. be real numbers. Then
The polynomial
The difterential equation (7) determines 1(17) completely in terms of, say,
1(0) and /,(0). Hence we may determine the symmetric functions ~ 17~
and II (1]i-1]i)2 in terms of 1(0) and /,(0). Since ~17~ =1 and the coef-
ficient of 17 m in I (17) is 1, this determines I (17) completely, and so also
II (1]i _1];)2 = {}. (8)
= II
i
1p(rt.,)
= (-1)/J n f{J(/J;)
;
= (_1)IJ R(1p, f{J).
If
w(17) = II
l~k;:oK
(17 - Yk)
where
on using the rules of operation (9.) and (10). But I",('YJ) =1 and
1",-I('YJ) ='YJ+y for some number I' (in fact 1'=0); so
(14)
by (9 2). The required value (2) for {} follows now from (11), (13) and (14).
IX.8.l. We also require an estimate of the number {}", occurring in
the last lemma.
LEMMA 7. II
G", = 1 . 22 ••••• m"',
then
lim sup {m- 2 10g G", -llog m};:;;; - 1.
"'-+00
Put
g(x) =xlogx (x>o).
Then
g'(x) = log x + 1
increases with x; and so
g(x + t) + g(x - t) ~ 2g(x) (1)
Then
lim inf v,. ~ (2n)1 et . (1 )
Now let
by Lemma 6, where {)". is the number defined there. On the other hand,
where
(i=f:i).
Hence
II ~~1. (5)
l;;;k;;;"
1=
<.0."
"V".
(!!'_ e2)1"".(".-1). (6)
n
Chapter X
Automorphs
x.t. Introduction. A homogeneous linear transformation w is said
to be an automorph of a point set .9 if .9 is just the set of points w:I:,
:l:E.9. The automorphs of a set .9 evidently form a group. Many of
the point sets of interest in the geometry of numbers, or which occur
naturally in problems arising in other branches of number-theory, have
a rich group of automorphs which is reflected in the set of .9-admissible
lattices. Already in the work in which he introduced the notion of
limit of a sequence of lattices, MAHLER (1946d, e) laid the foundations
for future work and indicated some fundamental theorems. Since then
much has been done but some challenging and natural questions remain
unanswered.
MAHLER (1946d, e) considers star-bodies with groups of automor-
phisms having special properties which he calls automorphic star-bodies.
In this account we prefer in each case to state the properties of the
group of automorphs which are required to hold.
We shall say that a homogeneous linear transformation w is an
automorph of a lattice A if wA =A, that is if A is precisely the set of
wa, aEA This is really a special case of the definition at the beginning
of the chapter since A is a point set. Since
F(w A) = F(A)
for a lattice A if w is an automorph of the distance-function F(:I:), since
F(A) = inf F(a) ,
oEII
*,0
by defimtion.
Introduction 257
\det(w)\ < 1-
If there is a lattice 1\ with F(I\) =1= 0, then there is a critical lattice M
for F(x) < 1, by Theorem VI of Chapter V. But then
F(wM) =F(M) = 1,
and
d(wM) = \det(w)1 diM) <.d(M),
in contradiction to the definition of a critical lattice.
For example, Theorem I shows that
\x~ x2 \ <1
is of infinite type since it has the automorphs xC"~ix, x2--+4x of de-
terminant 2. This was our example of a star-body of infinite type in
§ 5 of Chapter V.
THEOREM II. Let F(x) be a distance-function. Suppose that every
point Xo with F(xo) = 1 is ot the shape
(1 )
Cassels, Grometry of Nurnbf"rs 17
258 Automorphs
by (2). From (3) and (4) we have F(M) = 1. This concludes the proof.
Introduction 259
As an example of Theorem III one may take for f/, 9" respectively
the sets
and
9": IXlxsxal < 1, Ixsl < e, IXal < e,
where e is any fixed positive number. Then the automorphism w, may
be taken to be
XI=r-2e2xI' X 2 =re-I x s , Xa=re-Ixa,
where X = W,:I:. In this example one may deduce that a lattice A
with d(A) <LI(f/) has infinitely many points in f/.. For A must have
a point in 9" for every e>O. If A has no point a=!=o with a2 =aa=0,
this implies that A has infinitely many points in f/; and on the other
hand, if a = (al , 0, 0) is in A, then all the points ma (m = 1, 2, ... ) are
in A, so there are still infinitely many points of A in f/. Indeed the
argument shows that for any e> 0 there are infinitely many points
of A in :r. This sort of argument was already used for Lemma 12 of
Chapter V about the existence of infinitely many points in -1 <Xl x 2<k.
There we could prove rather more since this set was shown to be bound-
edly reducible. In § 7 we shall make a systematic study of when there
are infinitely many points of a lattice in a star-body following DAVEN-
PORT and ROGERS (1950a).
X.l.3. The point sets with a large group of automorphisms with
which we shall be concerned will be mainly constructed simply from an
algebraic form ffJ (:1:). For example ffJ (:I:) may be Xl XI' Xl X2Xa' Xl (X: + X:)
or X~ + X~ - xt and the set f/ may be defined by
IffJ (:1:)1 < 1 (1 )
or
o ~ ffJ(:I:) < 1 {2}
or
0< ffJ(:I:) < 1 (3)
or
- k < cp(:I:) < 1, (4)
where k and 1 are positive numbers. Of course (2) and (3) are not star-
bodies. Apart from sets especially constructed from sets of the type
(1)-{4) to act as counter-examples, other sets with large groups of
automorphisms have proved intractable. For example the lattice
constant of
IXII max(xt x~) < 1
is not known, though it would be of some interest in the theory of
simultaneous approximation and the problem has had considerable
Introduction 261
attention [see DAVENPORT (1952a) and CASSELS (1955a) and the refer-
ences given there J.
We shall make continual use in this chapter of the results of Chap-
ter I, § 4 about the relationship of lattices to forms.
A particular kind of lattice plays a special rl'>le in connection with
sets of the type (1)-(4), where IP(:I:) is an algebraic form. It is useful
to introduce some new terminology. If IP(a) is an integer for all aEA
we say that IP is integral on A. If, further, IP (a) = 0 for aE A only when
a =0, we will say that IP is non-null on A (the trivial zero at 0 being
disregarded). Finally, if there is some number t=l=O such that tIP is
integral on A we say that IP is proportional to integral on A. Then IP
is integral on IWI'" A, where m is the degree of IP'
In many, if not all, cases where the form IP has infinitely many
automorphs and the critical lattices Ac for one of the sets (1)-(4) are
known, it turns out that IP is proportional to integral on Ac' Indeed
in some cases IP is proportional to integral on every known admissible
lattice, and it is suspected, but not proved, that no other admissible
lattices exist. In other cases, there certainly do exist admissible lattices
on which IP is not proportional to integral, but the critical lattices are
not amongst them.
Before discussing the general· properties of a lattice A on which a
form l IP is proportional to integral and illustrating it with concrete
examples, it is convenient to prove a simple lemma.
LEMMA 1. Let l' > 0 and m> 0 be integers and let
y(u l , ... , u,)
where the gl' are polynomials to be determined. For any fixed values
of Ul , "'J U,-lJ the equations (6) determine uniquely the values that
must be taken by gl' (u1J . . . J U,_l) in (5); and then there are uniquely
1 We recollect that the word "form" implies homogeneity.
262 Automorphs
COROLLARY. II the y (ul , ... , uri are rational, so are the coellicients in f.
This follows at once from the proof.
Now let rp be a form which is integral on the lattice /\ with basis
bl , ... , b n • Put
I(u) = I(u l , ... , un) = rp (f
U i bi )· (8)
Then the function I (u) given by (8) is the product of three real linear
forms:
(9)
where /311' /312' /313 are numbers in a totally real cubic field srI and /3i k
is the conjugate of /31k in the conjugate field sri.
On the other hand, there are certainly lattices 1\ which are admis-
sible for
where i2 = - 1, there is a connection with the cubic fields that are not
totally real, similar to that of Xl X 2 Xa with totally real fields: it is clas-
sical, and will be proved in § 4.4 that if Xl (X~ + xi) is proportional to
integral and non-zero on 1\, then 1\ arises from a cubic field. But there
certainly are other admissible lattices for
(11)
Xl
X2=
= tIl Xl
=TX
t22 X2+ t 2a Xa
I
of the special type
Xa= tS2X2+taaXa,
where
for all z. Hence if A is admissible for Icp (z) I< 1, so is t ~ A for some
number t; and in general cp (z) will not be proportional to integral on
~ A if it is on A.
This does not exhaust the admissible lattices for Xl (X: + X:) < 1. One way to
show this is to use the arithmetic-geometric mean inequality in the shape
Hence any lattice admissible for IXI x2 xal < t is also admissible for IXI (x= + x:lI < I;
for example 2-! M has this property if Xl X 2 xa is integral and non-null on M (i.e.
when M arises from a totally real cubic field); and it is easy to see that Xl (X: + x~)
cannot be proportional to integral on M. [In fact the xl-eo-ordinates of M for
a lattice on which Xl X, xa or Xl (X: + x~) is non-null and proportional to integral
determine the relevant cubic field completely and it cannot be both totally real
and not totally reaL] More generally, one can construct admissible lattices by the
methods of Chapter VI, Theorem VIII, compare CASSELS (1955 b) for a closely
related problem.
It is an interesting problem to decide for any given form cp (z) if
there exist admissible lattices for a set Icp (z) 1< 1 on which cp (z) is not
proportional to integral. CASSELS and SWINNERTON-DYER (1955 a) have
considered the special cases cp (z) = Xl X 2 Xa and x~ + x~ - x~, but they only
transform the problems into another one. For another line of attack,
see ROGERS (1953b). It is reasonable to think that essentially new ideas
will be required even to cope with Xl X 2 Xa or x~ + x~ - x~.
X.l.4. An important part in the theory is played by so-called isola-
tion theorems. Their importance was first apparently recognised by
DAVENPORT and ROGERS (1950a) though there are foreshadowings in
MAHLER (1946e) and indeed in REMAK (1925a). A new type of isolation
theorem is proved and exploited in CASSELS and SWINNERTON-DYER
(1955a).
The phenomenon of isolation takes various forms all of which state,
roughly speaking, that lattices in the neighbourhood of a given lattice
M, with certain exceptions, are much worse behaved than M itself.
Thus one result we shall prove is that if Xl X 2 Xa is integral and non-null
on a lattice M, then to every e> 0 there is a neighbourhood ~ of M
in the sense of § 3.2 of Chapter V, depending on e, such that
Then /\ contains the point 't'wao' Although I't'ao-aol is small when 't'
is near the identity, it does not follow that I't'wao- waol is uniformly
small for all w, since in general w may be chosen so that wao is arbi-
trarily large. By suitable choice of w in Q M one may then show the
existence of a point 't'wa o in /\ ='t' M having the properties desired in
the problem in question, unless the transformation 't' satisfies certain
conditions. Sometimes one must start not with one point ao, but with
several, aI' ... , aT' so as to eliminate 't' of different kinds. This general
attack will be clearer from the examples in § 5. Isolation theorems may
be used to discuss the existence of infinitely many lattice points in
regions, as will be shown, following DAVENPORT and ROGERS (1950a), in § 7.
X.1.S. Before going on to the main subject matter of the chapter
we shall discuss in § 2 certain special forms and their groups of auto-
morphs. In § 3 we shall then discuss a method of MORDELL which shows
lOne of MINKOWSKI'S first applications of the geometry of numbers was in
fact to the theory of units in algebraic fields.
266 Automorphs
and i2 = - 1. If the XI are all real, then z; is real for 1 ~ j ~ rand z,H
and z,+ s+ k are conjugate complex numbers for 1 ~ k ~ s; and conversely,
if the ZI (1 ~ 1~ n) are of this shape then the XI are real. Let now w
be a real automorph of <p(a:). In the obvious way it gives rise to an
w
automorph of 1f' (z). Let Z = wz. Then
(4)
identically in Zl' ... , Z,,' where the ZI are linear forms in Zl' ... , z". The
only possibility is that Z L =).1 ZI where L = L (I) is a permutation of
1, ... , nand ).1' ... , )." are real or complex numbers. For our purposes,
it is enough to consider the automorphs
(1~l~n), (5)
where
(6)
by (4). But the transformation w transforms the real point a: into the
real point X = wa:. Hence Zl' ... , Z, are real and Z,H' Z'+s+k are
conjugate complex, and so
).i = real (1~j~r) }
(7)
).,+ k' ).,+s+ k conjugate complex (1 ~ k ~ s).
Special forms 267
Conversely, if the numbers Al satisfy (6) and (7), then (5) defines a real
automorph CI) of tp(~).
We shall also need the transformation CI)* polar to CI), that is the
transformation such that identically
when
x = CI) ~, y = CI)* y.
Now
L XIYI = LZIWI,
I I
where ZI is given by (3) and
(1~i ~ r)
where W = w*
w. In particular, the transformation CI)* is also an auto-
morph of tp (~).
X.2.2. We shall require also to know something of the automorphs
of the form
tp (~) = x~ + ... + x~ - ~+l - .•. - x:, (1 )
was shown that if b is any point of the polar lattice /\*, then there
are n - 1 linearly independent points of /I. on the plane
lTb: ;rb=Q
°
is certainly a point other than 0 of /\ in Y. If bn=f=O, for example, one
could project Y b on to the hyperplane Xn = and use Lemma 6 Corollary
of Chapter I. For this procedure to be effective, the vector bE /\* must
be chosen so as to give a good (n -i)-dimensional problem in lTb; and
so in general we have replaced one n-dimensional problem by another,
rather vaguer, one for the polar lattice, together with an (n -1)-
dimensional problem.
In this shape the technique has been applied by MULLENDER (1950a)
and DAVENPORT (1952a) to the enigmatic 3-dimensional starbody
Write
'P(x) = 'P" s(X) = X~ + ... + X~ - X~+1 - ••• - x~+s, (3)
and
(4)
over all lattices A, with the natural convention that if I'P I (A) = for °
all A, then 4, S= 00; as most probably happens when r > 0, S > 0,
r+s~5 (see appendix A).
We show first that
{I'PI (A)}'+S-l ~ C- 2 {1'P1 (A*)} d 2 (A) , (6)
where A* is the polar lattice of A and
where b l , b 2 , b3 occur only if r>O, S>O and both r>O, S>O, respectively.
Consider first b = b l , where
'P (b l ) = t2 • (10)
by (10) and (11). This proves (8) in the case b = b l since the left-hand
side of (12) is not less than {Irp I(I\)},+S-l. The proof of (8) in the second
case, when b = b 2 in (9), is similar except that the roles of rand s are
interchanged.
It remains to consider the case
b = b a = (t, 0, ... , 0, t) ,
where e> 0 is arbitrarily small, since this set has infinite (n - 1)-
dimensional volume. Hence (8) holds also when b = ba ; and so generally.
This concludes the"proof of (6).
We may also apply (6) to the lattice 1\* with its determinant
d (1\ *) = d- l (I\) and its polar lattice 1\:
is one of the two critical lattices. I f-&~ , -&;, -&; is a permutation of-&I' {}2' -&3'
then M(-&1' -&2' -&3) = M(-&~, -&;, -&;) if and only if the permutation is an
even one.
The geometrical purport of the lemma becomes clearer if new co-ordinates
YI' Y2 are introduced by the equations
1 V]
xI=YI' x2=-2YI+2Y2'
so
In YJ' Y2 co-ordinates, the region .'T has three asymptotes at an angle of 2n!3
and is carried into itself by either a rotation through 2n/3 round the origin or by a
reflection in an asymptote. Thc two critical lattices given by the lemma are then
each invariant under a rotation through 2n/3 and each is carried into the other
by a reflection in an asymptote. The reader may find it instructive to draw a
figure of the critical lattices each with 6 pairs of points on the boundary. For a
treatmcnt of sets .'T' which have similar symmetry and convexity properties to
.'T by the geometrical methods of Chapter III see BAMBAH (1951 a).
In what follows we do not introduce YI' Y2 as above but we do maintain the
essential cyclic symmetry between XI' x 2 and - XI - x 2 •
We note that the roots of (3) are
Ll
0'1 = 2cos~
2n Ll
0'2 = 2cos~
4n e"3 = 2cos~
6n
(5)
7 ' 7 ' 7 '
(k = 1, 2). (1O)
Then (7) becomes
II
1~j::;:;3
(a j1 u 1 +a j2 u2) = IT
l::;:;j::;:;3
(u 1 +8 j u 2 )· (11)
Cassels, Geometry of Numbers 18
274 Automorphs
Hence
(12)
where {fl' D2 • Da is some permutation of 8 1 , 8 2 , 8 a. From (10) and (12)
we have
lajl = D;+l - O;H }
(j=1.2.3). (13)
l a;2 = D; (Of+l - Of H)
where ° 4 =°1 • OS=02 and l is some number. By (11) we have
IT afl = 1.
f
and so in fact
la = (Dl - ( 2 ) (0 2 - Oa) (Oa - ( 1) (14)
=±7. (15)
where the value ± 7 may either be checked directly from (5) or from
the fact that the square of the right-hand side of (14) is the discriminant
°°
of the cubic f0 (u l • u 2 ) by definition (§ 5.1 of Chapter II). We note that
1 , 2 , Oa determine a l and a 2 absolutely uniquely. by (14).
we have
° °°
must correspond to a cyclic permutation of 1 , 2 , Oa. Hence there are
°
at most two distinct lattices M (°1 , 2 • Oa), for the permutations 1 • D2 • Da
of 8 1 , 8 2 , 8 a.
lt remains to show that M (°1 , '19 2 , Oa) is distinct from M (81 .82 , 8 a)
if D1 • O2 • Oa is an odd permutation of 8 1 , 8 2 , 8 a. We may suppose now.
without loss of generality. that
°=8 °=8
1 2, 2 1, Oa=8a·
From (4). (6). (13) and (15). a point b of M (81 .82 , 8 a) has
7ibj = P(8 j ) (j = 1. 2. 3). (16)
A method of Mordell 275
in some order. All the critical lattices A 01 .A'i which have a point a lor
which
(4)
1 Alternatively, (17) means that p(ef - 2) = p(e1 ); and so the polynomial
e.
+ t 2 _ 2t - 1. One may now put t = and obtain
P(t 2 _ 2) - P(t) is divisible by t 3
p(e.) = p(ea).
1S*
276 Automorphs
;i (D~ - D:)} ,
(7)
{O~. ~ (D: +D~).
where i 2 = -1 and {)l is the real, and D2 , Da are the complex roots 0/
and, as before,
I<pI (A) = aEA
inf l<p(a)i.
~o
where A* is the polar lattice of A. The proof follows closely the pattern
of the proof of Theorem IV. It is enough to show that
(14)
where b is any primitive point of A*.
Suppose, first, that !p (b) = 0. Then, after applying a suitable auto-
morph of .Ai furnished by § 2.1, we may suppose without loss of
generality 1 that
b=(1,O,O) (15)
or
b=(1,1,O). (16)
In the first case, (15), the plane
bz = 0,
which must contain two linearly independent elements of A is iust the
plane x1 =0, and all points on it satisfy !p(:':) =0. Hence (14) certainly
holds in this case. In the second case, (16), there are two linearly inde-
pendent points of A on
(17)
For these points
(18)
and the 2-dimensional set IX~X31 <8 is of infinite type for any 8>0.
Hence there are certainly points aEA other than 0 with 1!p(a)I<8.
This proves (14) in the case b is given by (16).
There remains the case when !p(b) =1=0 and so, after the application
of a suitable automorph, we may suppose that
b = (t, t, t) , t> 0, (19)
and so
(20)
that is, the polar lattices /\: and N: are identical at least on the plane
Xl + X2 + Xa = o.
Let now b = (bl , b2 , ba) be any point of /\: (and so of N:) with
(26)
Then the lattices /\~, N~ consisting of the points of /\t and of Nl re-
spectively in the plane
(27)
must both be critical, in the obvious sense, for the 2-dimensional section
of IXl X2 Xai < 1 by the hyperplane (27). By Lemma 3, there are only
two critical lattices and these have only the origin in common. Hence
/\~ and /\~ must be identical, since (1. 1, 1) belong to both lattices, by
(27). Thus /\t and /\1 coincide on any hyperplane (27) such that the
point b satisfies (26).
But now N:has a basis b:. bt,
b: (say) such that b b: =b:,
satisfies (26), for we have only to choose a suitable basis b: for the b:,
section of Nt by Xl + X 2 + Xa = 0 and extend it to a basis for N:. Let
bl , b 2 , ba be the polar basis for Nl . Then. on putting b = b:, b: in
(25) in turn, we see that /\t contains all points a of Nl such that either
b: a = 0 or b: a = 0;
that is all points of Nl of the shape either
U2 b2 + Ua ba or VI b l + Va b a,
where ul • U2• VI. V3 are any integers. Hence /\t must contain each point
U1 b 1 + u 2 b 2 + uaba = (u 2 b 2 + u 3 b 3 ) + (ul b l + Oba)
of N1 • Since d(Nl } =d(/\t}; we then have /\t=N l , as required.
This completes the proof of Theorem V. A. That of Theorem V. B
is similar except that Theorems VII and VII A of Chapter III are used
instead of Lemma 3. The details may be left to the reader.
X.4. Existence of automorphs. In this section we prove the exist-
ence of common automorphs of a lattice /\ and a form ffJ (~) which is
integral and non-null on I\, and make deductions about the possible
such /\ in a special case.
We shall require a quantitative form of MAHLER'S compactness
criterion, Theorem IV of Chapter 5.
LEMMA 4. There is a number
(1)
280 Automorphs
depending only on the integer n>O and the numbers ,11>0, ">0, 8>0
with the following property: amongst any No lattices 1\; (1 ;£i;£N) in
n-dimensional space such that
d (1\;) ~ ,11 (2)
and
11\;1 ~ ", (3)
there is at least one pair, say 1\1' 1\2' such that
1\2 = ~1\1 (4)
and the linear transformation ~ satisfies
(5)
where l is the iaentity transformation.
We recollect that
11\1 = aEA
inf lal, (6)
*0
and that the symbol 110'11 for a linear transformation X = 0':£ with
X;=L(I;"X" is 110'11 =nmax l(lf"l.
It would be possible to modify the proof of Theorem IV given in
Chapter V but it is simpler to follow the alternative proof sketched in
§ 2.2 of Chapter VIII. We suppose we have No lattices I\f, where No
will be determined later. By Lemma 3 of Chapter VIII there is a
,10> 0 and a K depending only on ,11 and ", such that any I\i satisfying
(2) and (3) has
d(I\;) ~ Llo >0 (7)
and has n linearly independent points in the sphere
1:£1 ~K.
By Lemma 8 of Chapter V, there is then a basis
of I\i with
(8)
Let '1/>0 be arbitrarily small, to be chosen later. Then, by (8), if No
is greater than an N1 depending only on n, '1/, Llo, K, that is on n, '1/,
Ll 1, ", there are two A; say 1\1 and 1\2' such that
Ibn - bi2 1<'1/ (1;£ i~ n). (9)
Since the b o are linearly independent, we have
for some numbers a;i' But now on solving for the aii from (7), (8)
and (9), we have
\aij\~aOrJ (1~i~n, 1~i~n),
where a o is a number depending only on ,1oK and n; a crude estimate
being
ao = n! ,10 1 (n K)"-1
obtained by estimating the elements of the matrix reciprocal to the
matrix with columns b n (1 ~i~n). Hence
\Ia\l<e
+
if rJ chosen to satisfy n a orJ < e. Hence 't' = l a has 't' Al = A2 and
I\'t'-ll\<e. Since A1 ='t'-IA 2 we have also 1\'t'-I- l l\<e, because (9) is
symmetric in AI' A2. This concludes the proof.
X.4.2. We shall also require the following rather trivial lemma which
says, roughly, that a form rp (z) cannot be integral on too many essen-
tially distinct lattices.
LEMMA 5. Let rp(z) be a lorm integral on a lattice A. Then there is
an rJ> 0 depending only on rp (z) and A with the lollowing property: II
rp (z) is integral on 't' A and
for the integers (3), since both sides of (4) are integers. By Lemma 1,
it follows that (4) holds for all real numbers Uj. Since every z is of
the shape LUjb j with real Uj' we have rp('t'z) =rp(z) for all z, as required.
COROLLARY. Suppose, Iurther, that rp(z) is non-null on A and that
d(A) ~,11
lor some ,11' Then rJ may be chosen depending only on rp and ,1., but not
otherwise on A.
For then
\rp(a)\ ~1 (aEA, a=Fo);
282 Automorphs
and so
IAI~c>o
for some c depending only on cp. Hence, as in the proof of Lemma 4,
there is a basis b1 , ... , b" of A with
(1;£j;£n)
for a K depending only on .d 1 and c, i.e. on .d 1 and cp. Hence all the
u,
points 1.: b, subject to (3) lie in a sphere
(5)
Then (2) holds for small enough 1J depending only on cp and K, since
cp(x) is uniformly continuous in (5). Hence the corollary follows.
XA.3. We are now in a position to prove the main theorem on the
existence of automorphs.
THEOREM VI. Let the form cp (x) be integral and non-null on the
lattice A and let a be any automorph 0/ cp (x). Suppose e> 0 is given
arbitrarily small. Then there is an automorph 't 0/ cp (x) with
I/'t-lll<e, (1)
such that
(2)
is an automorph 01 A for certain integers u, v with
o~u<v. (3)
It is not excluded, of course, that w may be the identical trans-
formation.
We have
Icpl (A) = inf\ cp(a)1 ~ 1, (4)
aE/\
,*0
by hypothesis, and so
(5)
for all integers u. Hence
laUA\ ~ c> 0 (6)
for all u and some constant c> O. Further,
d (aft A) = d (A) (7)
for all u since det (a) = ± 1 by Theorem I. By (6) and (7) we may apply
Lemma 3 to the aU A (1;£ u;£ N), if N is some large enough number,
to obtain two lattices a" I\. and a" I\. such that
aU A= 'tav A (u < v) (8)
Existence of automorphs 283
and
(9)
We may suppose, by choosing a smaller number instead of the ori-
ginal e if necessary, that e<1], where 1] is the number in Lemma 5,
Corollary with L11=d(I\). We may then apply Lemma 5, Corollary with
a V 1\ instead of 1\ and deduce from (8), (9) that't is an automorph for
<p(x). Hence w defined in (2) has all the properties required.
Theorem VI becomes false if the condition that <p(x) be non-null
on 1\ is omitted, as is shown by the 2-dimensional example where 1\ = 1\0
is the lattice of integral vectors, <p (x) = Xl X 2 , and a is the automorph
x1 -?2x1 , X 2 -?!X 2 . But in more dimensions it is sometimes possible to
use the idea behind Theorem VI to construct automorphs of 1\ even
when <p (x) may be null on 1\, for example, by restricting attention to
automorphs leaving fixed an element or elements of 1\ or of the polar
lattice 1\*.
X.4.4. Theorem VI takes a particularly simple shape when
<p (x) = { II
l~i~'
Xi}{ n
I ;i;k;i;s
(X~+k + X~+S+k)}' (1)
+
where n = r 2s, which is substantially equivalent to, but rather
stronger than, DIRICHLET'S theorem on the existence of units in an
algebraic number field. We write as usual
(1~j~r)
Suppose that <p (x) is integral on 1\ and that to> 0 is given arbitrarily
small. Then there are numbers Wi compatible 'with <p (x) and an integer
m> 0 such that
(1~j~n), (3)
284 Automorphs
and such that the translormation (a) given in the appropriate complex co-
ordinates by
(1~i~n)
is an automorph 01 A
The automorphs of rp (a:) were discussed in § 2.1. From what is said
there it is clear that if Z ='U is an automorph of rp given in the ap-
propriate complex co-ordinates and if
II,; -lll < n, (4)
where n is the dimension, then T must be of the shape
(1~i~n); (5)
that is, there can be no permutation of the forms on the right-hand side:
indeed, if '; is written as Zi= L T,IIZ,<> the inequality (4) implies
II
(1~i~n),
to prove it here.
LEMMA 6. Let rp (a:} given by (1) be integral on A and let the automorph
Z =(a)Z 01 A be given in the appropriate complex co-ordinates by
(1~i~n).
Then the Wi are algebraic units, that is they satisly an equation 01 the type
I(w;) = 0,
where
(6)
lor some m and c1 , ••• , cm - 1 are rational integers.
Let b l •.•.• b" be a basis for 1\. so that
where m is the matrix with elements mik and w is the diagonal matrix
with elements WI' .•• , Wn on the diagonal. Hence
w =B-1 mB,
and Wt, ... , Wn all satisfy the equation I (wi) = 0, where
I(t) =det(tl-m),
which is of the form (6).
The two following corollaries are immediate
COROLLARY 1. WI'"'' Wn satisly the same equation 01 type (6) with
m=n.
COROLLARY 2. II Wj is rational, then Wj= ±1.
Although we do not need it later it is interesting to note that Theorem VII
and Lemma 6 rapidly gives a complete characterisation of the lattices A on which
q:>(:I:) is proportional to integral and non-null, at least when r>O. We only sketch
the proof, for details see BACHMANN (1923a) Kap.12.
LEMMA 7. All the lattices A on which q:>(:I:) is proportional to integral may be
obtained in the following way. Let srI' ... , sr"
be a set of conjugate algebraic fields
of degree n over the field of rational numbers, wI/ere srI' ... , sr,
are real and sr,+k,
sr,+s+k are conjugate complex (1 ~k~s). Let Yll' ... , YIn be linearly itldependent
elements of srIover the rationals and iet Ylk (t ~l~n) be the conjugate of Ylk in sri'
Let M be the lattice with basis
(t:::;;k:::;;n)
in the appropriate complex co-ordinates. Then a necessary and sufficietlt condition
tllat q:> (:1:) be proportional to integral and non-null on a lattice A is that A be of the
shape
wllere t is real, "" is an automorph 0/ q:>(:I:), and M is of the type fust described.
When r > 0, the proof is shorter than the enunciation. By applying Theorem VI I
with
Since WI' ...• w" all satisfy the same equation of degree 11. they must all by (10)
be precisely of degree n and so conjugates. Let b l •...• b n be a basis for /\ and
use the notation (7). (8). Then it follows from (9) that
(Pil' ...• {Jj II)
is an eigenvector belonging to Wj of the matrix tn. But clearly m has a set of con-
jugate eigenvectors
(Yil' ...• Yin)
in the fields sr j generated by Wj; and if these are identified with those of the enuncia-
tion it is easy to see that the lattice M has the required properties.
When y= O. the position is more difficult since it may be impossible to achieve
that the Wj are all of degree n. though it is possible to make them all of degree
tn. Let a = (IX\ •...• IX,,) and b = (Pl' ...• {In) be two linearly independent vectors
of /\ in the appropriate complex co-ordinate system. Then 91 (ua + v b) is a poly-
nomial in the variables u and v with coefficients proportional to integers. and it
vanishes for integers u and v only when u = v = O. Hence lXI/PI is an algebraic
number. Similarly. if c= (YI' ...• Yn)E/\ is linearly independent from a and b.
then the ratios IXI/YI' PI/YI are of degree n as is also (PIX I + qPI)/:'l for any integers
p and q. It is not then difficult to deduce that lXI/PI is in a field of degree n depending
only on /\ and not on the choice of a and b; and the rl)st follows with some little
trouble. We do not go into details as we do not use the result.
X.S. Isolation theorems. As was stated in § 1 there is a wide
variety of isolation theorems, and it hardly seems worth while to for-
mulate theorems of great generality. We shall instead consider only
three concrete cases.
We shall need the following simple Lemma which is really a simple
case of KRONECKER'S Theorem and belongs of right in Chapter XI.
LEMMA 8. Let a., p, 1', 6 be real numbers with a.6 - PI' =t= o. Suppose
thata.IPisirrational. Then to every number e>O there is an rJ =rJ (a.,p, 1',6, e)
with the following property:
For any numbers A, I-' there are integers m, n such that
Ima.+np-AI<e, Imy+n6-I-'I~rJ·
By MINKOWSKI'S linear forms Theorem there are integers (m, n) =t= (0,0)
such that Ima.+nPI is arbitrarily small; and ma.+np=t=O since a.IP is
irrational. Hence there are integers (ml' nl ) and (m2' n 2) such that
o<lmla.+nJPI<e, O<lm 2 a.+n 2 PI<e,
and
Put
(j = 1, 2),
so that
(j = 1, 2),
Let e, f1 be the solution of
eXl + f1 X z = A,
Isolation theorems 287
(6')
Hence
£0;' =O(T1h, (7)
where the constant implied by the 0 symbol may depend on aI' a2, and
where we assume TJl in (2) chosen so that, say, ITn -11 < t. Put
c='t'a,,,, (8)
where a", is given by (4). Then, in the first place, it follows from (6)
and (7) that
so
Icll <1
if TJl is chosen small enough. Secondly, it follows from (6) or (6') that
(9)
But now, by (7),
£02'" C2 = Tn al £02 2 '" + Tn a2 = Tn a2 + 0 (Tn). (10)
Put TJO=tW22. Then since a2<O<al , we have from (7), (9) and (10),
that
provided that Tl2' T2l are small enough and that Tn, Tn are near
enough to 1, which may be achieved by taking TJl small enough in (2).
This concludes the proof when T12 >O. The proof when T12 <O is
completely similar, except that b is used instead of a.
COROLLARY. Under the hypotheses 01 the theorem except (3), there is
an TJ2 such that il
and 't' is not an automorph 0/ Xl .%2, then 't' A contains a point c with
al a2(1 - TJo) < c) c2 < bl b2 (1 - TJo) •
Isolation theorems 289
Put
(4)
so
(5)
and
PI> 0, pz< 0, Pa< 0,
ql< 0, qz> 0, qa< O.
Hence
PI qa - Paql = P2Qa - PaQ2 =F O. (6)
We now show that
(7)
is irrational. If not, there would be an automorph A =.s"~o with
integers (u, v) =F (0,0) for which, in an obvious notation, At = )'2' But
Cassels, Geometry of Numbers 19
290 Automorphs
this being one of twelve possible cases i • Now 1\ certainly does contain
some point a with
~>0>a2'
We shall pick one such point and keep it fixed in all that follows, so
that numbers depending only on a and 1\ will be said to depend only
on 1\, etc.
By Lemma 9 with an £ > 0 to be chosen later and
1 For the maximum in (9') may correspond to anyone of the six pairs (i, i)
with i*i; and the maximal Tij may be either positive or negative.
Isolation theorems 291
We put
Note that we have used the full force neither. of Lemma 9 nor of
the inequalities (13).
The proofs of the following two corollaries maybe left to the reader.
COROLLARY1. Theorem IX remains valid i/lcl c2 c3 1<8l is replaced
by 0<1 c1 c2 cal <81'
COROLLARY 2. To every 82 >0 there is an '1/2>0 depending only on
A, 82 such that, i/
II-r - LII < '1/2
and one 0/ 1'12,1'13,1'21,1'23 is not 0, then there is a cE-rA with
Zi = ~ Ti"z", (i)
where "
1'12 = Tn. Tn = Tal. Tn = Tn T23 = T32' Tn = Taa (2)
and the bar (-) denotes the complex conjugate.
THEOREM X. Let
!p(:r) = Xt (x; +~)
be proportional to integral and non-null on A and let
A = 1!p1 (A) = oEA
inf I!p(a)!. (4)
,*0
Then there are numbers '1/1>0, '1/2>0 with the following properties:
Suppose that -r is a homogeneous trans/ormation in the appropriate
complex co-ordinates given by (1) and (2) such that
1I-r-LII<'1/l' (5)
Isolation theorems 293
Then
(i) II T12=TlS=FO, there lS a C=(Yl,Y2'YS)=F0, in complex co-
ordinates, in TA such that
(6)
(ii) II TSI=T21=FO, there is a C=(YI'Y2,YS)=FO in TA such that
(7)
By Theorem VII there is an automorph Z =wz in complex co-
ordinates of the shape
°
contrary to hypothesis (d. proof of Lemma 9). Hence X is irrational.
Thus by Lemma 8 with e = 1, there is a number 'Y/s> with the follow-
°
ing property: To every pair of numbers e> and 'P there are integers
u and v such that
lux+V-'PI<l (9)
and
-1 Tau
'Y/s < - <'Y/s· (10)
e
We now prove (i). Since IP (:1:) is proportional to integral on A,
there is an aEA of the shape
a=(tX),tX 2 ,0Cs). tX2=Ce(O), IXs=Ce(-O), (11)
where
tX1>o, C>O, A=tX1 C2
and A is defined by (4). Put
T12 = -ae('P). T13= -ae(-'P), (12)
where a> 0. Then a is small when liT - LII is small. We now choose
integers u and v to satisfy
lux+v-('P+O)i<l (13)
[d. (9)] and (10) with
(14)
294 Automorphs
so that
-2 2aC Tau
'YJa < Tn ell
<1- (15)
Since Tn is near 1, there are two constants 1'/, 'YJ", depending only on 1\
(and a), such that
0<1'/ a-I < T" <1'/, a-i. (16)
We shall show that the point
C ='t"w-ua = (YI'Y2,Ya) (17)
then
and with
is of the shape
A=twNI'
where t~ 1, w is an automorph 01 .Ai, and NI is delined in Theorem V.
B. There is an 1J2> 0 such that every lattice A admissible lor
.A'; : IXl (x~ + x~) I < 1
and with
is 01 the shape
A ='tw N2 ,
where N2 is delined in Theorem VB, w is an automorph 01 .A'; and 't
is a transformation Xi= l: TikXk with T12=T)3=T21=T31=O.
k
We first prove A by reductio ad absurdum. Suppose, if possible,
that 1JI does not exist. Then there exists an infinite sequence of admis-
sible lattices M, (1;;;;r<oo), none of the shape twN 1 , and such that
d(M,) -+ 7.
296 Automorphs
Now
such that
Then
(4;)
for some numbers t, and some automorphs 00, of .Ai. This clearly
contradicts the hypotheses of the theorem.
As stated in § 1, it is unknown whether such a D or such a A exists.
X.7. An infinity of solutions. We now prove some results of DA-
VENPORT and ROGERS (1950a) about the existence of infinitely many
points of a lattice in certain point-sets with groups of automorphisms.
They prove more than we do here; the reader is referred to their inter-
esting memoire for the details.
The following trivial lemma gives almost all we need for the first
type of result.
LEMMA 10. Let n be some group of homogeneous linear transforma-
tions 00. Suppose that for every :1:*0 and every number r tnere is an ooEn
such that
100:1:1> r.
Then for every pair of numbers c, C with
O<c<C<oo (1 )
and every number r there is a finite set of elements 001, ... , 00", of n such
that
m~x 100;:1:1 >r (2)
1;:;i,S;".
for aU:I: in
*
THEOREM XIII. Let the boundedly reducible 1 star-body !/ have a group
n of automorphisms 00 such that to every z 0 and every r there is an
OOEn such that IOO:l:I>r. Then to.every integer k>O there is a bounded
set .9;. contained in !/ such that every lattice A with d (A) < LI (!/) has at
least k points in .9;. other than o.
That .9;, exists is equivalent to the statement that !/ is boundedly
reducible. We suppose .9;. has been found and deduce the existence of
.9;.+1' We may suppose without loss of generality that .9;. is the set
of points of !/ in some sphere
1:1:1 ~ C = Ck •
Further, there is a positive number Ck < C such that the entire sphere
(4)
1 For definition, see Chapter V, § 7.2.
An infinity of solutions 299
IIP(a)1 ~ ---;-
(23)
(1 -1)0) d(l\.) , I~I < e. (2)
We will choose 1)0 later in the course of the proof. Suppose that (ii)
is false for some particular I\. and e. For integers r = 1, 2, ... , let 1\., be
the set of points (r2 Xl' r-l X 2 , r-l Xa), (Xl> X 2 , Xa) E I\. Then there IS a
convergent subsequence
(3 )
and M is admissible for
IXl (x~ + x~)1 <~(1 -1)0) d(I\.).
(23)
(4)
for some integer ai' Since
x2,i+l> x 2,i > X 2, i-I'
we must have
ai > O.
Then we must have the - sign in (4), since
out that the behaviour of any particular basis, say :£J' :£1+1' of A is
influenced very strongly by the value of aj for j near to ] but only
very weakly by a;. for i remote from]. In many problems it is possible
to study the behaviour of only a few a j at a time. Hence the name
"local methods".
It would be interesting if local methods could be successfully extended
to problems in more than 2 dimensions, for example to problems relating
to Xl max (x:, x:), Xl (x~ + x~), x~ + x~ - x~ or Xl X 2 %3 • The difficulty is
not to find the analogues of the xi but to devise techniques to cope
with their interrelations. Continued fractions have however been
generalized to 2-dimensional lattices over a complex quadratic field,
i. e. substantially to certain special 4-dimensional lattices, see POITOU
(1953a) and the references there given.
Chapter XI
Inhomogeneous problems
XLt. Introduction. As previously, we say that points:£1 and:£2 are
congruent modulo A, written
:£1 = :£2 (A) ,
where A is a lattice, to mean that x 1 - :£2E A The set of points :£ con-
gruent to a given point :£0 modulo A is called a grid1 @: A will be called
the lattice of the grid and we shall call
d(@) = d(A)
the determinant of the grid. The characteristic inhomogeneous problem
of the geometry of numbers is to find conditions under which a grid
has a point in a given set 9'.
There is a wide variety of different problems. Thus one may be
concerned with all grids of given determinant d (@) or one may have
information about the lattice A Many of the fundamental techniques
for inhomogeneous problems are natural extension of those for lattices
[compactness theorems and so on; for bodies with automorphs see
SWINNERTON-DYER (1954a)J. For some specialized problems some
extremely powerful and delicate techniques have been developed which
would take too much space to discuss properly. Hence this last chapter
will have more the character ofa report and less that of a detailed
exposition.
1 Other terms are inhomogeneous lattice or non-homogeneous lattice.
304 Inhomogeneous problems
b 1 = (4 R. o. o..... 0)
b2 = (0. 'Y}. O. O•...• 0)
b 3 = (0. o. 'Y}, O•...• 0) (1 )
Every point :r1 of space is congruent modulo " to precisely one point
of the parallelopiped
f!J: {Ylbl+ ... +Ynb,,}
(-j ~ Yj <j).
1 So m (~o) = F(~o) in the notation of Chapter VII § 2.2, where ~o is the element
of the quotient space to which ~o belongs.
Introduction 305
and
It (/\) = sup m (:ro,/\)' (3)
or,
Clearly
,u (t /\) = Itill (/\) (4)
for any t =F O.
The infimum in (2) need not be attained, though it clearly is attained
when the set F(~) < 1 is bounded. The function m (~o) need not be
continuous, but it is semi-continuous:
and then
F(~ + a) < m(:ro) + 8
for all ~ in a neighbourhood of ~o, by the continuity of F(;J!); so
m (~) < m (~o) + 8 in this neighbourhood. Again, when F(~) < 1 is
bounded, the function m (~) is readily seen to be continuous. The reader
will be able to supply the proofs of the positive statements just made
on the lines of the proof of the semi-continuity of the function F(/\)
in Chapter V, § 3.3. Examples to show that the infimum in (2) need
not be attained and that m (~) need not be continuous are provided in
2 dimensions for certain lattices /\ when F(~) = IXl X21l. This case has
implications in the theory of algebraic numbers and has been extensively
investigated both because of this and because of its intrinsic interest;
see BARNES and SWINNERTON-DYER (1952a, band 1954a) and BARNES
(1954a), where there are extensive references to earlier work. There is
some work on similar lines for IX1 X2X3 1l (n=3), but it has not been
carried so far, see DAVENPORT (1947c), CLARKE (1951 a) and SAMET
(1954a, b).
From the definition (2) it follows that m (~) may be regarded as
defined on the quotient space al//\ (compare Chapter VII). Since this
is compact, it follows from (5) that the supremum in (3) is always
attained; that is, there is an ~l such that
Of course the infimum in (2) need not then be attained for ~l=~O'
With unbounded sets F(~) < 1 there may be again a phenomenon of
successive minima; that is, it may happen that
where possibly b(F) = O. If the set F(:r) < 1 has finite volume ~o', we
now show that
(8)
Let A be some lattice and e> 0 be arbitrarily small. There is a point
:rl congruent to any given point :ro and satisfying
F(:rl) < fl (A) + e. (9)
Hence the set (9) must have volume at least d (A). Since the volume
of the set of points :rl satisfying (9) is
{fleA) + e}" VF ,
the required result (8) follows.
We shall show in § 3 that if the body F(:r) < 1 is bounded, the
infimum in (7) is attained; that is there is a lattice M such that
{p (M)}" = b (F) d (M) .
We shall treat the estimation of b (F) for convex distance-functions
F in § 2 where the relevant literature will also be discussed.
When VF = 00 it is, of course, still possible that b (F) > O. In par-
ticular, DAVENPORT (1951 a) showed this to be the case for the 2-dimen-
sional distance-function
(10)
His estimate,
was improved to
b(F) ~ 451'2
More recently I, she has obtained an even smaller upper bound for
b(F).
The problem of determining b (F) for F given by (10) is closely related
to the problem of determining the real quadratic numberfields with a
Euclidean algorithm. DAVENPORT extended his work to number-fields
of two other types corresponding to
P=XI(X~+X~) and p= (x~+x~)(x~+x:).
These results were proved by the author [CASSELS (1952a)] much more
simply and with a better estimate of b(F).
HLAWKA (1954c) has generalized these results to any distance-
function F(;x:) in n variables which may be put in the shape
(not a star-body!) and quote other work about sets defined in term
of Xl'" X".
1 I am grateful to Miss PITMAN for allowing me to refer to this unpublished work.
now published. Acta Arithmetica 6 (1960). 37-46.
2 The first case has been settled by E. S. BARNES [J. Austral. Math. Soc.
2 (1961/62) 9-10]. who shows that b(F) =0.
20·
308 Inhomogeneous problems
or, more generally connecting fl (1\) and the successive minima of F(3!}
with respect to I\. When F(3!) is convex, there are further relations
with the corresponding quantities for the polar distance-function F*(3!}
and the polar lattice 1\*. These relations go under the general name of
transference theorems l (Obertragungssatze). Thus DIRICHLET'S hexagon
Theorem VII of Chapter IX may be regarded as a very precise trans-
ference theorem for Ix~ + xW.
We shall discuss transference theorems
for convex functions F(3!) in § 3. Much interesting work has been done
on transference theorems for the non-convex F(3!) defined by
{F(3!)}" = II (X~+k + x~+s+k),
Ixl ... x,1 l;i;k;i;s
where n=r+2s, but here we can only refer the reader to the paper
of DAVENPORT and SWINNERTON-DYER (1955a), where references are
given to earlier work. There is a striking related result in SWINNERTON-
DYER (1954a).
1 Presumably because information is transferred from one problem to another.
Convex sets 309
~ be the portion of f/' lying between the line joining C and d and the
Jine joining C - a and d - a. and taken closed; i.e. including the points
of f/' on those lines. Then clearly ~ is convex and every point of the
plane is congruent modulo A to a point of ~. After a finite number
of steps (since f/' is bounded) we obtain a closed convex set .r ( f/'
such that every point is congruent modulo A to a point of .r but no
two sets.r and.r +a. aEA have inner points in common. Then every
boundary point of .r is also a boundary point of .r + a for some a ==1= 0
in A. Since.r and .r +a are convex. this common boundary is either
a point or a line-segment. Since.r is bounded. only a finite number
of a come into consideration. and so .r is a convex polygon. We must
now show that it is symmetric about some point. Let the vertices of
.r be cl ..... c.... where the line segment cici+l is the common boundary
of.r and.r +ai • aiEl\. Then the line-segment (ci-a j ) (Cj+l- aj) is the
common boundary of .r and .r - a;. Hence m is even. m = 2l. and
Hence
t(C; + Ci+/) = t (C;+l + ci+l+ /)
1;£ ~bW);£ i
For details in 2 dimensions see FEJES T6TH (1950a and 1953a) and
BAM BAH and ROGERS (1952a).
Not much is known about b(F) in more than 2 dimensions. When
F(x) < 1 is the unit 3-dimensional sphere, the precise value has been
found by BAM BAH (1954 b), and other proofs have been given by BARNES
(1956b) and FEW (1956a); but all proofs are fairly complicated. The
4-dimensional sphere has been considered by BAMBAH (1954a), who
obtains an estimate for b (F) and gives a conjecture for the correct
value. Estimates for b(F) above and below and also for the corres-
ponding number for non lattice coverings have been obtained for
n-dimensional spheres, see BAMBAH and DAVENPORT (1952a), DAVEN-
PORT (1952b) and WATSON (1956a) for the lattice case, and ERDOS
and ROGERS (1953 a) and ROGERS (1957 a) for the non-lattice case, the
last treating general convex sets. Very recently ROGERS (1959a) has
obtained much stronger results by more powerful methods.
XI.2.2.1 ROGERS (1950b) has given an elegant proof of the following
result relating b (F) to the function
c5(F) = sup {F(A)}"
A d(A)
introduced in § 4 of Chapter IV.
THEOREM III.
b (F) ~ r" 3,,-1 c5 (F)
for all symmetric convex n-dimensional distance-functions which vanish
only at the origin.
ROGERS (1950b) also proved a similar result for non-lattice packings
and coverings, and indeed with the smaller constant 2-1 instead of
2-"3,,-1. Before proving Theorem III we note the following
COROLLARY.
VFb(F) ~ 3"-1,
where VF is the volume of F(x) < 1.
For VF c5 (F) ~ 2" by MINKOWSKI'S convex body theorem.
ROGERS proves Theorem III by considering a critical lattice M for F,
that is
F(M) = 1, d(M) = {c5(F)}-1. (1 )
1 When n is at all large, the results of this section are superseded by ROGERS
(1959 a ).
I
312 Inhomogeneous problems
m (:r1) = sup
al,
'In (:ro)
= fl(M) (2)
= fl (say).
Then
m (3 :r1) -;£ fl '
and so, since F(:r) < 1 is bounded, there is an aE M such that
F(3:r1 - a) = m (3:r1) ~ fl.
Then
(3)
and so 1a is not in M.
Let A be the lattice of points
b + ; a, bEM, r = integer,
so
d(A) = td(M).
Hence
{F(A)}" -;£ 15 (F) d (A) = 115 (F) d (M)
by the definition of 15 (F) ; that is, there exists a point b + ...':-a+o of A
such that 3
{F(b + ; a)}"-;£; 6(F)d(M). (4)
I
F(b) ~ F(M) = 1,
and (1) and (4) are in contradiction. Hence r= ±1, and
L-;;;;
d(M)
r"3,,-115(F)
.
(6)
Since the left-hand side of (6) is at most b (F), by the definition of b (F)
as an infimum (§ 1.3), the theorem follows.
Transference theorems for convex sets 313
discussed in § 1 and the successive minima AI' ... , All of F with respect
to 1\ which were discussed in Chapter VIII.
We first prove the inequality
(2)
Let b1 , ••• , b" be any basis for I\. Then by the definition of fl and the
fact that F(x) < 1 is bounded, there are vectors cjEI\ such that
F(t b j - c j ) ~ fl.
where
IUj-~jl ~t,
and U 1 , ... , u" are integers. Then, clearly,
F(xo - a) = F {L (~j - ui ) aj}
I
~ L I~i -
j
ujl F(a j )
~t LF(a j )
=t LA j •
1 For suppose that ~ rj d j = 0, where the 1'; are integers which, without loss
j
of generality, may be supposed to have no common factor. Then ~ I'j b j = 2~ I'j ('j'
j
Since the b j are a basis, all the I'j must be even. A contradiction!
314 Inhomogeneous problems
On the other hand. the maximum of Al + ... + I." for given Al and pro-
duct At ... ).,. is clearly attained when Al = 1.2= ... = 1... - 1. Hence. by (2)
and (3).
(6)
Both the inequalities (5) and (6) may be improved. The problem
of obtaining an estimate above for p in terms of Al is an old one which
has been attacked by many methods. The latest result due to KNESER
(1955a) and BIRCH (1956a) will be proved as Theorem V. The inequality
(5) has attracted much less attention. We sketch a proof of an improve-
ment due to BIRCH (1956b). as Theorem IV. BIRCH actually proves
something slightly stronger than Theorem IV and gives examples to
show that it cannot be much further improved.
THEOREM IV.
It was long conjectured that (9) was valid for all Q, but the following
example, due to KNESER and BIRCH (see BIRCH 1956a), shows that in
fact the weaker inequality (8) cannot be improved for 1 ~ Q< 2. Let
where O~ e< 1 is fixed and u l , ... , u" runs through all integers (note
the change of sign in the last co-ordinate). Then
for any 1x~0, t2~0. This follows at once from the "Sum Theorem",
Theorem I of Chapter VII. Indeed, S (Ix +t 2) contains the sum S (Ix) + S(t a),
where addition of sets is as defined in § 3 of Chapter VII, since
F(Yl+YZ)<Ix+ts if F(Yl) <Ix and F(Ya)<ta.
We also remark that p, is the lower bound of the numbers t such
that m{S(t)}=d(A). Clearly m{S(t)}=d(A) if every point of Bl is
congruent modulo A to a point ~ with F(~)<t. Conversely, suppose
that m{S(to)}=d(A). Let e>O be arbitrarily small. Then m {S(e)}>O
by (10). and so every point of Bl/A belongs to S(to)+S(e) (S(to+e) by
the first part of the "Sum Theorem" I of Chapter VII.
We now prove (8) very simply. By (10) we have
QAl~ nAn'
Then 2/-l;S Al Q by (2), which proves (9) in this case. Otherwise, by (11),
so that
F(x) ~ cFa(x) , .c> 0
for some c and all x, since F(x) < 1 is bounded. Hence clearly
/-l(O)(A) ;S c-l/-l (A) ,
where the superfix (0) indicates that the quantity is relative to Fa. In
particular, if /-l (A) is bounded above for some set 2 of lattices A, then
so is ",(0) (A); and hence AiO) is bounded below a strictly positive number
by Theorem IV [or by the weaker inequality (5) of § 3.1]; that is
inf lal
IAI = aE/\
,*,0
is bounded below.
318 Inhomogeneous problems
0)
The precise values of the constants in (3) are immaterial: what
matters is that the ratio K(xo)/m (xo) lies between constants. Theorem VI
goes back in essence to KHINTCHINE (1948a). KRONECKER'S Theorem
follows from it in a few lines [compare Chapter V, § 8 of the author's
tract (CASSELS 1957a), where a less general form of Theorem VI is
given].
Transference theorems for convex sets 319
so
= L II~iIlF(bj)
j
I
such that Ia,l = Ai' Let c l , c2 , c 3 be a set of mutually orthogonal
vectors such that
a1 = cl
a2 = V2l c1 + c 2 0)
a 3 = V 31 c1 + V32 C 2 + c3
(1~i~3)· (4)
(1~i~3)·
Hence
I~~I = ~~ Ic112 + ... +~: Ic312 ~ i (A~ + ~ + A:)
by (4). This establishes (2).
We now construct a 4-dimensionallattice M as follows. There is a
point ~o such that
fl = fl (A) = inf 1~ I· (5)
z=z.(A)
e= {
d(l\)
).1).2).3
}i •
But now. since Izl < 1 is bounded. there is certainly an Z1 such that
int Izl ~ p.;
IZll = z_z,
and the corollary follows.
XI.4. Product of n linear forms. Let
~ (z) = Ixl •.. x"I I ,,,. (1 )
As in § 1.4 we put
f-ll (A) = sup int ~ (z) • l) = sup (PI (I\))" (2)
z, z ..z,(A) 1 A d(l\) •
where the bii are integers and bi;=I=O. bi;==O for i>i. For any real
numbers (XlO •.•.• x"o) we can thus choose integers u l • •..• u". in order.
so that
Iu;b jj + ... +u"b; .. +xiol ~jlbiil·
For zl=Utbl +··· +u"b,,+zo. we then have
as required.
The conjecture (3) has been proved only for n = 2. 3. 4. A great
many proofs of (3) for n = 2 for have been given; we shall present one in
§ 4.2 due to SAWYER. This has the advantage that it gives naturally a
result for the .. asymmetric" distance function I
1
1\ ,(z) = { k IXI x2 1i if Xl x2 ~ O} .
, llx1 X 2 1 if X1X2~0
10f course Fk,/(a:) < 1 is symmetric about 0; but it is not symmetric in the
four quadrants.
Product of n linear forms 323
where k and l are positive numbers. These arise quite naturally even in
originally symmetric problems; indeed the result we shall prove was
first obtained by DAVENPORT (1948a) as a tool in his work on the "sym-
metric" problem for indefinite ternary quadratic forms. Further results
about F;. I have been obtained, notably by BLANEY (1950a), BARNES
and SWI~NERTON-DYER (19S4a) and, as an adjunct to anotherinvestiga-
tion, by BARNES (1956a). We refer the reader to these papers for further
details.
When n = 3, MINKOWSKI'S conjecture (3) was proved by REMAK
(1923 a, b) and a simplified proof was given by DAVENPORT (1939a).
We give DAVENPORT'S proof in § 4.3, having already paved the way in
§ 3.4. A proof for n = 3 using different ideas has been given by BIRCH
and SWINNERTON-DYER (1956a).
When n =4 a proof of (3) has been given by DYSON (1948a) following
the same general line as REMAK'S proof. It is an extremely powerful
piece of work and requires tools from topology as well as from number-
theory proper.
For n>4 only estimates for '1)1 are known. It was shown by TSCHE-
BOTAREW (1934a) that
21*
324 Inhomogeneous problems
and such that no two lattices A, A' of the set are of the type A' = t CJ) A,
where t is real and CJ) an automorph of 1\(~); so if MINKOWSKI'S con-
jecture is true then the first minimum is certainly not isolated. ROGERS
(1954c) has investigated the least number ,u~ (A) such that for every
e> 0 and every ~o there are infinitely many solutions of
1\ (~) < ,u~ (1\) + e
and obtained general conditions for 1\ under which ,u~ (A) =,udl\).
CHALK (1947a, b) has obtained the complete answer for what may
be regarded as an extreme asymmetric version of MINKOWSKI'S problem.
He shows namely that for any lattice 1\ and any point ~o there is an
= =
~1 (XlI' .•. , X" 1) ~o (1\) such that
(1~i~n), (4)
X11 ••• X"l ~ d (1\). (5)
That ~ in (5) cannot always be replaced by < is shown by the simple
example when 1\ =1\0 is the lattice of integer vectors and ~o=o. The
case n =2 was obtained by DAVENPORT and HEILBRONN (1947a). When
n=2, BLANEY (1957a) has given an interesting strengthened form:
namely that for every ~o there is an ~1 = (xu, x2l ) = ~o (1\) such that
(i = 1,2)
and
i (126* - 11) d(/\) ~ XII X21 ~ d (1\),
where the ~ on the left cannot be replaced by < for a certain lattice
A The proof is a classic example of the local methods discussed in
general terms in § 8 of Chapter X. COLE (1952a) has shown that to
=
every ~o there is an ~1 ~o such that
(1~i~n-1)
and
X11 ••• x"_1,1Ix"11 ~ id (1\).
CHALK (1947b) discusses when for given ~o there are infinitely many
~1 = ~o (1\) satisfying (4) and (5). The principle behind the proof of
CHALK'S theorem is similar to TSCHEBOTAREFF'S, and we prove it in
§ 4.4. The idea has been put in a much more general form by MACBEATH
(1952a) and C. A. ROGERS (1954b), but we do not go into that here.
XI.4.2. The proof of MINKOWSKI'S conjecture in 2-dimensions may
be made to depend on the following lemma due to DELAUNAY (1947a).
He used it as a tool to investigate ,Ill (1\) (in the notation of § 4.1) for
individual 2-dimensionallattices 1\; and the so-called "algorithm of the
Product of n linear forms 325
triangle Z2 Zs z" such that the only points ofin the closed triangle
@
Z2 Zs t are the vertices. Lemma 2 now follows from Proposition 2 on
putting
YI = zS, Y2 = Z2' Ys= t.
Since we have now disposed of all three cases, this concludes the
proof of Lemma 2.
COROLLARY. II zi = (xli' X 2i ) then
1IIxlix2il ~z-8{d(A)}'.
i
For the area of the divided cell is d(A). It is also the sum of the
areas of the four triangles $f with vertices 0, zi' iri +1 (1 ~ i ~ 4; irs = irl ).
But now the area of $f is
but there are three linearly independent points 0/ A on the boundary 0/ tf.
We call the ellipsoid tf free if 0 is the only point of A in it. We shall
assume that a free ellipsoid cannot have three linearly independent
points of A on the boundary, for some particular lattice A, and will
ultimately deduce a contradiction.
We note first that
P1P2P3~(~r{d(A)}-2>0 (2)
for some choice of the three ± signs, since the ±OJ lie on a plane through
the origin and so are points of a2-dimensionallattice on the boundary
of an ellipse which contains no point of the lattice (Theorem XI of
Chapter V).
Thirdly, under our hypothesis, if there are two pairs of points
± fit and ± 02 of A on the boundary of a free ellipsoid, they cannot
both lie in the same co-ordinate plane, say, x1 =0. For then we should
have
p2a~1 + P3a~1 = 1, 01 = (0, an, as1)
p2a~2 + Paa~2 = 1, 02 = (0, au, as2)·
If P1 is decreased but Pa, Pa kept constant, the points 01' Os remain on
the boundary and the volume of the ellipsoid increases. Ultimately
there must come a third point on the boundary for some value of P1'
°
since it is impossible to decrease P1 to without a point of A entering
328 Inhomogeneous problems
the ellipsoid. by (2). Hence for some PI the ellipsoid is free but there
are points a,.. az. as on the boundary. where as is not on x1 =0. This
contradicts the hypothesis whose absurdity we wished to prove.
Fourthly we show (on our hypothesis) that if there is a free
ellipsoid (1) with the points ±a1 • ±azEA on the boundary. then there
is one with ±a,.. ±az and ± (a,. +a2) on the boundary. For put
as=a,.+a2 • a4 =a,.-aZ ' and write
(1~i~4).
Then
(f=1.2) }
(f=3.4).
There are numbers ql. qz. q3 not all 0 such that
(i = 1.2). (4)
and after a change of sign. if need be. we may suppose without loss of
generality thatl
qla~4 + qza~4 + q3a~4 ~ O. (5)
We now consider the ellipsoids
(PI + t ql) ~ + (PI + t qz) ~ + (P3 + t q3) x~ = 1
where
t~o.
We distinguish three cases. Suppose, first, that an =1= 0, a 21 =l=O, a31 =1=O.
Then, by (6),
(j=1,2,3) (n-+oo),
in contradiction to (2). Suppose now that precisely one of an, a 21 , aal
vanishes, say, an=U, a21 =1=O, a31 =1=O. Then by the third remark above
we have a12 =1= 0, and so
(j = 2, 3),
again in contradiction to (2). Finally, suppose that two of an, a 21 , aal
vanish, say, a n =a 21 =O=l=aai' Then a l2 =1= 0=1= a22 , and so
P1(n) <= a:- 2
12
(j = 1,2), p~n) -+ 0,
(7)
Let PI' P2' Pa be the numbers given by Lemma 3, so that A has no
point in Pl x~ + P2 x~ + Pa x~ < 1, but three linearly independent points
on the boundary. Hence the three successive minima of A with respect
to the distance-function
F(x) = (PI xi + P2 x~ + Pa x~)~ (8)
are
(9)
We may now apply Lemma 1 Corollary to the lattice M of points
(Pi Xl' PR x 2 , pg Xa), (Xl' x 2 , Xa) E A,
with determinant
d (M) = (PI P2 Pa)! d (A)
and with successive minima with respect to Ixl given by (9). Hence to
any Xo there is a congruent Xl such that
since by (5) and (6) every factor on the right-hand side is ~ 1, and one
at least is ~ 1. Hence choosing for ~3 that one of ~± for which II x;
is least, we have
"i3>0 (1~i~n); II
;
"i3 ~ (1)6 II "j2"
i
1 Following MACBEATH (1952a), but in our special cases the argument can be
simplified.
Product of n linear forms 331
by (12). Hence, on taking for z,. that one of z± for which II Ixi I is least,
we have
where
52 = max {I 1 - t21, 11- it21}< 1.
As in the proof of IX we reach an ZI satisfying (7) after a finite
number of steps, the number of steps being bounded by a number
depending only on n, II IXiOI and E. This concludes the proof.
i
Appendix
In this appendix we list the lattice constants of some sets connected
with quadratic forms and give further references and some additional
comments. We write
~"s(z) = x~ + ... + x~ - x~+1 - ... - x~+s,
and denote by r"s the lattice constant of the set
I~"sl < 1
in n-dimensional space, where
n=r+5.
Results about definite forms are usually given in terms of 1'.. where
1': = r,;:o·.The first 8 values are known:
I'~ = 1, I'~ = t, I'~ = 2,
I'~ = 8, 1': = ~:-, I'~ = 64,
The value of 1'1 is trivial; the values of 1'2,1'3'1', have been found
in this book (Chapter II, Theorems II,. III and Chapter X, Theorem IV,
Corollary). For references and a list of the corresponding critical forms
see CHAUNDY (1946a), who gives proofs that 1':=28, l'~g=2J.°/3; but
CHAUNDY'S proofs contain a lacuna. Presumably his line of argument
would lead to incorrect results by n = 12; see COXETER and TODD (1953 a)
for a special form in 12 variables.
For indefinite forms we have
n2 _5
.lI,I-4"
1;~1 = I;.~2 = !
I~~2 =!
To3,1 =.lI,3
2 nll
= 4"7
Appendix 333
B 2,1
2 1
=4;,
O~fIJ"s<1.
Then
1 For later work on this problem, mainly due to DAVENPORT and BIRCH, see
RIDOUT (1958a).
References
BACHMANN, P. (1923 a): Die Arithmetik der quadratischen Formen II. Leipzig and
Berlin.
BAMBAH, R. P. (1951a): On the geometry of numbers of non-convex star-regions
with hexagonal symmetry. Phil. Trans. Roy. Soc. Lond. 243, 431-462.
(1954a): Lattice coverings with four-dimensional spheres. Proc. Cambridge
Phil. Soc. 50, 203-208 (1954).
(1954 b) : On lattice coverings by spheres. Proc. Nat. Inst. Sci. India 20, 25 - 52.
(1954c): On polar reciprocal convex domains (and addendum). Proc. Nat.
Inst. Sci. India 20,119-120,324-325.
(1955a): Polar Reciprocal Convex bodies. Proc. Cambridge Phil. Soc. 51.
377-378.
(1955b): Divided cells. Res. Bull. Panjab Univ. 81, 173-174.
(1958a): Some transference theorems in the geometry of numbers. Mh. Math.
62, 243-249.
'-. and H. DAVENPORT (1952a): The covering of n-dimensional space by spheres.
J. Lond. Math. Soc. 27, 224-229·
-, and C. A. ROGERS (1952a): Covering the plane with convex sets. J. Lond.
Math. Soc. 27, 304-314.
-, and K. ROGERS (1955a): An inhomogeneous minimum for non-convex star
regions with hexagonal symmetry. Canad. J. Math. 7, 337-346.
BARNES, E. S. (1950a): Non-homogeneous binary quadratic forms. Quart. J. Math.
Oxford (2) 1, 199-210.
(1951 a): The minimum of the product of two values of a quadratic form 1.
Proc. Lond. Math. Soc. (3) 1, 257-283.
(1954 a): The inhomogeneous minima of binary quadratic forms IV. Acta
Math. 92, 235-264.
(1955a): The non-negative values of quadratic forms. Proc. Lond. Math. Soc.
(3) 5,185-196.
(1956a): The inhomogeneous minimum of a ternary quadratic form. Acta
Math. 96, 67-97.
(1956b): The coverings of space ey spheres. Canad. J. Math. 8, 293-304.
(1956c): On linear inhomogeneous Diophantine approximation. J. Lond. Math.
Soc. 31, 73-79.
(1957a): On a theorem of Vorono!. Proc. Cambridge Phil. Soc. 53, 537-539.
(1957b): The complete enumeration of extreme senary forms. Phil. Trans.
Roy. Soc. Lond. 249, 461 - 506.
-, and A. OPPENHEIM (1955a): The non-negative values of a ternary quadratic
form. J. Lond. Math. Soc. 30, 429-439.
-, and H. P. F. SWINNERTON-DvER (1952a, b): The inhomogeneous minima of
binary quadratic forms I, II. Acta Math. 87, 259-323; 88, 279-316.
- - (1954 a): The inhomogeneous minima of binary quadratic forms III. Acta
Math. 92, 199-234.
BIRCH, B. J. (1956a): A transference theorem of the geometry of numbers. J.
Lond. Math. Soc. 31, 248-251.
References 335
CHALK, ]. H. H. (1947a, b): On the positive values of linear forms I, II. Quart.
]. Math. Oxford 18, 215-227; 19, 67-80 (1948).
- (1949a): Reduced binary cubic forms. ]. Lond. Math. Soc. 24, 280- 284.
- (1950a): On the frustrum of a sphere. Ann. of Math. (2) 52, 199-216.
-, and C: A. ROGERS (1948a): The critical determinant of a convex cylinder.
]. Lond. Math. Soc. 23,178-187.
- (1949a): The successive minima of a convex cylinder. ]. Lond. Math. Soc.
24, 284-291-
- (1951): On the product of three homogeneous linear forms. Proc. Cambridge
Phil. Soc. 47, 251-259.
CHAUNDY, T. W. (1946a): The arithmetic minima of positive quadratic forms.
Quart. ]. Math; Oxford 17 (67),166-192.
CLARKE, L. E. (1951a): On the product of three non-homogeneous forms. Proc.
Cambridge Phil. Soc. 47, 260-265.
- (1958 a): The critical lattices of a star-shaped octagon. Acta Math. 99,
1-32.
COHN, H. (1953 a, b): Stable lattices I, II. Canad.]. Math. 5, 261 - 270; 6,265 - 273
(1954).
COLE, A. ]. (1952a): On the product of n linear forms. Quart.]. Math. Oxford (2)
3, 56-62.
CORPUT, ]. G. VAN DER (1936a): Verallgemeinerung einer Mordellschen Beweis-
methode in der Geometrie der Zahlen. Acta Arithmetica 2, 145- 146.
COXETER, H. S. M., and]. A. TODD (1953 a): An extreme duodenary form. Canad.
]. Math. 5, 384- 392.
DAVENPORT, H. (1938a): On the product of three homogeneous linear forms I.
Proc. Lond. Math. Soc. (2) 44, 412-431-
(1939a): A simple proof of Remak's Theorem on the product of 3 linear forms.
]. Lond. Math. Soc. 14, 47-51-
(1939 b): On the product of three homogeneous linear forms III. Proc. Lond.
Math. Soc. 45, 98-125.
(1939c): MinkowSki's inequality for the minima associated with a convex body.
Quart.]. Math. Oxford 10,119-121-
(1941 a): Note on the product of three homogeneous linear forms. ]. Lond.
Math. Soc. 16, 98-101-
(1941 b): On a conjecture of Mordell concerning binary cubic forms. Proc.
Cambridge Phil. Soc. 37, 325-330.
(1943 a): On the product of three homogeneous linear forms. IV. Proc. Cam-
bridge Phil. Soc. 39, 1-21-
(1945 a, b): The reduction of a binary cubic form I, II. ]. Lond. Math. Soc.
20,14-22,139-157.
(1946a): On a theorem of Tschebotareff. ]. Lond. Math. Soc. 21, 28- 34 and
Corrigendum 24, 316 (1949).
(1947a): On a theorem of Markoff. ]. Lond. Math. Soc. 22, 96-99.
(1947b): The geometry of numbers. Math. Gazette 31,206-207.
(1947c): On the product of three non-homogeneous linear forms. Proc. Cam-
bridge Phil. Soc. 43, 137- 152.
(1948a): Non-homogeneous ternary quadratic forms. Acta Math. 80, 65 - 95.
(1949a): On indefinite ternary quadratic forms. Proc. Lond. Math. Soc. (2)
51,145-160.
(1950a): Note on a binary quartic form. Quart. J. Math. Oxford (2) 1, 253-261.
(1951 a): Indefinite quadratic forms and Euclid's algorithm in real quadratic
fields. Proc. Lond. Math. Soc. (2) 53, 65-82.
References 337
HALL, M. (1947a): On the sum and product of continued fractions. Ann. Math.
(2) 48, 966- 993.
HARDY, G. H., and E. M. WRIGHT (1938a): An introduction to the theory of
numbers. Oxford.
HLAWKA, E. (1944a): Zur Geometrie der Zahlen. Math. Z. 49, 285-312.
(1949a): Ausfiillung und Oberdeckung konvexer Karper durch konvexe Karper.
Mh. Math. Phys. 53, 81-131.
(1952a): Zur Theorie des Figurengitters. Math. Ann. 125, 183-207.
(1954a): Grundbegriffe der Geometrie der Zahlen. ]ber. DMV 57,37-55.
(1954b): Zur Theorie der Oberdeckung durch konvexe Karper. Mh. Math.
Phys. 58, 287-291.
(1954c): Inhomogene Minima von Sternkarpern. Mh. Math. Phys. 58, 292- 305.
] OHN, F. (1948 a): Extremum problems with inequalities as subsidiary conditions.
Studies and essays presented to R. COURANT, p. 187-204. New York.
KELLER, O. H. (1954a): Geometrie der Zahlen. Enzyklopadie der Math. Wissen-
schaften, Bd. 12 , H. II, Teil iii.
KHINTCHINE, A. Y A (1948a): A quantitative formulation of Kronecker's Theory of
approximation [in Russian). Izv. Akad. Nauk SSSR. (Ser. Mat.) 12,113-122.
KLEIN, F. (1895 a): Ober eine geometrische Auffassung der gewahnlichen Ketten-
bruchentwicklung. Nachr. Ges. Wiss. Gattingen 1895, 357-359.
- (1896a): Ausgewahlte Kapitel der Zahlentheorie. (Ausgearbeitet von A. SOM-
MERFELD und PH. FURTWANGLER.) Leipzig.
KNESER, M. (1955a): Ein Satz tiber abelsche Gruppen mit Anwendungen auf die
Geometrie der Zahlen. Math. Z. 61, 429-434.
- (1956a): Summenmengen in lokalkompakten abelschen Gruppen. Math. Z. 66,
88-110.
LEKKERKERKER, C. G. (1956a): On the Minkowski-Hlawka theorem. Proc. Kon.
Ned. Acad. Wet. Amsterdam A 59 (= Indag. Math. 18),426-434.
- (1957 a): On the volume of compound convex bodies. Proc. Kon. Ned. Acad.
Wet. A 60 (= Indag. Math. 19), 284-289.
MACBEATH, A. M. (1951 a): The finite volume theorem for non-homogeneous lattices.
Proc. Cambridge Phil. Soc. 47, 627-628.
(1952a): A theorem on non-homogeneous lattices. Ann. of Math. (2) 56,
269-293.
(1953a): On measure of sum sets. II. The sum-theorem for the torus. Proc.
Cambridge Phil. Soc. 49, 40-43.
-, and C. A. ROGERS (1955a and 1958b): A modified form of Siegel's mean-value
theorem I, II. Proc. Cambridge Phil. Soc. 51,565-576 and 54,322-325 (1958).
- - (1958 a): Siegel's mean value theorem in the geometry of numbers, Proc.
Cambridge Phil. Soc. 54, 139-151.
MAHLER, K. (1938a): On Minkowski's theory of reduction of positive definite
quadratic forms. Quart.]. Math. Oxford 9, 259-263.
(1939a): Ein Obertragungsprinzip fiir lineare Ungleichungen. Casopis pro pest.
mat. a fys. 68, 85-92.
(1939 b): Ein Obertragungsprinzip fiir konvexe Karper. Casopis pro pest. mat.
a fys. 68, 93-102.
(1946a, b, c): Lattice points in two-dimensional star domains I, II, III. Proc.
Lond. Math. Soc. (2) 49,128-157,158-167,168-183.
(1946d, e): On lattice points in n-dimensional star-bodies. I. Existence theorems.
Proc. Roy. Soc. Lond. A 187, 151-187. - II. Reducibility theorems. Proc.
Kon. Ned. Akad. Wet. 49, 331- 343, 444-454, 524- 532, 622-631.
(1946f): The theorem of Minkowski-Hlawka. Duke Math. J. 13, 611-621.
References 339
RtDEI, L. (1955a): Neuer Beweis des Haj6sschen Satzes tiber die endlichen abel-
schen Gruppen. Acta Math. Hungarica 6, 27-40.
- (1959a): Neuer Beweis eines Satzes von DELaNE tiber ebene Punktgitter.
J. Land. Math. Soc. 34, 205-207.
REINHARDT, K. (1934a): Ober die dichteste gitterfOrmige Lagerung kongruenter
Bereiche in der Ebene usw. Abh. Math. Sem. Hansische Univ. 10, 216-230.
REMAK, R. (1923a, b): Verallgemeinerung eines Minkowskischen Satzes I, II.
Math. Z. 17,1-34; 18, 173-200 (1924).
(1925 a): Ober die geometrische Darstellung der indefiniten binaren quadrati-
schen Formen. Jber. DMV 33, 228-245.
(1938a): Ober die Minkowskische Reduktion. Comp. Muth. 5, 368- 391,
RIDOUT, D. (1958a): Indefinite quadratic forms. Mathematika 5,122-124.
RIESZ, M. (1936 a): Modules teciproques. Comptes Rendus, Congr. Intern. des
Math., Oslo 2, 36-37.
ROGERS, C. A. (1947a): A note on irreducible star-bodies. Proc. Kon. Ned. Akad.
Wet. 50, 868-872 (= Indag. Math. 9, 379-383).
(1947b): Existence theorems in the geometry of numbers. Ann. of Math. 48,
994-1002.
(1947c): A note on a problem of Mahler. Proc. Roy. Soc. Lond. A 191,503- 517.
(1949a): The product of the minima and the determinant of a set. Proc. Kon.
Ned. Akad. Wet. 52, 256-263 (= Indag. Math. 11, 71-78).
(1949b): On the critical determinant of a certain non-convex cylinder. Quart.
J. Math. Oxford 20, 45-47.
(1950a): The product of n real homogeneous linear forms. Acta Math. 82,
185-208.
(1950b): A note on coverings and packings. J. Land. Math. Soc. 25, 327-331.
(1951 a): The closest packing of convex two-dimensional domains. Acta Math.
86, 309- 321-
(1951 b): The number of lattice points in a star-body. J. Land. Math. Soc.
26,307-310.
(1952a): The reduction of star-sets. Phil. Trans. Roy. Soc. Land. A 245,59-93.
(1952 b): Indefinite quadratic forms in n variables. J. Land. Math. Soc. 27,
314-319·
(1953a): Almost periodic critical lattices. Arch. der Math. 4, 267-274.
(1954a): The Minkowski-Hlawka Theorem. Mathematika 1, 111-124.
(1954b): A note on the theorem of Macbeath. J. Land. Math. Soc. 29,133-143.
(1954c): The product of n non-homogeneous linear forms. Proc. Land. Math.
Soc. (3) 4, 50 - 83.
(1955a): Mean values over the space of lattices. Acta Math. 94, 249-287.
(1955b): The moments of the number of points of a lattice in a bounded set.
Phil. Trans. Roy. Soc. Land. A 248,225-251-
(1956a): The number of lattice points in a set. Proc. Land. Math. Soc. (3)
6, 305- 320.
(19 57 a): A note on coverings. Mathematika 4, 1 - 6.
(1958a): Lattice coverings of space: the Minkowski-Hlawka theorem. Proc.
Land. Math. Soc. (3) 8, 447-465.
(1958b): Lattice coverings of space with convex bodies. J. Land. Math. Soc.
33,208-212.
(1958c): The packing of equal spheres. Proc. Lond. Math. Soc. (3) 8, 609-620.
(1959a): Lattice coverings of space. Mathematika 6, 33-39.
ROGERS, K. (1953 a): The minima of some inhomogeneous functions of two variables.
J. Land. Math. Soc. 28, 394-402.
342 References
Index
Commonly used symbols are listed first, followed by words and phrases in
alphabetical order
Izl viii LI (.9') 80
1I't1i 123 det('t) 123
11,11 318 e(x) = exp(2niX) 293
== (a ==b(k» 99£.n. F(A) 119
(Yl == YI(A» 194, 303 F(~) 195
{x} 207 L 124
[~ 164, 247 M(f) 26
Fm ..269 M+(j) 41
d(A) 10 m(S) 197
D(f) (for quadratic forms) 35 m(zo) 304
- (for cubic forms) 51 ptA) 305
g',$ 269 at, at/A 194
g .. 163, 246 V(.9') viii
b (F), il (F) 307 V(",) 74
d(F) 120
344 Index
Springer