Sparse Geometry
Sparse Geometry
1. Introduction
Let n ≥ 2 be an integer. For each x ∈ Rn , we write
n
!1/2
X
kxk = x2i , |x| = max |xi |
1≤i≤n
i=1
for the usual Euclidean norm and sup-norm on Rn , respectively. We also define the
0-norm on Rn :
n
X
kxk0 := x0i ,
i=1
where we use the convention that 00 = 0. The 0-norm counts the number of nonzero
coordinates of a vector, which we refer to as the sparsity level of this vector; if
sparsity level of some vector is no larger than m, we say that this vector is m-sparse.
Sparsity has been actively investigated in the context of compressed sensing, which
is a signal recovery paradigm based on the idea that most signals are sparse and can
therefore be reconstructed from a small number of linear measurements [6]. More
recently, the sparsity phenomenon has also been studied in discrete mathematics
and discrete geometry, in particular in the context of lattices [7], [2], [1]. In this
paper, we want to take a first stab at a systematic approach to what we see as a
“sparse analogue” of the classical geometry of numbers.
where f 1 , . . . , f k are integer vectors with relatively prime coordinates. Notice that
this decomposition is unique only if k = 1, for example
√ √ √ √
1, 2, −2 3 = 1 · (1, 0, 0) + 2 · (0, 1, 0) − 2 3 · (0, 0, 1)
√ ! √ !
2 √ 2 √
= 1 · (1, 0, 0) + − 3 · (0, 1, 1) + + 3 · (0, 1, −1) .
2 2
Then define
|α1 | if k = 1,
ν(x) :=
0 if k > 1.
For our basis matrix A, we define
n
Y
ν(A) := ν(ai ),
i=1
and for the lattice L = AZn , we let ν(L) = ν(A). This definition does not depend
on the choice of the basis matrix A. Clearly, there are many lattices for which
ν(L) = 0, for example ν(Λ1 ) = 0, where Λ1 is as in (1). In fact, it is not difficult
to show that ν(L) > 0 if and only if d(L) = n (see Lemma 3.1 below).
Second, let hLi be the additive abelian group generated by the entries of vectors
of L, and suppose that hLi has rank k ≥ 1. Fix a basis α = (α1 , . . . , αk ) for
hLi over Z, then every x ∈ L has a representation of the form (2) with vectors
f 1 , . . . , f k ∈ Zn . Define a map Φα : L → Rnk by
k
! f1
X ..
(3) Φα αi f i = . .
i=1 fk
The map Φα is additive, and hence extends to a map R ⊗ L = Rn → Rnk . We
can then pull back the sup-norm | · | on Rnk to Rn under Φα by defining |x|Φα :=
|Φα (x)|, this way obtaining a norm | · |Φα on Rn , which can then be compared to
the sup-norm on Rn . Specifically, we can define
|x|Φα n
(4) µ(α) := sup : x ∈ R \ {0} .
|x|
We can now state our first result.
Theorem 1.1. Let A ∈ GLn (R) and let L = AZn . Fix a basis α for hLi as above
and let µ(α) be as given in (4). Let 1 ≤ k < n and suppose that there exists a
subset I ⊂ [n] of n − k distinct indices such that dI (A) < n. Let ` = n − dI (A).
4 LENNY FUKSHANSKY, PAVEL GUERZHOY, AND STEFAN KÜHNLEIN
so the usual successive minima are λi (L) := λi (L, n). Then (5) is an upper bound
on the product of these k-sparse successive minima. We prove Theorem 1.1 in
Section 2. Our main tool here is the celebrated Siegel’s lemma, which is known
to be sharp with respect to the exponent (we state it below as Theorem 2.3). We
comment on the quality of the bound (5) at the end of Section 2. Unfortunately,
the upper bound of (5) depends on the choice of the basis for L and for hLi. We
can alleviate this dependence for lattices L with rational dimension d(L) = n. We
need some more notation.
Let us say that a lattice is rectangular if it has an orthogonal basis. Follow-
ing [11], we will say that a lattice is virtually rectangular if it contains a rectangular
sublattice of finite index. Two lattices L, L0 ⊂ Rn are called isometric if there
exists a real orthogonal matrix U such that L0 = U L; on the other hand, L, L0 are
called similar if there exists a real orthogonal matrix U and a positive real number
β such that L0 = βU L. In other words, similarity as a linear map is a composition
of an isometry and a dilation. It is easy to notice that the virtually rectangular
property is preserved under isometry (and under similarity), however the sparsity
levels, rational dimension and the irrationality measure ν are not necessarily pre-
served under isometry (they are preserved under dilation). In Section 3 we give the
following characterization of virtually rectangular lattices, using the invariants we
have just introduced.
Theorem 1.2. Let L ⊂ Rn be a lattice of full rank. The following three statements
are equivalent:
(1) d(L) = n,
(2) ν(L) > 0,
(3) s1 (L) = · · · = sn (L) = 1.
Further, a full-rank lattice L0 ⊂ Rn is virtually rectangular if and only if it is
isometric to some lattice L satisfying the three equivalent conditions above.
In Section 3 we also prove the following effective result and show it to be optimal
(Example 3.1).
Theorem 1.3. Let A ∈ GLn (R) be such that the lattice L = AZn satisfies the
equivalent conditions of Theorem 1.2. Then L contains a rectangular sublattice M
with a basis of 1-sparse vectors so that
n−1
det(L)
(6) [L : M ] = .
ν(L)
ON SPARSE GEOMETRY OF NUMBERS 5
In the 2-dimensional case our results imply a certain property of elliptic curves
over C. An elliptic curve E can be realized as a complex torus C/Λ for a planar
lattice Λ, called the period lattice of this curve. Given two elliptic curves, E and E 0
a morphism φ : E → E 0 between them such that φ(0) = 0 and φ(E) 6= {0} is called
an isogeny. It is a remarkable fact that an isogeny is always surjective and has a
finite kernel, the order of which is called the degree of the isogeny, denoted δ(E/E 0 ).
If an isogeny E → E 0 exists, then there also exists the dual isogeny E 0 → E of the
same degree such that their composition is simply the multiplication-by-δ(E/E 0 )
map, and hence the curves are called isogenous: this is an equivalence relation. An
injective isogeny is called an isomorphism, and the set of isomorphism classes of
elliptic curves over C is parameterized by
(7) D := {τ = a+bi ∈ C : −1/2 < a ≤ 1/2, b ≥ 0, |τ | ≥ 1}\{eiθ : π/2 < θ < 2π/3}
in the following way. For each τ = a + bi ∈ D we can define a lattice
(8) Γτ = Z + Zτ,
1 a
which can be thought of as Z2 in R2 . Then every elliptic curve is isomor-
0 b
phic to an elliptic curve Eτ with period lattice Γτ for some τ ∈ D. In fact, it is
more natural to identify the set of isomorphic classes of elliptic curves with the
quotient space of the upper half-plane under the action of SL2 (Z) by linear frac-
tional transformations, where D is a fundamental domain for this action. There is
a unique bijective holomorphic map j : D → C taking e2πi/3 to 0 and i to 1728,
called the Klein j-function, which is modular and gives the j-invariant j(τ ) for
each isomorphism class Eτ of elliptic curves. In [8] the relevant properties of the
j-invariant are outlined, and in particular it is noted that for τ ∈ D, j(τ ) ∈ R if
and only if τ belongs to the set
n √ o
1/2 + it : t ∈ R, t ≥ 3/2 ∪ eiθ : θ ∈ [π/3, π/2] ∪ {it : t ∈ R, t ≥ 1} ,
(9)
and j maps the first of these three subsets bijectively onto the interval (−∞, 0], the
second onto [0, 1], and the third onto [1, ∞) (see also Proposition on p. 160 of [10]
for an earlier appearance of this observation). With this notation, we can state the
following result which we prove in Section 4.
Theorem 1.4. Let τ = a + bi ∈ D and let Eτ be the corresponding elliptic curve
with the period lattice Γτ as above. The following statements are equivalent:
(1) Either a ∈ Q or there exists some t ∈ R such that a − bt, a + b/t ∈ Q,
(2) Γτ is virtually rectangular,
(3) Eτ is isogenous to an elliptic curve E 0 with real j-invariant ≥ 1,
(4) Eτ is isogenous to an elliptic curve E 0 with real j-invariant in [0, 1].
If these equivalent conditions hold with a ∈ Q, then there exists such an isogeny
E 0 → Eτ with δ(E 0 /Eτ ) = the denominator of a. If the conditions hold with a ∈
/Q
and t is any real number satisfying (1), then there exists such an isogeny E 0 → Eτ
6 LENNY FUKSHANSKY, PAVEL GUERZHOY, AND STEFAN KÜHNLEIN
with
|b|vw(t2 + 1)
(10) δ(E 0 /Eτ ) = ,
|t|
where v, w > 0 are denominators of the rational numbers a − bt and a + b/t, respec-
tively.
Our proof of this theorem uses Theorem 1.2. In particular, condition (1) of Theo-
rem 1.4 is equivalent to condition (1) of Theorem 1.2 in this 2-dimensional situation.
Further, (10) is just a reformulation of (6) in this case, since δ(E 0 /Eτ ) is precisely
the index of the rectangular period lattice of E 0 as a sublattice in the virtually
rectangular period lattice Γτ of Eτ . In the case when a = p/q ∈ Q, d(Γτ ) = 2.
We will refer to elliptic curves satisfying the conditions of Theorem 1.4 as virtu-
ally rectangular. The class of their period lattices includes all Γτ so that j(τ ) ∈ R
(see (9)). Further, this class includes all arithmetic planar lattices (see Lemma 2.5
of [11]), which are Γτ for τ ∈ D being a quadratic irrationality (see [8]): these are
τ = a+bi with a, b2 ∈ Q, which correspond precisely to elliptic curves with complex
multiplication (CM). We will discuss this situation in more details in Section 4, in
particular proving that CM elliptic curves are the only ones whose period lattice
contains non-parallel rectangular sublattices (Proposition 4.2 and Corollary 4.3):
in the CM case, there are infinitely many t satisfying condition (1) of Theorem 1.4
(each corresponding to a different rectangular sublattice), whereas for all other vir-
tually rectangular elliptic curves such t is essentially unique. Finally, in Section 5
we will show that virtually rectangular lattices in the plane have intrinsic geometric
meaning in terms of the corresponding points on the modular curve: they corre-
spond precisely to the points that lie on geodesics closed at infinity (Theorem 5.1).
Now assume the result for all 1 ≤ n − k < m ≤ n − 1, and let us prove it for
n − k = m. Let
I 0 = {i1 , . . . , im−1 }, so I = I 0 ∪ {im },
Tm−1
then dI (A) = d(I 0 ) + dm . Let V 0 = j=1 Vj and V = V 0 ∩ Vm . By induction
hypothesis,
dimQ V 0 ≥ n − d(I 0 ).
Since d(I 0 ) < dI (A) < n, this implies that dimQ V 0 > 0, and so V 0 6= {0}. Now, by
a well-known identity in linear algebra,
dimQ V = dimQ V 0 + dimQ Vm − dimQ spanQ {V 0 , Vm }
≥ (n − d(I 0 )) + (n − dm ) − dimQ spanQ {V 0 , Vm }
≥ n − dI (A) = `,
since spanQ {V 0 , Vm } ⊆ Qn , and so dimQ spanQ {V 0 , Vm } ≤ n.
T
n−k
This implies that dimQ j=1 Vj = ` > 0, and so there exist ` nonzero linearly
Tn−k
independent vectors y 1 , . . . , y ` ∈ j=1 Vj . These vectors are in Qn and satisfy
the equation AI y i = 0. Multiplying y 1 , . . . , y ` by the least common denominator
of their coordinates, we obtain linearly independent vectors x1 , . . . , x` ∈ Zn such
that AI xi = 0. This means that the vectors Ax1 , . . . , Ax` ∈ L have at least
n − k coordinates equal to 0. Since x1 , . . . , x` are linearly independent and A is
a nonsingular matrix, we must have Ax1 , . . . , Ax` linearly independent, and so
s` (L) ≤ k.
Remark 2.1. Notice that
dI (A) ≥ dimQ AI := dimQ {ai1 1 , . . . , aik n }.
The converse of Lemma 2.1 is not true: if s1 (L) = k, there may not exist any
I ⊂ {1, . . . , n} of cardinality n − k so that dI (A) < n. Indeed, consider the example
√ √
√1 √3 2√3
(11) A = √5 √3 2√ 3
2 3 5
and let L = AZ3 . Then s1 (L) = 1, since
0 0
A 2 = √ 0 √ ∈ L.
−1 2 3− 5
On the other hand, d(a1 ) = d(a2 ) = 2 and d(a3 ) = 3. Thus for any I of cardinality
3 − 1 = 2, dI (A) ≥ 4 > n = 3. Further, even dimQ AI here is at least 3. On the
other hand, notice for comparison purposes that if a lattice L = AZn is virtually
rectangular, then dimQ (A> A) ≤ n (see [11]).
For each row vector ai of A let di = d(ai ). Then there exist Q-linearly indepen-
dent real numbers αi1 , . . . , αidi such that
di
X
(12) ai = αij f ij ,
j=1
where f ij are integer vectors with relatively prime coefficients for all 1 ≤ i ≤ n,
Pn
1 ≤ j ≤ di . Let d = i=1 di and let F (A) be the d × n matrix with rows f ij .
8 LENNY FUKSHANSKY, PAVEL GUERZHOY, AND STEFAN KÜHNLEIN
Our next result relies heavily on the use of Siegel’s lemma (Theorem 2 of [4])
and its adaptation to sup-norm using Fisher’s inequality (equation (1.8) of [3]). We
state it here for the reader’s convenience.
Theorem 2.3 (Siegel’s lemma with matrix sup-norm). Let B be an m × n inte-
ger matrix of rank m < n. Then there exist n − m linearly independent vectors
z 1 , . . . , z n−m ∈ Zn such that Bz i = 0 for every 1 ≤ i ≤ n − m and
n−m
Y √ m
|z i | ≤ n|B| .
i=1
Lemma 2.4. Let A ∈ GLn (R), L = AZn and let α be a fixed basis for hLi. Let
1 ≤ k < n and suppose that there exists a subset I ⊂ {1, . . . , n} of n − k distinct
indices
1 ≤ i1 < i2 < · · · < in−k
Pn−k
such that dI (A) := j=1 dij < n. Let ` = n − dI (A). Then there exist ` linearly
independent vectors x1 , . . . , x` ∈ L with kxi k0 ≤ k and
`
Y
|xi | ≤ nn−dI (A)/2 |A|n µ(α)dI (A) .
i=1
Proof. Let L = AZn , where A is a basis matrix with rows a1 , . . . , an . We will prove
that (1) is equivalent to (2) and that (1) is equivalent to (3). First assume that
d(L) = n, then d(ai ) = 1 for each row ai of the basis matrix A. This means that
each ai = αi z i , where αi ∈ R \ {0} and z i ∈ Zn is a vector with relatively prime
coordinates, and so ν(ai ) = |αi |. Then
n
Y
ν(L) = ν(A) = |αi | > 0,
i=1
and so (1) implies (2). Further, this means that for any subset I ⊂ [n] of cardinality
n − 1, the linear system AI y = 0 has a nontrivial integer solution. Since such sets I
are of the form I = [n]\{i}, 1 ≤ i ≤ n, we can index corresponding integer solutions
by y i . Then each vector Ay i has only the i-th coordinate nonzero, and hence all
of such vectors are linearly independent (they are multiples of the standard basis
vectors). Therefore s1 (L) = · · · = sn (L) = 1, and so (1) implies (3).
Next assume ν(L) > 0, and let A be a basis matrix for L. Then
n
Y
ν(L) = ν(A) = ν(ai ) > 0,
i=1
which means that each ai is of the form ai = αi z i for some αi ∈ R \ {0} and
z i ∈ Zn a primitive vector. Hence d(L) = n, and so (2) implies (1).
Finally, suppose s1 (L) = · · · = sn (L) = 1. Then there exist linearly independent
vectors a1 e1 , . . . , an en ∈ L, where e1 , . . . , en are the standard basis vectors. Hence
there exists U ∈ GLn (Q) such that AU is a nonsingular diagonal matrix, which
implies d(A) = d(AU ) = n. Thus (3) implies (1).
Lemma 3.2. A lattice L is virtually rectangular if and only if it is isometric to
some lattice L0 with s1 (L0 ) = · · · = sn (L0 ) = 1.
Proof. Suppose that L contains a rectangular sublattice M , and let B be an or-
thogonal basis matrix for M . Then there exists a real orthogonal matrix U such
that U B is a diagonal matrix. Let M 0 = U M = U BZn be a sublattice of the lattice
L0 = U L. Since U B is diagonal, M 0 has a basis consisting of scalar multiples of the
standard basis vectors, and thus all successive sparsity levels of L0 are equal to 1.
Conversely, assume L is isometric to some L0 with successive sparsity levels
equal to 1, say L = U L0 for some orthogonal matrix U . Then L0 contains n linearly
independent vectors x1 , . . . , xn with kxi k0 = 1. These vectors must therefore be
constant multiples of standard basis vectors. Let M 0 = spanZ {x1 , . . . , xn }, then
M = U M 0 is a rectangular sublattice of L.
Then Theorem 1.2 follows by combining Lemmas 3.1 and 3.2. Next we prove
Theorem 1.3.
Proof of Theorem 1.3. Since d(L) = n, we must have d(ai ) = 1 for each row vector
ai of A. Then there must exist nonzero real numbers α1 , . . . , αn and primitive
integer row vectors f 1 , . . . , f n so that ai = αi f i for each 1 ≤ i ≤ n. Hence the
matrix F (A) with row vectors f i is n × n and A = AF (A), where A is the diagonal
matrix with diagonal entries α1 , . . . , αn . Let adj(F (A)) be the adjugate of F (A),
then adj(F (A)) is an integer matrix in GLn (Q), det(adj(F (A))) = det(F (A))n−1
and
F (A) adj(F (A)) = det(F (A))In ,
ON SPARSE GEOMETRY OF NUMBERS 11
j(E) and j(E 0 ) are algebraically dependent over Q. Then the upper bound on the
inequality (10) on Theorem 1.4 gives a bound on the degree of the field extension
[Q(j(E), j(E 0 )) : Q(j(E))].
at ∞ are never closed in Y . However, one may possibly argue that, for example,
a semicircle which meets the real line at two rational points is really closed at ∞
because both of its ends map to ∞ under π.
Theorem 5.1. A point on Y corresponds to a virtually rectangular lattice if and
only if this point belongs to a geodesic that is closed at ∞.
Proof. Let p ∈ Y , and assume that the corresponding lattice is virtually rectangu-
lar. Thus p = π(τ ) with τ = a + bi ∈ H, and the lattice Γτ is virtually rectangular.
Then the lattice Γτ has two orthogonal vectors, say, Aτ + B and Cτ + D where
A, B, C, D are integers and AD − BC 6= 0. The orthogonality condition can be
written as
(23) (a2 + b2 )AC + a(AD + BC) + BD = 0.
If A = 0, then BC 6= 0, and τ belongs to the vertical line x = −D/C. If C = 0,
then AD 6= 0, and τ belongs to the vertical line x = −B/A. Finally, if AC 6= 0
then (23) becomes an equation of the semicircle
2
(AD − BC)2
AD + BC
a+ + b2 =
2AC 4A2 C 2
satisfied by (a, b) with a rational radius of |(AD − BC)/2AC| and the center on
real line at x = −(AD + BC)/2AC ∈ Q. Thus in either case π(τ ) belongs to a
geodesic that is closed at ∞.
Conversely, assume that π(τ ) belongs to a geodesic which is closed at ∞. If this
geodesic is the image under π of a vertical line in H with a rational x-coordinate,
then τ = a + bi with a ∈ Q, and the lattice Γτ is virtually rectangular. Otherwise,
τ belongs to a semicircle which meets the real line at rational points, say α and β.
Pick σ ∈ SL2 (Z) such that σ(α) = ∞. Then σ(β) ∈ Q, and σ takes the semicircle
to the vertical line with a rational x-coordinate of σ(β) (Möbius transformations
preserve the Poincaré metric, therefore take geodesics to geodesics). Thus the
lattice h1, σ(τ )iZ is virtually rectangular, and this lattice is similar to Γτ . We thus
conclude that Γτ is virtually rectangular.
References
[1] I. Aliev, G. Averkov, J. De Loera and T. Oertel, Optimizing sparsity over lattices and semi-
groups. Lecture Notes in Computer Science, 2020.
[2] I. Aliev, J. De Loera, T. Oertel and C. O’Neill, Sparse solutions of linear diophantine equa-
tions. SIAM Journal on Applied Algebra and Geometry, 1(1), 239–253, 2017.
[3] R. Baker and D. Masser, Siegel’s lemma is sharp for almost all linear systems. Bull. Lond.
Math. Soc., 51(5), 853–867, 2019.
[4] E. Bombieri and J. D. Vaaler, On Siegel’s lemma. Invent. Math., 73(1), 11–32, 1983.
[5] H. Cohen and F. Strömberg, Modular Forms. A Classical Approach. Graduate Studies in
Mathematics, 179. American Mathematical Society, Providence, RI, 2017.
[6] Y. C. Eldar and G. Kutyniok, Compressed sensing: theory and applications. Cambridge
University Press, 2012.
[7] L. Fukshansky, D. Needell and B. Sudakov. An algebraic perspective on integer sparse recov-
ery. Appl. Math. Comput., 340, 31–42, 2019.
18 LENNY FUKSHANSKY, PAVEL GUERZHOY, AND STEFAN KÜHNLEIN
[8] L. Fukshansky, P. Guerzhoy and F. Luca. On arithmetic lattices in the plane. Proc. Amer.
Math. Soc., 145(4), 1453–1465, 2017.
[9] P. M. Gruber and C. G. Lekkerkerker, Geometry of Numbers. North-Holland Publishing Co.,
1987.
[10] M. Koecher and A. Krieg, Elliptische Funktionen und Modulformen. Springer-Verlag, Berlin,
1998.
[11] S. Kühnlein, Well-rounded sublattices. Int. J. Number Theory, 8(5), 1133–1144, 2012.
[12] T. Miyake, Modular Forms Modular forms, Translated from the 1976 Japanese original by
Yoshitaka Maeda. Springer Monographs in Mathematics. Springer-Verlag, Berlin, 2006.
[13] W. M. Schmidt, Diophantine approximations and Diophantine equations. Lecture Notes in
Mathematics, 1467. Springer-Verlag, Berlin, 1991.
[14] J. H. Silverman, The arithmetic of elliptic curves. Graduate Texts in Mathematics, 106.
Springer-Verlag, New York, 1986.
Institut für Algebra und Geometrie, Fakultät für Mathematik, KIT, FRG-76128
Karlsruhe
Email address: [email protected]