Pure FT
Pure FT
847
848 L. AUSLANDER AND R. TOLIMIERI
TABLE OF CONTENTS
MATHEMATICAL INTRODUCTION
the number
02iriab/n
v^
where ab denotes the product of the 2 numbers a and b. Thus
V2
1 1 1
F(3) = 1 -2m/3 ,2^2/3
V3 1 e2m2/Z e2m/3
1 1
e2iri/n e2iri2/n . , , e2m{n-\)/n
The main problem involving the finite Fourier transform is the following.
Given a complex valued function /(a), 0 < a < n, we want to compute the
function Y(b), 0 < b < n, given by
no) ƒ(<>)
= F(n)
y(«-i)J [/(n-i);
We call y(6) the finite Fourier transform of the function f(a) and we will
abbreviate this by Y = F(ri)f and sometimes denote F(n)f by f .
However, on books on Harmonic Analysis the finite Fourier transform is
defined in the following, apparently, different fashion. Let Z/n denote the
group of integers mod n. Let C denote the complex numbers and Cx(ri)
denote the multiplicative group of complex numbers e?mk/n9 0 < k < n, or
what is called the group of n roots of unity. Let Z/n denote the set of
homomorphisms of Z/n into Cx(w). We make Z/n into a group by defining
(â + b)(a) = â(a) • b(a), a G Z/n, â, b GZ/n,
X
where multiplication is in C (AI). The group Z/n is called the character group
of Z/n and is well known to be isomorphic to Z/n. We introduce the
notation b(a) = (a,^è), a G Z/n, b G Z/n. Let ƒ be a complex valued func-
tion on Z/n or Z/n. We definej/) 2 = 2 ö G < 7 f(à)f(a) where G is Z/n or
Z/n. We define L2(Z/n) or L2(Z/n) as the set of complex valued functions
on Z/n or Z/n with the above norm. We define the finite Fourier transform
F(n):L\Z/n)->L\Z?n)
by
(F(n)f )(b) = 1
(1)
V7t Û£Z//I
850 L. AUSLANDER AND R. TOLIMIERI
1. The Legendre symbol, quadratic reciprocity, and the trace of the finite
Fourier transform. One of the simplest invariants of a linear transformation is
its trace. In this section we will define the Legendre symbol (p/q) and show
that if Tr(F(n)) denotes the trace of the finite Fourier transform then
Tr(F( M ))
Œ)-q)\p) Tr(F(p))Tr(F(q))
where p and q are odd primes. Of course the celebrated result of Gauss on
quadratic reciprocity states that
(E\(l\= f - \ ) [ ( p -0/2][(<7-l)/2]^
This shows one of the connections of the finite Fourier transform with
classical mathematics. In order to carry out this program, we will have to
introduce a representation p of the group Z/nx of units (elements with
multiplicative inverse) in the ring Z/n on L2(Z/n).
We will now start this section with a discussion of the Legendre symbol.
Let p be an odd prime. Then Z/p is a field having p elements and Z/px
consists of the nonzero elements of Z/p and is a cyclic group of order/? — 1.
Let
S={£2|£EZ//?*}.
0<£<P teRp
852 L. AUSLANDER AND R. TOLIMIERI
where the bar denotes the complex conjugate. The resulting Hermitian inner
product space is denoted by L2(Z/n) and the n functions fa, a E Z/n, define
an orthonormal basis of L 2 (Z//î).
We will now describe the unitary representation p of Z/nx on L2(Z/ri).
Since Z/nx acting on Z/n by multiplication produces a group of automor-
phisms of the additive group Z/n, we may define for each a E Z/nx a linear
transformation p(a) of L2(Z/n) by setting
p(a)(f)s = f(as\ s E Z/n J E L 2 (Z//z).
l
Because p(ö)£ = fa~ a> where /,, a E Z/n, is the orthonormal basis defined
above, it follows that p(a) is a unitary operator on L2(Z/ri)9 a 6 Z / « X , If
U(ri) denotes the group of unitary operators on L2(Z/n) it is easily verified
COMPUTING WITH THE FINITE FOURIER TRANSFORM 853
that
p:Z/nx~*U(n)
is a group monomorphism.
We will now review briefly, and then extend, the material on the finite
Fourier transform that we gave in the Introduction.
Let C denote the field of complex numbers and let C,x denote the
multiplicative group of complex numbers of absolute value 1. A character on
Z/n is a group homomorphism X: Z/n^-±Cx. The set of all characters on
Z/n is denoted by Z/n. For À,, X2 e Z/n, we define
(Xj + X2)a = \x(a)\2(a), a E Z/n.
Then Z/n is a group, isomorphic to Z/n. For 0 < a < n, let Xa: Z/n -> C,x
be defined by
Aja) « e-2*ia*/*9 a e z / w .
Clearly Xa is a well defined character on Z/n and it is easily verified that Z/n
consists of the n characters Xa, 0 < a < n. Notice Xa E L 2 (Z/n) and
a,/} E Z//î- Hence the Aa, 0 < a < n, are an orthogonal basis of L2{Z/ri).
We can now define the finite Fourier transform F(n) of Z/n as the linear
mapping of L\Z/n) defined for ƒ E L\Z/n) by
thus
Hn?fa=f~« and F(n) 4 = ƒ
where ƒ is the identity mapping. Also
given by
1
m(e2*ifir/n)9 0< fry <n
V7l
Although the linear transformation F(ji) is represented by different matrices
relative to different bases we will denote the above matrix also by F(ri).
Since p(a) E U(n), a E Z/nx, and F(n) E U(n) we can form
F(n)p(a)F(n)~l. Because p(a)Xa = Xaa we have immediately that
F(n)p(a)F(n)-1 = p(ayl - pia-1).
We can now relate the above results to Lemma 1.1.1. and the results of
Gauss on quadratic reciprocity and the value of quadratic Gauss sums.
If Tr( ) denotes the trace of the linear transformation in the bracket we
have immediately that
Tr(F(rt))= 2 eW/n,
a£Z/«
Tr(p(a)F(n)) = 2 ^W/*
oez/n
Now let n and m be relatively prime positive integers and let y = am + fin,
0<a<n,0</3<m.
The Chinese remainder theorem implies that the set of all such 7 is a
complete residue system mod nm. Now let fa E L 2 (Z/«), 0 < a < n, and let
ffi E L2(Z/n), 0 < p < m, be the basis of L\Z/ri) and L\Z/m) as defined
above. Let x(fa>fp) = /«m+/to> where/Y, 0 < y < nm, is a basis of L2(Z/nm)
as above. Extend x t o a bilinear mapping of L2(Z/n) X L2(Z/m)-*
L2(Z/nm). Then x induces a linear map x*- L2(Z/n) ® L2(Z/m)~*
L2(Z/nm). It is easily verified that x* is an isomorphism. A straightforward
computation then shows that the tensor product of the linear operators
p(m)F(n) and p(n)F(m) satisfies
p(m)F(n) ® p(n)F(m) = F(nm). (2)
The result of Lemma 1.1.1 can now be stated as
p an odd prime. Since the trace of a tensor product is the product of the
traces of the factors, we have from equations (2) and (3) that
[^){^fTx{F(p))Tx{F{q)) = Tv(F(pq))
or
TtjFjpq)) (
m= Tr(F(p))TT(F(q))
and we have verified the result we stated at the beginning of this section.
The formula
^}
primes, (5)
COMPUTING WITH THE FINITE FOURIER TRANSFORM 855
is Gauss' celebrated formula for quadratic reciprocity. Formula (5) has many
elementary proofs; see for instance, Hardy and Wright [12]. But Gauss also
established the following fundamental result.
THEOREM 1.1.2. Let F(ri) denote the finite Fourier transform on L2(Z/n) and
let Tr(F(n)) denote the trace of F(n). Then
i +1 ifn = 0mod4,
1 ifn = 1 mod 4, _
Tr{F{n)) =
(6)
w
0 j f / i s 2 mod 4,
i ifn = 3 mod 4.
Although formula (5) has elementary proofs Theorem 1.1.2 has, to our
knowledge no elementary proof and seems to be much deeper than quadratic
reciprocity. In [13] there is an interesting discussion of the many proofs of
quadratic reciprocity.
In the next section we will relate the problem of computing Tr(F(/i)),
n > 0, to the multiplicity problem for F(n). However, before doing this, we
pause to show what insights elementary considerations can give us about
Theorem 1.1.2.
We begin by showing that Tr(F(2r)) = 0, r odd, is easily verified. For
Tr(F(2r))= 2 e 2 "* 7 * + 2 e2^+r)2/2r.
0<£<r 0<£<r
27rir/2 27rl( +r)2/2r
Because r is odd, e = - 1 and e * = -.\e2"*2/2r and we have
established that Tr(F(2r)) = 0, r odd.
Let/? be an odd prime. We will now show that Tr(F(4/?)) = 1 + / implies
[i up = 3 mod 4.
By (2)
Tr(p(4)F(p))Tr(p(p)F{4)) = Tr(F(4/>)) - 1 + /.
Lemma 1.1.1 gives
Tr(p(4)F(p)) - (4/p)Tt(F(p)) = Tr(F(/>))
as it is easy to verify that (4/p) = 1. Thus
1+i
Tr( F(p)) =
Tr(p(p)F(4)) •
But we can easily write out the four terms of the sum Tr(p(/?)F(4)) to prove
that
Tr(P(p)F(4)) = f l + l if
^lmod4>
I 1 — * if p = 3 mod 4.
This shows that
Tr(F(^)) = ( 1 ÎJ» =
= 1J mod 4,
11 if n = 33 mod 4.
856 L. AUSLANDER AND R. TOLIMIERI
2. Equivalence of the trace and eigenvalue problems for the finite Fouriei
transform. This section relates Theorem 1.2.2 or the computation of Tr(F(n)),
n > 0, and the multiplicity problem as defined in the Introduction for F(n).
Since the trace of a linear transformation is the sum of is eigenvalues, it is
clear that a solution of the multiplicity problem for F(ri) implies Theorem
1.1.2 or Gauss' result in quadratic Gauss sums. What is very surprising is that
knowing Tr(F(n)) for all n enables us to solve the multiplicity problem for
F(n). The proof rests on a simple observation that is at the heart of Schur's
proof of the computation of Tr(F(n)) (see [6, p. 351]). Schur observed that
f1 0 • • • 0]
0 0 • • 0 1
[ 0 1 0 - • 0j
and hence the characteristic polynomial of F\n) is given by
(,-l)< n + 1 ) / 2 (f + l) ( "- , ) / 2 , «odd,
K)
( , _ i)<«+2>/2(, + i)<-2>/2, nev en.
Since the eigenvalues of F2(n) are the square of the eigenvalues of F(n),
formula (7) yields both that the possible eigenvalues of F(ri) are ± 1, ±i and
the following result on multiplicity: If F(n) has mx eigenvalues 1, m2 eigen-
values - 1 , m3 eigenvalues i and m4 eigenvalues -i' then
n+1
2 , " odd,
mx + m2 = —-z—, m3 + m4
n +2
mx + m2 = —-—, m3 + m4 ——, n even. (8)
and
Tr(F(n)) = (mx - m2) + i(m3 - m4). (9)
Now let Tr(F(n)) = a + i/3 then (8) and (9) combine to yield for odd n
mx — m2= a, m3 — m4 = fi9
n+ 1 , n- 1
mx + m2 = —-—, m3 + m4 = —-—
and a similar set of equations for even n. Clearly, we can solve for mx, w2, m3,
m4 in terms of n, a, (i and hence once we have evaluated Tr(F(n)) we have a
complete solution for the eigenvalue problem. This shows that the trace
problem for F(ri) is equivalent to solving the eigenvalue problem.
Theorem 1.1.2 can now be stated in the following equivalent form.
THEOREM 1.1.2'. Let F(n) denote the finite Fourier transform on TL/n and let
mpj — 1, 2, 3, 4, denote the multiplicities of the eigenvalues 1, - 1 , i, -i of F{n\
respectively. The value of mJyj = 1, 2, 3, 4, as a function of n is given by the
following table.
COMPUTING WITH THE FINITE FOURIER TRANSFORM 857
n mx = 1 m2 = -1 m3 = i m4 = -i
Am m+ \ m m m— i
4m + 1 m+ 1 m m m \ '
4m + 2 m+ \ m+\ m m
4m + 3 m+ 1 m 4- 1 m+ 1 m
3. The algebra of the finite Fourier transform. Theorem 1.3.1 stated below
was first discovered and proven in [4] using nilpotent harmonic analysis and
its proof is independent of Theorem 1.1.2'. However the equivalence of
Theorems 1.1.2' and 1.3.1 is easily established and requires no nilpotent
harmonic analysis.
Let C[XX9 X2, X3] be the polynomial algebra in three inde terminants over
the complex numbers C and let C[XX, X29 X3] be the subalgebra generated by
9Lx"C[Xl9Xi9Xi]/(Xi + Xi)9
9t2 = C[Xl9 Xl Xl}/ (X% + X*X\ + Xl)9
3t3 = C | X Xl Xl}/ {Xl + XfXl),
where ( ) denotes the principal ideal in C[XV X29 X3] of the polynomial in the
bracket. Let Yx Y2b 733c, a, b, c e Z, a, b9 c > 0, denote the image in 9la,
a — 1, 2, 3, of the monomial Xx X2b X3C. Then it is easily established that
Y* ?? *33c> c = 0, 1, a9 b > 0,
is a vector space basis of the algebra 2Ia, a = 1, 2, 3. It is not difficult, using
elementary methods, to prove that
y6 i y6 y 6 i Ay 4 Ay 2 _i_ A y 6 y6 i y4 v2
^ 3 "*" A 2> A
2 "*" l 2 "^ 2> A
2 "*" A l A 2
are each irreducible in C[Xl9 X29 X\\ Hence each of the algebras 3ta, a =
1, 2, 3, has no divisors of zero.
THEOREM 1.3.1. Let 3ïa, a = 1,2, 3, be as defined above and let ^%a -* 9ta,
a = 1, 2, 3, be the linear transformation of 9ta such that
<5{Yxa Ylh Ylc) = ( - l)bieYf Ylb Ylc9 c = 0, 1; a, b > 0.
Lef F(«) 6e //*e sector subspace of 9la spanned by Yx Y2b Y\c where a + 2b +
3c = n and let ^(n) = ^ | V(ri), Then dim V(n) = n and ^(n) is equivalent to
the finite Fourier transform F{ri). Further
Since
2 dim Vx(4m) = dim V(4m)
we have dim V(4m) = Am and we have proven our result for n = 4m. The
other cases are proven in a similar way and the proof is omitted.
Thus Theorem 1.1.2' and F.4.1 are equivalent.
4. Direct solutions of trace and eigenvalue problems. Gauss' original proof of
Theorem 1.1.2 can be found in H. Rademacher [17]. Gauss' proof is algebraic
and very different from the proofs of his theorem that one finds in the usual
texts on elementary number theory. We will not summarize Gauss' proof, but
we do seriously suggest that all readers examine Rademacher's account of this
remarkable achievement.
We know two different analytic proofs of Theorem 1.1.2 and these are the
proofs most often found in elementary texts. Schur's proof was the first proof
that stressed the role of the finite Fourier transform and its eigenvalues.
Accordingly, we will begin with a brief discussion of Schur's proof. We will
follow it with McClellan and Park's proof of Theorem 1.1.2'. We will then
discuss the two analytic proofs of Theorem 1.1.2.
We will present Schur's proof of Theorem 1.1.2 only when n is an odd
prime p as this makes the discussion simpler, but exhibits all the most
interesting aspects of the method of proof. For those who want the whole
story, this can be found in [6].
As in §1.2, let ml9 m2, m3, m4 denote the multiplicity for F(p) of the
eigenvalues 1, - 1 , i, — /. In §1.2 we showed that
p+ 1 p- 1
m
\ + m 2 = 2 > m3 + m4 = —y—
and
Tr(F(p)) = ml - m2 + i(m3 - m4).
Schur's method of proof is simply to obtain enough relations amongst the m's
to enable them to be computed.
We begin with a result that shows that it is the sign that is the difficult part
of Theorem 1.1.2.
LEMMA 1.4.1. Let F(p) denote the finite Fourier transform on L2(Z/p),p an
odd prime. Then
iS p mo
TK^))-f;! f ^\ ^;
[ ± i if p = 3 mod 4.
PROOF. A multiplicative character X of Z/px is a homomorphism of the
multiplicative group Z/px into Cf. We extend X to a function on Z/p by
defining A(0) = 0. Now let A be a nontrivial multiplicative character of
Z/p x ; i.e., there exists £ e Z/p x such that A(£) =£ 1. View X as an element of
L\Z/p). Since the characters
K(0 = e***'*, a E Z/p,
determine an orthogonal basis of L2(Z/p) and <Xa, Xa) = p, we can write
860 L. AUSLANDER AND R. TOLIMIERI
P lez/p
An argument similar to that given in Theorem 1.1.1 gives
Tr(F(^))=^
V~p
2iez/p\P)
(fW*.
Thus, Tr(F(p)) = V 7 # i where
aGZ/p o6Z//)
E tfA(c*)= S *A«(*)
aeZ/j) aeZ/>
H
/?ez//> /*ez//>
Hence ac_£ = A(c)a£ for all /?. Hence |aj| = |ac|, where | | denotes the
absolute value, c ^ 0 mod/?. Thus
(\\y=p(p-l)\al\2 = p-l
and
|«,|-l/Vjp.
Since Tr(F(p)) = V^p a j when A = (/), we have
|Tr(F(/>))|=l.
It is an elementary fact that (-l/p) = ( - l ) ^ - 0 / 2 . By Theorem 1.1.1
Tr(F(/>))-= ( - l / ^ T r ^ ) )
where the bar denotes the complex conjugate. Hence
Tt(F(p))2(-l/p) = 1
or
Tr(F(P))={ll ^l™**;
{ ±i ifp = 3 mod4.
COMPUTING WITH THE FINITE FOURIER TRANSFORM 861
0<s,r</>
- n î?r+5 n 2/sin-^—-v.
0<s,r<p 0<s,r<p P
r = 2
Because S ^ ^ o + * P((P ~ 0/2) we have II r j r + ' = 1. Hence
2
— - = 2m2 + m3 — m4 mod 4.
From which we have
mx — m2 = mod 4 if/? = 1 mod 4,
m3 — m4 = mod 4 if/? = 3 mod 4.
This combines with our previous results to
[i if/? = 3 mod 4.
1.4.2. MCCLELLAN AND PARK'S PROOF OF THEOREM 1.1.2'. The proof of
Theorem 1.1.2' by McClellan and Park is interesting for three reasons. First, it
862 L. AUSLANDER AND R. TOLIMIERI
|>('J • • • *„(OJ
is nonsingular. If 8, i = 1 , . . . , w + 1, are all nonzero and alternate in sign,
then
\ <J>i('i)
*#i('«+i) 8 »+ij
is a nonsingular matrix.
We will now list certain elementary facts about the finite Fourier trans-
form. Proofs can be found in [16].
DEFINITION. A function ƒ on Z / « is called even if
ƒ(<*) = ƒ ( - « ) , aGZ/ii.
A function ƒ on TL/n is called odd if
ƒ ( « ) = -ƒ(-<*), aEZ/n.
2
1. For ƒ G L (Z/n), F\n)(f)(a) = /(-a).
2. Let [x] denote the greatest integer less than x. Then L2(Z/ri) has a
v = [n/2] + 1 dimensional subspace of even functions and an n — v dimen-
sional subspace of odd functions.
3. If ƒ is an eigenvector of F(ri), then ƒ is either an even or odd function.
4. Even eigenvectors have eigenvalues ± 1. Odd eigenvectors have eigenval-
ues ±i.
5. If ƒ is an even function, then F(ri)(f) + ƒ (F(ri)(f) — ƒ) is an eigenvector
of F(n) with eigenvalue 1 (-1). If ƒ is an odd function, then iF(ri)(f) -
COMPUTING WITH THE FINITE FOURIER TRANSFORM 863
We need to study
m
^éai(F(N)gi + gi)
i= 0
or
m
1=0
Since e~27rik/N = e2^N~k)/N the coefficients of fm, ...,f2m y i e l d m + l equa-
tions in m + 1 unknowns that can be written as
+
2 •
By reassociating the terms in the sum, we obtain
A(0)
"S ! / t ( 1 ) = I + " 2 e2**V" + * = VÏÏ Tr(F(n)).
2 2
A: = 0 * £= 1
Let 0 = f0 4- • • * +ƒ,_!. Then 0 is C' on [0, 1] and so, by Theorem F.l, we
have
4 = C e2™^1 dy.
«' — oo
One verifies that the above improper integral converges and that In = Vn tv
From all this we otain
Trcn«))-(i + r-)/0 + r')
which is another form of Theorem LI.2.
COMPUTING WITH THE FINITE FOURIER TRANSFORM 865
n— - c o
and/(0 satisfies the functional equation
*>-(ÎP£).
Clearly ƒ(/) diverges along the line Re(7) = 0. To find the relation between
Gaussian sums and the theta constant f{i), we examine/(0 in a neighborhood
of its line of divergence.
Let S(py q) = 2?^o e"***'*, (p> q) = 1. Set t = e + irip/q where e > 0.
Then
Yq r = 0 V 7 r- 0
Choosing # odd and/? = 2 yields Theorem 1.1.2 for w odd.
5. The finite Heisenberg groups and the finite Fourier transform. In this
section we will begin to discuss the role that nilpotent harmonic analysis plays
in the theory of the finite Fourier transform. The reason for the importance of
nilpotent harmonic analysis is that certain finite nilpotent groups have the
finite Fourier transform built into their structure. The first indications of this
are given in this section and will be looked at again in Chapter II.
866 L. AUSLANDER AND R. TOLIMIERI
where bar denotes complex conjugate. Let U(m) denote the group of linear
transformation of Cm such that for U G U(m) and c, d G Cm
<c,d> = <£/(c), t/(d)>.
U(m) is then called the group of unitary transformations of Cm. A unitary
representation p of a group G is a homomorphism of G into f/(m). A unitary
representation p is called irreducible if the only subspace of Cm invariant
under p(g), g G G, is Cm or 0. Also, p is called faithful if p is a monomor-
phism.
Let p t and p2 be unitary representations of G on Cm. We will say that px is
unitarily equivalent to p2 if there exists a unitary matrix U such that
U~lpx(g)U=p2(g), allgGG.
1/ is then called an intertwining operator for px and p2.
Let / be the identity matrix in U(m) and let x be a character on an abelian
group A. By x^ we mean the unitary representation of A defined by (xl)(à)
= x(à)I,a ^A-
Let G be a group and let z(G) denote the center of G.
THEOREM R.l. Let px and p2 be unitary representations of G and assume, in
addition, that px is irreducible. Let U be an intertwining operator for px and p2.
Then p2 is irreducible and U is unique up to multiplication by an element of C*.
THEOREM R.2. Let p be an irreducible representation of G and let z(G) denote
the center of G. Then z restricted to z(G), p/z(G) = x * I> where x *s a
character of z(G).
We will now state two results that are specific for the groups N(Z/n).
COMPUTING WITH THE FINITE FOURIER TRANSFORM 867
• •. '. '. 0|
0 1
1 0 OJ
2vic/
Let A = (a, 0, c) and let x(0, 0, c) = e ". Inducing x from A to NÇL/ri)
gives the following irreducible unitary representation of NÇL/ri) =
{(a, b, c)\a, b,c G Z/n)
Pl(z(N(Z/n))) = x • I,
Pl(a, 0, 0) = D„(a),
p,(0, b, 0) = (Sn)b.
2mc/n
Let B = (0, b, c) and let x(0, 0, c) = e . Inducing X from B to NÇL/A)
gives the following irreducible unitary representations of NÇL/ri)
p2(z(N(Z/n))) = x • /,
p2(a, 0, 0) = (Sn)a,
p 2 (0, b, 0) = Dn(b).
It is an elementary exercise to verify that p, and p 2 are irreducible unitary
representations of NÇL/ri) and if
1
F(n) (e 2iriab/n ), 0 < a,b < n,
\Tn
868 L. AUSLANDER AND R. TOLIMIERI
that
F(n)p2F(n)~l = p,
or F(n) intertwines px and p2. By the uniqueness of intertwining operators,
F(n) is essentially determined by the representations px and p2.
At this point the appearance of F(n) as an intertwining operator for unitary
representations of N(Z/ri) is totally unexplained. In §11.3 we will see another
way of looking at p, and p2 that better explains the finite Fourier transform's
role as an intertwining operator for px and p2.
6. A proof of Theorem 133, nil-theta functions and theta functions. In this
section we will outline a proof of Theorem 1.3.3 that uses harmonic analysis
on the real Heisenberg group and is completely independent of Theorems
1.1.2 or 1.1.2'. This proof shows the deep relation between the finite Fourier
transforms, F(n), n > 0, and the algebra of theta functions with periods 1 and
V^T . A complete exposition of this material can be found in Chapter II of
[1].
Let R denote the reals and Z c R, denote the integers. Let N = JV(R) be
the R-Heisenberg group and let T = N(Z) be the Z-Heisenberg group. Then
T c N and T \ N is a compact manifold. If (x,y, z) G N, x,y, z G R, then
the 3-form dx /\dy f\dz induces a probability measure on T \ N. We form
the Hubert space L 2 (r\N) and define a unitary representation U of N on
L\T \ N) as follows: For g G N, f G L 2 G (r \ N) define
(U(g)f)(Th)=f(Thg), hGN.
It will be convenient to consider functions T \ N as functions on Af such that
f(yh) = f(h), y G T, h G N. In general, if ƒ is a function on JV and yGiVwe
set
Zr{f){h)=f{rXh)
and call £ the left action. For each m G Z, let
H{m) = {ƒ G L\T \N)\f(x,y, z + /) = e2™%x,y, z)}.
One verifies that U(g)H(m) = H(m), g G N, and that
L2(T\N)= 2 @H{m)
mez
where the sum is the orthogonal sum.
We now want to better understand the spaces H(m). To do this we
introduce the automorphism
D-.N-+N
m
given by Nm(x9y, z) = (mx, >>, mz). By letting ƒ -» ƒ ° Dm we may use Dm to
induce a linear mapping of L2(T \ N). By a slight abuse of notation we will
denote this linear mapping also by Dm. Then
Dm{H{\)) c H{m).
One verifies that
m-l
H(m) = 2 eWM(/).(fl(l))).
y-o
COMPUTING WITH THE FINITE FOURIER TRANSFORM 869
0(m) = 2 ®®(mJ)
where @(mj) = 0(m) n #(m,;) = t{0J/mfi){Dm{&{\))) and 0(1) has basis
<p(x,y,z) = e2™ 2 e—^ + 'te™*.
/eZ
The relationship between the algebra 0 and J and the space â = 2 „ > 0 ©
L2(Z/n) and the finite Fourier transform extend to S by F = 2 „ > 0 © ^(«)
is given by the following theorem.
THEOREM 1.6.1. There exists a unitary operator V: 0-> â satisfying
(1) F(0(«)) - L 2 ( Z / H ) ,
( 2 ) F = F/K -1 .
It follows that 2- can be given the algebra structure of 0 and since
J(fifi) = JUxVUùJvh G @> we have
HSigi) = HsùHgiX Si> & G S.
It is also proved in [4] that 0 is isomorphic to
of 0(m). In [1] we show that <pmJ, 0 < j < m, is the required basis.
We can also verify this result using the ideas of §1.5. First, we define a
representation Ul of the Z/m-Heisenberg group N(Z/m) on @(w) as follows.
For ƒ E 0(m) and a, b, c E Z let
Ux(m)(a9 0, 0)/ = L{a/m^oyf9
^ ( m X a ^ O ) / ^ L(0,Vm,o>/,
f / ^ m X O ^ ^ ) / - e2"i(c/m)f.
It is not hard to see that with respect to the basis <pm7, defined above, that
Ux(m) is a unitary representation of N(Z/m) on 0(m) and that the matrix
Ux(m) = pj, pj as defined in the previous section. Also one verifies that
J~xUx{m)J = p2.
Since J(m) intertwines pj and p2 it follows that /(m) = cF(m) where |c| = 1.
One then verifies that c = 1 and we have our assertion.
where the given Fourier coefficients A(k) are complex and Wis the principal
iVth root of unity
W = el7Ti/N. (2)
2
A straightforward calculation using (1) would require N operations where
'operation' means, as it will throughout this note, a complex multiplication
followed by a complex addition.
The algorithm described here iterates on the array of given complex
Fourier amplitudes and yields the result in less than 2N log2 N operations
without requiring more data storage than is required for the given array A. To
derive the algorithm, suppose N is composite, i.e., N = rxrv Then let the
indices in (1) be expressed
J = J\r\ + Jo> Jo = 0, 1, . . . , rx-l, jx = 0, 1, . . . , r 2 -l,
k = kxr2 + k0, k0 = 0, 1, . . ., r 2 ~l, kx = 0, 1, . . ., /y-1. (3)
Then, one can write
since
The inner sum, over kx, depends only ony 0 and k0 and can be defined as a
new array,
There are N elements in the array Al9 each requiring rx operations, giving a
total of Nrx operations to obtain Av Similarly, it takes Nr2 operations to
calculate X from Av Therefore, this two-step algorithm, given by (6) and (7),
requires a total of
T=N(rl + r2)
operations."
Let us now formalize the steps of the Cooley-Tukey algorithm. We let ^ ( )
denote the complex functions whose domain is the set in the parens and
872 L. AUSLANDER AND R. TOLIMIERI
1 x F(r2)
Ç&fa xZfrJ^VIr^iZIrJ)
F(rxr2) M
1 x F(rt)
D'
^ ( Z / r j r j ) *• '$(Z/r2 x Z/r,) « f (Z/rj.^Z/r!))
where the "hat" denotes the dual group or the group of characters and where
we must still define the various mappings of the above diagram.
DEFINITIONS. THE MAPPING » . Let f(x9 y) E ^(X X Y). Then for each
fixed x0 E X, f(x0,y) E ^(Y) and so f(x,y) determines an element of
^(X, ^(Y)). It is obvious that this correspondence is 1-1.
THE MAPPING C. Between sets Z/rxr2 and Z/r, X Z/r 2 define the follow-
ing homeomorphism C*. (Notice: C* is not a group homomorphism.) For
0 < k < rxr2 let k = k2 + kxr29 where 0 < k2 < r2, 0 < kx < rx. Define
C*(/c) = (kX9 /c2) E Z/rx X Z/r 2 .
For ƒ E $(Z/rxr2) define C(/) = ƒ o (C*£ E ^(Z/r^X Z/r^.
THE MAPPING Z>. Between the sets Z/*rxr2 and Z/r 2 X zj}x define the
homeomorphism Z>* as follows: For 0 < k < rxr2 let k = k2rx + kv where
0 < k2 < r2, 0 < kx < rv Define
D*(k) = (k2, kx) E Z/r2 X z7>i.
For ƒ E €(Z/rxr2) define /)(ƒ) = ƒ o (Z)*)"1 E ^(z7r 2 X Zjrx).
THE MAPPINGS 1 X F(r2) AND 1 X F(r2). An element of ^(Z/rv <$(Z/r2))
determines an rptuple of elements of ^(Z/r 2 ). The mapping 1 X F{r^)
denotes applying the Fourier transform F(r2) to each of these r r tuple of
elements of ^(Z/r2y Define 1 X F(rx) similarly.
THE MAPPING M. For 0 < a < rx and 0 < b < r2 define
Define the action of Z/n on L2(Z/ri) as follows: For a,b E Z/n define
(T(a)f)(b)=f(b + a).
matrix representation
[0 1 0 ••• 0]
0 1 * . 0 '
I0 1 !
[ 1 0 ••• OJ
Since (T(\)F(n)fa)(â) = <a, î><a, â}/Vn , we have
{F(n)-ln\)F(n))fa = (a,\>.
Thus, using the isomorphism y defined above, we have F(nylT(l)F(n)(fa) has
the matrix representation
f e2mO/n 0 I
I 0 e2iri(n~l)/n\
This proves that GF(ri) has the matrix representation p2 of NÇL/n). This
argument shows the deep relation between the group NÇL/n) and the finite
Fourier transform F{ri). It also proves that
F(n)p2F(nyl = px
as it is easy to see that px(N(Z/n)) is the same as the matrix group generated
by T(â), âeZ/n, and F{ri)T{a)F(riy\ a G Z/n.
We now proceed to the task of characterizing the mappings C, D and M
discussed in the previous section.
Consider the Z/n-Heisenberg group, NÇL/n), n = rxr2, rx > 1 and r2 > 1
where
N(Z/n) = {(a, b, c)\a, b, c G Z/n).
Let T(r2, r t ) and r(r„ r2) contained in NÇL/n) be defined by
r(r 2 , r t ) - {(aV2, 6'^, 0)|n' G Z / r „ V G Z/r 2 },
r ( r „ r2) = {{b'rv a'r» 0)|a' G Z / r p 6' G Z/r 2 }.
An elementary computation shows that T(r2, rx) and r(r 1 , r2) are subgroups
of NÇL/n). Let T be a subgroup of NÇL/n). Consider the homogeneous space
T \ NÇL/ri) and give this finite set the measure where each point has measure
one. Form L2(T \ NÇL/ri)). Since NÇL/n) acts on T \ NÇL/ri) by
R(g)(Tn) - Tng, n, g G N(Z/n)9
R(g), g G N(Z/ri), defines a unitary representation R of NÇL/ri) on L2(T \
N(Z/n)) by
(H(g)(F))(rn) = FÇTng), F G L2(T \ N(Z/n)).
It will often be convenient to view functions on T \ NÇL/ri) as functions F
on NÇL/n) such that
*Xw)-*•(*)> yGr, g Giv(z/ w ).
COMPUTING WITH THE FINITE FOURIER TRANSFORM 875
W is the analogue of the Weil-Brezin map as defined in [19] and [7]. We will
now verify that W(f) G 3F(x, T(r 2 , rx)). It is clear that W(f) G ^(x). Hence it
remains to verify that
W{f){(ar2, brx, 0)(x,y, t)) = W(f)(x,y, t).
Now (ar2, brv 0)(x,y, i) = (ar2 + x, brx + y, t + ar2y). Thus the left side
876 L. AUSLANDER AND R. TOLIMIERI
above equals
e2m(t + ar2y)/nS£ /(j>2 + ar2 + X) e2™jr& + ^ "
*(R)- scr, z)
1 xF(Z)
9 (J x Z)
F(R) M
ff(Z x T)
1 xF(T)
>
D~ i
ff(RV •ff(Z,T)
Recall that as groups R is isomorphic to R, Z is isomorphic to T and T is
isomorphic to Z.
Define C : R -* [0, 1) X Z, where we identify T with [0, 1), as follows: If
JC G R, x = y + n, 0 < y < 1, n G Z, let
For the time being, let us proceed formally and interchange integration and
878 L. AUSLANDER AND R. TOLIMIERI
summation to obtain
where x = n + y, s = m + £.
We will now see how all this works rigorously.
Consider L2(R) with Haar measure dx and T X Z with Haar measure such
that the measure of T X 0 is one. It is easily seen that C: L2(R) -» L2(T X Z)
is a unitary operator; i.e., a norm preserving surjection. Assume that F(T):
L2(T)~>L2(Z) and F(Z): L 2 (Z)-* L2(T) are unitary operators. Since M
consists of multiplying each value of a function by a number of absolute
value 1, it is trivial to verify that M is a unitary operator. This shows that
D~l • 1 X F(T) • M- 1 X F(Z) • C: L2(R) -> L2(R)
is a unitary operator and shows that F(R) is a unitary operator.
We are left with the task of interpreting the interchange of integration and
summation used in deriving the integral formula for F(R). We will close this
chapter with a discussion of this process.
We may view
G(y, 9 - M - l X F(Z) • C(f) G L2(T X T).
As such, G(y, £) has a Fourier expansion
ƒ
We next note that if gm and g G L2(T) with lim^^^ gw = g in L2(T), then the
mth Fourier coefficient of gn converges to the mth Fourier coefficient of g.
This shows that if ƒ G L2(R), then
are more efficient than Cooley-Tukey. At the theoretical level, he has also
succeeded in [22] in defining the concept of essential multiplications or
divisions, or more briefly, essential m/d. This concept is important because
experience has shown that algorithms that minimize the essential m/d, or
minimal algorithms, possess interesting algebraic structures. We begin with a
formal definition of an algorithm that will permit us to define the concept of
essential m/d.
Let G be a field, called the field of constants, and let F = G{xx, — , xn) be
the purely transcendental field extension of G obtained by adjoining the
indeterminants xx, .. . , xn to G. We use Ü = {co^i = 1, 2, 3, 4} to denote the
field operations of addition, subtraction, multiplication and division, respec-
tively. For fx, f2 e F we will use wt{fl9 f2) to denote the result of applying the
binary operation co, to fx and f2 with the convention that
S , = ax + a6, ^2 = a
l *~ a6> $3 = ö
4 + a
3' ^4 = «4 " a
3'
a a
5 5 = a2 ~ a5, ^6 ^ 2 + 5> «S7 = S*! + 5 3 , Sg = 5 7 + S 6 ,
S9 = Ss + a0, *^10 =
*^1 "~ *^3' *^11 ~ *^3 "" *^5> *^12 ^ *^5 "~ S\,
^ 1 3 = = S2 "H o 4 , *$14 =
^ 1 3 "*" ^6» ^IS =
^ 2 "" ^ 4 ' ^16 =
^ 4 "" ^6>
s„- =
»$6 "~ *^2>
Let u = 2TT//7.
^ 4 == ^ 3 5 ' ^5 =
^34' ^6 =
^32-
where
P-I
Af= S <**kak, Kj<p-l. (3)
k— 1
Let {mv . . . , rrip^x} be the elements of (Z/p)x ordered so that mi • ny =
m^, where k = i-j mod/?. Then
=
2 «S 2 «"^ (here A: m> = ( m / )
and
2^«W = (2««W)(2>W)W').
But, multiplication on the right side is the same as multiplying the expressions
as polynomials in m and reducing modulo (mp~l-l). This directly relates the
finite Fourier transform to the group algebra C(Z//? — 1).
We may also present the above discussion in matrix language as follows:
Consider the matrix equation
a
rA\ ' \ 1
= («*) •
A* *p-l\
A* = fia.
The first row of the square matrix on the right is w 1 , . . . , cop~l. Thus there
is a permutation of columns of this square matrix so that the first row of the
resulting matrix is coa, . . ., o)aP . This permutation can be achieved by
multiplying the matrix (o)iJ) on the right by a matrix P. Noting that P~l = P',
where the superscript t denotes the transpose, we have
A* = tiPP'a.
Then the first column of fi is (co1, . . ., cop~ly. Forming
P'A* = P'tiPP'a
COMPUTING WITH THE FINITE FOURIER TRANSFORM 883
or <ov
«"-».
P'£2P = (0 (0
aP-
CO" <*T CO
Thus, if we let
Yi y\
= p<4*
P'J* = P'a = = y
1
P~\ yP-x
we have
y = P'ÏÏPy
which is called the cyclic convolution of (Yv . . . , Yp_x) and (yv ••Jt-l)-
It is now a straightforward computation to verify that
y, + y2w + + y p . 1 « /, "" 2 = (coal + (oa« + + <o«'~V>-2)
Ui + ^ - i « + ' • • +72^~2) m o d ^ - 1 - 1).
The above discussion shows that the m/d number of F(p), can be bounded
above by the m/d number of the right side of the above equation.
Let us now formalize the problem to which the above discussion has led us.
Let
A{x)y = (Ly)
We call A(x)y a system of bilinear forms. We wish to study the m/d number
of A(x)y.
Notice that the columns (rows) of all possible matrices A(x) form a vector
space C (R) over G. We define the G-column (G-row) rank of A(x) as the
dimension of the vector subspace of C (R) spanned by the columns (rows) of
A(x).
In this section F = G(x9y) and B = G \J {xl9.,., xn9 yl9... 9ym).
THEOREM III.3.1. Let A(x)y be a system of bilinear forms. If A{x) has G-row
rank s then this system of bilinear forms has m/d number greater than or equal
to s.
This theorem is actually much weaker than what is known to be true. Since
it is not harder to prove the more general result, we will prove it.
THEOREM 111.3.1', Let fv . . , , ƒ , E G(x9y) and let a be an N-step minimal
algorithm for computing f\, . • . , ƒ / . Then the m/d number offl9 . . . , ƒ / equals
the dimension of the vector space Wa defined as follows:
Let W* c G(x9y) be the vector subspace spanned by Oa(l)9 . . . , Oa(N) and
let L be the subspace of elements of the form 2 géxê + 2 hjyj + k9g9h9k EL G.
Define Wa = W* 0 L/L.
PROOF. Assume that ft = Oa(N). (A relabelling can always achieve this.)
Then let a! be the N — 1 step algorithm consisting of the first N — 1 steps of
a. Then a' is a minimal algorithm for f l 9 . . . , f„l9 aa(N), ba(N).
We will now prove Theorem III.3.1' by induction on N. Since a 1-step
algorithm that has no essential m/d computes an element of L, we have
proven the theorem for iV = 1.
By induction the theorem is true for a' and f{9 • . . 9ft^\9 aa(N)9 ba(N). Now
if the iV step of a is not an essential m/d step, a and a' have the same
number of essential m/d steps. But, clearly, W£ = W* and the theorem is
true.
If the N step of a is an essential m/d step Wa ^ W^ or else a is not
minimal. This again proves our result.
Theorem 111.3.1' implies Theorem III.3.1 once we observe that the row
rank of A(x) is the dimension of the vector space V spanned by 2 ci^x^j in
G(x9y). But V c W* and V n L = 0. This proves dim Wa > dim V.
Before going on to Theorems III.3.2 and III.3.3 let us pause to give an
application of Theorem III.3.1 to the problem posed in §111.2. Consider the
system of bilinear forms that are the coefficients of the product of the two
COMPUTING WITH THE FINITE FOURIER TRANSFORM 885
polynomials
This may be written in the form A(x)y, where A(x) has the special form
(assume a > b)
x0 0 0
•Xi Xr\ U
A(x)~
0
0 xa
where A(x) has 6 columns and a + b + \ rows and
PV
ƒ =
yb
It is easy to compute that the row rank of our special A(x) is a + b + 1.
Thus by Theorem III.3.1 the m/d number of A(x)y is greater than or equal to
a + b + 1. Later we will see that it is actually equal to a + 6 + 1.
THEOREM III.3.2. Le* ^(*).y be a system of bilinear forms. If A(x) has G
column rank s9 then the m/d number of A(x)y is greater than or equal to s.
Again it is no harder to prove a slight generalization of this and we will do
so. This is Theorem 1 of [22].
Consider G(xv . . . , * „ ) = G(x). Let <j> be a t X m matrix over G(x). We
shall use <j>l9..., <j>m to denote the columns of $. (Note. A(x) is a t X m
matrix over F.)
THEOREM III.3.2'. Let a be an algorithm computing <py over (G(x,y),B)
where B = G(x) U {yx, . . . ,ym}- If there are s vectors in {<J>l5.. . , <j>m] such
that no nontrivial linear combination of them with coefficients in G lies in the t
dimensional vector space G\ then a has at least s m/d steps of the following
form: w3(a(k), b(k)) with a(k) and b(k) & G or co4(a(A:), b(k)) with b(k) g G.
To relate Theorem III.3.2 and Theorem III.3.2', we must first show that if
A(x)y satisfies the hypothesis of Theorem HI.3.2 then A(x)y satisfies the
hypothesis of Theorem IH.3.2'. But a linear combination over G of the
columns of A(x) is in G' if and only if it is the 0 vector. This shows that our
first requirement is satisfied.
To see that the conclusion of Theorem III.3.2' implies the conclusion of
Theorem IH.3.2 merely note that the s steps guaranteed by Theorem HI.3.2'
are essential m/d for a.
886 L. AUSLANDER AND R. TOLIMIERI
2 hjyp some hj £ G.
Hence we must have at least one essential m/d in the algorithm.
Suppose the assertion holds for a = N. Assume $ is such that at least
N + 1 of the vectors {<j>v . . . , <ƒ>„} have no linear combination over G which
is in G'. Let a be a minimal algorithm computing <py and let k be the first
integer such that an essential m/d step of a occurs at step k. Then, either
<>«(*)-(2&*+/)-(2 V , + / )
or
S
O (k) = &y'+/
for gi9 ht G G, and ƒ,ƒ' G G{x). Furthermore, we may assume the labelling
has been done so that one of the ht ^ 0, for otherwise the k step of a would
not be an essential m/d step.
Clearly s(yn) = - ƒ - 2 /y>„ s the identity on yx u • • • \Jyn-x U G(x).
G(x) is a basis of substitution. However s(a) may not be an algorithm. Since
there are only a finite number of divisions in any algorithm the substitution
s(a) can fail to be an algorithm only when s* applied to some finite set
{rv . . . , rm) c G(x)[.y] is zero. Choose g G G so that
s*(rj) + g * 0, j G 1, . . . , M.
fL.L',1
1=0
fXoO — 0]
X& Xo
Xa Xb
(5)-*
0
0 ... ox,
890 L. AUSLANDER AND R. TOLIMIERI
By our presentation theorem, we have that there exist m^ . >. 9ma+b bilinear
forms where
m( « ( 2 aixi + 2 *jty)(2 a'iXi + 2 bfo)
and an (a + b + 1) X (a + b + 1) matrix U over G such that
m0
Xy = tf I !
m. la + b
The essence of this theorem is to prove that each
m=
' {4ox>a')[hkyjaJ)
V +
(£ ' *M
or
OT
, = «*«y*- (0)
As we discussed in §111.3, all the rows of X are linearly independent over
G. Hence the set of bilinear forms Xy span an a + b + 1 dimensional space.
Hence the set of bilinear forms Um span an a + b + 1 dimensional subspace
of the space of bilinear forms. This implies that U is nonsingular and so we
may let W = U ~l and write
WXy = m. (1)
Let (H>O, . . . , wj+j,) be the ith row of fF. Then substitution in (1) yields
^o
(2)
^
This implies that the bilinear form on the right side of (2) can be computed in
1 essential multiplication. By Theorem IH.3.2 of §111.3 the column rank on
the right side must be 1, or all the forms S^.Q wj+kxp & = 0 , . . . , 6, are all
(7-multiples of one. This implies that the matrix
w:
w; M>a+\
(3)
wk wat + b
has rank 1.
We claim this can happen only under two circumstances: either Wç —
0 , . . . , wj+fc-i = 0, w^+b =£ 0; or there exists a, (which may be zero) such
COMPUTING WITH THE FINITE FOURIER TRANSFORM 891
that
wj = «X, j = 0,. . . , a + b where a{ = (a,)7
(when 0° = 1).
We may verify this assertion as follows: We have two cases to consider.
Case 1. M>O = 0. Then since (3) has rank 1,
WQW2 — (w[) = 0 or wj = 0 (4)
and
l
wJwj - (wrf = 0 or w 2 = 0. (5)
We may proceed by induction to verify that w^ = 0, 0 < & < a + &— 1.
Since rank of (3) is 1, wla+b =£ 0.
Case 2. H>0 T^ 0. Let w[ = Z^JWQ and w2 = ^2w^. By (4)
^2 "~ ^1
and by (5)
1 ^ 3 "~~ A' i Or K"\ ~~* /C i •
w=
w.a + b
1
-a
x
+b
<*a + b a+b
or
„a + b
Wn
a.
W =
W,* + *>
„a+b
1 «a + * ^a + b
REMARK. We see that the Vandermonde matrix enters into every bilinear
minimal algorithm a.
We will now prepare ourselves to prove the following result. Let P = zn +
892 L. AUSLANDER AND R. TOLIMIERI
Cp =
o 1 «.-
The minimal polynomial of CP is P itself. This means that P(CP) is the zero
matrix and any other polynomial with this property is divisible by P. Now let
v G V,v¥=0. Consider the set § of polynomial Q such that
vQ(Cp) - 0.
Clearly S is an ideal, $ D P. Since CP is nonsingular z £ 5 and 5 ^ G[z].
Let (P) be the ideal generated by P. If B and P are relatively prime then the
ideal generated by B and P is G[x]. Since 5 ^ G[z], 5 c (P). Hence if Q is
not divisible by P, i>ö(C/>) 7^ 0 any v G V* and so Ô(CP) is nonsingular.
LEMMA III.4.2. Let VP = {v G F| /Aere ex/ste a polynomial Q of degree < n
and vQ(Cp) = 0}. Then dim Fp < /z.
PROOF. Let W = {t> <E FluP'-^Cp) = 0). Claim W = F P . Clearly WKc
VP. If t>ö(Q>) = 0, then by the above discussion
Q = P'g', Ö' relatively prime to P,
where / > r > 0 and Ô'(Q>) is nonsingular. But then vPr(CP) = 0 which
implies vPl~\CP) = 0.
LEMMA III.4.3. Let CP be the companion matrix to P(z). If
t =
'n-\
then the coefficients of z S"«o 4 Z ' m ° d P(z) are
CP(t).
This is essentially the definition of CP.
LEMMA III.4.4. Let R(z) = S?!»1 xizi and S(z) = 2"~ô ^ ^ ^ fp be the
system of bilinear forms that are the coefficients of R(z) • S(z) mod P(z). Let
TP = >4(x)y as in $1113. Then
A{x) = (X,CpX9...,C^lX)
where
COMPUTING WITH THE FINITE FOURIER TRANSFORM 893
X=
m
L, L)
Let V be an n-dimensional G-vector space, let F* be its dual space and let
w E V*. Then wA(x) =£ 0 because its first coefficient is
Hence wUm is not zero and so wU is not zero. Since w was arbitrary, this
shows that the rank of U is n. By reordering columns, if necessary, we may
assume that we have a nonsingular n X n matrix W since
WU=(I\U')
where / is the n X n identity matrix.
Let VP be as in Lemma III.4.2. Then, because W is nonsingular, there
exists a row of W, say the first, denoted by w, which is not in VP. Then
wA(x)y is a bilinear form and
wA(x)y = (1 0 . . . 0 u\ . . . uxr_n)m.
Thus the bilinear form on the left, above, can be computed using r — n + 1
multiplications. We now claim that the n columns of wA(x) are independent.
894 L. AUSLANDER AND R. TOLIMIERI
w
[%aiCH=0-
But w £ VP and the above contradicts Lemma III.4.2.
l k
COROLLARY III.4.6. Let P = P[\ . . . , P k where Pt are distinct irreducible
polynomials over G. Then TP can be computed in In — k multiplications and no
divisions.
This follows easily from the discussion at the beginning of this section.
Before coming to the final result to be proven in this paper, we would like
to remind the reader of the relation between the Chinese Remainder Theorem
and the rational form theorem for a matrix. __
Again let P = Pv . . . , Pk where Pt = P/1 and the Pt are distinct irreducible
polynomials in G[z]. Let CP be the companion matrix to P acting on V*. By
the Chinese Remainder Theorem
< ? [ * ] / * - 2 .®G(z)/P,
where equality denotes isomorphic. The rational canonical form theorem says
that there exists subspaces V* of V* such that
(a) V* = 2 0 Vf9
( b ) C P ( F ? ) - J?,
(c) Cp\Vf = CP,
where CP is the companion matrix to Pr
THEOREM III.4.7. Let P = Pv . . . , Pk where Pt = Pli and Pt are distinct
irreducible polynomials in G[z]. Further let the degree of Pé = nt and n =
2 ? . , nt. Let R(z) = 2 ^ x.z1 and S(z) = ^nr}ytz\ If T = RS then fP
cannot be computed with less than 2n — k essential multiplications and no
divisions.
We have already seen in Corollary III.4.6 that the m/d number is bounded
above by In - k. In the proof of the theorem we will need the following
purely technical lemma. We will state and prove this lemma below, but the
reader may prefer to skip directly to the proof of Theorem III.4.7 and return
to the lemma later.
LEMMA III.4.8. Let P = Pl9 . .., Pk be as in Theorem III.4.7 and let VP9
i = 1, . . . , k, be as in Lemma III.4.2. Let W be a nonsingular n X n matrix
and let Wl be the first nx columns of W, let W2 be the next n2 columns of W,
etc. Then each Wl has a row wl(j{i))J a function of i, such that w\j{î)) ^ VP.
Further, there exists a 1 X n matrix /? with nonzero entries only at the j(i),
COMPUTING WITH THE FINITE FOURIER TRANSFORM 895
2 V 0 ' ( 0 ) « VP(
i= i
Ax{x') 0 y1
A{x)~ y=
0 k
M* ). y\
Let A(x)y = Urn, by Theorem III.3.3. Then U is an n X t matrix over G
where / is the number of essential multiplications. Since all rows of A(x) are
linearly independent, the rank of U is n. Therefore, there exists an n X n
nonsingular matrix W such that
WU = (ƒ | U'), where I is the n X n identity matrix.
Applying Lemma III.4.8 to W and letting /? be as in Lemma III.4.8, we
consider the bilinear form
/3WA(x)y = p(I\U')m.
We claim that at least n multiplications are needed to compute this bilinear
form. We prove this by showing that the column rank of the left side above is
n. Let y = PW and consider every nontrivial linear combination of the
column of fiWA(x) = yA(x). Substituting CJPx* for they element oiAfa*) we
obtain this linear combination as
2Y/(2^)X'.
This vanishes only if ^ 2 ctyC^ = 0 for i = 1,. . . , k. But y, £ VP and so
cty = 0 for all i andy. Thus by Theorem III.3.2 it requires at least n essential
multiplications to compute yA(x)y. But /?(/\U') has at most k + t — n
nonzero coefficients. Hence
k + t ~ n> n or t > In — k
and our theorem is proven.
896 L. AUSLANDER AND R. TOLIMIERI
and so compute
R • S mod z6 - 1 = 2 Ôi(* mod />,.)($ mod /» ) mod z6 - 1.
COMPUTING WITH THE FINITE FOURIER TRANSFORM 897