Pierre Bremaud - Mathematical Principles of Signal Processing
Pierre Bremaud - Mathematical Principles of Signal Processing
Signal Processing
Pierre Bremaud
Mathematical Principles of
Signal Processing
Fourier and Wavelet Analysis
Springer
Pierre Bremaud
Ecole Polytechnique Federale de Lausanne
Switzerland
and
INRIAJEcole Normale Superieure
France
[email protected]
987 6 5 4 3 2 1
SPIN 10845428
ToMarion
Contents
Preface
xi
A Fourier Analysis in LI
Introduction
7
7
16
23
31
46
B Signal Processing
49
Introduction
51
BI Filtering
B 11 Impulse Response and Frequency Response
Bl2 Band-Pass Signals . . . . . . . . . . . . . .
55
55
68
23
26
31
39
43
viii
Contents
B2 Sampling
B21 Reconstruction and Aliasing .
B22 Another Approach to Sampling
B23 Intersymbol Interference .
B2-4 The Dirac Formalism . . . . .
75
75
82
84
88
95
95
100
109
B4 Subband Coding
B41 Band Splitting with Perfect Reconstruction .
B42 FIR Subband Filters
References. . . . . . . . . . .
115
115
120
126
Fourier Analysis in L 2
127
Introduction
129
Cl Hilbert Spaces
C11 Basic Definitions.
C12 Continuity Properties
C13 Projection Theorem .
133
145
145
155
155
159
161
Wavelet Analysis
133
136
139
150
161
163
166
167
Introduction
169
175
175
185
178
185
187
Contents
ix
195
195
202
211
04 Construction of an MRA
D41 MRA from an Orthonormal System .
D42 MRA from a Riesz Basis ..
D43 Spline Wavelets . . . . . . . . . . .
217
229
217
220
223
229
231
234
237
Appendix
239
241
Glossary of Symbols
263
Index
267
261
Preface
Fourier theory is one of the most useful tools in many applied sciences, particularly, in physics, economics, and electrical engineering. Fourier analysis is a
well-established discipline with a long history of successful applications, and the
recent advent of wavelets is the proof that it is still very alive. This book is an
introduction to Fourier and wavelet theory illustrated by applications in communications. It gives the mathematical principles of signal processing in such a way
that physicists and electrical engineers can recognize the familiar concepts of their
trade.
The material given in this textbook establishes on firm mathematical ground the
field of signal analysis. It is usually scattered in books with different goals, levels,
and styles, and one of the purposes of this textbook is to make these prerequisites
available in a single volume and presented in a unified manner.
Because Fourier analysis covers a large part of analysis and finds applications
in many different domains, the choice of topics is very important if one wants
to devise a text that is both of reasonable size and of meaningful content. The
coloration of this book is given by its potential domain of applications-signal
processing. In particular, I have included topics that are usually absent from the
table of contents of mathematics texts, for instance, the z-transform and the discrete
Fourier transform among others.
The interplay between Fourier series and Fourier transforms is at the heart of
signal processing, for instance in the sampling theory at large (including multiresolution analysis). In the classical Fourier theory, the formula at the intersection of the
Fourier transform and the Fourier series is the Poisson formula. In mathematically
oriented texts, it appears as a corollary or as an exercise and in most cases receives
little attention, whereas in engineering texts, it appears under its avatar, the formula
xii
Preface
giving the Fourier transfonn of the Dirac combo For obscure reasons, it is believed
that the Poisson sum fonnula, which belongs to classic analysis, is too difficult,
and students are gratified with a result of distributions theory that requires from
them a higher degree of mathematical sophistication. Surprisingly, in the applied
literature, whereas distribution theory is implicitly assumed to be innate, the basic
properties of the Lebesgue integral, such as the dorninated convergence and the
Fubini theorem, are never stated precisely and seldom used, although these tools
are easy to understand and would certainly answer many of the questions that alert
students are bound to ask. In order to correct this unfortunate tradition, which has
a demoralizing effect on good students, I have insisted on the fact that the c1assical
Poisson fonnula is all that is needed in signal processing to justify the Dirac
symbolism, and I have devoted some time and space to introduce the Lebesgue
integral in a concise appendix, giving the precise statements of the indispensable
tools.
The contents are organized in four chapters. Part A contains the Fourier theory
in LI up to the c1assical results on pointwise convergence and the Poisson sum
fonnula. Part B is devoted to the mathematical foundations of signal processing.
Part C gives the Fourier theory in L 2 . Finally, Part D is concemed with the timefrequency issue, inc1uding the Gabor transfonn, wavelets, and multiresolution
analysis. The mathematical prerequisites consist of a working knowledge of the
Lebesgue integral, and they are reviewed in the appendix.
Although the book is oriented toward the applications of Fourier analysis, the
mathematical treatment is rigorous, and I have aimed at maintaining a balance
between practical relevance and mathematical content.
Acknowledgments
Michael Cole translated and typed this book from a French manuscript, and Claudio Favi did the figures. Jean-Christophe Pesquet and Martin Vetterli encouraged
me with stimulating discussions and provided the illustrations of wavelet analysis. They also checked and corrected parts of the manuscript, together with Guy
Demoment and Emre Telatar. Sebastien Allam and Jean-Fran~ois Giovanelli were
always there when TEX tried to take advantage of my incompetence. To all of them,
I wish to express my gratitude, as well as to Tom von Foerster, who showed infinite
patience with my prornises to deliver the manuscript on time.
Gif sur Yvette, France
May 2,2001
Pierre Bremaud
Part
Fourier Analysis in L1
Introduction
ae
a2e
-=K-,
at
a2x
where e(x, t) is the temperature at time t and at loeation x of an infinite rod, and
is the heat eonduetanee. The initial temperature distribution at time 0 is given:
e(x,O) = f(x).
= L cne int .
neZ
Fourier claimed that his solution was general beeause he was eonvineed that alI21Tperiodie funetions ean be expressed as a trigonometrie series with the eoefficients
Cn
= cn(f) = 21T
2lT
lThe definitive form of his work was published in Theorie Analytique de la Chaleur,
Finnin Didot ed., Paris, 1822.
1 .
1 .
"',
true for - l ( < x < +l(. But the mathematicians of that time were skeptical about
Fourier's general conjecture. Nevertheless, when the propagation of heat in solids
was set as the topic for the 1811 annual prize of the French Academy of Sciences,
they surmounted their doubts and attributed the prize to Fourier's memoir, with the
explicit mention, however, that it lacked rigor. Fourier's results that were in any case
true for an initial temperature distribution that is a finite trigonometric sum, and be
it only for this, Fourier fully deserved the prize, because his proof uses the general
tricks (for instance, the differentiation rule and the convolution-multiplication
rule) that constitute the powerful toolkit of Fourier analysis.
Nevertheless, the mathematical problem that Fourier raised was still pending,
and it took a few years before Peter Gustav Dirichlet2 could prove rigorously,
in 1829, the validity of Fourier's development for a large class of periodic functions. Since then, perhaps the main guideline of research in analysis has been the
consolidation of Fourier's ingenious intuition.
The classical era of Fourier series and Fourier transforms is the time when the
mathematicians addressed the basic question, namely, what are the functions adrnitting a representation as a Fourier series? In 1873 Paul Dubois-Reymond exhibited
a continuous periodic function whose Fourier series diverges at O. For almost one
century the threat of painful negative results had been looming above the theory.
Of course, there were important positive results: Ulisse Dini3 showed in 1880 that
if the function is locally Lipschitz, for instance differentiable, the Fourier series
represents the function. In 1881, Carnille Jordan4 proved that this is also true for
functions of locally bounded variation. Finally, in 1904 Leopold Fejeii showed
that one could reconstruct any continuous periodic function from its Fourier coefficients. These results are reassuring, and for the purpose of applications to signal
processing, they are sufficient.
However, for a pure mathematician, the itch persisted. There were more and
more examples of periodic continuous functions with a Fourier series that diverges
at at least one point. On the other hand, Fejer had proven that if convergence is
taken in the Cesaro sense, the Fourier series of such continuous periodic function
converges to the function at all points.
2Sur la convergence des series trigonometriques qui servent arepresenter une fonetion
arbitraire entre des limites donnees, J. reine und angewan. Math., 4,157-169.
3 Serie di Fourier e altre rappresentazioni analitiche delle funzioni di une variabile reale,
Pisa, Nistri, vi + 329 p.
4Sur la serie de Fourier, CRAS Paris, 92, 228-230; See also Cours d'Analyse de l'lfcole
Polytechnique, I, 2nd ed., 1893, p. 99.
5Untersuchungen ber Fouriersehe Reihen, Math. Ann., 51-69.
Introduction
L J(n) = L
nEZ
jen),
nEZ
J(t)e-2irrvt dt
is its Fourier transform, where ~ is the set of real numbers. This striking formula
found very nice applications in the theory of series and is one of the theoretical
results founding signal analysis. The Poisson sum formula is the culrninating result
of Part A, which is devoted to the classical Fourier theory.
305-306.
8Convergence and growth of partial sums of Fourier series, Acta Math., 116, 135-157.
9S ur la distribution de la chaleur dans les corps solides, J. Ecole Polytechnique, 1geme
Cahier, XII, 1-144, 145-162.
Al
Fourier Transforms of Stable Signals
A 11
Fourier Transform in LI
This first chapter gives the definition and elementary properties of the Fourier
transform of integrable functions, which constitute the specific calculus mentioned
in the introduction. Besides linearity, the toolbox of this calculus contains the
differentiation rule and the convolution-multiplication rule. The general problem
of recovering a function from its Fourier transform then receives a partial answer
that will be completed by the results on pointwise convergence of Chapter A3.
We first introduce the notation: N, Z, Q, ~, C are the sets of, respectively,
integers, relative integers, rationals, real numbers, complex numbers; N+ and ~+
are the sets of positive integers and nonnegative real numbers.
In signal theory, functions from ~ to C are called (complex) signals. We shall
use both terminologies (function, or signal), depending on whether the context is
theoretical or applied.
We denote by L~(~) (and sometimes, for short, LI) the set offunctions f(t) 10
from ~ into C such that
If(t)1 dt <
00.
In analysis, such functions are called integrable. In systems theory, they are called
stable signals.
IOWe shall often use this kind of loose notation, where a phrase such as "the function
f(t)" means "the function f : lR ~ c." We shall also use the notation "I" or "fO" with
a mute argument. For instance, "f( - a)" is the function t --+ f(t - a).
Al.
1o
1
lA(t) =
1-+
if tE A,
ift
A.
The function I(t) is called locally integrable if for any closed bounded interval
[a, b] C IR, the function I(t)l[a,bj(t) is integrable. We shall then write
I
or, for short, I E Lloe'
The set of functions
L~ loe(lR)
I/(t)1 2 <
00
I/(t)1 2 dt.
The function I(t) is called locally square-integrable if for any closed bounded
interval [a, b] C IR, the function I(t)l[a,bj(t) is square-integrable. We shall then
write
L~,loe(lR)
Fourier Transform
We can now give the basic definition.
DEFINITION AI.I. Let s(t) be a stable complex signal. The Fourier transform (FT)
ofs(t) is thefunctionfrom:IR into C:
s(v) =
(1)
(Note that the argument of the exponential in the integrand is -2i7rvt.) The
mapping from the function to its Fourier transform will be denoted by
s(t) ~ s(v)
or
s(t - to)
~ e- 2ill'vtos(v)
e2i1rvot s(t)
s(v - vo)
s(at)
Fr
--+
-IsA{V}
lai
a
AISI (t)
+ A2S2(t)
~ AISI(V) + A2 S2(V)
~
s*(t)
s(-v)*
EXERCISE
EXERCISE
+ Vo )).
(2)
AI.4. Show that the FT of a real signal is Hermitian even, that is,
s(- v) = s(v)*.
Show that the FT of an odd (resp., even; resp., real and even) signal is odd (resp.,
even; resp., real and even).
EXERCISE
s(v)
Hs(v
-vo
+ vo) + s(v -
vo)}
+vo
10
Al.
sinc (x)
sin(Jrx)
= --Jrx
~ Tsinc (vT).
(3)
T
-T/2
+T/2
Tsinc(vT) = recT(v)
reCT(t)
(4)
In order to compute the corresponding Fourier integral, we use contour integration
in the complex plane. First, we observe that it is enough to compute the Fr s( v)
for v 2: 0, since this Fr is even (see Exercise A1.4). Take a 2: v (eventually, a
will tend to 00).
Consider the rectangular contour Y in the complex plane (see Fig. A1.3),
Y
= Yl + Y2 + Y3 + Y4,
v
')'4
-a
+a
A 11 Fourier Transform in L I
We denote by
Wehave
-Yi
11
e- rrz2
+ /z + h + 14,
dz = 11
where li is the integral of e- rr Z2 along Yi. Since the latter integrand is a holomorphic
function, by Cauchy's theorem (see, for instance, Theorem 2.5.2, p. 83, of [Al],
or Theorem 2.2, p. 101, of [A6]),
e- rrz2
= 0,
dz
and therefore,
+ /z + h + 14 =
O.
a~oo
= a-+oo
!im 14 = o.
then
/z
i dt
e-rr(a+itf
l/zl .::::
=
e-rr(a-t)(a+t)
e- rra
21o
errat
e-rr(a2-t2)e-2irrat
dt .::::
e-rra(a-t)
1
dt = -(1
_
dt
e- rra 2 ),
Jra
a-HXl
i dt.
+ h) =
that is,
(5)
JYl
lim
a--+oo
j+a e- rrt2
dt
= { e- rrt2 dt = 1.
-a
JJ!I!.
+ t; -a .:::: t
.:::: +a},
12
Al.
wehave
= l+ a e-n(iv+tf dt
-a
-)'3
Therefore,
The Fr of the Gaussian pulse can be obtained by other means (see Exercise
A 1.16). However, in other cases, contour integration is often necessary.
Using contour integration in the complex plane, we show that, for a > 0,
= e-at IlR+(t)
s(t)
Fr
s(v)
A
00
.
e- 2znvt
e- at dt
1
2mv + a
= .
=
1
2inv + a
1
1
00
1
.
a+2mv
(6)
.
e-2znvt-a\2inv
+ a) dt
e- Z dz.
(The reader is refered to Fig. A1.4 for the definition ofthe lines y, YJ. Y2, and Y3.)
Therefore, it suffices to show that
By Cauchy's theorem,
e-Z dz
Yl
e- z dz
e- Zdz
Y2
= 1.
+
e- Zdz
)'3
= 0.
A 11 Fourier Transform in LI
13
2i7f1/
I{
Convolution-Multiplication Rule
THEOREM
AI.I. Let h(t) and x(t) be two stable signals. Then the right-hand side
oJ
y(t)
=1
(7)
h(t - s)x(s) ds
is defined Jor almost all t and defines almost everywhere a stable signal whose FT
is y(v) = h(v)x(v).
Proof"
00.
00.
The integral IIR h(t - s )x(s) ds is therefore weIl defined for almost aIl t. Also,
1,y(t)' dt
111
h(t - s)x(s)
dsl dt
ds <
00.
14
Al.
=L
Lh(t - s)e-2irrv(t-s)x(s)e-2irrvs ds dt
= h(v)x(v).
* x)(t).
Fr
* x)(t) --+
h(v)x(v).
A
(8)
EXAMPLE AI.L
The convolution of the rectangular pulse reeT (t) with itself is the
triangular pulse of base [- T, + T] and height T,
TriT(t) = (T - Itl)1[-T,+T](t).
~ (Tsine (vT)f
(9)
funetion
c(t)
=L
x(s
+ t)x*(s) ds
Ix(v)1 2
T2
-T
&
I
I
I
+T
C'>~
I
~C'>
I
. 1
j
I
-~ -~ -~
TriT(t)
15
a > 0, is
fM(t)
(/*3
= f * f * f,
t n- 1
= (n -
I)!
e- at I t>o(t).
-
= tne-atlt~o(t).
Riemann-Lebesgue Lemma
Ivl-+oo
= O.
(10)
Proof' The Fr of a rectangular pulse s(t) satisfies Is(v)1 ::; K/lvl [see Eq. (3)].
Hence every signal s(t) that is a finite linear combination of indicator functions
of intervals satisfies the same property. Such finite combinations are dense in
Lb(l~) (Theorem 28 of the appendix), and therefore there exists a sequence sn(t)
of integrable functions such that
n-+oo
JIR
=0
and
Kn
we deduce that
::; -K n +
lvi
IR
16
Al.
THEOREM
g: [a,b]
f-*
lim l
....... 00
b
a
uniformly in x.
Proof
r:rr
=l
=-
COS(AU) Ib
hex - u)g(u) - A
a
+l
[hex - u)g(u)]
COS(AU)
du.
A
Since h E Cl and is periodic, h and h' are uniformly bounded. The same is true of
g, g' (g is in Cl). Therefore,
lim /(A)
....... 00
=0
uniformly in x .
Now,
11
fex - U)g(U)Sin(AU)dUI
:S I/(A)I
+l
:S I/(A)I
:S I/(A)I
max Ig(u)IE:.
a:'Ou:'Ob
AI 2
Inversion Formula
EXERCISE
Despite the fact that the FT of an integrable signal is uniformly bounded and
uniformly continuous, it is not necessarily integrable. For instance, the FT of the
rectangular pulse is the cardinal sine, a non-integrable function. When its FT is
integrable, a signal admits a Fourier decomposition.
AI 2 Inversion Fonnula
17
THEOREM At.4. Let set) be an integrable complex signal with the Fourier
transform s(v). Under the additional condition
Is(v)1 dv <
(11)
00,
s(v)e+2iJrvt dv
(12)
holdsfor almost all t.lf set) is, in addition to the above assumptions, continuous,
equality in (12) holds for all t.
At.ll. Check that the above result is true for the signal
(a E lR, a > 0, a E C).
Proof' We now proceed to the proof of the inversion formula. (lt is rather technical and can be skipped in a first reading.) Let set) be a stable signal and consider
the Gaussian density function
with the Fr
We first show that the inversion formula is true for the convolution (s
Indeed,
(s
* h u )(t).
2u 2
(13)
= (
J~
s(u)hu(u) ( { e
J~ ~;;r
(t)e-2iJrvt dt) du
J~
= {( { s(u)hu(u)e
J~ J~
,U
;;r
= JR
{ s(u)hu(u)e Zc;2' -'<-(t)du
I
(12
= (s
* h u )(t).
18
Al.
Thus, we have
(s
* hu)(t) =
(14)
s(v)hu(v)e2i1rvt dv,
* h u )(t).
Since for all v E IR, limu-l-o v t hu(v) = 1, it follows from Lebesgue's dominated convergence theorem that when u ..I- 0 the right-hand side of (14) tends
to
s(v)e2i1rvt dv
for all t E IR. If we can prove that when u ..I- 0 the function on the left-hand side of
(14) converges in L~(IR) to the function s(t), then, for almost all t E IR, we have
the announced equality (Theorem 25 of the appendix).
To prove convergence in L~(IR), we observe that
L*
I(s
hu)(t) - s(t)1 dt =
L*
Is
LIL
(s(t - u) - S(t))hu(U)dul dt
f(u) =
iJR Is(t
- u)-
f(u)hu(u) du.
Now, If(u)1 is bounded (by 2 iJR Is(t)1 dt). Therefore, iflimu-l-o f(u)
dominated convergence,
lim [ f(u)hu(u) = lim [ f(uu)ht(u)du =
uwk
(15)
uwk
= 0, then, by
o.
(16)
where
d(s( - u), sn(' - u))
L
=L
ISn(t - u) - sn(t)1 dt
+ d(s(), snO),
19
Suppose that, in addition, set) is continuous. The right-hand side of (12) defines
a continuous function because s(v) is integrable. The everywhere equality in (12)
follows from the fact that two continuous functions that are almost everywhere
equal are necessarily everywhere equal (Theorem 8).
= 0, which is integrable,
EXERCISE AI.12. Give the FT of set) = 1/ A(a 2 + t 2). Deduce from this the value
of the integral
f ~dU,
t > 0.
J.R.t+u
EXERCISE Al.13. Deduce from the Fourier inversion formula that
l(t)=
L(Si~(t) Y
dt = Jr.
Exercise 1.14 is very important. It shows that for signals that cannot be called
pathological, the version of the Fourier inversion theorem that we have in this
chapter is not applicable, and therefore we shall need finer resuIts, which are given
in Chapter A3.
EXERCISE AI.14. Let set) be a stable right-continuous signal, with a limit from
the left at all times. Show that if s(t) is discontinuous at some time to, its FT cannot
be integrable.
Regularization Lemma
In the course of the proof of Theorem A1.4, we have used a special case of the
regularization lemma below, which is very useful in many circumstances.
lim
a'\-O
LEMMA
L
+a
-a
ha(u) du = 1,
forall a > 0,
= 1,
limha(u) = 1,
a'\-O
forall a > 0,
ha(u) du
forall u
IR.
Then
lim
a'\-O
JIR
I(s
* h a )(t) -
s(t)1 dt
= 0.
20
Al.
Proof" We ean use the proof of Theorem A1.4, starting from (15). The only
plaee where the speeifie form of h" (a Gaussian density) is used is (16). We must
therefore prove that
lim ( J(u)h,,(u)
"-1-0 JIR
=0
independently. Fix e > O. Sinee limuto J(u) = 0, there exists a = aCe) such that
h,,(u) du :s -.
1-a~ J(u)h,,(u) du :s -2el~
-a
2
The last integral is, for suffieiently small a, less than e12M. Therefore, for
suffieiently small a,
JIR J(u)h,,(u) du :s
2" + 2" = e.
c2,
JIR
The last equality is symbolie and defines the Dirae generalized funetion (see Seetion B24). The first equality is obtained as in the proof of the previous lemma,
this time letting J(u) = <p(u) - <p(0).
Differentiation in the Frequency Domain
We shall see how differentiation in the time domain is expressed in the frequeney
domain.
A1.S. (a) Ifthe integrable signal set) is such that tks(t) E LU~)Jor
alt 1 :s k :s n, then its FT is in Cn , and
THEOREM
(-2imls(t) ~ s(k)(v)
(b) IJ the signal set) E
integrable, then
cn and if it is,
Proof"
Joraltl:S k:s n.
together with its n first derivatives,
Jor altl :s k :s n.
e-2invIs(t)dt,
21
we can differentiate k times under the integral sign (see Theorem 15 and the
hypothesis t ks(t) E Lb(IR)) and obtain
s(k)(v)
(b) It suffices to prove this for n = 1, and iterate the result. We first observe that
limlaltoo s(a) = O. Indeed, with a > 0, for instance,
s(a)
= s(O) +
s'(t) dt,
La
JIR
= (e-2irrvts(t)):: + i:a(2i:n:V)e-2irrvtS(t)dt.
00
EXERCISE AI.IS.
EXERCISE AI.16.
aB
at =
a2B
K a2x'
(17)
where B(x, t) is the temperature at time t and at location x of the rod with heat
conductance K, and with the given initial temperature distribution
B(x, 0) = f(x).
We assume that
(18)
is integrable.
Let ~ 1-* 8(~, t) be the Fr of x 1-* B(x, t). (We take different notations because
the variable with respect to which the Fr is taken is not the time variable t but the
space variable x.) In the Fourier domain, Eq. (17) becomes
d 8(c t)
--:-,"-'= -K(4:n:2~2)e(~, t),
dt
22
Al.
0) = F(H
= F(~)e-4n2K~2f.
1-+
or
8(x, t)
..Jrr
fex - 2-JKiy)e-yl dy .
As we mentioned earlier, Fourier considered the finite rod heat equation, which
receives a similar solution, in terms of Fourier series rather than Fourier integrals
(see Chapter A2). The efficiency of the Fourier method in solving differential or
partial differential equations of mathematical physics has been, after the pioneering
work of Fourier, amply demonstrated 12 .
12See, for instance, the classic text of 1. N. Sneddon, Fourier Transfonns, McGraw-Hill,
1951; Dover edition, 1995.
A2
Fourier Series of Locally Stable
Periodic Signals
A21
Fourier Coefficients
A periodic signal is neither stable nor of finite energy unless it is almost everywhere
null, and therefore, the theory of the preceding Chapter is not applicable. The
relevant notion is that of Fourier series. (Note that Fourier series were introduced
before Fourier transforms, in contrast with the order of appearance chosen in this
text.) The elementary theory of Fourier series of this section is parallel to the
elementary theory of Fourier transforrns of the previous section. The connection
between Fourier transforrns and Fourier series is made by the Poisson sum formula,
of which we present a weak (yet useful) version in this chapter.
A complex signal s(t) is called periodic with period T >
for all t E ~,
s(t
+ T) =
s(t).
Is(t)1 dt <
Lb([O, Tl),
00.
Is(t)1 2 dt <
00.
E L~([O,
24
One also says in this case that s(t) hasfinite power, since
lim
A-+oo
..!.. {A
A
10
Is(t)12
=..!..
T
(T Is(t)12 dt <
10
00.
As the Lebesgue measure of [0, T] is finite,L~([O, Tl) c Lt([O, Tl). (See Theorem 19 of the appendix.) In particular, a finite-power periodic signal is also locally
stable.
We are now ready for the basic definition.
DEFINITION A2.I. The Fourier transform {sn}, n
periodic signal s(t) is defined by theformula
Sn
= -I
n dt,
s(t)e- 2'lJr TI
(19)
EXERCISE
EXERCISE
show that the nth Fourier coefficient Sn ofs(t) and the FT i:;(v) OfST(t) are linked
by
(20)
EXERCISE
A2.4. Let s(t) be a T -periodic locally stable signal with nth Fourier
coefficient Sn. Show that limlnltoo Sn = O.
EXERCISE
One often represents the sequence {Sn}nEZ of the Fourier coefficients of a Tperiodic signal by "spectrallines" separated by 1/ T from each other along the
frequency axis. The spectralline at frequency n / T has the complex amplitude Sn
(see Fig. A2.1). This is sometimes interpreted by saying that the FT of s(t) is
s(v)
= I)n(V nEZ
f)'
25
THEOREM
y(t)
(21)
h(t - s)x(s)ds
is almost everywhere weil defined, T -periodie, and locaily stable. Its nth Fourier
coefficient is
Yn
A
= hA(n)
T x n,
(22)
We have
L1h(t - s)llx(s)1 ds =
where
hT(u)
=L
+ nT).
nEZ
Now
x(v)
I -q,I I I I I I
-~
-~
123
fI(v)
"
"
26
and hence by the usual argument (see the proof of Theorem A 1.1),
00,
+ T) =
=
L
L
x(t
+T -
s)h(s)ds
x(t - s)h(s)ds,
which shows that y(t) is periodic with period T. The same argument as in the proof
of Theorem AU shows that y(t) is locally stable. FinaIly,
l
=- l 1
Yn = -1
1
T
n dt
y(t)e- Z'l1!'j't
n dt ds
hT(t
- s)x(s)e- Z'l1!'j't
A22
Inversion Formula
In the proof of the Fourier series inversion formula, the Poisson kernel will play
a role similar to that of the Gaussian pulse in the proof of the Fourier transform
inversion formula of the previous seetion.
The Poisson kernel is the family of functions Pr : lR
Pr(t)
f-+
rlnleZin ~t.
nEZ
= LrneZin~t + Lrne-zin~t_l
n~O
n~O
(23)
27
and therefore,
Pr(t) :::: O.
(24)
Also,
T 2
-1 I+ / Pr(t) dt
T -T/2
(25)
1.
In view of the above expression of the Poisson kernel, we have the bound
1 [
Pr(t)dt <
T [-t,+t]\[-e,+s]
-
(1 - r 2 )
2
1 1 _ e 2i1ry 1
'
= O.
(26)
-1
1
+t
-t
cp(t)Pr(t) dt
= cp(O),
for all bounded, continuous cp : ffi. -+ <C (same proof as in Lemma ALl).
The following result is similar to the Fourier inversion formula for stable signals
(Theorem A1.4).
THEOREM A2.2. Let set) be aT -periodie localty stable complex signal with Fourier
coefficients {sn}, n E Z. lf
(27)
then, for almost alt t E ffi.,
set)
= LSne+2i1rYI.
(28)
nEZ
lfwe add to the above hypotheses the assumption that set) is a continuousfunction,
then the inversion formula (28) holds for all t.
Proof
11+[
= _
T
nEZ
-t
s(u)Pr(t - u)du,
(29)
and
!im
rtl
10r I10r
T
s(u)Pr(t - u) du - S(t)1 dt
T
= 0,
that is: The right-hand side of (29) tends to set) in Lb([O, T]) when r t 1. Since
LnEZ ISn I < 00, the function of t in the left-hand side of (29) tends toward the
28
function LnEZ sne+2irr(n/T)t, pointwise and in L~([O, T]). The result then follows
from Theorem 25.
The statement in the case where set) is continuous is proved exactly as the
corresponding statement in Theorem A1.4.
As in the case of stable signals, we deduce from the inversion formula the
uniqueness theorem.
A2.1. Two locally stable periodic signals with the same period T
that have the same Fourier coefficients are equal almost everywhere.
COROLLARY
EXERCISE
A2.6. Compute
using the expression ofthe Fourier coefficients ofthe 2-periodic signal set) such
that
fort
[-1,+1].
A2.7. Let x(t) be a T -periodic locally stable signal with nth Fourier
coefficient x n such that
EXERCISE
L InlPlxnl <
00.
nEZ
Show that x(t) is p times differentiable and that if the pth derivative is locally
integrable, its nth Fourier coefficient is (2i7T!.f)P Xn.
The Weak Poisson Formula
The Poisson sum formula takes many forms. The strong version is
(30)
This aesthetic formula has a number of applications in signal processing (see Part
B).
The next result establishes the connection between the Fourier transform and
Fourier series, and is central to sampling theory. It is a weak form of the Poisson
sum formula (see the discussion after the statement of the theorem).
'THEOREM A2.3. Let set) be a stable complex signal, and let 0 < T < 00 be
fixed. The series LnEZ set + nT) converges absolutely almost everywhere to a
T -periodic locally integrable function <I>(t), the nth Fourier coefficient of which is
(l/T)s(n/T).
We paraphrase this result as follows: Under the above conditions, the function
<I>(t) :=
L set + nT)
nEZ
(31)
29
~ I)(!!..) e2inIfI.
Sj(t) =
(32)
nEZ
(We speak of a "formal" Fourier series, because nothing is said about its convergence.) Therefore, whenever we are able to show that the Fourier series represents
the function at t = 0, that is, if <1>(0) = S j(O), then we obtain the Poisson sum
formula (30).
For now, we are saying nothing about the convergence of the Fourier series.
This is why we talk about a weak Poisson's formula. A strong Poisson's formula
corresponds to the case where one can prove the equality everywhere (and in
particular at t = 0) of <I>(t) and of its Fourier series. We shall say more about the
Poisson formula and, in particular, give strong versions of it in Seetion A33. The
version we have here, and that we shaH proceed to prove, is the one we need in the
Shannon-Nyquist sampling theorem (Chapter B2).
Proof:
{T L Is(t + nT)1 dt = L
10
nEZ
(T Is(t + nT)1 dt
nEZ
= L
nEZ
10
<n+l l T
Is(t)1 dt
nT
1
R
Is(t)1 dt <
00.
In particular,
+ nT)1
Is(t
<
00
a.e.
nEZ
Therefore, the series LnEZ set +nT) converges absolutely for almost all t. In particular, <I>(t) is weH defined (define it arbitrarily when the series does not converge).
This function is c1early T -periodie. We have
{T
10
1<I>(t)ldt =
{T ILS(t+nT)ldt
10
nEZ
:s {T L
10
Is(t
+ nT)1 dt =
nEZ
(ls(t)1 dt <
1R
= -1
T
<I>(t)e- 2"''in l dt
10
kEZ
00.
30
=~
T
{T
Ja
!LS(t + kT)e-
2i1C ',f(t+kTl!
dt
kEZ
{
2 n
1 ~ (n )
= T1 JJR
s(t)e- l1C'it dt = T S T .
We have a function as weH as its formal Fourier series. When both are equal
everywhere, we obtain the strong Poisson sum formula. The next exercise gives
conditions for this.1t will be improved by Theorem A3.12.
EXERCISE
(a)
A2.S. Let set) be a stable signal with the FT s(v), and suppose that
LnEZ
set
and
00.
lR.,
Ls(t+nT)= L
nEZ
nEZ
s(f)e
2i1C ',ft.
A3
Pointwise Convergence
of Fourier Series
A31
The inversion formula for Fourier series obtained in Chapter A2 requires a rather
strong condition of summability of the Fourier coefficients series. Moreover, this
condition implies that the function itself is almost everywhere equal to a continuous
function. In this seetion, the dass of functions for which the inversion formula holds
is extended.
Recall Kolmogorov's negative result (see the Introduction):
THEOREM A3.1. There exists a locally integrable 2rr -periodic function f : ~
for which the Fourier se ries diverges everywhere.
ce
This result challenges one to obtain conditions that a locally integrable 2rrperiodic function f must satisfy in order for its Fourier series to converge to
f. Recall that the Fourier series associated with a 2rr-periodic locally integrable
function f is the formal Fourier series
(33)
where cn(f) is the nth Fourier coefficient
cn(f)
_1
2rr
(34)
-Jr
The series (33) is calledformal as long as one does not say something about its
convergence in some sense (pointwise, almost everywhere, in LI, etc). If one has
P. Brmaud, Mathematical Principles of Signal Processing
Springer Science+Business Media New York 2002
32
no more than the condition that f is 27T -periodic and locally integrable, the worst
can happen, as Kolmogorov's theorem shows.
The purpose of this section is to find reasonable conditions guaranteeing
convergence as n -+ 00 of the truncated Fourier series
+n
sI (x) = L
ck(f)e ikx .
(35)
k=-n
We have to specify (1) in what sense this convergence takes place and (2) what
the limit iso Ideally, the convergence should be pointwise and to fitself. The next
exercise gives a simple instance where this is true.
EXERCISE
+n
Ck eikt
k=-n
converges uniformly to some function f(t). Show that in this case, for all k E Z,
Ck = ck(f).
Dirichlet's Integral
sI (x) = L+n { -1
k=-n
27T
= - 1 j+Jr
27T -Jr
j+Jr
.} .
f(s)e- ,ks ds e'kx
IL I
-Jr
+n eik(x-s)
k=-n
f(s)ds.
L eikt =
k=-n
sin((n
1
-2
)t)
(36)
sin(t /2)
(the function in the right-hand side is called the Dirichlet kerne!) and therefore,
f
Sn (x)
Performing the change of variable x - s = u and taking into account the fact that
fand the Dirichlet kernel are 27T-periodic, we obtain
f
Sn (x)
-2
7T
+ u)du.
(37)
If we let f(t)
33
27f
(38)
sin(u/2)
-Jr
I
S!(x)-A=27f
or, equivalently,
S!(x) - A
= -
27f
iJr
j+Jr sin((n
+ ! )u)
.
2
(f(x+u)-A)du
sin((n
+ ! )u)
2
sm(u/2)
(39)
sm(u/2)
-Jr
{fex
+ u) + fex
- u) - 2A}du. (40)
Therefore, in order to show that, for a given x E IR, S! (x) tends to A as n -+ 00,
it is neeessary and suffieient to show that the Diriehlet integral in the right-hand
side of (39) eonverges to zero as n -+ 00.
The localization principle states that the eonvergenee of the Fourier series is a
loeal property. More preeisely:
THEOREM A3.2. lf fand gare two locally integrable 27f -periodic complex-valued
functions such that, for a given x E IR and some 8 > 0, it holds that f (t) = g(t)
whenever t E [x - 8, x + 8], then
lim{S!(x) - S!(x)}
ntoo
Proof"
= O.
s! (x) -
S!(x)
= -2
j+Jr
7f
= 27f
-Jr
sin((n
+ !)u) Ilul:::8
fex
+ u) -
g(x
. ( /2)
sm u
+ u)
du
where
w(u)
= l lul ->8
fex
+ u) -
g(x
. ( u /2)
sm
+ u)
is integrable over [0, 27f]. The last integral therefore tends to zero as n -+
the Riemann-Lebesgue lemma.
00
by
lim S!(x) = A
ntoo
34
if,for some
< 8
:s Ti,
.1
8
~(u)
+ !)u) - du =
.
smn
11m
ntoo
~(u)
= f(x
0,
(41)
+ u) + f(x - u) - 2A.
(42)
uj2
where
lim
ntoo
[8 sinn + !)u)
10
.~(u)
Sln(uj2)
du = 0.
(43)
1
8
sinn
+ !)u) v(u)du,
(44)
where
v(u)
= ~(u) {U~2
Sin(~j2)}
Dini's Theorem
f(x
+ t) + f(x - t) - 2A
S! (x) = A.
Proof" The hypothesis says that the function ~(u)ju, where ~ is defined in (42),
is integrable, and therefore condition (41) ofTheorem A3.3 is satisfied (RiemannLebesguelemma).
COROLLARY
If(x
then limntoo
+ h) - f(x)1
S! (x) = f(x).
= O(lhIO')
as h --+ 0,
Proof
35
f(x
+ t) + f(x
- t) - 2A I < K _1_
ItI 1
-0:'
for some constant K and for all t in a neighborhood of zero, and 1/ltI 1-0: is
integrable in this neighborhood, because I-ex< 1. Dini's theoremA3.4 concIudes
the proof.
f(x
+ 0) =
+ h)
lim f(x
hW
and
exist and are finite, and further assume that the derivatives to the left and to the
right at x exist. Then
. SI() _
I1m
n X -
f(x
+ 0) + f( x
Prao!"
- 0)
ntoo
+ t) -
f(x
+ 0)
ttO
exists and is finite, with a similar definition for the derivative to the left. The
differentiability assumptions imply that
lim f(x
+ t) -
f(x
+ 0) +
f(x - t) - f(x - 0)
ttO
+ f(x
- t) - 2A
2A
= f(x + 0) + f(x
- 0).
A3.1. Apply the previous theorem to the 21T -periodic function defined
by
f(t)
=t
Onefinds
t
= 1T -
"" sin(nt)
~
nEZ
n#O
For t
2-n
36
nIl
4=1-3+:5-7+
Jordan's Theorem
DEFINITION
f-+
n-l
sup
'D
L !q;(Xi+l) -
q;(Xi)! <
(45)
00,
i=O
= {a = Xo
<
Xl
= b}.
(46)
In partieular, for all X E [a, b), q; has a limit to the right q;(x + 0); for all
X E (a, b], it has a limit to the left q;(x - 0); and the diseontinuity points of q;(t)
in [a, b] form a denumerable set, and therefore a set ofLebesgue measure zero.
A3.6. Let f be a 2n-periodic locally integrable real-valuedfunction
of bounded variation in a neighborhood of a given X E lR. Then
THEOREM
lim
st (x) =
f(x
+ 0) +
f(x - 0)
ntoo
(47)
A3.2. Let f
I-B
E L~(lR).
+ j(v)e2irrvt dv
= 2B
{ f(t
JR
and use this to study the pointwise convergence of the left-hand side as B tends to
infinity, along the lines of the current chapter.
The funetion
2B sine (2Bt)
37
EXERCISE A3.3.
by
!t(x) = x,
Compute their Fourier coefficients, and use this to compute
(_l)n,
n~l
!ao
(48)
n=l
2Jr
an = -1
rr
f(t)cos(nt)dt,
bn
= -1
rr
2Jr
f(t)sin(nt)dt.
Of course, the series in (48) is purely formal when no additional constraints are
put on f(t) in order to guarantee its convergence. Now, the function F(t) defined
for tE [0, 2rr) by
F(t) = Iat(f(X) - !ao)dx
(49)
is 2rr-periodic, is continuous (observe that F(O) = F(I) = 0), and has bounded
variation on finite intervals.
Therefore, by Jordan's theorem its Fourier series converges everywhere, and for
all x E lR,
00
F(x)
2Jr
F(t) cos(nt) dt
1 [
sin(nx)
=- F(x)-rr
n
= - -1
nrr 0
2Jr
]2Jr
0
-1
nrr
2Jr
b
f(t)sin(nt)dt = _..!:,
n
38
n=
10
= an
n
lR,
1
~
F(x) = zA
o+ ~
{an-;; sm(nx)
. - -;;
bncos(nx) }.
(50)
n=l
(51)
Since A o is finite we have shown, in particular, that L~l bn/n converges for any
sequence {b n }n2:1 of the form
bn =
10
I~ ;
+ n] by
ifx > 0,
J(x)
if x < 0,
n
2
x
2
----
if x < O.
sI
S!(~)
::: A.
(52)
39
phenomenon, which can be observed whenever the function has a point of discontinuity. The proof of (52) for this special case keeps most of the features of the
general proof, which is left for the reader. In this special case,
f
Sn (x)
Now,
sin((n
2 sin('it)
-_lX
o
+ !)t)
sin((n
+ !)t) dt -
sin(nt)
t
ll
+ -
dt
(sin(nt)Cos(!t)
- - dt
x
-.
2 sin(!t)
sm(nt)
dt
cos(!t)
1
2 sin('it)
1)
-t
dt
cos(nt)dt.
The last two integrals converge uniformly to zero (by the uniform version of the
Riemann-Lebesgue lemma). Also,
~ sin(nt)
A32
- - dt
sin(t)
- dt ::::: 1.18 -:rr:rr
> -
tot
Fejer's Theorem
sI
40
Fejer's Kernel
Take the imaginary part of the identity
n-l
' " ei(k+l/2)u
e iu / 2
1_
inu
e .
1- e lU
k=O
to obtain
sI
Starting from Dirichlet's integral expression for (t) [cf, Eq. (37)], we obtain, in
view of the identity just proven, Fejer's integral representation of a! (x),
a!(x)
{+Jr
LJr
Kn(u)f(x-u)du=
(+Jr
LJr
Kn(x-u)f(x)du,
(54)
where
I
Kn(t) =
sin2(~nt)
(55)
2 1
= 1 in (54)],
i:
(56)
Jr Kn(u) du = 1.
(57)
(58)
(59)
ntoo
ntoo
+C
Kn(u) du = 1.
-6
The last four properties make ofFejer's kernel a regularization kerneion [-:rr, +:rr]
(by definition of a regularization kernel).
lim
sup
ntoo XE[-Jr,+Jr]
= O.
(60)
Proof'
i:
41
1+
+8
-8
=A+B.
(61)
[-n,+n]\[-8,+8]
For a given 8 > 0, ehoose 8 sueh that I/(x - u) - l(x)1 ::s 8/2 when lul ::s 8.
Note that I is uniformly eontinuous and uniformly bounded (being aperiodie and
eontinuous funetion), and therefore 8 ean be ehosen independently of x. We have
A::s
-8
Kn(u) du.
J[-n,+n]\[ -8,+8]
By (57) and (59), B ::s 8/2 for n sufficiently large. Therefore, for n suffieiently
large, A + B ::s 8.
Fejer's theorem for eontinuous periodie funetions is the key to important approximation theorems. The first one is for free. We eall a trigonometrie polynomial
any finite trigonometrie sum of the form
L
+n
p(x) =
Ck eikx .
-n
THEOREM A3.8.
8
sup
8.
tE[-n,+n]
Proof'
iill.
THEOREM A3.9.
8
8.
tE[a,b]
Jf, moreover,
Proof' First, suppose that a = 0, b = 1. One ean then extend I : [0, 1]] 1-+ C
to a funetion still denoted by I, I : [-n, +n]] 1-+ C, that is eontinuous and
sueh that IHn) = I( -n) = O. By Theorem A3.8, there exists a trigonometrie
42
sup
tE[D, I]
tEl
-rr,+rr]
If(x) - p(x)1
e
:s -.
Now replace each term e ikx in p(x) by a sufficiently large portion of its Taylor
series expansion, to obtain a polynomial P(x) such that
e
sup IP(x) - p(x)1 :s -.
tE[D,I]
2
Then
sup If(x) - P(x)1
tE[D, I]
:s -.
2
To treat the general case f : [a, b] f-+ C, apply the result just proven to cp
[0, 1] f-+ ce defined by cp(t) = f(a + (b - a)t) to obtain the approximating
polynomial rr(x), and take P(x) = rrx - a)j(b - a.
Finally, to prove the last statement of the theorem, observe that
If(x) - Re P(x)1
:s
If(x) - P(x)l
Fejer's Theorem
We shall first obtain for the Fejer's sum the result analogous to Theorem A3.3.
First, from (54), we obtain
a!(x)
= -1
sm Czu)
2nrr D
(62)
a!(x)-A= 2nrr
THEOREM
sm (:zu)
ntoo
ntoo
(63)
(64)
8 .
4J(u)
sm 2(!nu) - 2 D
u
du
= 0,
(65)
where
4J(u)
Proof
= f(x + u) + f(x
(66)
- u) - 2A.
The quantity
In 18
sin 2 (!u)
- n
18
14J(u)1
sin 2 (!u)
du
tends to 0 as n
00.
n
tends to 0 as n
00.
43
sin 2( !nu)
sin 2 (!u)
fjJ(u) du
11 (1
:'S n
. 2( 1
SIll
ZU
) -
1I
12 IfjJ(u)1 du
ZU
THEOREM A3.11. Let f (t) be a 2JT -periodic locally integrable function and assume
that, for some x E ~, the limits to the right and to the left (respectively, f(x
and f(x - 0), exist. Then
lim
Proof"
u! (x) =
+ 0) + f(x
- 0) .
2
Fix 8 > O. In view of the last result, it suffices to prove (65) with
fjJ(u)
= {f(x + u) -
f(x
+ O)} + {f(x -
f(x
11
rJ(c),
sin2(!nu)
1
;
fjJ(u) du
c
n
< -
1ry
sin 2(!nu)
u2
du
+ -1
1 -IfjJ(u)1-
n ~
u2
du.
The last integral is bounded; and therefore, the last term goes to 0 as n
for the penultimate term, it is bounded by Ac, where
A33
(67)
u) - f(x - O)}.
Since fjJ(u) tends to 0 as n ~ 00, for any given c > 0 there exists rJ
rJ :'S 8, such that IfjJ(u)1 :'S c when 0 < u :'S rJ. Now,
o<
+ 0)
00
sin2(!v)
dv <
00.
00.
As
The following corollary of F6jer's theorem will play the key role for the proof of
the Poisson sum formula (Theorem A3.l2).
44
COROLLARY A3.1.
= fex).
ntoo
0'1 (x) = A.
lim
ntoo
We have already given a weak version of the Poisson sum formula in Section
A22. A most interesting situation is when the function cI>(t) defined by (31) is
equal to its Fourier series for all t E lR., that is,
Ls(t
nEZ
for all tE R
(68)
nEZ
(2)
LnEZ
00
be
Here are two important cases for which the strong Poisson sum formula holds.
COROLLARY A3.1. Let set) be a stable complex signal, and let 0 < T < 00 be
fixed. If, in addition, L set + nT) converges everywhere to a continuousfunction
that has bounded variation, then the Poissonformula (68) holds.
Praof' We must verify conditions (1) and (2) ofTheorem A3.12. Condition (1)
is part of the hypothesis. Condition (2) is a consequence of Iordan's theorem
A3.6.
EXAMPLE
s(t)
s(v)
0(1
oe
as Itl
+\tl"')
+llvl"')
~ 00,
as lvi
~ 00,
for some a > 1, then the Poisson formula (68) holds for alt
Proof
45
(69)
< T <
00.
Convergence Improvement
The Poisson formula can be used to replace aseries with slow convergence by one
with rapid convergence, or to obtain some remarkable formulas. Here is a typical
example. For a > 0,
n(a 2
+ v2 )
Since
Ls(t +n)
= Le-2nalt+nl
nEZ
nEZ
is a continuous function with bounded variation, we have the Poisson formula, that
is,
1 - e- 2na
-1,
Therefore,
1
nO": 1
Letting a
n 1 + e- 2na
a 2 + n 2 = 2a 1 - e- 2na
1
2a 2 .
0, we have
The general feature of the above example is the following. We have aseries
that is obtained by sampling a very regular function (in fact, C OO ) but also slowly
46
h(t-2T)
~
[jU[jU[j[j[j~
1
-3T -2T
-T
2T
3T
-3T -2T
-T O T
2T
3T
decreasing. However, because of its strong regularity, its Fr has a fast decay. The
series obtained by sampling the Fr is therefore quickly converging.
Radar Return Signal
s(t) =
(I>(t -
nT) f(t).
(70)
nE'L
(We may interpret h(t - nT) as a return signal of the nth pulse of a radar after
reftection on the target, and f (t) as a modulation due to the rotation of the antenna. )
The Fr ofthis signal is (see Fig. A3.1)
nE'L
(t)
!!.-).
T
(71)
References
[Al] Ablowitz, M.J. and Jokas, A.S. (1997). Complex Variables, Cambridge University
Press.
[A2] Bracewell, R.N. (1991). The Fourier Transform and Its Applieations, 2nd rev. ed.,
McGraw-Hil1; New York.
[A3] Gasquet, C. and Witomski, P. (1991). Analyse de Fourier et Applieations, Masson:
Paris.
[A4] Helson, H. (1983). Harmonie Analysis, Addison-Wesley: Reading, MA.
[A5] Katznelson, Y. (1976). An Introduetion to Harmonie Analysis, Dover: New York.
[A6] Kodaira, K. (1984). Introduetion to Complex Analysis, Cambridge University Press.
[A7] Krner, T.W. (1988). Fourier Analysis, Cambridge University Press.
References
47
[A8] Rudin, W. (1966). Real and Complex Analysis, McGraw-Hill: New York.
[A9] Titchmarsh, E.C. (1986). The Theory of Funetions, Oxford University Press.
[AlO] Tolstov, G. (1962). Fourier Series, Prentice-Hall (Dover edition, 1976).
[All] Zygmund, A. (1959). Trigonometrie Series, (2nd ed., Cambridge University Press.
Part
Signal Processing
Introduction
The Fourier transform derives its importance in physics and in electrical engineering from the fact that many devices mapping an input signal x(t) into an output
signal y(t) have the following property: If the input is a complex sinusoid e2invt,
the output is T(v)e2invt, where T(v) is a complex function characterizing the device. For example, when x(t) and y(t) are, respectively, the voltage observed at the
input and the steady-state voltage observed at the output of an Re circuit (see Fig.
BO.I), the input-output mapping takes the form of a linear differential equation:
y(t)
+ RCy(t) = x(t),
I
1+ 2irrvRC
The Re circuit is one of the physical devices that transform a signal into another
signal, that satisfy the superposition principle, and that are time-invariant. More
precisely:
R
1'~1'
C
y(t)
x(t)
l"""
,1,,)
52
(a) If Yl (t) and Y2(t) are the outputs corresponding to the inputs Xl (t) and X2(t),
then AlYl (t) + A2Y2(t) is the output corresponding to the input AlXl (t) + A2X2(t);
(b) If y(t) is the output corresponding to x(t), then y(t - -r) is the output
corresponding to x(t - -r).
Such physical devices are caIled (homogeneous linear) filters.
A basic example is the convolutional filter, for which the input-output mapping
takes, in the time domain, the form
y(t)
h(t - s)x(s)ds,
where h(t) is caIled the impulse response, because it is the response of the filter
when the Dirac pulse 8(t) is applied at the input. Indeed,
h(t) =
If the impulse response is integrable, the output is weIl defined and integrable, as
long as the input is integrable. Then, by the convolution-multiplication rule, the
expression of the input-output mapping in the frequency domain is
y(v) = T(v)x(v),
where T (v) is the frequency response, that is, the Fr of the impulse response:
T(v) =
h(t)e-2i1Cvt dt.
Observe that if the input is x(t) = e-2i1Cvt, the output is weIl defined and equal to
h(s)x(t - s)ds
= T(v)e-2i1Cvt,
Introduction
53
L s (nT) sinc (f - n) .
nEZ
54
cmumute for base-band signals. This is not a difficult result, but it is of course a
fundamental one because in signal processing, one first sampies and then performs
the filtering operation in the sampled domain, since one of the advantages of digital
processing comes precisely from the difficulty of making analog filters.
One advantage of analog processing is that it is instantaneous. To maintain
competitivity, the signal processing algorithms have to be fast. For instance, the
discrete Fourier transform is implemented by the so-called fast Fourier transform,
an algorithm whose principle we briefty explain in this Part. Subband coding also
has a fast algorithm associated with it. It is a data compression technique. The
signal is not directly quantized, but instead, it is first analyzed by a filter bank, and
the output of each filter bank is quantized separately. This allows one to dispatch
the compression resources unequally, with fewer bits allocated to the subbands
that are less informative (see the discussion in Chapter B4). Subband coding is
the last topic of Part Band introduces the sections on multiresolution analysis in
Part D.
BI
Filtering
B 11
Convolutional Filter
We introduce a particular and very important dass of filters.
DEFINITION Bl.l. The transformation fram the stable signal x(t) to the stable
signal y(t) defined by the convolution
y(t)
h(t - s)x(s)ds,
(1)
where h(t) is stable, is ca lied a convolutional filter. This filter is called a causal
filter if h(t) = 0 for t < O.
The signal y(t) is the output, whereas the signalx(t) is the inputofthe linear filter
with impulse response h(t). Informally, if x(t) is the Dirac generalized function
8(t) (an impulse at time 0), the output is (see Fig. Bl.1)
56
BI. Filtering
8(t)
f\h(t~
>
impulse
>
impulse response
[t
oo
(2)
h(t)e-2i1Cvt dt,
(3)
y(t) = T(v)e2i1Cvt.
(4)
(Note that the output is weH defined by the convolution formula, even though in
this particular case the input is not integrable.)
EXERCISE Bl.1. Let y(t) be the output of a stable and causal convolutional filter
with impulse response h(t) [see (2)]. Let
z(t)
1
t
h(t - s)x(s)ds,
~ 0,
be the output of the same filter, when the input x(t) is applied only from time t = 0
on. Show that
tt+oo
= O.
1-+
The meaning ofproperties (a) and () is the following: (a) XI (t), X2(t)
E
===}
x(t - T)
D('c).
AI, A2
+ A2X2(t)
AIXI (t)
E D('c);
and () x(t)
E D('c),
The meaning of properties (i) and (ii) is the following: (i) XI (t), X2(t)
AI, A2
E C,XI(t)~
YI(t),X2(t)
E D('c),
Y2(t)
E D('c),
e 2mvt
D('c),
lR
===}
D('c),
AIXI(tHA2X2(t) ~ AI(t)YI(tH
===}
57
===}
then
T(v)e 2mvt
(5)
T(v)
where G(v)
EXAMPLE
= G(v)ei(v),
(6)
= IT(v)1 is the amplitude gain and (v) = Arg T(v) is the phase.
Bl.l. Let
D('c) = {x(t) :
or
D('c) = {x(t)
+ e(t) :
Ix(t)1 dt <
00
and e(t)
E},
y(t)
(7)
where xCv) is the FT ofthe input x(t) E L~(lR). The right-hand side of(7) has a
meaning since T(v) and xCv) being in L~(lR) implies that T(v)x(v) is in Lt(lR)
(see Theorem 20 of the appendix).
EXAMPLE B 1.3. If T ( v) is an arbitrary function, not necessarily in L~ (lR), one can
always define a filter ,C by the input-output relation (7), provided one chooses for
domain D('c) the set of signals x(t) such that the right-hand side has a meaning.
58
B 1. Filtering
-B
+B
Low-pass (B)
~2B
----7
~2B
-VQ
----7
+VQ
Band-pass (vQ, B)
(8)
where B is the cut-offfrequency. One calls band-pass (B, va), where 0< B < va,
a filter with frequency response
T(v)
= 1[-vo-B,-vo+Bj(v) + l[vo-B,vo+Bj(v),
(9)
Hilbert's filter (see Fig. B 1.3) belongs to the category of Example B 1.3. It is the
filter with frequency response
where T(O) = O.
(10)
One possible domain for Hilbert's filter is the set of stable (resp., finite-energy)
signals whose FT has compact support. The amplitude gain of Hilbert's filter is 1
(except for v = 0, where the gain is zero), and its phase is
(v)
Jr /2
if v > 0,
if v
-Jr /2
if v < O.
+i
= 0,
r - ,- - - - - -
,0
- - - - - - - ' , -i
Hilbert filter
(11)
59
There is no function admitting the frequency response (10). There is, in fact,
a generalized function (in the sense of the theory of distributions) with Fr equal
to T(v). However, in signal theory, the Hilbert filter is used only in the theory of
band-pass signals (see Section BI2). For such signals the Hilbert filter coincides
with a bona fide convolutional filter:
EXERCISE Bl.3. Show that the output y(t) ofthe Hilbertfilter, corresponding to
a stable signal x(t) having an FT x(v) that is null outside the frequency band
[- B, + B], can be expressed as
y(t)
=-
R.
x(t - s)
2 sin 2 (n Bs)
ns
ds.
where
Ix(v)1 dv
x(t)
<
x(v)e2iJrvt dv,
and
00
Ivllx(v)1
(12)
<
00.
~ x(t) =
[(2inv)x(v)e2iJrvt dv.
(13)
JR.
dt
(Apply the theorem of differentiation under the integral sign; 15 ofthe appendix).
The mapping x(t) ~ dx(t)/dt is a linear filter, called the differentiating filter, or
differentiator, with frequency response
T(v)
= 2inv.
(14)
~ Ix(v)1 dv
JIR.
<
00
and
[lx(v)1 dv
JIR lvi
<
00.
The signal
y(t)
= [
x.(v) e2iJrvt dv
2mv
is in the domain of the preceding filter (the differentiator), and therefore,
JIR
T(v) = - . - .
2mv
(15)
60
BI. Filtering
y(t)
x(t)
y(t)
x(t)
.c2 *.c1
series
.c 2 +.c l
parallel
~----,-----:~ y (t)
We now describe the basic operations on filters (see Fig. B1.4). Let C] and C2
be two convolutional filters with (stable) impulses responses h](t) and h 2 (t) and
frequency responses T](v) and T2(V), respectively.
The series filter C = C 2 * C] is, by definition, the convolutional filter with
impulse response h(t) = (h] * h 2)(t) and frequency response T(v) = T](v)T2(V).
It operates as folIows: The input x(t) is first filtered by Cl, and the output of C] is
then filtered by C 2 , to produce the final output y(t).
The parallel filter C = C] + C 2 is, by definition, the convolutional filter with
impulse response h(t) = h](t) + h 2(t) and frequency response T(v) = T](v) +
T2(V). It operates as folIows: The input x(t) is filtered by Cl, and "in parallel," it
is filtered by C 2 , and the two outputs are added to produce the final output y(t).
The feedback filter C = CI/(1 - C] * C 2 ) is, by definition, the convolutional
filter with impulse response frequency response
T(v) =
T](v)
1 - T](v)T2 (v)
This filter will be a convolutional filter if and only if this frequency response is the
PT of a stable impulse response. 1fthis is not the case, one may define the feedback
filter by the input-output relation
A
y(v)
T](v)
1 _ T](v)T2 (v)x(v)
61
The filter 'cl is the forward loop filter, whereas 'cl is the feedback loop filter.
The forward loop processes the total input, which consists ofthe input x(t) plus
the feedback input, that is, the output y(t) processed by the feedback loop filter.
EXERCISE
Give the impulse response of the convolutional filter with the above jrequency
response T (v). Interpret the filter as a feedback filter.
Filtering of Decomposable Signals
Bl.4. The signal set) is called decomposable ifit can be put into the
form
set)
where
(16)
where /LI and /L2 are signed measures of finite total variation. A signed measure
of finite total variation is a mapping /L : B(IR.) ~ IR. of the form
/L(C)
= /L1(C n A) -
/L2(C n ),
for some A E B(IR.) and all C E B(IR.), where /LI and /L2 are measures on (IR., B(IR.))
of finite total mass.
EXAMPLE Bl.4. Let set) be a periodic signal with period T that is stable over its
period and whose Fourier coefficients satisfy the condition
L ISnl <
00.
nEZ
= LSn8q,(dv).
nEZ
62
BI. Filtering
00,
and since
THEOREM
/Lx:
x(t) =
e2invt flxCdv),
(17)
h(t - s)x(s)ds
LL
and hence
(L
=
=
L
L
L
(L
00,
00.
e2invs flxCdV )) ds
(h(t - s)
e 2inv(t-s)
(18)
T(v)flx(dv).
e- 2inv(t-s)h(t - s)
dS) flxCdv),
andhence
(19)
EXAMPLE
BI.5. In light of(18), one can interpret Eq. (22) ofTheorem A2.I:
A(n)A
T x n,
Yn = h
A
where h(t) is a stable impulse response and x(t) is a locally stable periodic signal
withperiod T: IfLnEZ l.inl < 00, then LnEZ IYnl < 00, since h(v) is bounded.
63
The two signals x(t) and y(t) are therefore decomposable, and (19) can be written
lLy(dv)
LYneq,(dv)
LXnh(f) eq,(dv)
nEZ
nEZ
= h(v)ILAdv) =
T(v)iLAdv).
P(z)
= ao + Latzt,
Q(z)
= bo + Lb1z l
1=1
(20)
1=1
For all v
(21)
Q(2inv)
T (v) = -P-(2-in-v-) .
(22)
IR, define
We define a linear time-invariant filter CC, D(I: as follows. First, we define the
domain
L~(IR)}.
(23)
We first observe that any function x(t) in the domain is differentiable up to order
q and that its jth derivative is
x(j)(t) =
I: : x(t)
y(t)
(24)
T(v)x(v)e2irrvt dv.
(25)
One has to verify that the integral in (25) is weIl defined. Indeed,
bounded because P(z) has no imaginary root. In particular,
for some K <
I/I P (2i n v) I is
00.
In fact, T(v)x(v)v k is integrable for all k, 0 .:::: k .:::: p. To check this, observe
that Ivl k /IP(2inv)1 is bounded for all k .:::: p because P(2inv) is bounded away
64
BI. Filtering
from zero (P(z) has no imaginary root) and Ivl k /IP(2iJTv)1 behaves as Ivl k 00. Therefore, the output y(t) is differentiable up to order p, and for j :::s p,
y<j)(t)
(2iJTv)jT(v)x(v)e2invt dv.
at
(26)
From (24), (26), and (22), it follows that the input x(t) and the output y(t) are
linked by the differential equation
+L
p
aoy(t)
= box(t) + L
q
aly(I)(t)
1=1
b1x(l)(t),
1=1
= Q(:t )
(27)
x(t).
Ai P(2iJTv)
The answer is "no, in general" and "yes, asymptotically" if we impose the following
condition:
P(z) is strict1y stable,
(28)
that is, the real parts of all the roots of P(z) are strict1y negative. Then, y(t), t ~ to,
the solution of (27) with arbitrary initial conditions y(to) = Yo, Y6j)(to) = Y6 j )
(1 :::s j :::s p - 1), satisfies
( l
lim y(t) -
ttoo
IR
Q(2iJTv)
.
x(v)e 2lJrvt dv
P(2m v)
A
= O.
(29)
Proof" The general solution of (27) is the sum of a particular solution of (27) and
of the general solution of the differential equation without a right-hand side,
p(:t ) y(t)
= O.
(30)
Therefore, since
( Q(2iJTv) x(v)e2invt
P(2iJTv)
JIR
is a particular solution of (27), we have to show that limttoo z(t) = 0 for the general solution of (30). This follows from the theory of linear differential equations
65
because the characteristic polynomial P(z) of (30) has all its roots in the open left
half complex plane (see [B5]).
If P(z) is not strictly stable, there are initial conditions such that
ImIR
Q(2inv) ~
.
.
x(v)e217fvt dv
P(2mv)
00.
EXAMPLE Bl.6. Consider the LRC circuit (see Fig. BI.5). Its input and output
are related through the differential equation
= x(t),
where jet) and y(t) are thefirst and second derivatives ofy(t). The roots ofthe
characteristic polynomial
P(z)
= 1 + RCz + LCz 2
z = ----'--,----'-and their real parts are always strictly negative. Therefore, the system is strictly
stable, and the permanent regime when the input is x(t) E D(C) is
y(t)
We note that Q(z)
2'
n
r
P(z) = ap
(z -
Zk)m k ,
k=l
1'~1'
x(t)
c 1-
y(t)
l111111111111I11111
66
BI. Filtering
and therefore,
1
0
-00
.
tj- 1
- - - eZkte-217rvt
(j - I)!
tj-l
00
_ _ _ ezkte-217rvt
(j - I)!
dt
=-
dt
=+
..
(2inv - Zk)1
..
(2inv - Zk)1
h(t) =
1
'""'
k tj - e Zkt
~
~('-1)'
k;Re(zkl>O j=l }
.
for t < 0,
(31)
we have
h(t)
Fr
-+
mk
L L (2inv
k=l j=l
k~ Zk)j .
q-p
k=O
lR
In particular,
y(t)
s)x(s)ds.
h(t - s)x(s) ds
(32)
(33)
if and only if q < p. If, moreover, P(z) is strict1y stable (no roots in the c10sed
right half-plane), then, as the expression (31) of the impulse response shows, the
impulse response is causal, and the filter is then called realizable.
EXERCISE
BI.5. Give the impulse response of the LRC filter in the case R 2 <
4LjC.
We observe that the input-output relationship (32) is meaningful for all x(t) E
D(,C'), where D(,C') consists of the stable complex signals that are differentiable
67
up to order max(O, q - p). Therefore, one can consider that the filter (C', D(C',
where L' is described by (32), is an extension ofthe original filter (C, D(C. We
may consider that (32) is an extension of the differential equation (27). For some
functions of the extended domain, the input-output relationship is not a differential
equation.
EXERCISE Bl.6.
~y
=
2
y" -
x" -
~x
3 .
Butterworth Filters
We consider the problem of implementing an approximation of the ideal lowpass (B) filter (with cut-off frequency B) by means of a stable and realizable
filter with real impulse response h(t). Let T(v) be the frequency response of the
approximating filter. The following family of filters, called Butterworth filters, has
been proposed:
1+
(ir'
(34)
(As n -+ 00, the filter looks more and more like an ideallow-pass filter.) One
seeks T (v) of the form
T(v)
K
P(2inv)'
where P(z) has all its roots strictly to the left of the imaginary axis in order to
guarantee stability and causality.
The roots of the polynomiall + (v / B)2n are
O::;l::;2n+1.
We reorder these roots in such a way that VI, vi, ... , vn , v~, are the 2n roots, where
VI, .. , Vn have strictly positive imaginary parts. We shall allocate VI, . , Vn to
T(v), thus proposing
T(v)
(35)
This is the frequency response of a real filter (i.e., T*(v) = T( -v because any
root among VI, ... , Vn is purely imaginary or it can be associated with another root
symmetrie with respect to the imaginary axis (see Fig. B 1.6).
In the case n = 2, we find
68
BI. Filtering
1/1
,,
1/0
,,
,
/
\
........... \
...
,,
/
/
/
,,
.........
.... I \ ....
1/*
1
...
1/*
I/i
n=2
n=3
= 2 and n = 3
-Vi V2 V3
(v - vd(v - V2)(V - V3)
EXERCISE
Bl2
0/ order
2 can be
Band-Pass Signals
In this section, we give the basic facts conceming frequency transposition and study
the phenomena associated with it, such as cross-talk in quadrature multiplexing,
and channel dispersion.
Complex Envelope
DEFINITION
It will be assumed, moreover, that set) is real and hence that its Fr is Hermitian
even:
sC-v)
= s(v)*.
We are going to show that a real band-pass signal set) has the representation
set)
= met) cos(2nvot) -
n(t) sin(2nvot),
(36)
wherem(t) andn(t) aretwo signals thatare real, andbase-band (B). The base-band
signals met) and n(t) are the quadrature components ofthe band-pass signal set).
/\/\
/\1'\
8(1/)
/\1'\
~S:(I/)
d\
69
~u(1/ )
= 2 10
00
s(v)e2iJrvt dv,
(37)
sa(v
+ vo)e2iJrvt dv.
(38)
BI.S. Show that the FT ofthe signal Re {u(t)e2iJrvot} is s(v), and thus
s(t)
= Re {u(t)e2iJrvot}.
(39)
Let m(t) and n(t) be the real and imaginary parts of u(t):
u(t)
= m(t) + in(t).
(40)
n(v)
+ u(-v)*
2
'
u(v) - u(-v)*
2i
(41a)
(41b)
and that
m(v) = {s(v
(42a)
(42b)
70
BI. Filtering
EXERCISE
Band-Pass Filtering
When the band-pass signal (36) is passed through a filter with frequency response
T(v), we may, without loss of generality, consider that T(v) = 0 if lvi fj [vo B, Vo + B], since filtering is expressed in the frequency domain by multiplication
of the Frs. Hence it will be assumed that the impulse response h(t) of the filter is
also a band-pass (vo, B) function.
The output signal y(t) has as Fr
y(v) = T(v)s(v),
cos(27rl/ot)
m(t)
2 cos(27rl/ot)
m(t)
-+-oo~
n (t) -+-OO---Y
sin(27rl/ot)
(43)
n(t)
2 sin(27rl/ot)
71
B1.12. Show that ifwe denote by v(t) and u(t) the complex envelopes
ofy(t) and s(t), respectively, then
EXERCISE
v(v)
= T(v + va)u(v).
(44)
We shall describe two effects that are specific of frequency transposition. The
first one is the phenomenon of cross-talk in quadrature multiplexed channels.
Cross-Talk
Suppose we use quadrature multiplexing; we thus send two band-pass messages
m(t) and n(t) in the form
s(t) = m(t)cos(2nvat) - n(t)sin(2nvat).
n'(v)
=-
i{s'(v
+ va) -
Va)}l[-B,+Bl(v),
s'(v - Va)}l[-B,+Bl(v).
Let us assume that the distortion s(t) -+ s'(t) is a linear filtering with frequency
response T(v). Let us note that T(v) is Hermitian symmetric, as it is the Fr of a
real impulse response h(t). We have s'(v) = s(v)T(v), and therefore,
m'(v) = {T(v
va)s(v - va)}l[-B,+Bl(v),
It depends on both m(v) and n(v), and therefore, in general, there is interference
between the two paths. However, under the condition that T(v) be Hermitian
symmetric about Va in the band of width 2B centered on Va, that is,
T(v
+ va) = T*(va -
v)
forall v
[-B, +B],
(45)
+ va) = T(v -
va)
forall v
[-B, +B],
(46)
then
m'(v) = G(v)m(v),
n'(v)
= G(v)n(v),
(47)
72
B 1. Filtering
T(v)
~+13_
~:::: !-~=~
v
-va-B -Va -va+B
va-B
Va va+B
where
G(v) = T(v
+ vo).
(48)
T(v
+ Vo) =
+ v,
T(v - Vo)
=A-
v.
We shall now study another phenomenon associated with frequency transposition, that of group delay.
Dispersive Channels
(49)
Kei(v),
where K is a complex constant that will be taken equal to unity. This channel transforms the complex sinusoid e2irrvt into the delayed complex sinusoid e i (2rrvt+ (v)),
where (v) is the phase ofthe filter at the frequency v.
Let s(t) be a real signal, band-pass (vo, B), of the form s(t) = m(t) cos 2:rrvot.
Let y(t) be the signal obtained by passing s(t) through the dispersive channel. The
corresponding base-band equivalent filter has the frequency representation
v(v)
= T(v + vo)m(v),
where v(v) is the Fr of the complex envelope v(t) of y(t) [see (43)].
Suppose that in the band [vo - B, Vo + B], the dispersion has a first-order
expansion
(v
av
,
V=Vo
[-B, +B];
73
then (approximately)
where
(vo)
(50)
T ---P 2:rrvo
and
(51)
Therefore, we have
Now,
y(t)
= Re {v(t)e2invot}.
Hence we have
y(t)
The constants
Tp
and
Tg
= m(t -
Tg ) cos 2:rrvo(t -
Tp ).
(52)
B2
Sampling
B21
THEOREM
Lls(~)1
nEZ
2B
<00.
(53)
76
B2. Sampling
Then
LS(v
ja
_1 L S( ~) e-2iJrvfs, a.e.
2B na 2B
+ j2B) =
(54)
T(v)e2iJrvt dv,
h(t) =
where T(v)
E LU~).
(55)
The signal
set) = _1
2B
LS(~) h(t - ~)
2B
2B
+ j2B)!
T(v)e2iJrvt dv.
nEZ
(56)
= [
J~
!LS(V
(57)
JEZ
LjEZ
s(v
+ j2B)
1 [
2 n
2B J~ s(v)e- "'2li V dv,
that is, since the Fourier inversion formula for set) holds (s(v) is integrable) and
it holds everywhere (s(t) is continuous), the nth Fourier coefficient of <p(v) is in
fact equal to
2~ ~s(2~)e-2iJrfsv.
nE",
In view of condition (53), the Fourierinversion formula holds a.e. (Theorem A2.2),
that is, <p(v) is almost everywhere equal to its Fourier series. This proves (54).
Since the frequency response T(v) E Lb(~), the impulse response h(t) given
by (55) is bounded and uniformly continuous, and therefore set) is bounded and
continuous (the right-hand side of (56) is a normally convergent series-by (53)of bounded and continuous functions). Also, upon substituting (55) in (56), we
obtain
set)
= _1 L
2B
=[
J~
nEZ
S ( ~) [ T(v)e2iJrv(t-fsl dv
2B J~
~)
! L _1 s(
e-2iJrvfs! T(v)e2iJrvt dv.
nEZ 2B
2B
77
)IIT(V)I dv =
krna Is( ~
2B
na
Therefore,
set)
dV)
< 00.)
g(v)e2inVI dv,
where
g(v)
IT(v)1
2B
2~ {~S(2~) e-2inviB
T(v).
THEOREM B2.2. Let s(t) be a stable and continuous signal whose FT s( v) vanishes
outside [- B, + B], and assume condition (53) is satisfied. We can then recover
s(t)from its sampies s(nj2B), n E Z, by theformula
set)
= LS(~) sine(2Bt 2B
nEZ
n), a.e.
(58)
+ j2B)! T(v) =
s(v)I[-B,+Bl(v) = s(v),
JEZ
s(v)e2invI dv
= set).
s.(t) = _1
1
2B
"s(~)
8(t - ~).
2B
2B
L;,
nEa-
(59)
78
B2. Sampling
!21 s (t)
_~~~f~f~f~f'f'f'f",
Figure B2.l. The Dirae eomb of (59)
1/2B
s (t)
s(t)
--7j)(}--+---l
lL:(n)
n)
2B gT (t - -2B'
2B
nEZ
= -r
1[0
'
Tl(t),
I1
= r
s(t - u) du.
(Observe that we eannot use Theorem B2.1 as such; why?). Condition (53) implies
that Si,T(t) is integrable and has an Fr given by
= ;(v)
(2~ ~S (2~) e-
2irrfB ) .
The signal ST(t) is obtained by low-pass (B) filtering of the stable signal Si,T(t).
Sinee the impulse response of a low-pass is not integrable, we eannot use the
eurrent version of the eonvolution-multiplieation rule as it iso However, we shall
proeeed formally beeause the result is justified by a more appropriate version of
79
s,(v) = g,(v)
(1"
2B L;s (n)
2B e- 2in 2Bn l[-B,+Bl(v) ) = g,(v)s(v).
A
nE",
The result then follows by the inversion formula and the convolution-multiplication
formula (the current version, this time).
Aliasing
What happens in the Shannon-Nyquist sampling theorem if one supposes that the
signal is base-band (B), although it is not the case in reality?
Suppose that a stable signal s(t) is sampled at frequency 2B and that the
resulting impulse train is applied to the low-pass (B) with impulse response
h(t) = 2Bsinc (2Bt), to obtain, after division by 2B, the signal
s(t)
= LS(~) sinc(2Bt 2B
nEZ
n).
What is the Fr of this signal? The answer is given by the following theorem,
which is a direct consequence of Theorem B2.l.
THEOREM B2.3. Let s(t) be a stable and continuous signal such that condition
(53) is satisfied. The signal
s(t)
= LS(~) sinc(2Bt 2B
nEZ
n)
(60)
J(v)e2inVf dv,
where
J(V)
= !LS(V + j2B)!1[-B,+Bl(V),
(61)
kEZ
~
If s(t) is integrable, then s(v) is its Fr, by the Fourier inversion theorem. This
Fr is obtained by superposing, in the frequency band [- B, +B], the translates
by multiples of 2B of the initial spectrum s( v). This superposition constitutes the
phenomenon of spectrum folding, and the distortion that it creates is called aliasing
(see Fig. B2.3).
EXERCISE
s(t)
= ( Sin(2Jl' Bt)2
Jl't
sin(2Jl' Bt)
Jl't
80
B2. Sampling
-w
~
+B +W
-B
s(v)
s(v)
3B
4B
<--~,
~(v)
+B +W
-W -B
= ~ L s (~) 8 (t - ~) .
Vo
nEZ
Vo
Vo
Passing this train through a low-pass (vo + B), one obtains a signal a(t). Passing
this train through a low-pass (B), one obtains a signal b(t).
Show that
a(t) - b(t)
= 2 s(t) cos(2Jl'vot).
(We have therefore effected the frequency transposition of the original signal.)
The following exercise gives aversion of the sampling theorem for band-pass
signals.
EXERCISE
2B
L s (~)
8 (t - ~)
2B
2B
nEZ
81
Oversampling
We have seen the effeets of inadapted sampling, that is, sampling at a too slow
rate (undersampling) that results in aliasing, or speetrum folding. We now show
that oversampling ean be exploited to obtain faster rates of eonvergenee in the
reeonstruetion formula.
Assurne that the situation of the Shannon-Nyquist theorem prevails; in partieular, we have the reeonstruetion formula (58). The quantity sine (2Bt - n) therein
is of the order of 1/ n in absolute value and of altemating sign. Therefore, the speed
of eonvergenee of the series on the right-hand side is, roughly, eomparable to that
of
L (_1)n
nEZ
(..!!:.-).
2B
In order to aeeelerate eonvergenee, one ean use oversampling in the following way.
Assurne that supp(s(v)) is eontained in the frequeney interval [- W,
some 0< W < 00. In formula (57) ofTheorem B2.1, ehoose
B
+ W]
for
(62)
(1 +a)W
for some a > 0 and take any integrable function T(v) such that
T(v)
=1
ifv
[-W, +W].
(63)
= s(v).
JEZ
Therefore, ifwe sampIe at arate 2B largerthan the Nyquistrate 2W, and then filter
the resulting train of impulses with a filter of impulse response h(t), we obtain,
after division by 2B, the signal
_1 '"'
2B L::,
nEtL.
s(..!!:.-)
h(t _ ..!!:.-)
2B
2B '
which is a replica of the original signal s(t) provided the frequency response of
the filter verifies condition (63).
EXERCISE
v+B
T(v) = B _ W 1[-B,-w]
+ 1[-w,+w] +
-v+B
B _ W 1[+B,+w].
Give the corresponding impulse response, and study the rate 0/ convergence 0/ the
series on the right-hand side 0/(60).
The series in the reconstruetion formula can decay faster by choosing a smoother
frequency response T(v), since increasing the smoothness of a function increases
the decay of its Fourier transform.
82
B2. Sampling
B22
This section presents another approach, more direct and with a broader scope,
to sampling. It acknowledges the fact that a signal is a combination of complex
sinusoids and therefore starts by obtaining the sampling theorem for this type of
elementary signals.
Sampling a Single Sinusoid
Consider the signal
set)
= e2iTrAt ,
where A E R This signal is neither stable nor of finite energy, and therefore it
does not fit into the framework ofthe L'- and L 2 -versions of Shannon's sampling
theorem. However, the Shannon-Nyquist formula remains essentially true.
THEOREM
E ~
= L e2iTrnT
e2iTrAt
nEZ
. (rr
sm
- (t - nT) )
rr T
.
(t - nT)
T
+ 1/2T),
and all t
E [ - B,
. (rr
-
+B].
(-1/2T, + 1/2T),
(64)
)
,
(65)
(A - nT)
where the series converges uniformly for all t E [-B, +B] for any B < 1/2T.
The result then follows by exchanging the roles of t and A.
Let g(t) be the 1/2T-periodic function equal to e2irrt on (-1/2T, +1/2T].
The series in (65) is the Fourier series of g(t). We must therefore show uniform
pointwise convergence of this Fourier series to the original function.
Without loss of generality, we do this for the Fourier series of the 2rr-periodic
function equal to e iat on (-rr, +rr], where the convergence is uniform on any
interval [-c, +c] C (-rr, +rr). By (39) ofSectionA31, it suffices to show that
lim
ntoo
+Tr
leia(t-s) - eiat 1
-Tr
sin((n + .! )s)
2
sin(s /2)
ds
=0
ntoo
83
=L
set)
Yke2ilrVkf,
(66)
k=!
where Yk
C,
Vk E
(67)
+N
(~(t
7/(t -
- nT)
nT)
(68)
B2.1. With a single sinusoid check that you realty need the strict
inequality in (67).
EXERCISE
o<
B <
+ B],
where
00,
set) = [
e2ilrVf J.L(dv).
(69)
[-B.+B]
=L
lR,
sin
s(nT)
nEZ
Proof"
(!f
(t -
7T:
T (t -
nT)
(70)
nT)
Sinee ,Lis finite and the eonvergenee in (64) is uniform in A E [-B, +B],
set)
=[
{L
[-B,+B]
=L
nEZ
nEZ
{[
[-B,+B]
e2ilrvnT
sm
- (t - nT
. (7T:
7T: T
- (t - nT)
T
e2ilrvnT J.L(dv) }
)}
J.L(dv)
. (7T:
sm
- (t - nT))
7T: T
.
- (t - nT)
T
This c10ses for the moment our study of the Shannon-Nyquist sampling theory.
It will be eompleted in Seetion B32 by the theorem of equivalenee of analog and
digital filtering, and in Seetion C22 by the L 2 -version of the sampling theorem.
H2. Sampling
84
B23
Intersymbol Interference
As a further illustration of the weak Poisson sum formula, we consider the problem
of intersymbol interference in digital communication. It does not belong to the
Shannon-Nyquist sampling theory, however, it does concern sampling.
Pulse Amplitude Modulation and the Nyquist Condition
In a certain type of digital communication system one transmits "discrete" information consisting of a sequence {an}nEZ, of real or complex numbers, in the form
of an analog signal
s(t)
= I>n g(t -
nT),
(71)
nEZ
where g(t) is areal or complex function (the "pulse"). Such a "coding" of the
information sequence is referred to as pulse amplitude modulation. Here, T > 0
determines the rate of transmission of information and also the rate at which the
information is extracted at the receiver.
B2.2. Assume that g(t) is a stable signal with FT g(v) and that an is
stable, with transferfunction A(z). Show that s(t) is stable, andgive its FTin terms
of g(v) and A(z).
EXERCISE
s(kT) =
L ang(kT -
nT),
nEZ
that is,
akg(O) +
L ak-j g(jT).
jEZ
NO
If one only wants to obtain ak from the sampie s(kT), the term
L ak_jg(jT)
jEZ
NO
g(jT)
=0
for all j
i= O.
(72)
The weak version of the Poisson sum formula of Section A22 is actually all
that is needed to prove the result. Indeed,
THEOREM B2.6. Let g(t) be a continuous and integrable function, and assume
that its FT g( v) is in LUIR). The following two conditions are equivalent:
85
= Oforall j E Z, j f. 0;
(b) LnE:d{v + !f) = const. almost everywhere.
(a) g(jT)
Proof By the weak version of the Poisson sum formula of Section A22,
Tg( - nT) is the nth Fourier coefficient ofL g(v +n/T) (Note that the continuity
condition on g(t) is used here.) Therefore, if (b) is true, then (a) is necessarily
true. Conversely, if (a) is true, then the sequence {Tg( - nT)}nEZ is the sequence
of Fourier coefficients of two functions, the constant function equal to T g(O),
and LnEZ g( v + n / T), and therefore the two functions must be equal almost
everywhere.
T'
(74)
2W=2B~ ~
T
In this case, there is no other choice for the corresponding pulse than
I
g(v) = 2B 1[-B,+Bj(v),
that is,
sin(27r Bt)
(75)
.
27r Bt
One dis advantage of such a pulse is linked to questions of numerical stability.
Indeed, let us assurne that the sampling of s(t) is not carried out at the time kT but
at the time kT + ~, where ~ > O. We obtain
sin(27r B~)
sin(27r B~)"
s(kT +~) = ak 27rB~
+ f;;;oa k- j 27rB(~ _ jT)'
g(t) =
+~)
- akl
(76)
does not stay bounded for all bounded sequences {ad, because
I . I
L
NO I~ - JT
=00.
(77)
86
B2. Sampling
= sine (2Bt)
eos(27f Bt)
1 _ 16B 2 t 2
'
(78)
whose Fr is
g(v) = eos 2(7fV)
4B 1[-2B,+2Bj(v).
A
(79)
In fact,
sekT
+ D.) =
sine (4BD.)
ak 1 _ 16B 2D. 2
'"
sin(47fBD.)
f#oa n- j 47f B(D. - jT)(1 - 16B2(D. - jT)2)'
and the error (76) is seen to remain bounded whatever the bounded sequenee {ad.
Partial Response Signaling
Another disadvantage of the pulse (75) is that one eannot realize signals with an
Fr that has an "infinite slope" (at - B and
+ B).
We shall see that, with clever encoding, we ean attain the Nyquist limit (74)
(which says that in order to transmit a "symbol" an every T seeonds without
intersymbol interferenee, a bandwidth of at least 2W = 2B ~ 1fT is needed),
without resorting to an unrealizable pulse (with a very large slope).
For example, in the duobinary encoding teehnique, instead of transmitting (7),
one transmits
S'(t) = L(an
+ an+l)g(t -
nT),
(80)
nEZ
that is,
S'(t) = Lang'(t - nT),
(81)
nEZ
where
g'(t) = g(t)
+ g(t + T).
(82)
With the pulse (75) ofminimal bandwidth 2B, starting from (80) we obtain
s'(kT)
= ak + ak-l =
Cb
and from the sequence {Ck} and the initial datum ao we recover the sequenee {ak}'
The interest of this teehnique is that we do not seek to implement Si (t) in the form
(80) using the unrealizable pulse g(t), but rather in the form (81) with a realizable
pulse g'(t). Indeed,
8'(V)
= (1 + e-2iJrvT)g(v)
= 2T eos(7fvT)e-2iJrvT 1[-B,+Bj(v).
87
This pulse has minimal bandwidth 2B, and, furthermore, it is easier to realize, not
having an infinite slope.
The above is a particular case of the technique of partial response signaling. 3
The general principle is the following: We pretend to use the unrealizable pulse
g(t) given by (75), but in (71) we replace the symbol an by an encoding Cn , say, a
linear encoding
(83)
which gives
S'(t)
= I>ng(t - nT).
nEZ
= .~:::>ng'(t - nT),
nEZ
where
g'(t)
(84)
= T(v)g(v),
(85)
where
k
(86)
j=!
and
k
P(z)
= 1 + LYjz j .
(87)
j=!
= kT we obtain
s'(kT) = p(z)ak = Ck.
We shall see in Seetion B32 that the sequence fad is deduced from the sequence
{ Ck} by inverse filtering
ak =
1
P(z) Ck
(88)
(we assume that 1/ P(z) is stable and therefore that the corresponding filter is
causal; these notions are discussed in detail in Section B32).
3See A. Lender (1981), Correlative (Partial Response) Teclmiques and Applications
to Radio Systems, in Feher, K. (ed.), Digital Communications: Microwave Applications
(Prentice-Hall: Englewood Cliffs, Ni), Ch. 7..
88
B2. Sampling
qJ(t)o(t) dt
= qJ(O),
for all funetions qJ(t). They are aware that there exists no such funetion in the
usual sense with such property, and they take the above formula as a symbolie
way of dealing with a limit situation. In the "prelimit," o(t) is replaeed by a proper
funetion, depending on a parameter, say, n. There are many ehoiees for this proper
funetion on(t), the simplest one being
on(t)
= nree~(t).
ntoo
JIR
= qJ(O).
Thus, in this point of view, the Dirae funetion is the limit of proper funetions
beeoming more and more eoneentrated around the origin of times while their
integral remains equal to 1. Another eandidate with these properties is the Gaussian
pulse that we have already eneountered in the proof of the inverse Fourier formula:
ha(t)
,2
= - - e--,;;z,
a"fEi
where this time the positive parameter a tends to zero. Observe that the Fr ofboth
on(t) and ha(t) (whieh we have previously eomputed) eonverge pointwise, as the
4Theorie des Distributions, Vols. 1 and 2,1950-1, Hermann, Paris.
89
8(t)e-2irrvI dt
= e-2irrvO = 1.
nT).
8(t -
neZ
n dt
8(t)e- 2''" TI
= 1.
-1
Le
2irr!!.v
T
neZ
'
1 +N
Le
2irr!!.v
-N
'
1 sin(2rr(N + T)
T
sin(rrT)
GraphieaIly, up to a multiplieative faetor 1/ T, such a funetion looks in the vicinity
of 0 like a Dirae funetion: As N ~ 00, it becomes more and more coneentrated
around 0, and its integral in a neighborhood of 0 tends to 1. Therefore, at the limit
we have, invoking the 1/ T -periodicity, the Fourier transform of the Dirac comb
~T(V) = ~ L 8 (v - !!..) .
T
neZ
This overdose of heuristics may weIl be fatal for the more critical mind. However,
in most basic courses in signal analysis, it is administered with the best intentions,
with the exeuse that it saves the student from a painful exposition to distributions
theory. This apology of mathematical euthanasy is founded on wrong premiees.
The first question that one should ask is: Do we need the Dirae comb in signal
analysis? Looking back at the previous chapters, we ean immediately answer NO.1t
is not needed to derive the Shannon-Nyquist theorem, because the Poisson formula
is all that is needed there. Is the Poisson formula harder than distributions theory?
Again, the answer is NO, without surprise, because the distributions theory version
90
B2. Sampling
of the Poisson sum formula is only a small ehapter of distributions theory. (I shall
add that the heuristie derivation of the Poisson sum formula-see the eomment
following the statement of Theorem A2.3 of Chapter l-is mueh more eonvineing
than the usual heuristie derivation of the Fr of the Dirae eomb.)
In fact, the reader may skip this ehapter and proeeed to Chapters 3 and 4 without
damage. On the other hand, the Fourier transform of the Dirae eomb is part of a
well-established tradition in signal analysis that is bound to be etemal due to its
aesthetie appeal. I have therefore devoted the next seetion to the expression of the
classical results of Fourier analysis in the Dirae formalism. It is, however, a purely
symbolie analysis.
The Dirac Generalized Function
The principal formal objeet of the Dirae formalism is the Dirae generalized funetion
8(t), and the first formal rule is the symbolie formula
qy(t)8(t - a)dt
EXAMPLE
= qy(a).
(Dl)
e-2iJrvt8(t - a)dt
= e-2iJrva,
that is,
Fr
8(t ...,.. a) -+
e- 2IJrva.
EXAMPLE
e2iJrvt xCv) dv
=L
nEZ
J[{
e2iJrvtxn8(V -
-f)
= LXne2iJrft,
nEZ
dv
91
EXAMPLE B2.3. Let x(t) be as in the previous example. /fit is the input of afilter
with (stable) impulse response h(t) and withfrequency response T(v) = h(v),
symbolic calculations give for the output
y(t) =
h(v)x(v)e2iJrvt dv
nEZ
that is,
The sequence
A(n)A
T xn,
Yn = h
A
L 8(t - nT).
nEZ
The seeond symbolie fomula, that we now introduee, gives the FT of this
generalized funetion:
(D2)
EXAMPLE
B2.4.
equality
cp(t)I:lT(t) dt =
qJ'(v)Lr;:.(v) dv
= ~L
nEZ
(v - -f)'
Multiplication Rule
The third symbolie formula of the Dirae formalism eoneerns the multiplieation of
a Dirae generalized funetion by a funetion in the usual sense:
s(t)8(t - a)
== s(a)8(t - a).
(D3)
92
B2. Sampling
EXAMPLE
= s(a)cp(a) =
s(t)8(t - a)cp(t)dt
s(a)8(t - a)cp(t)dt.
Si(t)
= Ls(nT)8(t -
nT)
netz:
that is,
Sie v)
"~(
= -1 ~
s vT netz:
= LS(V netz:
n) .
-f)
l[-t,+tl(v).
set)
= ( L h(t -
nT)) J(t)
netz:
V(fL)J(V - fL)dfL
= h(v)~r(v)
=~
T
L h(v)8(v netz:
!!.-)
T
93
Thuswe have
The examples above show how the Dirae symbolie ealeulus formally aeeounts
for ea1culations of Fourier transforms. This symbolie ea1culus retrieves formulas
already proven in the framework of the classical Fourier theory in LI, formulas
that have been proved under eertain eonditions of regularity, and of integrability
or summability. The symbolie ea1culus does not say under what eonditions the
final symbolie formulas have a meaning, nor in what sense they must be interpreted (equalities almost everywhere? in LI?). For this reason, the Dirae symbolie
ea1culus must be used with preeaution. From a mnemonic point of view, it ean
be useful, as it allows one to obtain some formulas very quiekly, and "generally"
these formulas are eorreet under eonditions that are "almost always" satisfied in
praetiee.
However, let us emphasize onee more the fact that these formulas have been
obtained rigorously within the framework of Fourier transforms in LI.
B3
Digital Signal Processing
B31
TheDFT
Suppose we need to compute numerically the Fr of a stable signal s(t). In practice
only a finite vector of sampies is available,
s = (so, ... , SN-t>,
0/ s
(so, . .. ,SN-i) is
Wk
the vector S =
N-i
Sk =
sn e - i (21rkn/N).
n=O
The DFr is an approximation of the Fr, the quality of which depends on the
parameters N and .1.. The first question to ask is: How to choose these parameters
to attain a given precision? As we shall see, the answer is given by the Poisson
sum formula. For the time being, we shall give the basic properties of the DFr
without reference to a sampled signal.
Let a = (ao, ... , aN-i) be a finite sequence of complex numbers. For the Nth
root of unity, we adopt the following notation:
96
(89)
Am = Lanw';rn
n=O
is the DFT of a = (ao, ... , aN-d.
THEOREM
1 N-I
N LAmw'Nmn .
m=O
(90)
Proof'
N-I
N-I
~
m(k-n)
= L...t ak L...t w N
.
k=O m=O
~
But if k =1= n,
N-I
~
m(k-n)
L...t w N
=
m=O
since
WN(k-n) _ 1
N
w Nk-n
=0
N-I
L w~(k-n) = L I = N.
m=O
m=O
If we consider the periodic extensions of the finite sequences a = (ao, ... , aN - d
and A = (Ao, ... , AN-d, defined by
an+kN
= an,
(91)
= w';rn.
The sequences an and Am being N -periodie, the domains of the sums (89) and
(90) can be shifted arbitrarily. In particular, with N = 2M + 1,
+M
Am = L an w';rn ,
n=-M
an =
L A mWN-mn .
2M + 1 m=-M
+M
(92)
In the sequel, we use the above periodie extensions. The relation between the
sequences {an} and {Am} will be symbolized by
(93)
We observe that
(94)
B3.2.
multiplication rule
If an
97
(95)
ProoJ-
=r
gives
N-I
N-I-k
"~ b n-kwN
mn = w mk"
mr
N ~ b rWn
n=O
r=-k
With an
(97)
1 N-I
L
lall = N L IA
k=O
m=O
m 2
(98)
98
where one unit corresponds to one multiplication. The fast Fourier algorithm, 5 , also
called the fast Fourier transform (FFT), considerably reduces the computational
complexity. It is based on the following remark.
Let an ~ Am be a DFT pair (note that we are considering a DFT of order 2N
2N
(the latter DFTs are of order N). A direct calculation shows that
B m, Cm+N
Cm, and
m+N
W 2N
(99)
-wm
1
2N , we can sp lt
(O::s m ::s N - 1)
(100)
and
(101)
+N
- 1 = (N - 1)(2N - 3)
units instead of (2N - 1)2 for the direct method. If we have to calculate a DFT of
order N such that
N = 2s ,
(102)
the FFT will take F(N) ::S N 10g2 N computational units. The result is obtained
by induction. Indeed, F(2) = 1, and the considerations above show that
F(2N)
= 2F(N) + N
- 1 ::S 2F(N) + N.
The gain in computational complexity with respect to the direct method is thus
of the order of
1 10g2 N
--2 N
5Cooley, J.w., Lewis, P.A.w., and Welch, P.D., The Fast Fourier Transform Algorithm, eonsiderations in the ealeulation of sine, eosine, and Laplaee transforms, 1. Sound
Vibrations, 1970, 12(3),315-337.
99
The above discussion just gives the basic idea of the FFT. For a detailed account
of the algorithmic aspects of the discrete-time Fourier transform, see, for instance,
[B8]. We now turn to the numerical issues behind the DFf.
Numerical Analysis of the DFT
The Poisson sum formula is useful in numerical analysis when approximating a
Fourier integral by a Darboux sum, and this is of course related to the finite Fourier
transform.
Let us recall the Poisson sum formula, assuming that the conditions of validity
are satisfied:
(103)
The expression (103) elucidates the relation between the Fr s(v) of the signal
s(t) and the DFr ofits sampled and truncated version (s( - M ll), ... , s( +M ll,
+M
s(nll)e-2irrn2.J+,.
n=-M
In fact, letting v
Ls(nll)e-i2rr2~~' = ~ LS(~ +
II
nEZ
THEOREM
nEZ
II
(2M
+ l)ll
(104)
).
B3.3. Let s(t) be a signal with support contained in [-M ll, +M ll].
We then have
n~M
Proof:
s(nll)e-2irr 2!:t:,
II
fas
II
(2M
k)
+ l)ll
(105)
If the terms corresponding to the indices n =1= 0 in the right-hand side of (105)
were null, only the central term
~(
II s (2M
+ l)ll
would remain. The DFf of (s( -M ll), ... , s( +M ll would then be a sampled
version of the PT, that is,
( II1~s(-Mvd,, II1~)
s(+MvI) ,
where VI = Ij[(2M + l)ll]. But one cannot have a signal s(t) with bounded
support which has FT s(v) also with bounded support. There will thus always be
an error, equal to
100
This error is the aliasing error. It can be controlled by choosing t:.. small enough for
= [-1/2t:.., 1/2t:..]. But then
M must be adjusted so that s(t) remains zero outside [-M t:.., +M t:..]. Increasing
M increases the computational complexity.
(106)
Band T are chosen such that s(t) is negligible outside [-T /2, +T /2] and s(v) is
negligible outside [- B, + B]. Precision requires large T and large B, in order to
capture a large amount of the time-frequency content of the signal. This results in
large complexity (measured by N) of the DFT. This in turn requires sophisticated
algorithms such as the FFT in order to reduce the computationalload.
B32
The Z-Transform
DEFINITION
(107)
= LXke-ikW.
(108)
kEZ
Proof"
= -1 1+11" i(w)e .
lnW
2JT
dw.
(109)
-11"
EXERCISE
i(w)
= 2Bs~(W 2B)
2JT .
(110)
101
B3.2. Give the impulse response of the filter with frequency response
exp(cos(w))ei sin(w).
(111)
ifn = 0,
otherwise,
(112)
and, in particu1ar,
L
IXkllhn-kl <
00
for al1 n
Z.
kE71
(113)
The filter is called causal because if the input Xn is zero for n .::: no the output
Yn is zero for n .::: no. The input-output relation (111) takes, for a causal filter, the
form
n
Xkhn-k.
(114)
= Lhne-inW
(115)
Yn = L
k=-oo
DEFINITION
nE71
is the frequency response of the convolutional filter with stahle impulse response
hn.
102
Ifwe write i(w) and y(w), respectively, for the Fourier sums ofthe input X n and
the output Yn, the input-output relation (6) reads
y(w) = h(w)i(w).
(116)
Indeed,
y(w)
LYne-inW
nEZ
"
" h n-ke -inw
~~Xk
nEZ kEZ
_" 1
h - -i(n-k)w
Xk e -ikw "
~ n-k e
kEZ
nEZ
EXAMPLE B3.1
Yn
defined by
= Xn-k
Yn
hn
1o
1 ifn
ifn
= k,
=1=
k,
= e- ikw .
1
Yn
= 2N + 1
k=-N
Xn-k
hN(w)
where h(O)
1 sin{(N + !)w}
= - - - ------'=----2N + 1
sin{w/2}
103
Let s(t) be a stable continuous signal, base-band (B), sampled at the Nyquist
frequency 2B. We obtain the sampled signal
S
. n s(v)dv
( - n) = j+B e 2lJrV2Jj
2B
-B
= _1 j+Jt eiJtW2BS(!!.. w) dw
2Jr
-Jt
Jr
~ Is (2~) I <
00,
(117)
(118)
Prao!"
2~ t; h (2~ )x(n 2~ k )
Yn =
Hence we have
Yn
j+Jt 2Bh~ (B
-Jt
Jr
= -1
2Jr
W)
. dw
x (B
- w ) e lnW
Jr
and therefore Yn
y(nj2B).
h(t - s)x(s)ds,
+B
-B
(119)
e2iJtvtx(v)h(v)dv,
104
Transfer Functions
H(z)X(z).
(122)
Note, however, that the z-transform of a signal only takes a meaning as a function
of z E C if one gives the domain of convergence of the series defining it.
We use the unit delay operation z defined symbolically by
iXn = Xn-k
Lhk(ixn)
nEZ
(123)
In some cases (see the examples below) a function H(z) holomorphic in a ring
{rl < Izl < r2} containing the unit circle {Izl = I} is given. This function defines
a convolutional filter whose impulse response h n is given by the Laurent expansion
(see [B6], Theorem 1.22, p. 53)
H(z) = L
hnz n
(124)
nEZ
Recall that the Laurent expansion is explicitly given by the Cauchy formula
hn = - 1
-H(z) dz,
2irr c zn+l
(126)
105
where C is a c10sed path without multiple points that lies within the interior of the
ring of convergence, for example the unit circ1e, taken in the anti-c1ockwise sense.
The method of residues can be used to compute the right-hand side of formula
(126). This equality also takes the form
(127)
The integral in (126) can also, have been computed by the method of residues:
If C is a simple c10sed contour on which f is analytic, except for a finite number
of isolated singular points Z I, ... , ZN, then
1. f(z)dz = 2irr
tab
k=1
where ak is the residue of f at Z = Zk (see [B 1], Chapter 4, pp. 207 and following).
This is the Cauchy residue theorem. In the case where f has a pole of order m at
Z = Zk, the residue at this point is given by formula
ak
EXERCISE
= (m
1
dm- I
_ I)! dz m- I [f(z)(z - Zk)m-I]lz=Zk
B3.4. Compute
1.
3z + 1 d
~ z(z - 1)3 Z.
H(z)
HI(Z)
1 - H I (Z)H2(Z)
This filter will be a convolutional filter if and only if this frequency response is the
FT of a stable impulse response.
106
Xn
Yn
Xn
.cl
.cl
The filter
is the forward loop filter, whereas
is the feedback loop filter.
The forward loop processes the total input, which consists of the input X n plus the
fed-back input, that is, the output Yn processed by the feedback loop filter.
EXERCISE
hn =
(~r
l{n:::O).
P(z)
= 1 + L:>jzj,
Q(z)
j=l
= 1 + Lbel
(128)
e=l
r2
rl
The function
H( ) = Q(z)
z
P(z)
(129)
107
is holomorphic in the ring Cr j,r2 = {rl < Izl < r2} (in the open disk {Izl < r2}
if rl = 0) which contains the unit circle since r2 > 1. We thus have a Laurent
expansion in Crj ,r2
(130)
H(z) = I>nzn,
nEZ
which defines a filter with stable impulse response h n and frequency response
Q(e- iW )
P(e-' W )
(see [BI], Section 3.3, or [B6], Theorem 1.22, p. 153).
H(e iW )
EXAMPLE
(z -
yy
First Case: lyl > 1. The ring 0/ convergence is defined by r2 = Iyl and rl = 0
(thus, in/act we have a disk 0/ convergence {Izl < lylJ that contains the unit
circle). The Laurent expansion is in this case a power-series expansion in the
neighborhood 0/ zero
z- y
y
Y
y2
and there/ore,for Izl < y,
(-Ir-l(r - I)!(z _ y)-r-l
L n(n y n=r-l
=- -
yn
00
1) ... (n - r
= _ 2. ~ (j + ~ yko
}!
zn-r+l
+ 2) - - ,
yn
Finally,
(z -
yY
(- IY
~ (j + r - I)!
1 ~
'1'
(
1)
yr r- . j=O
}.yJ
= 0 if n <
= (-Ir
O.
Z,
(n + r -1)! (2.)r+n,
n!(r - I)!
Y
n 2: 0,
Izl
< y,
108
into
(z - r)'
l/ s,
nO":O
Isl< - .
Irl
(n - I)!
(r - 1)!(n - r)!
-n-r
,n
r,
Q(e-iw)x(w).
Now P(e-iw)y(w) is the Fourier surn of the signal Yn + L:~=l ajYn-j, and
Q(e-iw)x(w) is the Fourier surn of X n + L:i=l bexn-e. Therefore,
Yn
j=l
e=l
+ LajYn-j = X n + Lbexn-e,
(131)
or, syrnbolically,
P(Z)Yn
= Q(z)xn.
The general solution of the recurrence equation (131) is the surn of an arbitrary
solution and of the general solution of the equation without right-hand side
p
Yn
+ LajYn-j
j=l
= O.
This latter equation has for a general solution a weighted surn of terms of the form
r(n)p-n,
where p is aroot of P(z) and r(n) is a polynornial of degree equal to the multiplicity
of this root minus one. If we are given X n , n E Z, and the initial conditions
Yo, Y-l, ... , Y-p+l, the solution of (131) is cornpletely deterrnined.
109
In order that the general solution never blows up (it is said to blow up if
limlnltoo /Yn/ = (0) whatever the stable input X n, n E Z, and for any initial conditions Y-p+l, ... , Y-l, Yo, it is necessary and sufficient that all the roots of P(z)
have modulus strict1y greater than unity.
A particular solution of (131) is
Yn
=L
k::::O
hkxn-k .
The output Yn is stable when the input X n is stable since the impulse response h n
is itself stable, and therefore Yn does not blow up.
Therefore, we see that in order for the general solution of (131) with stable input
to be stable, it is necessary and sufficient that the polynomial P(z) has all its
roots with modulus strict1y greater than 1.
Xn
B3.1. The rational filter Q(z)/ P(z) is said to be stable and causal
P(z) has all its roots outside the closed unit disk {/z/ ::: I}.
DEFINITION
if
Causality arises from the property that if P(z) has roots with modulus strictly
greater than unity Q(z)/ P(z) = H(z) is analytic inside {/z/ < rz} where rz > 1.
The LaUfent expansion of H(z) is then an expansion as an entire series H(z) =
Lk::::O hkz k, and this means that the filter is causal (hk = 0 when k < 0).
DEFINITION
invertible
B3.2.
if Q(z) has all its roots outside the closed unit disk {/z/ :::
In fact, writing the analytic expansion of P(z)/ Q(z) in the neighborhood of zero
as Lk::::O WkZ k, we have
that is,
X
B33
n=
L WkYn-k .
(132)
k::::O
All-Pass Filters
A particular case of a rational filter is the all-pass filter.
THEOREM B3.3. Let Zi (l ::: i ::: L) be complex numbers with modulus strictly
greater than 1. Then the transfer function
L
*_ 1
H(z)=n~
i=l
Z - Zi
(133)
110
satisfies
IH(z)1
Proof"
<I
if Izl < 1,
= 1
iflzl = 1,
> 1
iflzl > 1.
(134)
Let
ZZi* - 1
Hi()
Z=--Z - Zi
-~) = -~zlz *
I 2
'
1, using
(135)
which is true for Izl = 1, E <C, =I- O. On the other hand, Hi(Z) is holomorphic
on Izl < Iz;J and IHi(O)1 = Iz;J-l < 1. Therefore, we must have IHi(z)1 <
1 on {Izl < Iz;J}' otherwise the maximum modulus theorem for holomorphic
functions would be contradicted. (Recall the maximum modulus theorem: If I is
analytic in a bounded region D and III is continuous in the closure of D, then III
takes its maximum on the boundary of D; see [BI], Theorem 2.66, p. 97, or [B6],
Theorem 1.21, p. 51.) Observing that
IHiCI* ) 1= IHiI(z) I
we see that the resultjust obtained implies that IHi(z)1 > 1 if Izl > 1.
A filter with frequency response H(e- iw ) is a pure phase filter, or all-pass filter,
by definition.1t is called all-pass because its gain is unity: IH(e-iW)1 = 1.
Consider a signal X n such that
O-::;.n-::;.N,
otherwise.
It can be represented by its polynomial z-transform
N
A(z) =
L anZn .
n=O
Let Zl,
Z2, ... , ZN
= aN
n(z N
Zj).
j=l
The effect offiltering X n with an all-pass filter (zrz - I)/(z - Zl) is to replace the
factor z - Zl in A(z) by zrz - 1, but it does not change the energy of the signal.
111
* = A(z) zlz
Z - Zl
is such that
and therefore,
Thus,
N
n=O
la n l2 =
L Ib l
(136)
n 2
n=O
At a time 0 ::::: k ::::: N the two signals (ao, ... , aN) and (b o, ... , b N) have already
dissipated the energies
k
Ea(k)
= L lajl2
and
Eb(k)
j=O
=L
Ibj
j=O
l2 .
B(z)
= (zrz -
I)F(z),
where
F(z)
we have
where I-I
subtracting yields
and therefore,
(137)
This shows that if Izii < 1, then (ao, ... , an) is always late with respect to
(b o, ... , bN ) in dissipating its energy.
112
Fejer's Lemma
EXERCISE B3.2. Let X n be a stable signal with z-transJorm X(z). Define its
autocorrelationfunction Cn by
cn =
Xn+k X;
kEZ
where
R(z) = X(z)X(z)*.
The 2:rr-periodic function R(e- iUJ ) in the above exercise has the following
properties:
i:
(138)
n
(139)
00.
(140)
The next result is Fejer's lemma, which is also called the spectralJactorization
theorem.
'THEOREM B3.1. Let R(z) be a rational Jraction in z with complex coefficients
such that (138) and (139) are satisfied. Then there exist two polynomials in Z with
complex coefficients, P(z) and Q(z), and a constant c 2: 0, such that P(O)
Q(O) = 1 and
(141)
Moreover, one can choose P(z) to be without roots inside the closed unit disk, and
Q(z) to be without roots inside the open unit disko
Proof"
= az mo T1 (z -
Zk)m k ,
kEK
zDm k = a*(z-I)m o
kEK
T1 (Z-I -
kEK
C and ro
Z such that
T1 ( z - --;Zk1 )m
kEK
ZDmk.
Z. If
113
Therefore, if Izi = 1,
Two rational fractions that coincide when Izi = 1 coincide for all Z E <C. In
particular, a = b, and whenever we have in R(z) the factor (z - zd with IZk I i= 1,
then we also have the factor (z - 1.). We therefore have
Zk
= bzro n(z -
ZjYj
(z _
jE]
eEL
Using Fejer's identity (135), we therefore find that R(z) can be put under the form
R(z)
= ciIG(z)1 2 ,
where
G(z)
= n(z jE]
Zj)Sj
n(z -
ze)".
tEL
The function R(e- iw ) can remain real and nonnegative if and only if c ~ 0 and
d = O. Finally, we can always suppose that IZj I < 1 for all j E J (a root Zj is
paired with another root l/zj).
EXERCISE
5 - 2cos(w)
1 Q(e- i "')
----)=c
.
3 - cos(w)
P(e-'''')
The proof ofTheorem B3.1 can be specialized to obtain that for any polynomial
p(z) such that p(e- i"') ~ 0 for all w E lR, there exists a polynomial A(z) with
A(O) = 1 and no root inside the closed unit disk, and a constant c ~ 0, such that
p(e- i"')
= cIA(e- i"')1 2
Looking at the proof of B3.1, we see if there exist another polynomial B(z) with
B(O) = 1 and a constant c' ~ 0, such that p(e- iw ) = eil B(e- i "')1 2 , then c = c' and
B(z) = H(z)A(z)
B4
Subband Coding
B41
Let x(t) be a stable base-band (B) real signal that we seek to analyze in the
following sense. For fixed N = 2k we wish to obtain for all I ::::: i ::::: 2k the signals
Xi(t) with Fourier transforms
i- I
= [T
B '2 kB .
From a theoretical point of view the problem is stated with its solution: For each
i, do no more than filter x(t) with a pass-band filter offrequency response IB/v)!
From the practical point of view of digital processing, in the sampIe domain, an
ideal band-pass filter has an infinite impulse response-actually one with rather
slow decay-and this makes the above pure band-pass filters of poor value from a
numerical point of view.
A solution consists of replacing the pure band-pass filters by approximations
with "good" impulse responses, and if possible finite impulse responses (FIR).
However, FIR filters with short impulse response have in general a poor frequency
resolution, and therefore the analysis will not be satisfactory without a careful
choice of the approximate band-pass filters. One also requires perfect synthesis,
P. Brmaud, Mathematical Principles of Signal Processing
Springer Science+Business Media New York 2002
116
that is,
2k
x(t) = LXi(t),
i=l
where Xi (t) is obtained from x(t) by approximate band-pass filtering on the band Bi.
This means that leakage between contiguous bands must be mutually compensated.
The above is a summary of the numerical problem associated with subband
decomposition of a signal by a filter bank. The second problem is algorithmic:
How to perform efficiently analysis and synthesis? The standard example of an
efficient algorithm is the FFf, which involves successive splitting, and subband
decomposition is another avatar of this idea: The basic block of the algorithm consists of splitting a given band in two, that is, of solving the subband decomposition
problem for N = 2.
Subband coding is one way of performing data compression. Instead of sampling the original signal and then quantifying the resulting sampies with a view
of digitizing them, one performs the sampling and quantifying operations on each
of the outputs Xi(t). If a subband Bi is deemed unimportant it will be allocated
fewer compression resources, that is, only coarsely quantified. The appraisal of
the importance of each subband is generally based on psychological experiments.
The subjective difference between subbands is very marked in two-dimensional
signal processing, where it has been observed that low-frequency components are
the most important from a subjective point of view.
The Basic Algorithm
Since all signals and filters considered in the present chapter are real, we need only
consider positive frequencies, those in the frequency band [0, B]. Ideal splitting of
the frequency band [0, B] uses two ideal band-pass filters, one for the band [0, B 12]
and the other for the band [BI2, B]. We call To(v) and Tl(V) their frequency
responses. Then, as the Shannon-Nyquist theorem suggests, we sampie each output
at rate B, and reconstruction is perfo,Pled by t~o ideal band-pass filters, [0, B 12]
and [B 12, B], respectively. We call To(V and Tl (v) their fr~quency responses (of
course, ifweuse ideal pass-band filters, To(v) = To(v), and Tl (v) = Tl (v); wekeep
different notations because in the nonideal case, the analysis and reconstruction
filters need not be the same).
Consider Fig. B4.1. In the ideal case (ideal pass-band filters), the signals in the
upper branch at levels (){ (Xl (t and Y (Yl (t are identical and equal to the original
signal x(t) filtered by the band-pass [0, B 12]. This follows from the theory of
sampling of Chapter B2, and the details of the operations in the lower branch are
shown in Figure B4.2. Similarly, in the lower branch of Fig. B4.1, the signals at
levels ()( (X2(t and Y (Y2(t are identical and equal to the original signal x(t)
filtered by the band-pass [B 12, B].
As we explained ~efore, the ideal band-pass filters will be replaced by approximations To( v) and To( v) that have most of their energy inside the band [0, B 12],
117
J3
-B/2
B/2
-13
-B/2
B/2
13 //\f'......//\f'......//\f'.....
~~
~~
~
0
+B
B/2
,
~
+2B
and Tl (v) and Tl (v) that have most of theirs inside [B /2, B]. We insist once more
on the fact that we do not require that To(v) = To(v) nor that Tl(v) = Tl(V),
because we need some freedom in the choice of To(v) and Tl(v) to guarantee
perfect reconstruction. Analysis of the original signal yields the decomposition
(Xl (t), xz(t, whereas synthesis reconstructs y(t) = Xl (t) + xz(t). Synthesis is
called perfect when y(t) = x(t).
The signal at level a is
Xl(t)
L>(~)
ho(t - ~),
2B
2B
_1
2B JEa.
. ~
where ho(t), h l (t), ho(t), h l (t) are the r~spective imp~lse responses corresponding to the frequency responses To(v), To(v), Tl(V), Tl(V). Sirnilarly the signal
Yl (t) at level y in Fig. B4.1 is
Yl(t)
= ~Xl(~)ho(t-
;).
118
YI(n2B)
LX(~)
ho(~B
2B
= Lk
_1
2B j
- ~).
B
~)
iio(!!.2B
2B
YI(n)
=L
(142)
with a similar expression for the output Y2(t) ofthe lower branch ofFig. B4.1.
Down- and Up-sampling
We shall now express the resuIts in terms of the operations of down-sampling and
up-sampling, and then go back to (142).
Let {xnlnez be a sequence of complex numbers and let m
sequences {Ynlnez and {znlnez defined by
Yn
= Xnm '
= Xn,
=0
nE Z,
N. Consider the
nEZ
and
{ Znm
Zj
For example, with m
if j is not divisible by m.
= 2,
Xo
X2
Xl
Yo
X3
X4
Xs
Y2
YI
and
Xo
Xl
X2
X3
X4
Xs
Zo
Zl
Z2
Z3
Z4
Zs
Z6
Z7
Zg
Z9
ZIO
The sequence {YnlneZ is said to be obtained from the original sequence {xnlnez
by down-sampling by a factor m. The corresponding operation is denoted as m,/...
Up-sampling by a factor m, denoted as mt, is the operation that transforms {xnlnez
into {znlnez.
In this chapter, we are concemed with the case m = 2. For future use, we shall
express the operation of down-sampling by 2 followed by up-sampling by 2 in
terms of z-transforms (see Figure B4.3).
Denote X(z) and R(z) thez-transforms ofthe sequences {x(n)lnez and {r(n)}neZ,
respectively. The sequence {r(n)}neZ is therefore obtained from {x(n)}neZ by
119
@1--_Y--'l(~_)_--1CWI-_~)_r_(n_)
x( n))
XW
RW
r--------------I
" - - - - - - - - ______ 1
ANALYSIS
SYNTHESIS
~ !LX(n)zn + Lx(n)(-zt) ,
nEZ
nEZ
that is,
R(z)
= !{X(z) + X( -
(143)
z)}.
Going back to (142) and the similar expression for the lower branch of Fig. B4.1,
we see that the whole system is equivalent in the z-domain to Fig. B4.4.
From (143) we see that
y(z)
= ! {X(z)Ho(z) + X( +
!{X(z)H1(z)
+ X(Z)Hl( -
Z)}Hl(Z).
! X(z){Ho(z)Ho(z) + H 1(z)H1(z))
(145)
Ho(z)Ho(z)
- (z) = 2.
+ H 1(Z)Hl
(146)
120
B4 2
~o(z)
H I (z)
= Ho(z),
= - Ho( -
(147)
z).
Assume that the filter Ho is symmetrie, that is, it has a symmetrie impulse response
= ho(n), n E Z). Then
(h o( - n)
Ir ho( -
= L (-
n)zn
(symmetry of Ho)
nEZ
= L ( - l tho(n) (~)n
Z
nEZ
Therefore, if Ho is symmetrie,
= Ho(e-i(Jr-w).
This means that the pulsation speetrum of HI is symmetrie with respeet to that of
Ho with respeet to the frequeney n /2. This is why in this ease Ho and H I are said
to be quadrature mirror filters (QMFs).
Going back to (147)-and without assurning that Ho is symmetrie-the perfeet
reeonstruetion eondition (146) beeomes, in terms of Ho:
Ho(d - Ho( - Z)2 = 2.
(148)
One drawback of the solution (145) is the nonexistenee of a finite impulse response
filter Ho satisfying it. However, we ean relax eondition (146) to
Ho(z)Ho(z)
+ HI(z)HI(z)
= 2z k
(149)
6Esteban, D., and Galand, C. (1977), Applications of quadrature mirror filters to splitband voice-coding schemes, Proc. IEEE Inf. Conf ASSP, Hartford, Connecticut, 191-195.
121
for some K ::: 1, which means that we accept a delay of K time units to recover
the input, and in this case FIR filters do exist.
B4.1. Taking the no-aliasing condition (147) into account, the relaxed
condition (149) with K = 1 gives
EXAMPLE
Ho(z)2 - Ho( -
= 2z.
(150)
= -J2 (1 + z).
(151)
1---------------------1
1
I ____________________ J
~---------------------~
ANALYSIS
SYNTHESIS
122
stage 1
stage 2
stage J
stage 1
stage 2
stage J
Ho(z)
1
= -J2
(1 + z),
123
Another Solution
Another class of solutions7 for the no-aliasing condition (145) is
(152)
2.
(153)
= H(z)*
for z
= e-ia>,
+ IHo(- e-i a1 2 =
2.
(154)
L hnzn,
neZ
ja>
mo (w) = ,J2Ho(e- ),
ml (w)
1
ja>
= ,J2Hl(e).
7Smith, M.J.T., and Barnwell, m T.P. (1986), Exact reconstruction techniques for treestructured subband coders, IEEE Transactions ASSP, 34, 434-441.
124
= ei"'mo(w + JT)*,
+ Imo(w + JT)1 2 = 1.
(155)
The solution (152) is in terms of Ho(z), and therefore it suffices to obtain mo(w)
satisfying (155). We seek a finite impulse response filter Ho(z), in which case
mo(w) is a polynomial in e- i "'. We shall in fact look for a solution in the form
1 + ei"')N
mo(w) = ( - 2 L(w),
we have
Mo(w)
+
1+
2N
i", 1
IL(w)1 2
= (cos2 (~))
IL(w)1 2.
= (COS2(~))N P(sin2(~)),
for some polynomial P. Condition (155) must be satisfied for all w, and therefore
it is equivalent to
(156)
for all y E [0, 1]. Since two polynomials identical on [0, 1] are identical
everywhere, the latter equality is for all y E IR.
The polynomials yN and (1 - y)N have no common roots, and therefore, by
Bezout's theorem, there exist two unique polynomials a and b, of degree::: N -1,
such that
(1 - y)N a(y)
+ yNb(y) = 1.
(1 - y)Nb(1 - y)
+ yN a(1 -
y)
= 1.
Therefore, (157) is
(1 - y)N a(y)
+ yN a(1 -
y)
= 1.
(157)
125
Therefore, P(y) = a(y) is a solution of (156). We have thuse proven that (156)
admits at least one solution, and by the uniqueness in Bezout's theorem, this
solution is the only one of degree :s N - 1. We have
N-l
(f+k-l) l + G(yN).
k=O
:s N -
= L (f+k-l) l
N-l
a(y)
k=O
This solution is the unique one with degree :s N - 1. Observe that it is nonnegative
for aH y E [0, 1], and therefore a solution to the initial problem. Call it PN and let
P be the general solution. We have
+ yN (P(1
- y) - P N(1 - y
= 0.
yN Q(y),
= 0,
which implies Q(y) + Q(1- y = 0. That is, Q is symmetrie with respect to 1/2,
and therefore ofthe form Q(y) = R(1/2 - y) for an odd polynomial R.
In summary, the general solution of (156) is
P(y)
= N-l
L (f+k-l) l + yN R
k=O
(1 )
--
(158)
where R(y) is any odd polynomial such that P(y) so defined remains nonnegative
for all y E [0, 1].
Having obtained Mo(w), it remains to extract its square root mo(w). But this can
be done by spectral factorization, using Fejer's lemma.
We shall elose this chapter on the basic principles of subband coding. Note, however, that other solutions were proposed, most notably "biorthogonal solutions,"8
which are more versatile and yield finite impulse response subband filters with
better properties (of symmetry, for instance). We refer to the monograph [B12],
where the reader will find a full and detailed treatment of this topic, as weH as
additional references.
126
References
[BI]
[B2]
[B3]
[B4]
[B5]
[B6]
[B7]
[B8]
[B9]
[BIO]
[B 11]
[BI2]
Part C
Fourier Analysis in L2
Introduction
The modem era of Fourier theory started when the tools of functional analysisin particular, Lebesgue's integral and Hilbert spaces-became available. Fourier
theory then seemed to have reached the promised land, which is called L 2, the
space of square-integrable complex functions, indeed a Hilbert space.
F. Riesz and E. Fischer were the first to study Fourier series in the L 2 framework. 1
Many ideas of the modem theory of Hilbert spaces were already contained in the
work of these two mathematicians, and they had a c1ear view of the geometrie
aspect ofthe L 2-spaces. They were inspired by aseries of articles by David Hilbert
written after 1904 on the theme of integral equations and in which he gives the
properties of 4(Z). Note, however, that the notion of abstract Hilbert spaces made
its appearance much later than one usually believes, in the years 1927-1930, with
the work of John von Neumann, who was motivated by quantum mechanics. 2
In short, a Hilbert space is a vector space H on the field <C (or lR.), with a
Hermitian (or scalar) product, denoted (., .) or (., .) H, and a special topological
property that we shall now briefly introduce. The Hermitian product induces a
norm, the norm IIxll, or IIxIlH, ofthe vector x E H being
IIxll = (x,x}'1.
I
IF. Riesz, Sur les systemes orthonormaux de fonctions, CRAS Paris, 144, 1907,615619; and E. Fischer, Sur la convergence en moyenne, CRAS Paris, 144, 1907, 1022-1024;
Applications d'un theoreme sur la convergence en moyenne, CRAS Paris, 144, 1907, 11481151.
2His theory was published in the reference text Mathematische Grundlagen der Quantum
Mechanik in 1932.
130
This allows us to define a limit in H: We say that lim n-4oo x n = x iflimn-4oo IIxn xII = O. Having this notion of a limit, we have the notion of a Cauchy sequence:
A sequence {xnlnEN in H is called a Cauchy sequence if
lim IIxm - xnll
m,n~oo
= O.
= O.
Note that for any positive integer k, Ck , considered as a vector space on C with the
usual Hermitian product, is indeed a Hilbert space. B ut there are more sophisticated
Hilbert spaces. For instance, L~(lR), the space of functions f : lR --+ C that are
square-integrable:
If(t)1 2 dt <
00.
In L~(lR), one does not distinguish two functions that are almost everywhere equal.
The Hermitian product is
(f, g)
=f
= (f, gh
2 (ITt)
= (
lITt
f(t)g(t)* dt.
= O.
Another example of a Hilbert space is the space of functions f : lR --+ C that are
2rr-periodic, and in L~([ -rr, +rr]), that is, square-integrable on [-rr, +rr):
+Jr
-Jr
If(t)1 2 dt <
L~([-Jr,+Jr]) =
00,
L:
Jr
f(t)g(t)* dt.
In L~([-rr, +rr]), one also does not distinguish two functions that are almost
everywhere equal.
A third example is .e~(Z), the set of complex sequences a = {xnlnEz such that
L Ix l
n 2
nEZ
<
00,
Introduction
131
L
anb~.
neZ
The Hilbert space LUlR.) is a paradise ofFourier transforms, since every function
thereof admits a Fourier transform, and moreover the mapping that associates to a
function its Fourier transform is a bijection from L~(lR.) to itself, and the inversion
formula for Fourier transforms, which gives the latter in terms of the former is
f(t) =
This is not apreeise statement. In particular, the integrals appearing in the definition
of the transform and in the inversion formula are in some extended sense, and the
equality in the inversion formula is "almost everywhere." To be exact,
f(t)
lim jb
a,btoo -a
where the limit is in the sense of L~(lR.). A similar interpretation is needed for the
integral defining the Fourier transform.
The beautiful formula of the L 2- theory is the Plancherel-Parseval's formula
j(v)g(v)* dv
f(t)g(t)* dt,
in other words,
(f, g) L~(IR)
where f, g
= (j, g) L~(IR)'
E L~(lR.).
The above results are stated for the Fourier integral transform, but similar results
hold for the Fourier series of periodic functions: Let f be a 2n -periodic function
square-integrable on [0, 2n]; then it admits the representation
f(t)
=L
cn(f)e int .
neZ
This is the inversion formula for Fourier series. Similarly to the Fourier transform
in L~ (lR.), this equality is only almost everywhere, and the sum has to be interpreted
in an extended sense:
where the limit is in the sense of L~([ -n, +n]). This result is in fact a particular
case of the Hilbert basis theorem, which gives the orthonormal expansion
= L(x, en)en
neZ
132
= lk=n,
and "complete" means that the closure of the vector space consisting of the finite
linear combination of elements of {en }nE!\! is H. (See Chapter C2 for precise definitions.) In this case, the above orthonormal expansion is valid (the series in the
right-hand side converging with respect to the distance induced by the Hermitian
product of H), and moreover, we have Plancherel-Parseval's identity
IIxll 2 =
L 11 (x, e }1I
n
2.
nEZ
The Fourier series development is a particular case of the above very general resuIt,
where H == L~([-x, +x]), and
en(t)
==
~elnt.
",2x
2x
nEZ
where f and gare 2x-periodic functions in L~([ -x, +x]. In terms ofHermitian
products,
Cl
Hilbert Spaces
CII
Basic Definitions
Pre-Hilbert Spaces
DEFINITIONCl.l. Let E beavectorspaceoverC(resp., lR)andlet(x, y) -+ (x, y)
be a mappingjrom Ex E to C (resp., IR) such that,forall x, y E E and all a E C
(resp., IR),
It is then said that Eis a complex (resp., real) pre-Hilbert space with the Hermitian product (resp., scalar product) (., .). The complex (resp., real) number
(x, y) is the Hermitian (resp., scalar) product ojx and y.
In the above definition and in the sequel, 0 represents the zero of IR or C, or the
neutral element of addition in E. The context will remove ambiguity.
From now on, we shall consider complex pre-Hilbert (and later Hilbert) spaces.
The other choice for the scalar field, IR, leads to formally analogous results.
P. Brmaud, Mathematical Principles of Signal Processing
Springer Science+Business Media New York 2002
134
For any x
E, denote
(1)
(2)
(3)
= 4{lIx +
(4)
Consequently, two Hermitian products (., h and (., h on E such that 11 . 111 =
11 . 112 are identical.
THEOREM CI.I. For all x, y
I(x, y)1
:s IIxll
(5)
x lIyll,
C such that ax + y
= O.
Proof" We do the proof for the real case and leave the complex case to the reader.
We may assume that (x, y) =1= 0; otherwise, the result is trivial. For all E lR,
= IIx +
(x, Y)YIl2 ~ O.
:s 0,
and thus the inequality (5) holds. Equality in (5) corresponds to a null discriminant,
and this in turn implies a double root of the polynomial. For such a root, IIx +
(x, Y)YIl2 = 0, that is, by Property (d) in Definition CU,
x
... , X n E
+ (x, y)y = O.
E
= O.
0, and IIxll
E,
135
lIaxll = lalllxII,
IIx + ylI ::: IIxll + lIyll
Proof
+ (y, x)
:::
2l1xllllyll,
11 . 11
IIx - ylI.
(7)
= y,
A metric space is a set E endowed with a distance d. One then says: the metric
space (E, d), or, for short and when the context is sufficiently explicit as to the
choice of the distance, the metric space E. A pre-Hilbert space E is therefore a
metric space for the distance d induced by the Hermitian product.
C1.3. A Hilbert space is a pre-Hilbert space that is complete with
respect to the distance d defined above.
DEFINffiON
Recall that a metric space (E, d) is called complete if any Cauchy sequence in E
converges; that is, if {Xn}n:o:l is a sequence in E such that limm.ntoo d(x n , x m ) = 0,
then there exists x E E such that limntoo d(xn , x) = O.
When considered as a Hilbert space relative to the norm 11 . 11, E will be denoted
H.1f necessary, the notation for the Hermitian product and the norm will explicitly
refer to the space H: We then write (., ')H and 11 . IIH.
C1.1. Let (X, X, f.L) be a measure space. It is proven in the appendix
(Theorem 26) that LUf.L) is a Hilbert space relative to the Hermitian product
EXAMPLE
136
EXAMPLE
4(Z) =
{{Xn}nEZ : X n E
Cforall nE Z and
L Ix l
n 2
< oo}
nEZ
Y}e~(z) = LXnY;.
nEZ
C1.2. Show that if h(t) and x(t) are both in L~(lR), then Y = h * x is
weil defined. Find h E L~(lR) such that IIh 11 = 1 and maximizing y(T)for a given
time T. What is the corresponding maximum?
EXERCISE
Cl 2
Continuity Properties
Closed Subspaces
A subset G is said to be c10sed in H if every convergent sequence of G has a limit
in G.
EXERCISE Cl.3.
137
12 = span {XI,
G
t E
= span {XI, t
Tl.
E T}.
Paraphrasing the above result, we see that X E span {XI> t E T} if and only if X is
the limit in H of a sequenee of finite linear eombinations of elements of {XI, t E T}.
Continuity of the Hermitian product
CI.4. Let H be a Hilbert space over C with the Hermitian product
( " .). The mappingfrom H x H into C defined by (x, y) t-+ (x, y) is bicontinuous.
THEOREM
Proof:
We have
II h l II.IIh2 11-1-0
I(x + h l , Y + h 2 ) -
t-+
(X,
y)1 = O.
EXERCISE C1.5. Let (X, X, fL) be a measure space, where fL is a finite measure.
Let {fn}n ::: 1 be a sequence of LUfL) converging to f. Apply Theorem Cl.4 to
prove that limntoo fL(fn) = fL(f). Give a counterexample ofthis property when the
hypothesis that fL is finite is dropped. (Hint: f = 1[0, I] , fn = (1-1/ n) 1[0, 1] +- .. .)
Show that when fL is finite,
Note that when fL is not finite, G need not be a Hilbert subspaee of L~(fL).
Wavelet multiresolution analysis will provide a speetacular eounterexample.
Isometry Extension Theorem
DEFINITION CI.5. Let Hand K be two Hilbert spaces with Hermitian products
denoted ( " .) Hand ( " .) K, respectively, and let q; : H t-+ K be a linear mapping
such that,for all x, y E H,
(8)
Then q; is called a linear isometry from H into K. If, moreover, q; isfrom H onto
K, then Hand Kare said to be isomorphie.
Note that a linear isometry is neeessarily injeetive, sinee q;(x) = q;(y) implies
q;(x - y) = 0, and therefore,
0=
138
Proof We sha11 first define (l(x) for x EH. Since V is dense in H, there exists
a sequence {xn}ne:! in V converging to x. Since cp is isometric,
== x).
ntoo
= (l(ax + y).
However,
Cl 3
139
Projection Theorem
(9)
forallx
ntoo
H.
THEOREM
(10)
UEG
Proof"
that
{Yn}n~l
be a sequence in G such
Gi :s d(x, Yni
1
:s d(x, G)2 + -.
(*)
Since ~(Yn
+ Ym)
+ IIx -
Ym 11 2 )
411x - ~(Ym
+ Yn)1I 2
G,
therefore,
IIYn - Ymll 2
:s 2 (~ + ~).
The sequence {Yn}n~l is thus a Cauchy sequence in G, and it consequently converges to some Y E G since G is c1osed. Passing to the limit in (*) gives
(10).
IIx - Y'II
= IIx -
Yll
= d(x, G),
140
CL Hilbert Spaces
+ y')
411x _ !(y
!(y + Y')11 2
+ y')1I 2.
E G,
Therefore,
lIy - y'1I 2 ~ 0,
forallZEG.
(x-y,z)=O
IIx -
(y
+ Az)1I 2
:::
=j::.
O. Because
d(x, Gi,
that is,
Since
wehave
- 2A Re {(x - y, z)}
which implies Re {(x - y, z)}
(pure imaginary) leads to
+ A211z 11 2 ::: 0
.for allA
IR,
Im{(x - y, z)} =
o.
Therefore,
(x - y, z) = O.
That y is the unique element of G such that y - x E G.L follows from the
observation made just before the statement of Theorem C 1.6.
Projection Principle
The projection theorem states, in particular, that for any x E G there is a unique
decomposition
x = Y +z,
G,
Z E
G.L,
(11)
141
xeH
The next result features two useful properties of the orthogonal projection operator
Pa
'THEOREM
forall xe H.
Proof:
Pa
= PF. In
where Wj e Gl. (i
= Pa(Xj) + Wj
(i
= 1,2),
= 1,2). Therefore,
Xl
+ X2 =
=
+ Pa(X2) + WI + W2
Pa(xt} + Pa(X2) + W,
Pa(XI)
= y+w,
where W e Gl., y e G: namely, y = Pa(XI + X2). Therefore,
Pa(XI + X2) = Pa(XI) + Pa(X2).
Xl
+X2
= aPa(x)
for all a e G, X e H.
142
Thus PG is linear.
From Pythagoras' theorem applied to x = PG(x) + w,
IIPG(x)1I
+ IIwII 2 = IIx1I 2 ,
and therefore,
Hence, PG is continuous.
() The unique decompositions of x on G and G.L and of PG(x) on F and F.L
are
x
= PG(X) + w,
The next result says that the projection operator PG is "continuous" with respect
to G.
THEOREM CI.S. (i) Let {Gn }n:::l be a nondecreasing sequence ofHilbert subspaces
ofH. ThentheclosureGofUn:::l Gn isaHilbertsubspaceofH and,for all X E H,
ntoo
ntoo
Proof: (i) The set Un>l G n is evidently a vector subspace of H (in general,
however, it is not closedflts closure, G, is a Hilbert subspace (Theorem C1.3). To
any Y E Gone can associate a sequence {Yn}n:::h where Yn E G n, and
lim IIY-Ynll=O.
n->oo
= lI(x -
PG(x - (x - PG n(x1I 2
= 211x -
- 411x - !(PGn(x)
+ PG(xII
143
+ PG(x) is a vectorin G,
IIx - !(PGn(x)
+ PG(x))1I 2 ~
IIx - PG(x)1I 2,
and therefore,
IIPGn(x) - PG(x)1I 2 S 211x - PGn (x)1I 2 - 211x - PG(x)1I 2
Gl..
= clos
(U G;) .
n~1
EXERCISE
H,
GI ffi G2 := {z = XI
+ X2
: XI E G, X2 E G2}
I/(xd - I(X2)1
AlixI - x211
H.
(12)
C1.3. Let Y
= (x, y).
(13)
144
x2)1
Proof"
Uniqueness.
Let y, y'
H be such that
for an X EH.
In particular,
(x, y - y')
The choice x
=y-
y' leads to
=0
for all x
H.
For an x
= (z,
f(z)*
11 zZIl 2 ) = f(z).
Therefore, the mappings x --+ f(x) and x --+ (x, y) coincide on the Hilbert
subspace generated by N and z. But this subspace is Hitself. Indeed, for an
xE
H,
x = (x -
where u
f(x)
f(z)
N and w is colinear to z.
z) + f(x)
z= u + w,
f(z)
C2
Complete Orthonormal Systems
C21
Orthonormal Expansions
The result ofthis section is the pillar ofthe L 2-theory ofFourier series and wavelet
expansions. It concems the possibility of decomposing a vector of a Hilbert space
along an orthonormal base.
The Gram-Schmidt Orthonormalization Procedure
The central notion is that of an orthonormal system:
DEFINITION
An orthonormal system {en}n~O isfree in the sense that an arbitrary finite subset
of it is linearly independent. For example, taking (eI, ... , ek), the relation
k
Lcxiei =0
i=1
implies that
CXiei )
=0
1:::
e ::: k.
EXERCISE
146
Ip-
L(fp, ej}ej
.,,-11-------;;-11
I
j=!
en +!
Show that
{en}n~O
p - "t(fp, ej}ej
is an orthonormal system.
Hilbert Basis
The following theorem gives the preliminary results that we shall need for the
proof of the Hilbert basis theorem.
THEOREM C2.1. Let {en}n2:0 be an orthonormal system 01 Hand let G be the
Hilbert subspace 01 H generated by {en}n~!. Then:
(a) For an arbitrary sequence {an }n~O 01complex numbers, the se ries Ln>o anen
is convergent in H if and only if {an}n~! E e~, in which case
(14)
(15)
n~O
L(x,en}en = PG(x),
(16)
n~O
= (PG(X), PG(y)}
(17)
n~O
Prool:
147
verges if and only if Ln>O lan 12 < 00. In this case equality (14) follows from the
continuity of the norm, by letting n tend to 00 in the last display.
(b) Accordingto (a) ofTheorem Cl.7, IIxll :::: 11 PGn (x)lI, where G n is theHilbert
subspace spanned by {el, ... , en }. But
n
PGn(x)
= ~)x, ei}ei,
i=O
IlpGn (x)11
L I(x, ei}1
2.
i=O
Therefore,
n
IIxll 2 ::::
L I(x, ei}1
2,
i=O
00.
(c) From (15) and result (a), it follows that the series Ln>o (x, en}en converges.
For any m :::: 0 and for all N :::: m,
=0
G.
= L(x, en}en
n;::O
148
Leuing N -+
product).
DEFINITION
00,
{Wn}n~O
In other words, the finite linear combination of the elements of {w n }n~O forms
a dense subset of H.
==}
(z = 0).
(18)
We are now ready for the fundamental result: the Hilbert basis theorem.
THEOREM C2.2. Let {en}n~O be an orthonormal system oJ H. The Jollowing
properties are equivalent:
IIxII 2 =
L I(x, e }1
n
2;
(19)
n~O
(c)Jor alt x
H,
(20)
x = L(x, en}en.
n~O
Proof
(a)=}(c)
Accordingto(c)ofTheoremC2.1,
L(x, en}en = PG(x),
n~O
n~O
IIxll 2 =
IIPG(x)+x - PG(x)1I 2
PG(x)1I 2
PG(x)1I 2 ;
149
therefore,
IIx -
PG (x)1I 2 = 0,
which implies
x
PG(x).
==
A sequence {en }n;::O satisfying one (and then an) of the conditions of Theorem
C2.2 is caned a (denumerable) Hilbert basis of H.
VI
EXERCISE
1/
Biorthonormal Expansions
DEFINITION
a biorthonormal system
if
(0:) (e n , d k ) = Oforall n
() (e n , dn ) = Ifor all n
=f. k,
~
O.
This system is ca lIed complete if, in addition, each ofthe sequences {en}n;::O and
{dn}n;::oforms a total subset of H.
x = L(x, dn}en
n;::O
whenever these series converge. Indeed, with the first series, for example, calling
its sum y, we have for any integer m ~ 0,
= L(x, en}(dn , em }
n;::O
(x, e m ).
Therefore,
(x - y, em )
=0
for an m
O.
150
if it
Proof Let {fn }n:::O be a sequence defined in Definition C2.4. Construct from
it the orthonormal sequence {en}n:::O by the Gram-Schmidt orthonormalization
procedure. It is a Hilbert basis because (a) of Theorem C2.2 is satisfied. Indeed,
forany zEH,
(en , Z)
(fp, z)
z=Q
C22
{en ( )} def {
1 2i Jr
../Te
!l. . }
T
'7J
fLj,
+N
{T
L Icn(f)1 + Jo
n=-N
2
If(t) - fN(t)1 2 dt
(T
= Jo
If(t)1 2 dt.
(21)
151
Ntoo
[T
10
= 0.
Jex
+ t)Jet)* dt,
where
J(t)
=L
f(t
+ nT)l(o.nCt + nT),
nEZ
defines a T -periodic and continuous function qJ. Its nth Fourier coefficient is
Cn(qJ)
- + t)f(t)*
- dt ) e- 2.IITTXn dx,
= T1 10[T( fex
1
{T _
= T 10
=
f(t)*
{{T _
10
fex
+ t)e-
2.
- 2
T1 10{T f(t)* {ft+T f(s)et
IITTX
n
IITTS
dx dt
ds e
n
IITTt
dt
Since LnEZ Icn (f)1 2 < 00 and qJ(X) is continuous, it follows from the Fourier
inversion theorem for locally integrable periodic functions that, for all x E IR,
qJ(x)
=L
Icn(f)1 2e 2i Jl"j-x.
nEZ
In particular, for x = 0,
Ntoo
(T
10
If(t) - fN(t) 12 dt
= 0.
It remains to pass from the continuous functions to the square-integrable functions. Since the space C([O, Tl) of continuous functions from [0, T] into C is dense
in L~([O, Tl) (Theorem 27), with any 8 > 0, one can associate qJ E C([O, Tl) such
that IIf - qJlI ::::: 813. By Bessel's inequality, IIfN - qJNII 2 = 1I(f - qJ)N1I 2 :::::
152
11
f -
cP 11 2 , and therefore,
+ 211f - cpll
8
IIcp - CPNII + 2 3,
IIf-fNIIS8.
lim ( Iset) -
Ntoo
llR
bn sine (2Bt - n
n=-N
where
bn
L+N
+B
j2 dt = 0,
(23)
-B
Proof' Let L~(lR; B) be the Hilbert subspaee of LUlR) consisting of the finiteenergy eomplex signals with a Fourier transform having its support eontained in
[ - B, +B]. The sequenee
(24)
where h(t) == 2B sine (2Bt), is an orthonormal basis of L~(lR; B). Indeed, the
functions of this system are in L~(lR; B), and they form an orthonormal system
sinee, by the Planeherel-Parseval formula,
+B
-B
e2irrvkz-;
dv
= 2B
x ln=k.
153
It remains to prove the totality of the orthonormal system (24) (see Theorem C2.2).
We must show that if g(t) E L~(IR.; B) and
2~) dt = 0
g(t)h(t -
for all n
Z,
(25)
LB
=0
for all n
Z.
(26)
But we have proven in the previous section that the system {e2irrvn/2B }nEZ is total in
LUIR.; B); therefore, (26) implies g(v) = 0 almost everywhere, and consequently,
g(t) = 0 almost everywhere.
Expanding s(t)
E L~(IR.;
n
h(t - - ) ,
v2B
2B
Mn
(27)
where the limit and the equality in (27) are taken in the L 2-sense (as in (23, and
Cn
[s(t)
Jrw.
~h(t -
..!!...-)dt.
2B
v2B
+B
-B
./fii
An Apparent Paradox
Note that since s( v) is in L 2 and of compact support, it is also in LI, and therefore
the Fourier inversion formula is true and the reconstruction formula takes the
farniliar form
s(t) = Ls(..!!...-) sinc(2Bt - n).
nEZ
2B
(28)
154
The flaw in the above "proof' is that the Fourier inversion formula holds only
almost everywhere, and maybe not at the sampling times. Therefore, formula (28)
is true only if the Fourier inversion formula can be applied at all the times of the
form nj2B. This is the case if s(t) is continuous, because the inversion formula
then holds everywhere.
We see that the continuity hypothesis always pops up. We cannot expect a much
better version of the sampling theorem in the LI or L 2 framework. Indeed, since
s(v) is integrable, the right-hand side of
s(t) =
s(v)e2invtdv
C3
Fourier Transforms of Finite Energy
Signals
C31
Fourier Transform in L 2
A stable signal as simple as the rectangular pulse has a Fourier transform that is not
integrable, and therefore one cannot use the Fourier inversion theorem for stable
signals as it iso However, there is aversion of this inversion formula that applies
to all finite-energy functions (for instance, the rectangular pulse). The analysis
becomes slightly more involved, and we will have to use the framework ofHilbert
spaces. This is largely compensated by the formal beauty of the results, due to the
fact that a square-integrable function and its Fr play symmetrical roles.
+ .)
is uniformly continuous.
Proof:
+ h + u) -
set
+ u)1 2 du
= L1S(h
+ u) -
s(u)1 2 du
156
+ .), s( . ) L~(R)
t -+ LS(t +x)s*(x)dx
and called the autocorrelation function of the finite-energy signal s(t). Note that
it is the convolution s(t) * s(t), where s(t) = s( -t)*.
THEOREM C3.2. lfthe complex signal s(t) lies in L~(~) n L~(~), then its FT s(v)
belongs to L~(~) and
L Is(t)1 2 dt
=L
(30)
Is(v)1 2 dv.
Praof: The signal s(t) admits s(v)* as FT, and thus by the convolutionmultiplication rule,
(31)
,2
= --e-2,;2.
a,J2ii
=L
(s
* s)(x)ha(x)dx.
(32)
Since ha(v) = e- 2:rr 2a2x 2 t I when a .,j.. 0, the left-hand side of (32) tends to
Is(v)1 2 dv, by dominated convergence.
IR
On the other hand, since the autocorrelation function (s * s)(t) is continuous and
bounded, the quantity
L (s
tends when a
.,j..
* s)(x)ha(x)dx =
L (s
* s)(ay)hj(y)dy
0 toward
L (s *s)(O)hj(y)dy
by dominated convergence.
= (s *s)(O) = L
Is(t)1 2 dt,
157
From the last theorem, we have that the mapping cp: s(t) -+ s(v) from LboR.) n
L~(lR) into L~(lR) thus defined is isometrie and linear. Sinee LI n L 2 is dense
in L 2, this linear isometry ean be uniquely extended into a linear isometry from
LUlR) into itself (Theorem C1.5). We will eontinue both to denote by s(v) the
image of s(t) under this isometry and to eall it the FT of s(t).
EXERCISE
Ttoo
J~
-1
+T
s(t)e-2irrVldt 1 dv.
(33)
-T
SI(t)S2(t)* dt =
EXERCISE
(34)
SI(V)S2(V)* dv.
1.
THEOREM
(35)
h(t - s)x(s) ds
(36)
Ix(s)1 2)ds
= L1h(t)1 dt + L1h(t -
s)llx(s)1 2 ds,
00,
sinee Ih(t)1 and Ix(t)1 2 are in Lb(lR). Therefore, for almost all t,
00,
and y(t) is almost everywhere well defined. We now show that y(t)
LUlR).
158
LIL
h(t - S)X(S) dS
=
=
LIL
LL{L
2
I
dt
h(u)x(t - U)dUr dt
00.
* XIlL~(lR) ~
(37)
IIhIlLt(lR)lIxIlL~(lR)'
The signal (35) is thus in L~(IR) when h(t) E LUIR) and x(t)
furthermore, x(t) is in Lb(IR), then y(t) is in Lb(IR). Therefore,
x(t) E Lb(lR)
n L~(IR) -+
E LUIR).
If,
(38)
isinLb(lR)nL~(IR)andlimxA(t)
in L~(IR). Introducing
YA(t)
= x(t)inL~(IR).Inparticular,limxA(v) = x(v)
h(t - s)xA(s)ds,
= h(V)XA(V), Also, lim YA(t) = y(t) in L~(IR) [use (37)], and thus
limYA(v) = y(v) in L~(IR). Now, since limxA(v) = x(v) in L~(IR) and h(v) is
EXERCISE
7t:
ab(a
+ b)'
+ IItwll~2 + IIvwll~2P .
Show that the subset 0/ H consisting 0/ the C oo -functions with compact support
is dense in H. Hint: Select any q; in a Coo-function with compact support, with
159
C32
Inversion Fonnula in L 2
So far, we know that the mapping cp : L~(~) 1-+ L~(~) defined in Seetion C31
is linear, isometrie, and into. We shall now show that it is onto, and therefore
bijeetive.
THEOREM
E L~(~).
Then
= lim
Atoo
+A
-A
(39)
(40)
We shall prepare the way for the proof with the following result.
LEMMA
u(x)v(x)d.x
u(x)v(x)d.x.
(41)
Proof' If (41) is true for u(t), v(t) E L~(~) n L~(~), then it also holds for
u(t), v(t) E L~(~). Indeed, denoting XA(t) = x(t)I[-A,+Aj(t), we have
that is, (UA, VA) = (UA, VA)' Now UA, VA, UA, and VA tend in LU~) to u, V, u,
and V, respeetively, as A t 00, and therefore, by the eontinuity of the Hermitian
produet, (u, v) = (u, v).
The proof of (41) for stable signals is aeeomplished by Fubini's theorem:
u(x)v(x)d.x
L {L
= L {L
=L
=
u(x)
v(y)
u(x)e- 2i JI'XY d Y } dy
v(y)u(y)dy.
160
Proof of(39); Let g(t) be a real signal in L~(lR), and define f(t)
g-(t) = g(-t). We have j(v) = g(v)*. Therefore, by (41):
g(x)j(x)dx =
= (g~)(t), where
g(x)f(x)dx
(x)(x)* dx.
Therefore,
IIg -
But IIgll 2
f112
j) + IIfII 2
= IIgll 2 -
2Re (g,
= IIgll 2 -
211g11 2 + IIfII 2 .
(42)
In other words, every real (and, therefore, every complex) signal g(t) E L~(lR)
is the Fourier transform of some function of LUIR). Hence, the mapping q; is
onto.
EXERCISE
if its FT has
We dose this seetion by showing how the LI Fourier inversion theorem was
lirnited in scope, since it does not take much for a stable signal not to have an
integrable FT.
C3.6. Show that if a stable signal is discontinuous at a point t
FT is not integrable.
EXERCISE
= a, its
C4
Fourier Series of Finite Power Periodic
Signals
C41
= {an}, n
Z, such
= L>nb~
f:
and the Hilbert space L~([O, T], dt/T) of complex signals x = {x(t)}, t
such that
Ix(t)1 2 dt < 00, with the Hermitian product
(x, Y}L~([O,T],~) =
THEOREM
C4.1. Formula
Sn
= -1
T
(43)
nEZ
dt
Jorx(t)y(t)* T'
T
n dt
s(t)e- 2"17r'i/
lR.,
(44)
(45)
defines a linear isometry sO --+ {Sn} lram L~([O, T], dt / T) onto e~, the inverse
01 which is given by
s(t)
= L::Sne2i1l"f/ ,
(46)
nEZ
where the se ries on the right-hand side converges in L~([O, T], dt / T), and the
equality is almost everywhere. This isometry is summarized by the PlancherelP. Brmaud, Mathematical Principles of Signal Processing
Springer Science+Business Media New York 2002
162
Parseval identity:
LXnY~ = ~
nEZ
(47)
(Tx(t)y(t)* dt.
10
Prao!"
{e n (-)}
nE Z,
L~([O, Tl)
-N
Ntoo
This result shows that the situation for finite-power periodic signals is pleasant,
in contrast with the situation prevailing locally stable periodic signals (remember
Kolmogorov's result, Theorem A3.1). The proof ofCarleson's result is omitted; it
is rather technical. It also shows that the L 2 framework is very adapted to Fourier
series, since everything works "as expected."
Discrete-Time Fourier Transform of Finite-Energy Signals
Let lb be the space of sequences In, n
discrete-time signals).
THEOREM
C4.3. lb
00
(stable
Prao!" Let A = {n: IX n I ~ I}. Since LnEZ IX n I < 00, then necessarily
card(A) < 00. On the other hand, if Ixnl ::::: 1, Ix n l2 ::::: Ixnl, whence
163
Let L~(2n) be the Hilbert space of functions j: [-n, +n] ~ C such that
f~: Ij(w)1 2 dw < 00 provided with the Hennitian product
(],
C4.4.
THEOREM
g)L~(2n) =
l+
_1
2n
-n
j(w)g(w)* dw.
dejined by
fn =
l+
-n
j(w)e inw dw ,
2n
j(w)
=L
fn e- inw .
(48)
nEZ
L fng: = -2n1 l+
f(w)g(w)*
dw.
(49)
-n
nEZ
Proof:
C4 2
We give a necessary and sufficient condition in the frequency domain for a system
of shifted functions to be orthonormal.
THEOREM C4.5. Let g(t) be afunction of L~(lR.) andjix 0 < T <
and sufficient condition for the family offunctions
00.
{g(. - nT)}nEZ
A necessary
(50)
Ig(v
nEZ
-f )1
= T
almosteverywhere.
(51)
g(t)g(t - nT)* dt
=T
Ig(v)12e-2invnT dv
= I n=o.
The proof then follows the argument in the proof of Theorem B2.6.
164
Riesz's Basis
The following notion will play an important role in multiresolution analysis.
DEFINITION
(52)
AL 1ck1 2 ::s
k
00
(IL
Jffi.
kT)1
CkW(t -
2
dt
::s B L
kEZ
if
Icd,
(53)
kEZ
(IL
Jffi.
Cke-2itrkTVW(V)12 dv
kEZ
Also,
Now, any function c(v) E L~([O, I/T]) has the form LkEZcke-2itrkTv, where
LkEZ ICkl 2 < 00, and (53)is therefore equivalent to
AT
1fT
Ic(v)1 2 dv::s
1 IC(V)121~
1fT
::s BT Jo
fT
Iw(v
Ic(v)1 2dv
f )1 1
2
dv
E L~([O,
165
a.e.
(54)
THEOREM
(55)
Proof"
We must now show that the Hilbert space Vo spanned by (g(. - nT)}neZ is in
fact identical to Vo.1t suffices to show that the generators of Vo belong to \10, and
vice versa. Define
a(v) = L ane-2i7CvnT,
neZ
E l~(Z).
Since
w(v) = a(v)g(v),
it foIlows that
w(t) = L ang(t - nT).
neZ
Therefore, the generators of Vo are in Vo. The converse is true by the same argument
since
g(v) = a(v)-Iw(v),
where a(vr l is also, in view of condition (54), a (ljT)-periodic of finite
power.
EXERCISE
{k g (t -
~),
2B
166
where {cn} nEZ is in CU7-), unless only one of the Cn is nonzero. Show that if the
Fourier sum c(w) ofthe sequence {cn}nEZ is such that A < !c(w)!2 < B for some
o<
:s
B <
L~(lR; B).
then
{k g (t -
2~)}
nEZ
is a Riesz basis of
References
[Cl]
[C2]
[C3]
[C4]
[C5]
Part
Wavelet Analysis
Introduction
where w(t) is the time window function, a function negligible outside a relatively
small interval around zero.
Given a window w(t), the local information at time b is obtained by computing
the Fourier transform of the last display:
Wf(v, b):=
f(t)w*(t - b)e-2invt dt
(j, Wv,b),
where
Wv,b(t)
= w(t -
b)e2invt.
170
We see that the time information is collected in a time interval around time b of
width of the order of that of the time window. Now, from the Plancherel-Parseval
identity,
(Note that as a varies, the energy of lai! !(at) remains the same.) The so-called
Heisenberg uncertainty principle makes the above limitation more explicit and
states that
2 Sampling theory and wave propagation, in NATO ASI series, Vol. 1, Survey in Acoustic
Signal! Image Processing and Recognition, C.H. ehen, ed., Springer-Verlag, Berlin, 233-
261,1983.
Introduction
171
In the wavelet transfonn, the role the family of functions Wv,b(t) plays in the
windowed Fourier transfonn is played by a family
(t - b)
a, b
IR, a
=I- 0,
where 1ft(t) is called the mother wavelet. The wavelet transfonn (WT) of the
function f E L~(IR) is the function
C j(a, b) = (j, 1fta,b) =
f(t)1ft;,b(t) dt,
= (j, :(fra,b) =
a, b
IR, a
=I- 0.
!(V):(fr;,b(V) dv,
where
[b
+ am -
au, b + am
8- m 8-]
+ au] x [-m - -,
- + -
m
m/
(where m is the center of 1ft, u is its width; and 8- are the corresponding objects
relatively to :(fr; see the details in the main text). Assuming
> 0, we see that
the frequency window is centered at v =
a and has width 28- / a; therefore, the
ratio center/width
Q=
28-
m/
The interest of the wavelet transfonn for signal analysts is that they can "read"
it to extract infonnation about the time-frequency structure that is otherwise
blurred in the brute signal by concurrent phenomena and subsidiary effects. For instance, they can detect the appearance time of a phenomenon linked to a particular
frequency (e.g., the time at which a particular atom starts to be excited).
A fonnula, called the identity resolution, allows us to reconstruct, under mild
conditions that we shall make precise in due time, the function from its wavelet
transfonn:
f(t)
172
L
LU, 1J!j,k)1J!j,k.
JEZ kEZ
where
j
3Multiresolution approximation and wavelets, Trans. Amer. Math. Soc., 315, 1989,6988.
4Reproduced with the kind permission of Martin Vetterli.
Introduction
173
0.5
of---------'
o
100
200
300
400
500
600
700
800
900
1000
100
200
300
400
500
600
700
800
900
1000
200
300
400
500
600
700
800
900
1000
0.5
o
o
0.5
01------"--"--"'\
100
(a)
0.5
01------'
100
200
300
400
500
600
700
800
900
1000
300
400
500
600
700
800
900
1000
300
400
500
600
700
800
900
1000
0.5
o
o
0.5
01------'
100
200
(b)
Dl
The Windowed Fourier Transform
D 11
= -1
mfjj
= _1_
Ew IR
Ew
t Iw(t)1 2 dt,
JIR
v Iw(v)1 2 dv,
Iw(t)1 2 dt =
Then define
and
Iw(v)1 2 dv.
176
w,
The numbers o"w and 0"1ii are the root mean-square (RMS) widths of w and
respectively. Note that m w and mlii are not always defined. When they are weIl
defined, o"w and 0"1ii are always defined but may be infinite. Therefore, we shall
always assurne that
and
Ivllw(v)1 2 dv
<
00,
to guarantee at least the existence of the centers of w(t) and of its Fr.
EXERCISE D1.I. Check if the centers m w and mlii are well defined, and then
compute o"w and 0"1ii and the product O"wO"Iii in thefollowing cases:
w(t)
w(t)
= l[-T,+Tl
= e-a1tl ,
a > 0,
w(t) = e- at2 ,
a > O.
EXERCISEDI.2.
L 2 iswell defined.
It - toI 2 Iw(t)1 2 dt
is minimized by to = m w .
Heisenberg's Inequality
THEOREM
DI.I.
inequality
(1)
Proof: We assurne that the window and its Fr are centered at 0, without loss of
generality (see Exercise D1.3). Denoting the L 2 -norm of a function f by IIfII, we
have to show that
~
and therefore,
= (2inv) w(v),
(2)
177
IItwll
(3)
By Schwarz's inequality,
x IIw'lI :::
IItwll
I(tw,
Now,
2Re{(tw, w')}
t(ww'*
+ w'w*)dt
tlw(t)121~~ -lIW(t) 12 dt = 0 -
IIw1l 2
This gives (3) in the case where w E Coo . We now show that (3) is true in the
general case. To see this, we first observe that it suffices to prove (3) in the case
where w belongs to the Hilbert space
H = {w
with the norm 11 w 11 H = (11 W11 2 + IItw 11 2 + 11 vwll2) 2: (if w(t) is not in this space,
Heisenberg's inequality is trivially satisfied). Then we use the fact that the subset of
H consisting ofthe Coo -functions with compact support is dense in H (see Exercise
C3.4). The result then follows from the continuity ofthe Hermitian product.
Equality in (1) takes place if and only if w(t) is proportional to a Gaussian signal
e- ct2 , where c > O. We do the proof only in the case where it is further assumed
that WES and is real. Observe that all the steps in the first part of the proof remain
valid since for such a function, tlw(t)121~~ = O. Equality is attained in (1) if and
only if the functions tw(t) and w'(t) are proportional, say,
w'(t)
=-
ctw(t),
L2.
~
1
lI(v - vo)wll ::: 4n
IIwll .
(4)
The above resulttells us that in Heisenberg's inequality (1), the numbers a w and
can be taken to be, respectively, the root mean-square width of w around any
time to and the root mean-square width of w around any frequency vo. In particular,
to can be the center of w(t), and Vo can be the center of w(v). This version is the
most stringent, since the RMS widths around the centers are the smallest RMS
widths (see Exercise D1.2).
aliJ
178
D 1 2
Windows
Let f(t) be the signal to be analyzed. The local information at (around) time t = b
is contained in the time-Iocalized signal
f(t)w*(t - b),
(5)
where w(t) is the time window function, the support of which is included in a
relatively small interval about zero. Typical examples are the rectangular window
w(t) = l[-a,+aj(t)
wherea > O.
Given a window w(t), the local information at time b is obtained by computing
the Fourier transform of (5):
Wf(v, b):=
(6)
= (j, Wv,b),
(7)
(8)
where we assume, of course, that fand w are complex-valued functions that have
finite energy.
L 2. One caUs the function Wf : lR x lR -+ C
defined by (6) the windowed Fourier transform (or WFT) of f associated with the
window w.
DEFINITION
DI.I. Let w, f
(a) w E Lb(lR)
(b)
n LUlR),
Iw(t)1 2 dt
= 1,
179
E L~(lR),
(9)
Atoo
JR.
-1 JR.(
= O.
Ivl:",A
(10)
Proof
1 JR.(
Ivl:",A
= (I, Wv,b),
where
Wv,b(f.1,)
(11)
(12)
L IWf(v, b)1 2 db = L
=L
IL
db
and, therefore,
L L IWf(v, b)1 2 dbdv = L L II(Jl)1 2Iw(Jl- v)1 2 dJldv
=L
=L
180
Iw(t)1 2 dt = 1,
Iw(v)1 2 dv =
Ij(JL)1 2 dJL
IJ(t)1 2 dt (Plancherel-Parsevalidentity).
We show that the function JA is weH defined, that is, (v, b) -+ Wj(v, b)Wv,b(t) is
integrable over [- A, + A] x R In view of (12),
IA(t) := j+A { IWj(v, b)llwv,b(t)1 dv db
-A JIR
By Schwarz's inequality and the Plancherel-Parseval identity, and using assumption (b),
:s (
=
2 )1/2 (
l.r-I{j(.)w(. - v)*}1 db
(L
Iw(t - b)1 2 db
)1/2
Ij(JL)w(JL -
v)*1 2 dJL
(lj12 * Iw 12 ) (v)
:= h(v).
This function h(v) is in LI, being the convolution product oftwo LI-functions. In
particular,
IA(t):s
+A
-A Ih(v)1 dv < 00,
where
g(v):=
+A
-A g(v) dv,
181
=
Therefore,
fACt) =
i: (L
A
In order to change the order of integration in the above integral, we first verify that
(v, f.1.) -+ 1!(f.1.)llw(f.1. - v)1 2 is integrable over [- A, + Al x lR. But I!I E L 2 and
Iwl 2 E L 1 ; therefore, I!I * Iwl 2 E L 2 , and the integral of an L 2-function over a
finite interval is finite. But this integral is just
i: (L
A
i: (L
A
L(i:
L (i:
=L
A
= F-
E L 2,
and
(jC{JA).
We now show that limA too fA = f in L 2. For this, we write, using the PlancherelParseval identity,
IIf -
fAlli2 = IIF- 1!
- F- 1!C{JAlli2
= IIF- 1{!(1 -
C{JAmi2
Wehave
/L-A
-00
Iw(Y)1 2 dy
1+00
/L+A
Iw(Y)1 2 dy,
182
o ::; 1 ::;
((JA(/L)
-A 12
-00
Iw(Y)1 2 dy
= y(A) -+
1+00 Iw(Y)1
AP
dy
(A -+ (0).
Also,
-+ 0
sinee
(A -+ (0)
L 2 . Therefore, finally,
lim
Atoo
111 - IAlli2 = O.
From (7) and the Planeherel-Parseval identity, we obtain the two expressions
for the windowed Fourier transform:
(13)
tlw(t)1 2 dt
=0
and
vlw(v)1 2 dv
= O.
(14)
awaJij:::: 4n '
==
Ae-ct2 ,
where c
D1.3. The Gabor window is optimal, in the sense that it minimizes the
uncertainty aJijaw.
THEOREM
183
lj!m,n(t)
= e2irrmtw(t -
n),
(15)
01.1. Show that the answer to the previous question is positive for the
rectangular window w(t) = 1[0,1](t).
EXERCISE
Although such "atomic" windowed Fr bases do exist, they turn out to be very
bad from the view point of time-frequency resolution, as the following result,
called the Balian-Low theorem,5, shows.
THEOREM 01.4.
t 2Iw(t)1 2dt =
EXERCISE
00
or
v 2Iw(v)1 2dv =
00.
D2
The Wavelet Transform
D21
b, v E IR,
(t - b)
-a-
a, b
IR, a =I 0,
(16)
where 1/I(t) is called the mother wavelet. The function 1/Ia,b is obtained from the
mother wavelet 1/1 by successively applying a change of time scale (accompanied
by a change of amplitude scale in order to keep the energy constant) and a time
shift (see Fig. D2.1).
DEFINITION D2.1. The wavelet transform ofthe function f E
C j : (IR - {On x IR
1-*
C defined by
Cj(a, b)
= (j, 1/Ia,b) =
f(t)1/I:,b(t)dt.
(17)
186
1lV2
1
1
1
1
1
1
1
1
1
3.51 1
1 1
,-------, 1
1
-1
1y'2
1
0 11
1+1
1
1
-1 L - J
17.5
1
-y'2
1 1
-V2 L.....
'ljJ(t)
'ljJ2,7.5(t)
C j(a, b) =
(j, Vla,b)
(18)
where
(19)
Let m", and a", be, respectively, the center and RMS width of the mother wavelet
1/1 , respectively defined by
m",
= -1
E",
1lR
I
11/I(t)1 2 dl,
aJ = _1_
r(I E", llR
and similarly define m;;; and a;;;, where
and width of 1/1a,b are, respectively,
m",)211/1(1)1 2 dt,
b+am""
aa""
1
a
We shall simplify notations by writing
-m;;;,
1
a
-a;;;.
[b+am-aa,b+am+aa]x
[ma -
~a
m
+ ~J.
a
a
in the time-
187
center frequency
fii
-bandwidth
- 2a
a,
We shall see in the next subsection that in order to guarantee perfect reconstruction of the signal from its wavelet transform, the center of :(f must be zero. Also,
the center of the wavelet itself can be taken equal to zero without loss of generality.
The Fourier transform of a wavelet has bumps at positive and negative frequencies
(see Example D2.3, the Mexican hat). The centers of the bumps then play the role
of the center of the wavelet in the first part of the above discussion (where fii was
assumed to be nonzero ).
D22
Under mild conditions, there exists a wavelet inversion formula similar to the WFT
inversion formula.
THEOREM
f-+
11jJ(t)1 2 dt
1,
n L 2,
(20)
188
and
1~(v)12 dv =
K <
lvi
JR.
00.
(21)
K JR.\/OjJR.
(22)
and
(C j(a, b)1/Ia,b(t) da ~b ,
J(t) = K1 {
(23)
JR.\/OjJR.
dadb
C j(a, b)1/Ia,b(t) 2a
R.x/lal:::eJ
= lal 1j2
f !(v)~*(av)e-2i7rvb
dv
The function inside the curly brackets is in L I because it is the product of two
L 2-functions, and it is in L 2 as it is the product of an L 2-function with a bounded
function (~ is bounded because 1/1 E LI). By the Plancherel-Parseval identity,
= laILIF;I{!(v)~*(av)}12 (b) db
= laILI!(v)121~(av)12 dv,
and therefore,
11
R. R.
dbda
IC j(a, b)1 2 2- =
a
11
R. R.
~
2
da
IJ(v)1 2 11/I(av)1
dvla I
A
= LI!(V)1 2 K
dv
=K
LIJ(t)1 2 dt.
189
C j(a, b)1/!a,b(t) db
l(a) =
lal l / 2
L
L
F;I{!(v)V;*(av)}(b)1/!a,b(t)db
= lal l / 2
!(v)V;*(av)F;I{1/!a,b(t)}(v)dv,
F;I{1/!a,b(t)}(V)
and therefore,
l(a)
K J11al?}
d~
a
lai
K J11al?} JR
With a view to applying Fubini's theorem we must check that the function
~
(a, v) -+ IJ(v)II1/!(av)1 ~
11
R lal?
IJ(v)II1/!(av)1 -
lai
JR
1+ + {
11
1+
J1VI?1
Ix I
= 11
1V;(x)1 2
J1xl?IVI
+1
-I
1!(v)1 dv <
dx) dv
+h
1!(v)1 ( (
-I
:s K
1V;(x)1 2
(
J1xl?lvl
-I
But
lai
J1al?
= { 1!(v)1 (
da dv
1V;(av)1 2 da) dv
= { 1!(v)1 ( (
JR
00
lxi
dX) dv
(25)
190
because!
lz =
l!ev)1
Ivl:::l
(1
L 2 e[ - 1, + 1]). Also,
2
11/Iex)1 dX) dv
Ixl:::slvl Ix I
But
l11/let)1 2 dt = 1,
and, therefore, using Schwarz' inequality,
lz
< -1
(1
<
l!ev)1 dv
lvi
- e Ivl:::l
- c
IJev)1 2 dv )1/2
A
Ivl:::l
(1
-dV)I/2
Ivl:::l v 2
KJset)
= F-1{!gs}(t).
K 2 11J
1 1
= { IK - gsev)1 2 1!ev)1 2 dv =
+
= A + B.
JlR
Ivl:o:eIvl>e1/ 2
On
{lvi
:s c- I / 2 },
gsev) = {
Jlal:::s
l1freav)1 2 da
lai
= {
Jlxl:::SIVI
l1frex)1 2 dx
lxi
1/ 2
191
where
Ke
:s K
and
lim K e
e-+O+
= K.
Therefore,
Also,
since
IIf - fell
--+ 0
as
--+ 0+.
Proof"
If f
L 2 n LI and
[Ca) -da
Ja
a2
11
Ja Ja
~
~
2 2
dvda
f(v)Il{!(av)1
e "rvt -
a2
= [ j(v)e2irrvt ( [
lJa
= K
f to be in L 2
lJa
1~(av)12 da) dv
lai
g(v)e2irrvt dv.
This quantity is almost everywhere equal to K f (t) by the Fourier inversion formula
in LI.
Recall that if f is continuous the equality in the Fourier inversion formula holds
for all t and, therefore, (23) is then true for all t E R
Oscillation Condition
l{!(t) dt
= o.
(26)
192
In most situations it suffices to verify (26), and then (21) follows. For example, if
1/I(t) and t1/l(t) are integrable, then Vi is Cl; therefore, if Vi(o) = 0, the quantity
IVi(v)1 2 /lvl is integrable in a neighborhood of zero and therefore on ~, since at
infinity there is no problem, due to the hypothesis 1/1 E L 2 (which implies that
Vi E L2 ).
EXAMPLE
= ye-
t2 / 2
where y is a normalization factor that makes the energy equal to unity. The
theoretical problem here is that
Vi(O) =
1/I(t) dt > O.
However, the numerical results obtained with this wavelet were satisfactory
because the value of Vi(O) is in fact very smalI.
EXAMPLE
otherwise,
satisfies the conditionsfor the reconstructionformula (23) to be valid. Here
~
1/I(v)
= le. -;rrv
1 - cos(7l' v)
7l'V
IR
2 2 2
rr v
We shall now give a pictorial example. Fig. D2.4 shows a simple signal and
Fig. D2.5 shows its wavelet transform. The mother wavelet used is not given,
since it is irrelevant to this qualitative illustration. In the latter figure, the time
axis is horizontal, and the time axis vertical, the bottom part corresponding to
high frequencies. We observe the good time localization and the fact that sharp
discontinuities are represented in the bottom part.
193
~ilr-----~----II
(a)
-10
-8
-6
-4
-2
JJ(/:::
10
(b)
-10
-8
-6
-4
-2
1
10
(c)
j:~
-10
-8
-6
-4
-2
10
194
0.8
0.6
0.4
0,2
01 - - - - - - - '
~.2
__
500
L _ _ _ _ _- - - '
__
__
___
__
__
__
__
1000
1500
2500
4000
2000
3000
3500
~
+ sinusoid
~~
D3
Wavelet Orthonormal Expansions
D31
Mother Wavelet
2n
JR
JR
2n
JR
Also, the necessary and sufficient condition for {cp( . - n) }nEZ to be an orthonormal
sequence of L~(IR) (Theorem C4.5) now reads
L 1<7J(w + 2kn)1
= 1, a.e.
kEZ
196
One reason for abandoning the definition in terms of the frequency v is that
the topic of MRA involves a mixture of analog signals and of digital filtering,
and digital signal processing is traditionally-as in the present text---dealt with in
terms of the pulsation w.
Scaling Function and MRA
DEFINITION D3.1. A multiresolution analysis (MRA) of L 2 = L~(IR) consists of a
function cp E L 2 together with afamily {Vj}jez of Hilbert subspaces of L 2 such
that
Z, Vj
{::=}
f(2 j .) E Vj ,
The function cp is called the scaling function of the MRA. The index j represents
a resolution level: The projection Pj f of a function f E L 2 on Vj is interpreted
as the observation of this function at the resolution level j.
Usually, the projection on Vo is the function itself, in which case the projections
at all levels j 2: 0 are identical. The projection at level 0 is, in applications, the
one offered by the recording device.
Observe that, since the mapping f -+ ,J2 f(2 ) is an isometry from Vo onto VI
and since (cp(. - n) }neZ is an orthonormal basis of Vo, the set (,J2 cp(2 . - n) }neZ
is an orthonormal basis of VI. More generally, {CPj,n}nez is an orthonormal basis
of Vj , where
(27)
nV
jeZ
= 0
{::=}
lim P_j/ = 0
J .... +OO
for all fE L 2
(28)
197
](1/)
-7f
j(.) E Va
1/
+7f
}(I/)
~.
o
+27f
1/
Vi
j(.)
j(.)
E V- 1
](1/)
1/
Shannon multiresolution
Figure D3.2. Nesting in the Shannon MRA
and
closU Vj = L 2
jE71
EXERCISE
{:::::::}
.lim Pj / = /
J-->+OO
Vj
= {f E
L 2 : supp (1)
for all /
L2
(29)
c [- 2 j 1T, + 2 j 1T]}.
ll[-rr,+rrj(w)
(see Fig. D3.2). Verify that {Vj }jE71 is a multiresolution analysis 0/ L 2 associated
with the scaling function rp.
EXERCISE
Vj
= {f E
L2
: /
+ l)Tj]).
Define
rp(t)
ll(o,l](t)
(see Fig. D3.3). Verify that {Vj }j E71 is a multiresolution analysis 0/ L 2 associated
with the scaling function rp.
We shall see later that some regularity of the scaling function is desirable.
198
,...........J--I
L...I
Vo
j(.) E V-I
16
j(.)
Haar multiresolution
Figure D3.3. Nesting in the Haar MRA
D3.2. The function ({J is said to belong to Sr for some rEN if ({J is r
times continuously differentiable with rapidly decreasing derivatives, in the sense
that
DEFINITION
I({J(k)(t)1
:s
C kp
= 0, 1, ... , r,
and all p
N.
sr is called
(1
+ Itl)P
for k
(30)
The Haar and Shannon scaling functions are not in Sr (for any rEN).
Conditions (a), (b), (e), and (d) of Definition 4.3 are not independent. In fact,
the first part of (d) is always true under conditions (a), (b), (e), whereas the latter
conditions are almost sufficient for the second part of (d). The result below makes
this statement precise.
D3.1. Suppose that eonditions (a), (b), and (e) of the definition of an
MRA are satisfied. Then njEZ Vj = 0. Moreover, ifep is eontinuous at the origin,
then
THEOREM
lep(O) I =1= 0
{::=>
closU Vj
= L~(l~).
(31)
jEZ
clos ( Uj EZ Vj ) is
= L c{q;(2 j
199
f E W.
- h 112 :s
-k),
kE'lL
L c{q;(2
j .
-2 j - Cm - k)
kE'lL
is therefore in Vj if j :::
and Tm 2-( h E Vj This means that Tm 2-( f is c:-c1ose to Vj for all j :::
and the arbitrariness of c:, we deduce that Tm 2-( f E W.
Let now a
c E (a - 8, a
:::
e,
e. From this
+ 8),
(use Theorem C3.1 stating that the map a 1--+ Taf is uniformly continuous). In
particular, we can find a dyadic number c for which the above inequality is satisfied.
Since Tc! E W and c: is arbitrary, we deduce that Ta f E W.
We now proceed to the proof of (31). We assurne that ~is continuous at 0 and
that I~(O)I -=1= O. Therefore, ~(w) -=1= 0 on (-c, +c), for some c > O. Consider any
function g orthogonal to W, that is, orthogonal to all f E W. Since W is invariant
under translations, for all x E lR and for all f E W,
L
L
0=
f(x
+ t)g(t)* dt.
for all x ER The function jg* E L~(lR), and therefore, by the Fourier inversion
theorem in L I , j g* = 0 almost everywhere.
In particular, with f(t) = 2 j q;(2 j t) (indeed, such f E Vj C W), we obtain
~(Tjw)g(w)*
= 0,
a.e.
200
By (29),
lim 111 - Pj/1I2
jt+ oo
= 0,
.hm
Jt+ oo
1
rr
that is,
Itim 11
+00
ke'Z JIR
For large enough j, [-2- j, +2- j] c [-rr, +rr], and therefore the last displayed
expression is 2 j times the sum of the squared absolute values of the Fourier
coefficients of 1[-2-i,+2-ijr. Therefore, by the appropriate Plancherel-Parseval
identity,
ke'Z
2rr
-2-j
2rr
-2-j
rr
Therefore,
201
Wavelet Expansion
We shall suppose in the sequel that the scaling functions cp satisfy 1qJ'(0) I > 0, and
then take (without further loss of generality)
qJ'(0) = 1.
(32)
DEF1NITION D3.3.
= L~(lR) is an orthonormal
(33)
The function
expansion
1/1 is then called the mother wavelet of the wavelet basis. The
1 = LL(f, 1/Ij,k)1/Ij,k
jeZ
(34)
keZ
Vj+l = Vj EB Wj.
From property (d) of the definition of MRA,
L2
= EBWj .
(36)
jeZ
1E
Wo ~
1(2 j .) E
Wj.
for all
1E
L2.
(37)
Pj+I! - Pj/
= L(f, 1/Ij,k)1/Ij.k
keZ
is the additional detail required to pass from the resolution level j to the higher
resolution level j + 1.
A first issue is: How to compute the mother wavelet 1/1 from the scaling function
cp? The next question is: How to obtain a scaling function cp? Finally, one would
like to obtain a mother wavelet with "good" numerical properties, that is fast
convergence of the wavelet expansion (34).
202
D32
We address the first issue, that of explicitly finding a mother wavelet given a scaling
function.
1/I(t)
!],
+1
ift
(0,
-1
ift
(!' 1],
does it. To see this, it suffices to verify that any f E VI with support (0, 1] is a
linear combination of cp and 1/1 and that cp and 1/1 are orthogonal. Orthogonality
is obvious. Any fE VI such that supp(f) E (0, 1] is oftheform
f(t)
!],
ift
(0,
ift
(!' 1].
a-
f = - 2 - cp + -2-1/1
The function 1/1 is called the Haar mother wavelet.
4" ----,
l---
f(t)
L --
4"
01
1
2
~x
4
'Cl
'ljJ(t)
+1
lx
2
-1
21
1
1
1
1
--~
203
L I$(w + 2kn)1
= 1,
a.e.
(38)
kEZ
The scaling function ({J E Vo and therefore, ({J E VI. Requirements (a) and (e) in
the definition of an MRA imply that {({JI,n}nEZ is a Hilbert basis of VI. and we
therefore have the expansion ({J = LnEZ hn({JI,n, that is,
({J
= v'2
L h n({J(2 . -n),
(39)
nEZ
where
(40)
~
((J(w)
1M ~hne'""'
inW~(W)
'i({J "2
-v2
nEZ
'
that is,
(41)
where mo(w) is the 2n-periodic function defined by
mo(w)
1M ~hne
'""'
-v2
-inw .
(42)
nEZ
It is called the low-pass filter MRA, because mo(O) = 1 (recall the running
assumption that $(0) = 1; see (32)). Substituting identity (41) in (38) gives
204
300~
200
100
O~
__
____- L____
50
100
::~'
____L -_ _
150
200
'~'
_ _ _ _- L____~____L-~~____~
250
100
150
400
450
500
AM' '~
,~-, LJJ
- J_ _ _ _~_ _ _ _- L_ _ _ _
50
350
'''A~'
100~~-,
__
-L____
O~
300
200
250
300
350
400
450
500
300~
200
100
O~
__- J_ _ _ _
50
100
150
200
250
300
350
400
450
500
300~
200
100
O~
__
____- L____
50
100
150
____L -__
200
____- L____
250
300
____L -_ _
350
400
____
450
500
300~
200
100
O~
__- J_ _ _ _
50
100
150
200
250
300
350
400
450
500
300
350
400
450
500
(a)
50
100
150
200
250
,oo~
-10:
50
100
150
200
250
300
350
400
450
500
,oo~
-10:
50
100
150
200
250
300
350
400
450
500
100~
-10:
50
100
150
200
250
300
350
(b)
400
450
500
205
Therefore,
or, equivalently,
(43)
The filter with frequeney response eiwmo(w + n)* is ealled the high-pass filter
of the MRA. Eqn. (43) shows that the high-pass and the low-pass filters altogether
extraet the whole energy eontained in the band [-n, +n].
We now eharaeterize the spaees V-I and Vo. This will be a preliminary to the
eharaeterization of W_I, the orthogonal eomplement of V-I in Vo. Onee this is
done, we shall obtain the eharaeterization of Wo and then the mother wavelet
itself.
LEMMA
D3.1. f E V-I
= m(2w)mo(w)qJ'(w),
E l~.
+n]).
f(t)
where {Cn}nEZ
E L~ ([ -n,
(44)
1
I>kCP( -t - k),
",2 kEZ
2
(45)
= hI>ke-i2kWqJ'(2w).
kEZ
JR
=L
kEZ
-Jr
-Jr
Ih(w)1 dw <
+00.
dw
206
LEMMA
D3.2.
Vo
(46)
= L dkfP(t -
I(t)
k),
(47)
kEZ
where {dn}nEZ
f(w) =
L dkeikwqJ(w) = d(w)$(w),
kEZ
where d E L~([-rr, +rr]). Arguing as in the proof ofLemma D3.l, we can show
= IIdlli~([_rr,+rr]) = 2rr
I, g E
L Id l
k 2
kEZ
= 2rrll/ll~.
(48)
Vo,
(49)
We are now ready to state and prove the Fourier characterization of Wo, the
Hilbert space of details at level O.
THEOREM
Wo
if and only if
(50)
only if
(51)
0=
(2rr
10
d(w)m(2w)*mo(w)* dw,
E L~([ -Jr,
207
E L~([O,
+Jr]),
(52)
almost everywhere in [0, +Jr] (and therefore almost everywhere in [-Jr, +Jr)).
Define
mo(w) = (mo(w), mo(w + Jr.
In view of the identity (43), this is a unitary veetor in C 2 eonsidered as a 2dimensional veetor spaee (with sealar field C). The veetor
m'o(w) = (mo(w
+ Jr)*, -mo(w)*)
do
= (w)m'o(w),
where
(w) =
(do, m'o(w)
In partieular,
(w + Jr)
= -(w + 2Jr),
a.e.
or, equivalently,
(w) = -(w + Jr), a.e.,
whieh implies in partieular that is 2Jr-periodie. It is also in L~([ -Jr, +Jr)).
Indeed,
j(w) = d(w)qJ(w) = (w)mo(w + Jr)*qJ(w),
and therefore,
208
+ n)*v(2w)qy(w),
where
d(w)
= eiwmriw + n)*.
Since Imo(w)1 ::: 1, this implies that d(w) E L~([-n, +n]). Therefore, I E Vo
(Lemma D3.2). Also, from the expression of d(w), do(w) = eiwv(w)m'o(w), and
therefore
do..lmo(w),
that is, d(w)mo(w)* + d(w + n)mo(w + n)* = O. By Lemma D3.1 and Eq. (48),
this implies that 1.1 V-I. But also I E Vo. Therefore, I E Wo.
We are now ready for the main result of this subsection, the Fourier characterization of the mother wavelet in terms of the scaling function and of the high-pass
filter.
THEOREM
= eiW/2mo(~ + n V(w)qy(~) ,
(53)
= 1 almost everywhere.
Proof" Since:V; is of the form (50) with lvi = 1, it is in L~(IR) (by the now
standard argument) and, therefore, it is the Fr of a function 'tjJ E L~(IR), which is
in Wo by Lemma D3.2. By Lemma D3.2 again, any function g E Wo has an Fr of
the form
g(w)
= eiW/2mo(~ + n s(w)qy(~) ,
Since sv*
E L~([ -n,
= s(w)v(w)*:V;(w).
+n],
s(w)v(w)*
= LCke-inw,
nEZ
v-I
= v*,
E l~,
and therefore,
g(t)
= I:>k1fr(t -
209
n).
nEZ
1fr
L 11fr(w + 2br)1
= 1,
{1fr(. -
n) }nEZ is
a.e.,
kEZ
L 11fr(w + 2krr)1
= Iv(w)1 2 ,
kEZ
= 1.
= 1). Werecall
(55)
==
1 leads to
1fr(w)
=.J2
L(-It-1h"'-n_lcp(2t - n),
(57)
210
(a)
r-
0.5
0.5
-0.5
-0.5
-1
-1
-3
-2
-1
-3
-2
-1
(d)
(c)
1.5
1.5 ,-----------~--~--~--____,
0.5
0.5
OL-------~--~--~--~
0.1
0.2
0.3
0.4
0.5
0.1
0.2
0.3
0.4
0.5
Figure D3.6. Haar scaling function and the corresponding wavelet (left: scaling
function; right: wavelet; top: time domain; bottom: frequency domain)
and, therefore,
hn
= v'2
forn = 0, I,
otherwise,
and, using (57),
1jI(t) = q;(2t) - q;(2t - I).
Here
fi(w) = I[-n,+nj(w),
and, therefore,
q;(t) =
sin(nt)
nt
211
Therelore, necessarily,
mo(w)
= $(2w)
on
[-n, +n],
that is,
By periodicity,
=L
mo(w)
$(2w + 2kn).
kEZ
= - e-iwmo(w + n)*$(w)
= - e- iw
(L
$(2w + 2kn
kEZ
=-
e- iW ($(2w
= _e- iw
+ 1) $(w)
+ n) + $(2w -
(1 [_lI"._lj](w) + 1[+lj,+lI"](w).
D33
!
2
sin( !n(t 2
!
2
~n(t _ ~)
Mallat's Algorithm
Mallat's algorithm6 is a fast algorithm for obtaining from the projection at a given
level the wavelet's coefficients at coarser levels of resolution.
Let 1 be a function in L~(IR).lts projection on Vj , the resolution space at level
j, is
Pjl
= LCj,n'Pj,n,
nEZ
where
(58)
6Mal1at, S. A theory of multireso1ution signal decomposition: The wave1et representation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 11, 1989,
674-693.
212
(b)
0.5
0.5
-0.5
-0.5
-1
-15
-10
-5
10
-1
-15
15
(c)
1.5
-10
-5
10
15
(d)
1.5
0.5
0.5
0.1
0.2
0.3
0.4
0.5
0.1
0.2
0.3
0.4
0.5
Figure D3. 7 Shannon sealing funetion and the eorresponding wavelet (left: sealing
funetion; right: wavelet; top: time domain; bottom: frequeney domain)
= Ldj,nVrj,n,
nEZ
where
(59)
and wehave
(60)
Denote by Cj and d j the sequenees {Cj,n}nEZ and {dj,n}nEZ, respeetively. The purpose of Mallat's algorithms is to decompose the funetion f, that is, to pass from
CM to dM-I, dM-I, ... , do, Co, and to reconstruct that is to pass, from co, do,d l ,
" " d M to CM.
The sequenee d M - lo d M - lo ... , do, Co is the wavelet encoding ofthe wavelet data
CM' We shall explain the interest of this eneoding onee we have derived Mallat's
algorithm.
213
Since the function cp(t/2) is in V-I, and V-I C Vo, and since {cp(. - n)}nEZ is a
Hilbert basis of Vo, we have the decomposition
2CP(2 t ) = L
ancp(t
+ n),
nEZ
where
11m cp(-t)cp(t+n)dt.
1
a n =2
lR
Therefore,
=2
L::!.
2
= 2-9- LakCP(2jt -
2n
kEZ
+ k),
that is,
CPj-l.n
= V2L a kCPj,2n-k.
(61)
kEZ
Sirnilarly, since the function 1/1'(t/2) is in W_I, and W_I C Vo, and since {cp(. n)}nEZ is a Hilbert basis of Vo, we have the decomposition
where
n
= ~ [ 1/1'(~t)cp(t+n)dt.
21lR
= V2LkCPj,2n-k.
kEZ
=L
ane inw
nEZ
= Lneinw,
nEZ
(62)
214
In Theorem D3.3, we now make the particular choice of the mother wavelet
corresponding to v(w) = 1:
that is,
L
= L ( _l)n+la;_neinw.
n einw
nEZ
nEZ
Therefore,
(63)
(64)
hLaZcj,Zn-k.
kEZ
(65)
hLZCj,Zn-k.
kEZ
These are the basic recursions of the decomposition algorithm (see Fig. D3.8).
The recursions for the reconstruction algorithm (see Fig. D3.9) are obtained from
(60), (61), and (62). This gives
Cj,n
EXERCISE
=h
[azk-nCj-l,k
L
kEZ
(66)
+ Zk-ndj-l,k]'
(64)
(64)
+ Cj,Zk
h
Cj,Zk-l
(64)
(64)
~CM-I ~CM-2 ~
(~
dM-I
(6~
dM-2
Cl
(64)
~Co
(6~
~CM-I~CM
215
and
dj -
I k
Cj,2k-1 -
"fi
Cj.2k
N)
+ K 4' + ... + K 2 M + K 2M = K N
D4
Construction of an MRA
D41
The Fourier structure of an MRA is now elucidated, and we know how to obtain a
wavelet basis when an MRA is given. This chapter gives two methods for obtaining
anMRA.
In the previous chapter, we started from a nested family of resolution spaces
{Vj bEZ and we discovered a scaling function q; in rather simple examples. Now,
obtaining the scaling function from a given nested family of resolutions spaces can
be a difficult task in general. However, if we are interested in a wavelet basis rather
than in a given family of resolution spaces, we might as weH start from a given
function q; E L 2 with the property that {q;(. - n)}nEZ is an orthonormal system,
and define the resolution spaces in an ad hoc manner guaranteeing that q; is indeed
the corresponding scaling function.
If q; is to be the scaling function, there is but one choice for the resolution spaces,
namely,
Vj
= span {q;j,n
:n
Z}.
An inspired choice of q; will make the Vj 's nested as required, and this has to
be verified because there is no reason why it should be so when one starts from an
arbitrary orthonormal system {q;(. - n)}nEZ, A necessary and sufficient condition
for this is that
q;(t)
= L cnq;(2t nEZ
n),
(67)
218
{cn}nEZ E
(~) fi(~)
fi(w) = mo
(68)
holds for some 2JT -periodic function mo in L~( -JT, +JT). We must also verify that
conditions (d) in the definition of, an MRA are satisfied. By Theorem D3.1, it
suffices that fibe continuous at the origin and that Ifi(O) I = 1.
Meyer's Wavelet
Define cp by
fi(w)
2JT
lflwl S
3'
. 2JT
3 -
4JT
3 '
(69)
otherwise,
(C k or C OO ) such that
v(x)
{~
ifx SO,
ifx:::l
(70)
and
v(x)
+ v(l
- x)
1.
(71)
L lfi(w + 2kJT)1
1,
kEZ
and, therefore, {cp(. - n)}nEZ is an orthonormal system. We must now verify that
the Vj are nested, and for this it suffices to verify that Vo c Vj or, equivalently,
that cp E Vj. But this is true if and only if there exists a 2JT -periodic function mo
of finite power such that
fi(w) = mo
(~) fi(~) .
mo(w)
= Lfi(2w+4kJT)
kEZ
~(w)
=~
cp(w)cp
"2 '
since the supports of $(w + 2klr) and of $(w /2) do not overlap if k
$(~) = 1
if w
=1=
219
O. But since
supp(~,
we have
$(w)$(~) = $(w),
as desired. We obtain a mother wavelet by formula (53) of with v(w)
gives
1. This
which gives
e iw / 2 sin(
:;j;(w)
~ v (2~ Iwl -
1))
~ v (2~ Iwl -
1))
e iw / 2 cos(
0
EXERCISE D4.1.
.f 2JT
3
1 -
4JT
if3
::slwl::s
::s Iwl ::s
4JT
3'
8JT
(72)
3'
otherwise.
Check that q;(t) is indeed in LUlR) and that the system {q;(. -n ]}nEZ is orthonormal.
Check that the dilation equation (55) holds with
mo(~) =
4JT
q;(w)
iflwl::S 3'
otherwise.
Show that q;(t) so defined is the scaling function 0/ some multiresolution analysis
and that a mother wavelet is given by its Fourier trans/orm
220
D42
= I>nw(2t -
(73)
n),
nEZ
where {cn}nEZ
= span {Wj,n
:n
(74)
Z}.
Of course, condition (73) guarantees that these spaces are nested. In order to obtain
a Hilbert basis of Vo, we use Theorem C4.6 which says that under the "frame"
condition
o< a
L Iw(w + 2krr)1
~<
(75)
00,
kEZ
cp(w)
w(w)
(76)
kEZ
D4.1. Let W
E L~(lR.)
o< a
satisfy
L Iw(w + 2krr)1
~ <
(77)
00,
kEZ
and define
Vj
(78)
nVj
(79)
= 0.
JEZ
Proof The inequalities (77) are equivalent to the existence of A > 0, B <
such that
0< AllfII 2 ~
L l(f, wo,k)1
~ BllfII 2 ,
kEZ
for all
kEZ
00,
00
(80)
221
Pjlll
Z,
By (80),
M=supll(x)l,
and
XEIR
we have
1(1, wjk)1 2 = ITj
Ixl<c
II(x)w(Tjx - k)1
dxl2
:::: TjM22cl
Ixl<c
::::
where
A(c,j)
= I)k-Tjc,
kEZ
k+Tjc].
222
By the dominated convergence theorem this term tends to 0 as j -+ 00. In particular, there exists a j such that 11 Pj 111 ::: 8, and therefore 11 f II ::: 28. Since 8 is
={
I - Ix I if 0 ::: Ix I ::: 1,
(81)
otherwise,
1) + w(2x)
( sinz'-~(' I )
)2
!w(2x - 1).
(82)
Wehave
L Iw(w + 2krr)1
= ~
~ cos(w)
kE'L
= ~ ( 1 + 2 cos2 (~) )
(One way to prove this is to compute the Fourier coefficients of the left-hand side
w(t)w(t
+ n)* dt,
and this immediately gives the result. Note the generality of the method and its
interest when w(t) is compactly supported.) The mother wavelet is then obtained
from (76). This gives
cp(w)
= w(w)
.f3
1 + 2cos 2
(W))1/2 .
"2
(83)
223
= L::>nW(t -
n).
nEZ
mo(w)
(jJ(2w)
= -:::::-- =
q;(w)
cos 2
W))1/2
1 + 2cos (
(
(2) 1+2cos (w)2 ,
2 -
and this leads to an expression for the mother wavelet's Fourier transform. Again
the (numerical) evaluation of the Fourier coefficients of the function factoring (jJ( w )
yields an evaluation of 'ifr(t) in terms of the translates of q;(2x).
D43
Spline Wavelets
= 1[O,1](t)
For n
[I
* Bn)(t) =
Bn(x) dx.
I-I
= 3, we have
ifO~t~1,
if 1
~ t ~
2,
if t < 0,
the rest of the function being obtained by symmetry around 2.
In the general case, Bn(t) is (for n :=:: 1) in cn- I , its support is the interval
[0, n + 1], and
Bn(x)dx
Wehave
Bo(w)
= e-i~sinc
1.
(2:)'
and, therefore, in the Fourier domain, the recurrence defining the B-splines
becomes by the convolution-multiplication rule
Bn+I(W)
= e-i~sinc
(2:) Bn(w).
224
= (.W
e-'Tsinc
Bn(w)
A
(w))n+!
2Jr
(84)
=0
(IWI~+! ) .
(85)
We shall now seek a scaling function for the B-spline of order n. From the
observation
e-iIsinc
it follows that
where
mo(w) =
(1 +
-iw )n+!
;)
.fi n+!
hk = 2n+! (k)'
0~k~ n
+ 1,
1 n+!
Bn(t)
= --;; L(~+!)Bn(2t 2
k).
k=O
IBn(w + 2kJr)1 2
(86)
kEZ
-->-,
w
Jr
- 2'
O<w< -
Jr,
A ~
IBn(w + 2kJr)1 2 ~ B,
kEZ
and {Bn (. - k)} kEZ constitutes a Riesz basis of the Hilbert subspace that it generates.
In order to compute the scaling function of the MRA, we need the following lemma.
225
L IBn(w + 2br)1
= Pn(cos(w.
(87)
kEZ
Moreover, the coefficients of this polynomial are rational and can be computed
recursively.
Proof
Denote the left-hand side of (87) by Fn(w). Inserting (86) in(84) gives
Fn(w)
where
Gn(w)
=L
kEZ
h- + 1Tk) 2n+2'
(J)
"
= n(2n2+ 1) Gn(w),
and, therefore,
Fn(w)
We introduce the new variable y = cos(w), and define the function Pn by Fn(w) =
Pn(y). Since Fo(w) = 1, we have Po(y) = 1. The recursion in the last display
becomes
Pn(y)
= n(2n + 1) (1 -
y)
n+ld (Pn-1(y)
dw (1 _ y)n .
d
dy
- = (- sin(wand
d2
d2
-dw 2 = (-y)+ (1 - l ) - ,
dy
dy 2
we obtain, after simplification and rearrangement,
Pn(y)
2
(n(n
n(2n + 1)
+ 1 + nY)Pn-l(Y) + (1 + (1
y)(2n
- y)2(1
+ (2n -
l)y)Pn- 1
+ y)P~'--l)'
226
The general method of the previous section gives for scaling function (fi = (fin
Therefore,
cp(t)
L CkBn(t -
k),
kEZ
r = LCkl.
(Pn(z+r' )
Observe that Ck =
Ck.
ICkl
for some
Ipl
kEZ
~ plk l ,
We now proceed to compute the mother wavelet. We have to compute the impulse
response of the low-pass filter mo(w). We have
1
2:
that is,
.
l+e- WW )
mo(w) = (
n+!
(Pn(COS(W)2
Pn(cos(2w
( Pn(cos(w )
Pn(cos(2w
where
qk
= q-k = -
rr (
2:
"
-ikw
= '~qke
,
kEZ
Pn(cos(w )
Pn(cos(2w
2:
cos(kw)dw.
Therefore,
k).
= L bk B n (2t -
k),
kEZ
where
br = hL(-ll-ILk-1qr-k.
kEZ
227
D5
Smooth Multiresolution Analysis
D51
q;
)1
(k)(
X
<
-
(1
Ck,n
+ Ix Dn
(88)
DEFINITION
L q;*(x -
n)q;(t - n).
neZ
(89)
230
wehave
Iq(x, t)1
:s L
al,
nEZ
""
CO,k+2
nDk+2 (1
:s nEZ
~ (l + Ix -
<C 2
- O,k+2 nEIU'" (1
It foUows that, for aU k
that, for all x, t E IR,
+ Ix -
CO,k+2
nDk+2
+ It -
nD 2 (l
+ It -
nl)2 (1
+ Ix -
tDk '
(90)
In particular, for each t E IR, the function qt : IR --+ C defined by qt(x) = q(x, t)
is in L 2 , and the development of any function f E Vo along the orthonormal basis
{qJn}nEZ = {qJ(' - n)}nEZ
q(x, t)f(x)dx.
(91)
DEFINITION D5.3.
We know (Theorem D3.1) that 1$(0)1 = 1, and we can assume without loss of
generality that $(0) = 1. Therefore, in view of property (38),
$(2klr)
= l[k=o}.
(93)
231
It follows from this and the weak Poisson formula (Theorem A2.3) that
L qJ(x - n) = 1.
qm(x, t)dx
for m
(94)
(95)
1,
= I[o,l](t) and of
In general, the kernel of an MRA does not have a c10sed form, and the examples
in the previous exercise are exceptions.
D52
Let f
E L~(lR),
qm(x, t)f(x)dx,
where qm is the autoreproducing kernel of Vm, defined by (92). This kernel representation allows us to obtain pointwise convergence results, in the manner
of Dirichlet's pointwise convergence analysis of Fourier series. We need some
preliminary results on the kernel.
(96)
lim
mtoo
t+c
t-c
m(x, t)dx
1,
(97)
= 0.
(98)
tim
mtoo Ix-tlo::y
t)
= 2rr (m + I tsm2 e(
) I[-n,+nj(x '2 x - t
sin 2 (m+l(x -
8m (x, t)
t)
232
Proof
Iqm(x, t)ldx
L
=L
=
2mlq(2mx, 2m t)1 dx
Iq(x, 2m t)1 dx
<C2
J[{ (1 + lxi?
dx=K<oo
'
t+c
t-c
qm(X, t)dx
12m (t+c)
q(x,2mt)dx
~Q-~
+ Ixl)2 dx '
and this quantity tends to zero as m tends to infinity. A similar conclusion holds
for h Therefore, property (b) ofDefinition D504 is satisfied.
For property (c) ofDefinition D504, it suffices to observe that
in view of (90).
We can now state the main result.
THEOREM D5.2.
(a, b), as m -+
00.
233
LEMMA D5.1. Let {8m }mEZ be a quasi-positive delta sequence. Let f E Lb(lR) be
continuous on (a, b), and define for all mEZ thefunction fm by
fm{t)
8m(x, t)f(x)dx.
Then
tim fm{t)
m--+oo
= f(t)
Proof"
= f(t)
+f
t-y
HY
Hy
-00
8m(x, t) dx
t-y
+ (f+oo + jtt+y
y)
-00
= A+B +(C).
Let [a, ]
o<
ICI
t-y
J'R.
18m(x, t)1 dx
~ 8K
and
sup 18m(x, t)1 { If(x)1 dx
J'R.
Ix-tl2:Y
~8
J'R.
If(x)1 dx
1-
(use property (b) of the definition of quasi -delta sequences, and the fact that
lim ft+c 8m(x, t)dx
mtoo t+y
uniformly with t
8.
=0
234
::s I/(t) -
AI
where M
= SUPtE[a.1I/(t)l.
D53
1= L(f, 1{Ij.n}1{Ij,n =
j,n
L d j,n1{lj,n,
j,n
where
it is highly desirable from a numerical point ofview that the coefficients dj,n decay
rapidlyas IJ I, Iml -+ 00, thus ensuring fast convergence ofthe wavelet expansion.
This is not the case, however, even for smooth functions (say, I E C oo n L 2 ) if
no further conditions are imposed on the mother wavelet 1{1. To understand this
and see what type of conditions 1{1 should satisfy, let us examine the asymptotic
behavior of
dj,o
as
J -+
00.
= 2 j /2
= JN
l(x)1{I(2 j x)* dx
l(x)1{I(Nx)* dx
= JNa(N).
A Taylor expansion of I (assumed to be C OO ) with Lagrange residue gives
where
RK(X)
(x
t)K
K!
lK+l)(t) dt.
We assurne that the scaling function has a Fourier transform at 0 equal to 1, which
implies that the mother wavelet has a null Fourier trans form at 0 or, equivalently,
that it integrates to O. Therefore,
a(N)
1'(0) JLl
N2 1!
1"(0) JL2
N3 2!
+ ... +
I(K)(O) JLK
N K +1 K!
+r
(K)
N
235
x k1/l(x)*dx,
rN(K) .:::
NK+2 '
for some finite nonnegative c. In particular, a wavelet with moments that are zero
up to order K implies
deN) .:::
-JN N~+2'
THEOREM
LUIR), where
Then
(99)
Let N be a dyadic integer (that is, N = 2- j oko) such that 1/I(N) #- 0 (the
existence of N follows by the density of dyadic integers in IR and by the fact that
1/1 is continuous and not identically zero).
Let j > 1 be sufficiently large for 2j N to be an integer. By orthogonality
o=
2j
=
Passing to the limit j
~ 00
1/1 (x )1/t(2 j x
- 2 j N) dx
+ N)1/I(y)dy.
1/t(Tjy
(*)
1/I(y)dy
= O.
Suppose that (99) is true for k = 1, ... , n -
= O.
expansion
1/t(x)
=L
n
k=O
1/I(k)(N)
(x
N)k
k!
+ rn(x)
(x
N)n
n!
LI~
1/I(k)(N)
#- O. Substituting
236
= 1jI(n\N)
{ Tjn yn 1jI(y)dy
J1I?
n!
+(
J1I?
r n (2- j y
+ N) 2- jn y n 1jI(y)dy.
n!
yn1jl(y)dy
00,
and therefore,
= 0 .
= 0,
(100)
Im(x)dx = 0,
= Pvml satisfies
Im(x)dx = O.
(101)
mt oo
J1I?
= (
J1I?
Im(x)dx.
In Mallat's algorithm one first computes the projection Po I that is the approximation of I at the resolution level 0, and then the coarser resolution approximations
Pj I, j :s -1. As we have just seen, the moment conditions on 1jI are useful for
the first part of the algorithm. For the second part fast decay of the coefficients
hn =
<p(x)<p(2x - n)* dx
is needed for rapid numerical convergence. An ideal situation is when only a finite
number of h n are nonzero, which is guaranteed ifthe scaling function has compact
support. Note that if this is the case, then the compactness of the scaling function
carries over to the mother wavelet, and this is why one usually talks of compact
wavelets rather than compact scaling functions.
Let us mention at this point that if we start from a Riesz basis of Vo, as in the
method explained in Section D42, the compactness of w (there defined) does not
imply compactness of the scaling function. In the face of this negative statement
one needs to be reassured about the transmission of exponential decay from w to
<p. As a matter of fact the situation is not too bad, and a result in this direction is,
for example, Proposition 5.4.1 in [D3]. We end this section by showing how the
decay ofthe scaling function is transmitted to the coefficients h n Localization of
References
237
the scaling function can be taken in many related senses. We mentioned previously
one of them, namely ({J E Sr. Another definition of localization could be
L+
(1
00,
forallm E N.
(102)
Ixl~A
(103)
../211
: : . /2 (1
Ihni:::
({J(x)({J(2x -n)dxl
Ixl~A
+../2 (1
1({J(x)1 2 dx)1/2 ,
Ixl::::A
Ixl~A
and therefore with a proper choice of A, saya = n, in view of the tail majorization
(103), we obtain
Ihn I :::
Dm
(1
+ n)m
for all m E N,
(104)
where the Dm are finite. Thus, the Fourier coefficients of mo are rapidly decaying
and this implies that mo E C oo .
The topic of compact wavelets is an important one, but it is rather technical. The
interested reader is refered to [D3] for the detailed theory.
References
[01]
[02]
[03]
[04]
[05]
[06]
[07]
[08]
[09]
Appendix
Introduction
Integration is almost as old as mathematics. It is at least as old as Greek
mathematics,8 since Eudoxus and Archimedes used the exhaustion method to
compute the volume ofvarious solids, in particular, the pyrarnid and the cone. 9
The modem theory of integration is intimately linked to Fourier series. Indeed,
Bernhard Riemann (1826-1866) developed his theory of integration as a tool for
studying Fourier series, the theme of his memoir of habilitation to professorship at
the University of Gottingen. Also, Renri Lebesgue (1875-1941), who conceived
his theory of integration in the period from 1902 to 1906, stated in a 1903 artic1e: "[
am going to apply the notion of integral to the study of the trigonometrie expansion
offunetions that are not integrable in the sense of Riemann."
The Riemann integral has a few weak points, the two main ones being that
8Sir Thomas Heath, A History of Greek Mathematics; Vol. I: From Thales to Euclid,
Clarendon Press, Oxford, 1921; Dover edition, 1981.
9Exhaustion is the procedure by which we compute, for instance, the volume of the cone
of height h and circular base of radius R, as the limit of a heap of circular tiIes:
lim
h
2:>' (k)2
-R -.
n
n
n
ntoo k=!
242
Appendix
(1) The dass of nonnegative functions which are Riemann-integrable is not large
enough. Indeed, some functions have an "obvious" integral, and Riemann's integration theory denies it, while Lebesgue's theory recognizes it (see Example 9), and
its stability properties under the limit operation are too weak.
(2) The Riemann integral is defined with respect to the Lebesgue measure (the
"volume" in ffi.n), whereas the Lebesgue integral can be defined with respect to a
general abstract measure, a probability for instance.
The last advantage is an excellent argument to convince a student to invest a
little time in the study of the Lebesgue integral, because the return is considerable.
Indeed, the Lebesgue integral ofthe function f with respect to the measure p, (see
the meaning in the first chapter), modestly denoted by
Ix
fex) p,(dx),
contains a variety of mathematical objects, for instance, the usual Lebesgue integral
on the line,
f(x)dx,
can also be viewed (with profit) as a Lebesgue integral with respect to the counting
measure on Z. The Stieltjes-Lebesgue integral
f(x)dF(x)
E[Z]
are also in the scope ofLebesgue's integral. For the student who is reluctant to give
up the expertise dearly acquired in the Riemann integral, it suffices to say that any
Riemann-integrable function is also Lebesgue-integrable and that both integrals
then coincide.
Is Lebesgue's theory hard to grasp? Not at all, because most of the results are
very natural, and in that respect, the Lebesgue integral is much easier to manipulate
correctly than the Riemann integral. A tedious (but not difficult) part is the step-bystep construction of the Lebesgue integral. However, if one just gives a summary
of the main steps without going into the details, this is usually not a cause of
frustration for the student interested in applications. The really difficult part is the
proof of existence of certain measures, but students usually do not mind admitting
such results. For instance, there is an existence theorem for the Lebesgue measure
243
e (the "length") on JEt It says: There exists a unique measure e on JR: that gives to
the intervals [a, b] the measure b - a. Of course, in order to understand what all the
fuss is about, and what kind of mathematical subtleties hide behind such a harmless
statement, we shall have to be more precise about the meaning of "measure". But
when this is done, one is very much ready to approve the statement although the
proof is not immediate. Of course, in this appendix, the proofs of such "obvious"
results are not given. In fact, the goals of this appendix are to provide a tool and to
give a few tips as to how to use it safely. The reader who has no previous knowledge
ofintegration theory will therefore be very much in the situation of the new recipient
of a driving license who takes the road in spite of her inexperience. Experience is
best acquired on the road, and the main text contains many opportunities for the
student to check her reflexes and to apply the roles that are briefly explained in the
appendix. The student wishing to purehase good insurance is directed to the main
companies, a few of which are listed in the bibliography of this appendix.
Farewell and bon voyage!
P(X) is the collection of all subsets of an arbitrary set X; card(X) is its cardinal,
that is, the "number" of elements in it.
DEFINITION
1. Afamily X
(a) X E X,
() (A E X) ===} (:4 EX),
(y) (An E X for all n E N) ===}
if
(U~oAn EX).
244
Appendix
If X = ]Rn is endowed with the Euclidean topology, the Borel sigma-field B(]Rn)
is denoted Bn. For n = 1, we write B(]R) = B. For I =
I j , where I j is
a general interval of]R (I is then called a general reetangle of ]Rn), the Borel
sigma-field B(I) on I consists of all the Borel sets contained in I.
Hi=-l
THEOREM
Measurable Functions
One ofthe central notions ofLebesgue's integration theory is that of a measurable
function.
DEFINITION
f-1(C)
forall
CE
e.
f: (X,X)
(E,e),
or
fEeiX,
or
fEX,
where the third notation will be used only when (E, e) = (I, B(I, I being a
general rectangle of]Rn, provided the context is clear enough as to the choice of I.
If f : (X, X) ~ (]Rk, Bk) one says that f is a Borelfunetion from X to ]Rk.
(However, this is not quite standard terminology; in the standard terminology,
(X, X) must be some (]Rn, Bn).)
Let B be the sigma-field on i: generated by the intervals of type ( - 00, a], a E :IR.
A function f : (X, X) ~ (i:, B), where (X, X) is an arbitrary measurable space,
is called an extended Borel funetion, or simply a Borel funetion. As for functions
f : (X, X) ~ (]R, B), they are called real Borelfunetions. In general, in a sentence
such as "f is a Borel function defined on X," the sigma-field X is assumed to be
the obvious one in the given context.
It seems difficult to prove measurability since most sigma-fields are not defined
explicitly (see the definition of Bn, for instance). However, the following result
often simplifies the task.
245
THEOREM 2. Let (X, X) and (E, E) be two measurable spaces, where E = a(C)
for some collection C of subsets of E. Then f : (X, X) f-+ (E, E) if and only if
f-I(C)
for all C
C.
{f:S a}:= {x
IR n : !;(x):s a;joralll:s
i:s n}.)
3. Let (X, X), (Y, y), and (E, E) be three measurable spaces, and let
f-+ (Y, y), g : (Y, y) f-+ (E, E). Then f := g 0 q; : (X, X) f-+ (E, E).
q; : (X, X)
f-+
(ii) Let fn : (X, X) f-+ (IR, 8), n E N. Then lim infntoo fn and lim SUPntoo fn
are (possibly extended) Borel functions, and the set
Measures
DEFINITION 5. Let (X, X) be a measurable space and let /-L : X f-+ [0,00] be
a setfunction such thatforany denumerablefamily {An}n:::1 ofmutually disjoint
sets in X,
(105)
246
Appendix
The setfunction JL is called a measure on (X, X), and (X, X, JL) is ca lied a measure
space.
Property (105) is the sigma-additivity property.
The following three properties are easy to check:
JL(0) = 0;
(A ~ B and A, B E X)
(An E X for all nE N)
===}
===}
(JL(A):::: JL(B));
(JL(U~oAn):::: L~o JL(An)).
EXAMPLE
L aj lai(C),
00
JL(C) =
j=O
where aj
N, is a measure denoted JL
= L~o aj Bai.
5. There exists one and only one measure l on (~, S) such that
la, b]) = b - a.
(106)
EXAMPLE 5.
1. Fis nondecreasing;
2. F is right-continuous;
3. F admits a left-hand limit, denoted F(x-), at all x
247
f-LO, tD
ift 2:
0,
-f-Lt,OD
ift <
0.
(107)
= f-La, bD,
From the last formula, we deduce that any point set {a}, a E lR has null Lebesgue
measure, and therefore, any countable subset of lR (Ql, for instance), has null
Lebesgue measure.
The following lemma features the sequential continuity properties of measures.
LEMMA
f-L
(0
n=1
An)
= lim t
ntoo
f-L(A n).
(l08)
Let {Rn }n~ 1 be a nonincreasing (that is, Rn+1 ~ Rn for all n 2: 1) sequence of X
such that f-L(R no ) < 00 for some no E N+. Then
f-L
Proof:
(n
n=1
Rn)
= lim +f-L(Rn).
(l09)
n,!,oo
since
n-I
f-L(A n) = f-L(Ad
+L
f-L(Ai+1 - Ai)
i=1
and
f-L(QAn)
= f-L(A 1) + ~f-L(Ai+I-Ai)'
The necessity of the condition f-L(Rno ) < 00 for some no is illustrated by the
following counterexample. Let v be the counting measure on Z, and for all n 2: 1
define Rn = {i E Z : lil 2: n}. Then v(R n) = + 00 for all n 2: 1, and
v
(0
Rn)
= v(0) = 0.
248
Appendix
6. Let F : lR
(lR, ) such that Ffl- = F.
THEOREM
f--+
This result is easily stated, but it is not trivial.lt is typical of the existence results,
which ans wer the following type of question: Let C be a collection of subsets of X
with C c X, where X is the sigma-field on X generated by C. Given a set function
u : C f--+ [0,00], does there exist a measure fL on (X, X) such that fL(C) = u(C)
for all C E C, and is it unique?
The reason why such results are nontrlvial is that the sigma-field generated by
C is not explicitly constructed. It is therefore not easy to say what fL( C) should be
when one does not really know what a typical C E X should look like!
Negligible Sets
A very important concept in measure and integration theory is that of negligible
sets, with the correlated notion of "almost everywhere."
For instance, if I and g are two Borel functions defined on X, the expression
I Sg
fL-a.e.
means that
= o.
f--+
Praol: Let t E lR be such that I(t) =j:. g(t). For any c > 0, there exists s E
[t - c, t + c] such that I(s) = g(s). (Otherwise, the set {t; I(t) =j:. g(t)} would
contain the whole interval [t - c, t + c] and therefore could not be of null Lebesgue
measure.) Therefore, one can construct a sequence {tnk:::l converging to t and
such that l(tn) = g(tn) for all n ~ 1. Letting n tend to 00 yields l(t) = g(t), a
contradiction.
The Integral
Having defined measures and measurable functions, we are ready to construct the
abstract Lebesgue integral.
249
THEOREM 9.
Proof"
Take
In(x)
n2- n -l
kr n 1Ak ,n(x),
L
k=O
where
Ak,n
= {x
+ l)r n }.
I : (X, X)
1-+
(lR., B) of the
I(x)
= Lai lA/X),
i=l
where ai E lR.+, Ai E X for all i E {l, ... , k}, one defines the integral of I with
respect to J1" denoted
Ix
I dJ1"
or
Ix
I(x) J1,(dx),
or
J1,(f),
by
(110)
Jx{ I
dJ1,
= lim t {
ntoo
Jx In dJ1"
(111)
where {fn}n2:l is a nondecreasing sequence ofnonnegative elementary Borel functions In : (X, X) 1-+ (lR., B) such that limntoo t In = I. This definition can be
shown to be consistent, in that the integral so defined is independent of the choice
250
Appendix
of the approximating sequence. Note that the quantity (111) is nonnegative and
can be infinite. It can be shown that if I ::: g, where I, g : (X, X) 1--+ (~, 13) are
nonnegative, then
In particular, if
1+ = max(f, 0)
and
1- = max( - I, 0),
wehave
and therefore,
(112)
Integrable Functions
DEFINITION 9. A measurable function
JL-integrable function if
Ix III
I : (X, X)
dJL <
1--+
(113)
00.
with
Ix
Ix
(115)
This leads to one of the forms "finite minus finite," "finite minus infinite," and
"infinite minus finite." The case JL(f+) = JL(f-) = + 00 is rigorously excluded
from the definition, because it leads to the indeterminate form "infinite minus
infinite."
Counting Measure and Dirac Measure
251
EXAMPLE 7.
=L
00
1J.(f)
anl(n).
n=!
EXAMPLE 8.
t-+
= I(a).
Ix
lA dlJ.
= IJ.(A).
(116)
t-+
(X, X)
(117)
The extension to complex Borel functions of the properties (a), (b), (d), and (f) in
Theorem lO is immediate.
Riemann and Lebesgue
The following result tells us that all the time spent learning about the Riemann
integral has not been in vain.
THEOREM 11. Let I : (lR., ) t-+ (R ) be Riemann-integrable. Then it is
Lebesgue-integrable with respeet to l, and the Lebesgue integral is equal to the
Riemann integral.
EXAMPLE 9. The eonverse is not true: The funetion I defined by I (x) = 1 if x E Ql
and I (x) = 0 if x f/. Ql is a Borel funetion, and it is Lebesgue-integrable with its
integral equal to zero beeause {I =I- O} = Ql, has l-measure zero. However, I is
not Riemann-integrable.
252
Appendix
EXAMPLE
fex)
(R 8) defined by
x
1 +x2
+
f (x)
- - 2 1[0
l+x
'
oo)(x)
f-(x)
and
=-
- - 2 1(-00 Oj(X)
l+x
'
+A
-A
x
--2
l+x
dx
= O.
THEOREM
lim
ntoo
fn = f
/L-a.e.
and
Jx[ f
d/L = lim
ntoo
t [
Jx fn d/L.
The next result is a useful technical tool called Fatou 's lemma.
THEOREM
all n
(1
x
fn d/L) .
(118)
The domina ted convergence theorem is also called the Lebesgue theorem:
14. Let fn : (X, .1:') ~ (i, B), n ~ 1, be such that, for some function
f: (X,.1:') ~ (i, B) and some /L-integrablefunction g : (X,.1:') ~ (i, B),
THEOREM
(i) lim fn
ntoo
= f,
/L-a.e.,
:s
Igl JL-a.e.lorall n
253
1.
Then
Ix
= !~~
I dJL
(Ix
In dJL ) .
The results in Theorems 12 and 14 ensure that under certain circumstances limit
and integration may be interchanged (that is, JL(lim In) = lim JL(fn. The classical
counterexample that shows this is not always true is the following:
EXAMPLE
In(x)
=0
1
n
Ixl>-
if
1
n
:s x :s 0,
1
n
One has
lim In(x)
ntoo
that is, limntoo In
lor alt n ~ 1.
= 0 JL-a.e.
=0
if
x =j:. 0,
= O. However,
JL(fn)
=1
Ix
I(t, x) JL(dx).
(119)
THEOREM 15. Assume thatlor JL-almost alt x thefunction t "rl I(t, x) is continuousatto E (a, b)andthatthereexistsaJL-integrablefunctiong : (X, X) f-+ (i:, )
such that I/(t, x)1 :s Ig(x)1 JL-a.e. lor alt t in a neighborhood V 01 to. Then
I : V f-+ IR. is welt defined and is continuous at to. Furthermore, assume that
JL-a.e ..
1-
al (to, x) JL(dx).
x at
(120)
254
Appendix
Proof" Let (tn}n~l be a sequence in V \ {to} such that limntoo tn = to, and define
fn(x) = f(t n, x), f(x) = f(to, x). Then, by dominated convergence,
lim I(tn) = I(to).
ntoo
Also,
l(tn) - I(to)
=
tn - to
---'--'-~
f(tn, x) - f(to, x) dx
JL( ),
x
tn - to
to), X)I
) JL(dx)
{ af
= Jx
(to, x) JL(dx).
at
JL(A l x A2)
for all Al
= JLl(A l )JL2(A2)
(121)
Xl> A 2 E X 2.
The measure JL is the product measure of JLl and JL2, and is denoted JLl JL2.
The above result extends in an obvious manner to a finite number of sigma-finite
measures.
EXAMPLE 12. The typical example of a product measure is the Lebesgue measure
on the space (Rn, Bn): It is the unique measure in on that space that is such that
in( Ai) =
B.
255
17. Let (X I, XI, Ji,1) and (X I, X 2, Ji,2) be two measure spaces in which
Ji,1 and Ji,2 are sigma-finite. Let (X, X, Ji,) = (XI x X2, XI X X 2, Ji,1 Ji,2).
(A) ToneIli. lf I is nonnegative, then, lor Ji,1-almost all XI, the function X2 --+
I(XI, X2) is measurable with respect to X 2, and
THEOREM
XI
--+ (
X2
(123)
(B) Fubini. If I is Ji,-integrable, then, lor Ji,1-almost all XI, the function X2 --+
I(XI, X2) is Ji,2-integrable and XI --+ JX2 I(XI, X2) Ji,2(dx2) is Ji,2-integrable, and
(123) is true.
In this text we shall refer to the global result as the Fubini-Tonelli theorem.
Part (A) says that one can integrate a nonnegative Borel function in any order
of its variables. Part (B) says that the same is true of an arbitrary Borel function if
that function is Ji,-integrable. In general, in order to apply Part (B), one must use
Part (A) with I = 1I1 to ascertain whether or not J 1I1 dJi, < 00.
EXAMPLE 13. Consider thefunction I defined on XI x X2
thelormula
Wehave
= h(X2)
However;
1
1
h(X2) dx2
1=
00
(-
0,
h(xI)) dxl,
since h ~ f-a.e. on (0, 00). We therelore see that successive integrations yield
different results according to the order in which they are perjormed. As a matter
ollact, I(XI, X2) is not integrable on (0,1) x (1,00).
256
Appendix
Integration by Parts
THEOREM
18. Let
interval (a, b)
(R 8).
For any
lR,
= {
~a.bl
(124)
Observe that in the first integral we have (a, t] (c1osed on the right), whereas in
the second integral we have (a, t) (open on the right).
Proof' The proof consists of computing the l1-measure of the square (a, b] x
(a, b] in two ways. The first one is obvious and gives the left-hand side of (124).
The second one consists of observing that 11a, b] x (a, b]) = I1(Dd + I1(D2),
where D I = {(x, y);a < y ::::: b, a < X::::: y} and D 2 = (a, b] x (a, b] \ D I . Then
I1(D 1) and I1(D2) are computed using Tonelli's theorem. For instance,
I1(Dd =
and
L(L
I D ,(x, y)111(dx)
l\a<x:o;yjI11(dx)
= 111a, y]).
Let 11 be aRadon measure on (lR, 8) and let F/i- be its c.dJ. The notation
g(x) F/i-(dx)
stands for IIR g(x) l1(dx). When this integral is used, it is usually called the
Lebesgue-Stieltjes integral of g with respect to F w With this notation, (124)
becomes
F 1(b)F2(b) - F1(a)F1(b)
= (
~.bl
F 1(x)dF2(x)
+(
~a.~
F2(X-) dFl(X),(125)
The Spaces LP
For a given P :::: 1, L~(I1) is, roughly speaking (see the details below), the collection
of complex-valued Borel functions J defined on X such that
IfIP dl1 < 00.
We shall see that it is a complete normed vector space over C, that is, a Banach
space. Of special interest to Fourier analysis is the case P = 2, since L~(I1) has
additional structure that makes of it a Hilbert space.
Ix
Let (X, X, 11) be a measure space and let J, g be two complex-valued Borel
functions defined on X. The relation R defined by
(fRg)
(f = g l1-a.e.)
257
(Ix
===}
III P dJL
Ix Igl
dJL) .
+ {g} = {f + g},
{f}*
= {f*},
a{f}
= {af},
= {fg}.
The first equality means that {f} + {g} is, by definition, the equivalence dass
consisting of the functions 1+ g, where land g are abritrary members of {f}
and {g}, respectively. A similar interpretation holds for the other equalities.
By definition, for a given p ::: 1, L~(JL) is the collection of equivalence dasses
{f} such that
IIIP dJL < 00. Clearly, it is a vector space over C (for the proof
recall that
Ix
CII;
Igl)
s ~ III P + ~ Igl P
Proof' From the inequality la Iq S 1 + la IP, true for all a E C, it follows that
JLOIlq) S JL(I) + JLOIIP). Since JL(I) = JL(lR.) < 00, JL(IIlq) < 00 whenever
JL(IIIP) < 00.
20. Let p and q be positive real numbers different Iram 1 such that
-p + -q = 1
(p and q are then said to be conjugate), and let I, g : (X, X) t-+ (i,8) be
nonnegative. Then, we have Hlder's inequality
(127)
In particular,
if I, g
E L~(lR.),
then Ig E L~(lR.).
258
Proof'
Appendix
Let
A
(Ix
Ix FP dJl Ix
=
00,
Gq dJl = 1.
The inequality
F(x)G(x)::::: - F(x)P
p
+ -1 G(x)q
(*)
= p In(F(x)),
= q In(G(x)).
tex)
From the convexity of the exponential function and the assumption that 1/ p
l/q = 1,
es(x)/p+t(x)/q :::::
and this is precisely the inequality (*). Integrating this inequality yields
1
x
1
p
1
q
+ - = 1,
(FG)dJl::::: -
that
Ix
1-+
Ix gP dJl <
00.
Ix J(f +
[Ix jP dJl
r/ [Ix
p
(f + g)(p-l)q
r/
259
and
Ix g(f
+ g)p-l dlL
Adding together the above two inequalities and observing that (p - l)q = p, we
obtain
One may assume that the right-hand side of (128) is finite and that the lefthand side is positive (otherwise the inequality is trivial). Therefore, !x(f +
g)P dlL E (0,00). We may therefore divide both sides of the last display by
[Jx (f + g)P dlL q. Observing that 1 - 1/q = 1/P yields the desired inequality
(128).
For the last assertion of the theorem, take p = q = 2.
L~(IL)
1-+
[0,00) defined by
)I IP
(129)
UX
C, I
E L~(IL).
Riesz-Fischer Theorem
We shall denote vp(f) by IIfll p. Thus L~(IL) is a normed vector space over C,
with the norm 11 . 11 p and the induced distance
dp(f, g)
= 111 -
gllp.
THEOREM 23. Let p ::: 1. The distance d p makes 01 L~ a complete normed space.
11 .
I p
To show completeness one must prove that for any Cauchy sequence
Since {fn}n~1 is a Cauchy sequence (that is, limm,ntoo dp(fn, Im) = 0), one can
select a subsequence (fn,}i~1 such that
gk
=L
i=1
I/ni+l - Inil,
(*)
260
Appendix
00
g=
;=1
I/ni+' - Ini I
By (*) and Minkowski's inequality we have IIgk I p :s 1. Fatou's lemma applied to the sequences {gfk~:1 gives IIgli p :s 1. In particular, any member of
the equivalence c1ass of g is finite /L-almost everywhere, and therefore
L (jni+' (X) 00
In, (X) +
ln/x))
;=1
converges absolutely for /L-almost all x. Call this limit I(x) (set I(x)
this limit does not exist). Since
= 0 when
k-I
In,
+L
;=1
we see that
One must show that I is the limit in Lt(/L) of Unkk~:I' Let e > O. There exists an
integer n = N(e) such that II/n - Im I p :s e whenever m, n 2: N. For all m > N,
by Fatou's lemma we have
Jx[ 1I -
I~OO
m--+oo
111 - Imllp
= O.
lim f,n
;too
'
=I
/L- a.e.
(130)
Note that the statement in (130) is about functions and not about equivalence
c1asses. The functions thereof are any members of the corresponding equivalence
c1ass. In particular, since when a given sequence of functions converges /L-a.e. to
two functions, these two functions are necessarily equal/L-a.e.
References
THEOREM 25.
261
=g
f-L-a.e.
IIfII =
[Ix
Ifl 2 df-L
f/2
Ix
fg* df-L
IIfII 2
= (j, f).
Approximation Theorems
We now quote the approximation results used in the main text.
27. Let f E L~(lR.), P ~ 1. There exists a sequence {fnln:o:! of continuous functions fn : IR t-+ C with compact support that converges to f in
THEOREM
L~(IR).
(To have compact support means, for a continuous function, to be null outside
some c10sed bounded interval.)
Let f E L~(IR), P ~ 1. There exists a sequence {fnln:o:! offunctions
fn : IR t-+ C which are finite linear combinations ofindicatorfunctions ofintervals,
that converges to f in L~(IR).
THEOREM 28.
THEOREM
References
[Dl]
[D2]
[D3]
[D4]
[D5]
Glossary of Symbols
card (X), or
N, the integers.
N+, the positive integers.
Z, the relative integers.
IR, the reals.
IR+, the positive reals.
C, the complex numbers.
C.
(a, b], interval of IR open to the left, c10sed to the right; and similar notation for
the other types of intervals.
Re(z), the real part of z E C.
J : IR
1-+
C.
= J.
J(t - s)g(s)ds =
g(t - s) J(s)ds.
264
Glossary of Symbols
* f*n.
A, =
otherwise.
= l(t)l[o,Tj(t).
l[_~,+~j(t),
=b-
a.
that
flR
I/(t)IP dt <
f:
1 : lR
such that
~ C such
00.
I/(t)IP dt <
1 : Ca, b]
~ C
00.
1 : lR ~
C such
{Xn}nEZ
such that
LnEZ IXn 12
<
00.
1 : lR ~ C.
C.
1 : lR ~ C.
C~, the set of continuous functions 1 : lR ~ C with bounded support.
C([O, T]), the set of continuous functions 1 : [0, T] ~ C.
Co, the set of continuous functions
1 : [0, T]
decreasing.
Sr. the set of functions 1 : [0, T]
order r rapidly decreasing.
C in
S', the set of tempered distributions on lR; the set oflinear forms on S.
(x, Y) H, the Hermitian product of x, y EH, H Hilbert space.
IIx 11 H
265
cn(f)
S(f)
j(v)
H(z)
CPj,n(t)
= 2 j / 2cp(2 j t -
n).
Index
convolution, 14
convolution-multiplication rule, 14,25
convolutional filter, 55, 101
counting measure, 246
cut-off frequency, 58
Decomposable signal, 61
dense, 138
differentiating filter, 59
dilation equation, 209
Dini's theorem, 34
Dirac comb, 91
Dirac measure, 246
Dirichlet integral, 32
Dirichlet kerneI, 32, 36
dispersive channel, 72
distance, 135
distribution function, 246
dominated convergence, 252
down-sampling, 118
Elementary Borel function, 249
Fejer's idenlity, 110
Fejer's kerneI, 40
Fejer's lemma, 112
Fatou's lemma, 252
268
Index
localization principle, 33
locally integrable, 8, 23
locally square-integrable, 8
locally stable, 23
low-pass, 58
Mallat's algorithm, 211
measurable function, 244
measure, 246
measure space, 246
metric space, 135
Mexican hat, 192
Meyer's wavelet, 218
Minkowski's inequality, 258
monotone convergence, 252
Morlet's pseudo-wavelet, 192
mother wavelet, 185,201
MRA,196
multiresolution analysis, 196
Norm, 134
norm of a linear form, 143
Nyquist condition, 85
Octave band filter, 122
orthogonal, 134
orthogonal complement, 139
orthogonalprojection,140
orthogonalsum, 143
orthonormal system, 145
Parallelogram identity, 134
partial response signaling, 87
periodic signal, 23
phase, 57
phase delay, 73
Plancherel-Parseval identity, 157, 162
Poisson sum formula, 91
polarization identity, 134
pre-Hilbert space, 133
probability measure, 246
product measure, 254
product sigma-field, 254
projection principle, 141
projection theorem, 140
pulse amplitude modulation, 84
Pythagoras' theorem, 134
QMF,120
Index
quadrature components, 68
quadrature mirror filter, 120
quadrature multiplexing, 70
quasi-positive delta sequence, 231
Radon measure, 246
realizable filter, 55, 101
rectangular pulse, 9
regularization lemma, 19
regularizing function, 19
reproducing kernel, 230
residue theorem, 105
resolution level, 196
Riesz basis, 164
Riesz's representation theorem, 144
Riesz-Fischer theorem, 259
root mean-square width, 176
Sampie and hold, 78
scalar product, 133
scaling function, 196
Schwarz inequality, 134
Shannon wavelet, 210
short-time Fourier transform, 178
sigma-additivity, 246
sigma-fie1d, 243
sigma-finite measure, 246
spectral decomposition, 62
spectral factorization, 112
spectrum folding, 79
stable, 7, 100
synchronous detection, 70
Tonelli's theorem, 255
total, 148
transfer function, 104
triangular pulse, 14
Uncertainty princip1e, 175
up-sampling, 118
Wavelet orthonormal basis, 201
Weierstrass theorem, 41
WFT,178
WFT inversion formula, 179
window function, 169, 178
windowed Fourier transform, 178
WT, 185
Z-transform, 104
269