2007-JSSC-Equalization and Clock and Data Recovery Techniques for 10-Gbps CMOS Serial-Link Receivers
2007-JSSC-Equalization and Clock and Data Recovery Techniques for 10-Gbps CMOS Serial-Link Receivers
Abstract—Two equalizer filter topologies and a merged equal- with other equalization methods [6] so as to mitigate both
izer/CDR circuit are described that operate at 10 Gb/s in 0.13- m effects.
CMOS technology. Using techniques such as reverse scaling, pas- Section II presents various gain peaking methods that are
sive peaking networks, and dual- and triple-loop adaptation, the
prototypes adapt to FR4 trace lengths up to 24 inches. The equal- used in Section III to develop two equalizer filter topologies.
izer/CDR circuit retimes the data with a bit error rate of 10 13 Section IV describes an adaptive equalizer architecture and
while consuming 133 mW from a 1.6-V supply. Section V the merged equalizer/CDR circuit. Section VI sum-
Index Terms—Adaptive equalization, analog equalization, marizes the experimental results for the two prototypes.
broadband receivers, DFE, FFE, high-speed links, lossy channel,
reverse scaling. II. GAIN PEAKING TECHNIQUES
A. General Considerations
I. INTRODUCTION
Copper traces designed as transmission lines on FR4 sub-
strates suffer from both skin effect and dielectric loss. For ex-
Fig. 1. Complex pole peaking circuit. (a) Implementation. (b) Magnitude response. (c) Phase response. (d) transient response.
concluding that the product of the gain, the boost factor, and the
bandwidth of the stage is limited by the of the technology.1
It is important to appreciate the impact of the limited
(about 75 GHz in 0.13- m CMOS technology) on equaliza-
tion. For a small-signal gain of 2, an undegenerated differen-
tial pair with an overdrive voltage of 300 mV and fanout of
unity yields a bandwidth of less than 12.5 GHz. The degener-
ated structure trades this gain-bandwidth product for the boost
factor, the low-frequency gain, and the linear range.
With the performance envelope imposed by (2), a cascade
of stages such as the circuit of Fig. 2(a) fails to provide the
required boost factor while accommodating a data rate of 10
Gb/s and a reasonable low-frequency gain (e.g., 6 dB–0 dB).
Fig. 2. RC-degenerated differential pair. (a) Circuit implementation. (b) Fre- As the number of stages in the cascade increases to achieve a
quency response: actual response (solid line); bode approximation (dashed line).
higher boost factor, the overall bandwidth tends to drop unless a
greater low-frequency loss is allowed in each stage. Simulations
indicate that, for a total boost factor of 22 dB at 5 GHz and an
low- complex poles. Of course, inductors can still act as shunt- overall bandwidth of 5.5 GHz,2 three degenerated stages with a
peaking elements to broaden the bandwidth of equalizer stages. low-frequency loss of 6 dB per stage are required. Such a high
An efficient method of boosting by means of real loss results in a sensitivity degradation of about 3 dB.
zeros is capacitive degeneration. Fig. 2(a) shows a de-
generated differential pair that yields a zero at C. Peaking By Passive Networks
and poles at and We propose the use of passive high-pass networks to pro-
, with a low-frequency gain vide boost at the front end of an equalizer, thus relaxing the
of . Fig. 2(b) depicts linearity and gain peaking of the subsequent active stages and
the frequency response. Improving the linearity of the stage, saving power consumption. Fig. 3(a) depicts an example where
degeneration nonetheless creates a trade-off between the the zero and pole frequencies are respectively given by
low-frequency gain and the boost factor, . Interestingly, and and the boost factor by
one can write if is neglected. To avoid degrading the input
1Here, C includes the input capacitance of the next stage.
2The overall bandwidth refers to that of the FR4 trace along with the equalizer
(2) circuit.
GONDI AND RAZAVI: EQUALIZATION AND CLOCK AND DATA RECOVERY TECHNIQUES FOR 10-GB/S CMOS SERIAL-LINK RECEIVERS 2001
Fig. 3. (a) Passive network. (b) Proposed passive network with series-inductive peaking.
Fig. 4. Illustration of the reverse scaling technique. (a) Block diagram. (b) Single-ended circuit realization.
(CM) levels. This mitigates the voltage headroom issue and, range of 1–4 GHz. The pole provides additional adjustment of
more importantly, avoids variability in the CM level seen by the overall phase response in the all-pass path.7
- due to the preceding stage, thus maintaining a constant The use of series inductive peaking (with 3-nH inductors)
tuning range. The capacitors 0.25 pF are realized using in the passive boost stage allows wide input transistors (
multi-finger fringe structures having a parasitic component of m) in the first differential pair and hence reverse scaling
about 3%. The CM level is generated using a resistive divider. through the cascade. The low-frequency loss of 7 dB in the
The corner frequency associated with this capacitive coupling passive network degrades the sensitivity to some extent. With
is around 3 MHz, resulting in negligible droop with encoded inductive peaking and negative Miller capacitances, the band-
data. width of each active stage reaches 18 GHz. The summing stage
incorporates active feedback [10] to improve the speed while
avoiding inductors. Since active feedback trades low-frequency
B. Equalizer Filter II
gain for bandwidth, it has been applied to only one stage.
In Section II, the high-pass passive network was introduced to The choice between the two types of equalizers described
provide a peaking profile. The use of this network in the equal- above somewhat depends on the application. The former does
izer filter can save power consumption by relaxing the linearity not incorporate a passive network at the input, providing greater
and gain peaking requirements of the active stages. Fig. 8 shows sensitivity but consuming a higher power. The latter achieves a
the second equalizer filter architecture [12], which incorporates higher linearity and wider tuning range.
both passive peaking and reverse scaling. In contrast to the first
IV. ADAPTIVE EQUALIZER
equalizer, this design performs boost tuning by interpolation be-
tween a peaking path and an all-pass path (set by coefficients Fig. 9 shows the first adaptive equalizer architecture [9],
and ), thus achieving a wider tuning range than that ob- where the equalizer filter is followed by a slicer, and two loops
tained by means of MOS varactors and variable resistors. Also, a control the boost in the filter and the swing in the slicer. The
constant (linear) degeneration resistance in the differential pairs equalized data, , is sensed at node A rather than B because
yields higher linearity and a more accurately-defined low-fre- the slicer incorporates some peaking to improve convergence
quency gain. of the loops, thereby leading to larger jitter at B than at A when
The use of interpolation nonetheless presents a difficulty for the loops reach steady state. The need for the swing control
intermediate line lengths, i.e., if . The disparate de- loop is justified as follows.
lays through the two paths result in substantial ISI after their High-speed adaptive equalizers set the peaking in the filter
corresponding outputs are summed. Realized as a degenerated stages so as to compensate for the high-frequency loss of the
differential pair, the phase shift block in the all-pass path par- channel. To this end, the equalizer output is sharpened by a slicer
tially corrects this error. The zero of this differential pair is po- and the adaptation loop adjusts the peaking according to the
sitioned such that the phase response approximates the effective 7Simulations indicate that a 20% delay mismatch produces less than 5 ps of
response of the three zeros in the peaking path for the frequency ISI jitter at the equalizer output.
2004 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 42, NO. 9, SEPTEMBER 2007
Fig. 10. Spectrum before and after slicer with swing mismatch.
Fig. 14. Simulated dynamics of (a) boost and swing control signals, and (b) the
Fig. 12. Response of the adaptive equalizer to a 25% mismatch in the line. control voltage of the VCO in the CDR loop.
(a) Frequency response. (b) Eye diagram.
Fig. 19. Measured results before and after equalization at 10 Gb/s for FR4 traces, (horizontal scale: 20 ps/div.): (a) before equalization for 30-in FR4 (vertical
scale: 100 mV/div.), (b) after equalization for 30-in FR4 (vertical scale: 100 mV/div.), (c) after equalization for 6-in FR4 (vertical scale: 100 mV/div.).
VII. CONCLUSION
The high loss of long traces on FR4 boards can be compen-
sated through the use of equalization techniques such as pas-
sive peaking networks, reverse scaling, and capacitive degener-
ation. To adapt to the line length, equalizers must incorporate
both boost and swing control while guaranteeing smooth, con-
flict-free convergence. Finally, the equalization and CDR func-
tions can be merged to eliminate slicers. This work has demon-
strated these concepts for a data rate of 10 Gb/s and trace lengths
Fig. 20. Die photograph of merged adaptive equalizer/CDR.
of 24 inches in 0.13- m CMOS technology.
Fig. 21. Measured results before and after equalization/data recovery at 10 Gb/s for FR4 traces (horizontal scale: 20 ps/div.): (a) before equalization/data recovery
for 24-in FR4 (horizontal scale: 10 ps/div., vertical scale : 75 mV/div.), (b) after equalization/data recovery for 24-in FR4 (horizontal scale: 20 ps/div., vertical
scale : 20 mV/div.), (c) after equalization/data recovery for 6-in FR4 (horizontal scale: 20 ps/div., vertical scale: 20 mV/div.).
Fig. 22. BER sensitivity graph for equalization/data recovery for 24-in FR4 at
10 Gb/s. Fig. 24. (a) Loss profile of 30-in microstrip on FR4 board. (b) Narrowband loss
model.
Fig. 23. Measured results after equalization/clock recovery for 24-in FR4 at
10 Gb/s. (a) Recovered clock spectrum. (b) Recovered clock jitter histogram
(horizontal scale: 5 ps/div.).
Fig. 26. Profiles of actual microstrip (dashed lines) and scalable model (solid
lines) for a 30-in trace. (a) Magnitude profile. (b) Phase profile. Fig. 27. (a) Error due to additive noise based on the sampling points in the eye.
(b) BER as a function of clock skew.
[Fig. 25(b)]. The conductance to the substrate, therefore, in- where denotes the rms value of noise. Carrying out the inte-
creases with frequency. Accurate modeling of the dielectric loss gration in (6) and neglecting higher-order terms, we have
necessitates at least two such sections: the loss in the –
branch becomes appreciable above 0.4 GHz and that in the
– branch above 20 GHz, thus providing good accuracy up
to 7 GHz. The above model can simply be cascaded to represent
longer traces. Fig. 26 compares the loss and phase profiles for
a 30-in trace as predicted by field simulations and the model,
demonstrating a reasonable fit.9
(10)
APPENDIX II
BER ESTIMATION For example, with mV, mV,
mV ,10 and sampling in the middle of the eye, we obtain
The BER of equalizer/CDR chains is a function of additive .
noise, ISI, and CDR skew and jitter. In this section, we propose In the presence of clock skew and jitter, the sampling point de-
a method of estimating the BER based on these parameters. The viates from the maximum eye opening, rapidly raising the BER.
objective is to develop an intuitive understanding of the tradeoffs (Since the ISI-induced jitter in the data waveform contains pre-
and hence arrive at a reasonable compromise. dominantly high-frequency components [19], the CDR circuit
Consider the eye diagram shown in Fig. 27(a), where fails to track the corresponding phase variations.) For
and denote the peak received swing and the peak eye to in Fig. 27(a), we approximate the eye opening as
opening, respectively. We first exclude clock jitter. The ISI and plot as a function of the
due to limited bandwidth or incomplete equalization leads to a clock skew [Fig. 27(b)]. Here, the above values of , ,
roughly uniform distribution of the amplitude between and are used and .
. Denoting this distribution by and that of additive We can now incorporate the effect of clock jitter by weighting
Gaussian noise by , we observe that an error occurs if (10) according to the jitter distribution, , where
noise causes a level between and to fall below zero denotes the random departure of the sampling point from the
(or a level between and to exceed zero). Thus, the center of the eye. As illustrated in Fig. 28(a), if places the
probability of error is given by sampling instant at , then is lower and the effect of
more pronounced. The error probability density function across
the bit period can therefore be expressed as
(6)
(11)
Also, Plotted in Fig. 28(b) for a Gaussian jitter distribution having
an rms value of ps, this result reveals that errors are
(7) most likely to occur in the vicinity of ps, where the
dramatic rise in still compensates for the fall in . Beyond
(8) this point the clock jitter becomes so improbable that errors are
less and less likely to occur.
and The framework developed above can be used to formulate
the trade-offs between required swings, additive noise, ISI, and
(9) tolerable clock jitter. For example, with mV and
, the plots in Fig. 29 can be generated, where
9Simulations indicate that if the proposed model is altered to incur one more
dB of error, the equalizer output suffers from 2.6 dB of additional vertical clo- 10The additive noise is obtained by integrating the noise at the output of the
sure and 0.03 UI of additional jitter. equalizer path up to 100 GHz.
2010 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 42, NO. 9, SEPTEMBER 2007
The authors wish to thank V. Pathak and Kawasaki Micro- Srikanth Gondi received the B.E. (Hons.) degree in
electronics America for support of this work, Dr. J. Lee for de- electrical and electronics engineering from Birla In-
sign assistance, and TSMC for fabrication support. stitute of Technology and Science, Pilani, India, in
1997, the M.Sc. degree in computer engineering from
Iowa State University, Ames, in 2002, and the Ph.D.
degree in electrical engineering from University of
REFERENCES California, Los Angeles, in 2006.
He was a Research Assistant at Iowa State Uni-
[1] K. Krishna, D. A. Yokoyama-Martin, S. Wolfer, C. Jones, M. versity until 1999, where he focused on mixed-signal
Loikkanen, J. Parker, R. Segelken, J. L. Sonntag, J. Stonick, S. Titus, circuits for the gigabit ethernet transceiver and the
and D. Weinlader, “A multigigabit backplane transceiver core in 1394 home-networking applications. From 2000 to
0.13 m CMOS for serial data transmission across high loss legacy 2002, he was a Mixed-Signal Design Engineer at Texas Instruments, Dallas, TX,
backplane channels,” IEEE J. Solid-State Circuits, pp. 2658–2666, working on high-speed interface architectures and circuits for wireline applica-
Dec. 2005. tions, realized in BiCMOS technologies. He was a Graduate Student Researcher
[2] G. Zhang and M. M. Green, “A BICMOS 10 Gb/s adaptive cable equal- at the University of California, Los Angeles, from 2002 to 2006, where he devel-
izer,” in IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers, Feb. oped circuits and architectures for serializer-deserializer (SerDes) applications.
2004, pp. 482–483. He is currently a Senior Design Engineer in the R&D Division of Kawasaki
[3] Y. Tomita, M. Kibune, J. Ogawa, W. W. Walker, H. Tamura, and Microelectronics America, working on analog front-ends for both wireless and
T. Kuroda, “A 10-Gb/s receiver with series equalizer and on-chip wireline applications. His current research interests include mixed-signal equal-
ISI monitor in 0.11 m CMOS,” IEEE J. Solid-State Circuits, pp. izers, data converters, phase-locking and clock recovery for data communica-
986–993, Apr. 2005. tions.
GONDI AND RAZAVI: EQUALIZATION AND CLOCK AND DATA RECOVERY TECHNIQUES FOR 10-GB/S CMOS SERIAL-LINK RECEIVERS 2011
Behzad Razavi (S’87–M’90–SM’00–F’03) re- includes wireless transceivers, frequency synthesizers, phase-locking and clock
ceived the B.S.E.E. degree from Sharif University of recovery for high-speed data communications, and data converters.
Technology, Tehran, Iran, in 1985 and the M.S.E.E. Prof. Razavi received the Beatrice Winner Award for Editorial Excellence at
and Ph.D.E.E. degrees from Stanford University, the 1994 ISSCC, the best paper award at the 1994 European Solid-State Circuits
Stanford, CA, in 1988 and 1992, respectively. Conference, the best panel award at the 1995 and 1997 ISSCC, the TRW Inno-
He was with ATT Bell Laboratories and Hewlett- vative Teaching Award in 1997, and the best paper award at the IEEE Custom
Packard Laboratories until 1996. Since 1996, he has Integrated Circuits Conference in 1998. He was the co-recipient of both the Jack
been Associate Professor and subsequently Professor Kilby Outstanding Student Paper Award and the Beatrice Winner Award for Ed-
of electrical engineering at the University of Cali- itorial Excellence at the 2001 ISSCC. He was recognized as one of the top 10
fornia, Los Angeles. He was an Adjunct Professor authors in the 50-year history of ISSCC. He received the Lockheed Martin Ex-
at Princeton University from 1992 to 1994, and at cellence in Teaching Award in 2006 and the UCLA Faculty Senate Teaching
Stanford University in 1995. He is the author of Principles of Data Conversion Award in 2007. He served on the Technical Program Committees of the Inter-
System Design (IEEE Press, 1995), RF Microelectronics (Prentice Hall, 1998) national Solid-State Circuits Conference (ISSCC) from 1993 to 2002 and VLSI
(translated to Chinese and Japanese), Design of Analog CMOS Integrated Cir- Circuits Symposium from 1998 to 2002. He has also served as Guest Editor
cuits (McGraw-Hill, 2001) (translated to Chinese and Japanese), Design of Inte- and Associate Editor of the IEEE JOURNAL OF SOLID-STATE CIRCUITS, IEEE
grated Circuits for Optical Communications (McGraw-Hill, 2003), and Funda- TRANSACTIONS ON CIRCUITS AND SYSTEMS, and International Journal of High
mentals of Microelectronics (Wiley 2006), and the editor of Monolithic Phase- Speed Electronics. He is an IEEE Distinguished Lecturer and a Fellow of IEEE.
Locked Loops and Clock Recovery Circuits (IEEE Press, 1996) and Phase-
Locking in High-Performance Systems (IEEE Press, 2003). His current research