Ecen720: High-Speed Links Circuits and Systems Spring 2021: Lecture 12: Cdrs
Ecen720: High-Speed Links Circuits and Systems Spring 2021: Lecture 12: Cdrs
Sam Palermo
Analog & Mixed-Signal Center
Texas A&M University
Announcements
• Lab 6 due Apr 12
• Project Preliminary Report due Apr 19
• Project Final Report due Apr 29
2
Agenda
• CDR overview
• CDR phase detectors
• Single-loop analog PLL-based CDR
• Dual-loop CDRs
• Phase interpolators
• CDR jitter properties
3
Embedded Clock I/O Circuits
• TX PLL
• TX Clock Distribution
• CDR
• Per-channel PLL-based
• Dual-loop w/ Global PLL &
• Local DLL/PI
• Local Phase-Rotator PLLs
• Global PLL requires RX
clock distribution to
individual channels
4
Clock and Data Recovery
[Razavi]
• A clock and data recovery system (CDR) produces the clocks to sample
incoming data
• The clock(s) must have an effective frequency equal to the incoming
data rate
• 10GHz for 10Gb/s data rate
• OR, multiple clocks spaced at 100ps
• Additional clocks may be used for phase detection
• Sampling clocks should have the proper phase relationship with the
incoming data for sufficient timing margin to achieve the desired bit-
error-rate (BER)
• CDR should exhibit small effective jitter
5
Embedded Clocking (CDR)
PLL-based CDR Dual-Loop CDR
VCTRL Frequency 800MHZ
CP PFD Ref Clk
Synthesis
PLL Vctrl
4
PLL[0]
PLL[4:0] PLL[4:0]
proportional (3.2GHz)
RX[n:0] gain
5:1 5:1
5 Mux/ MUX MUX 10
integral Interpolator
Din gain
Pairs 15
early/
late early/
RX late sel
FSM
Loop Filter (16Gb/s) PD
Phase-Recovery Loop
• Clock frequency and optimum phase position are extracted from incoming data
• Phase detection continuously running
• Jitter tracking limited by CDR bandwidth
• With technology scaling we can make CDRs with higher bandwidths and the jitter tracking
advantages of source synchronous systems is diminished
• Possible CDR implementations
• Stand-alone PLL
• “Dual-loop” architecture with a PLL or DLL and phase interpolators (PI)
• Phase-rotator PLL
6
Agenda
• CDR overview
• CDR phase detectors
• Single-loop analog PLL-based CDR
• Dual-loop CDRs
• Phase interpolators
• CDR jitter properties
7
CDR Phase Detectors
[Perrott]
Tb/2
ref
[Razavi]
Late
Tb/2 ref
Tb/2
ref
[Razavi]
Late
[Lee]
“-1” “-1”
• Mueller-Muller phase
detector commonly used
16
Mueller-Muller Baud-Rate Phase Detector
MM-PD is measuring the effective “1”
[Musa]
ଵ ିଵ
which can be computed by
“-1” “-1”
ିଵ ାଵ
20
Analog PLL-based CDR
“Linearized” KPD
[Lee]
21
Analog PLL-based CDR
[Lee]
architecture
1-tap FIR, 1-tap
PAM4 Equalizer IIR DFE
+ Z-1
• 3 data samplers
MUX
DFE
DFE DFE
IIR Slicer
FIR IIR
Time
for PAM4
Weight Amp. Thr.
Const.
Input 2-bit Flash ADC
CTLE 3 12
D
detection
+
DFE adaptation
Divider 14 GHz
and LC-VCO
Buffers
• 1 error sampler
BBPD
for threshold
adaptation [Roshan-Zamir JSSC 2019]
23
56Gb/s PAM4 Analog PLL-based CDR
• PLL-based CDR to In Dn[1:3]
24 PAM4 Early
Loop Filter
BBPD
reduce power
12 Charge
4:8 Late
En Pump
consumption
8
4
14 GHz
• Bang-bang phase 4 4
2X Oversampling
LC-VCO
detector works on
Clock Generators
Data I
Phase
symmetric PAM4
CLK CLK0
Calibration 4 I Q
CLK45
CML
VCNT
Divider
transitions to
4 IB
Edge CLK90 Q
CLK QB
errors
CMOS
• Parallel charge
pumps minimize
logic and loop delay
12 Charge
4:8 Late
En Pump
LC filter improves
8
4
phase noise
14 GHz
LC-VCO
2X Oversampling
4 4
• 8-phase quarter-
Clock Generators
Data I
Phase
rate clock
CLK CLK0
Calibration 4 I Q
CLK45
CML
VCNT
IB Divider
• CML divider
4
Edge CLK90 Q
CLK QB
• 2X oversampling CML
to
clock CMOS
26
Single-Loop CDR Issues
PLL-based CDR
VCTRL
proportional
RX[n:0] gain
integral
Din gain
early/
late
Loop Filter
• Can be difficult to
distribute multiple phases
long distance
• Need to preserve phase
spacing
• Clock distribution power
increases with phase number
• If CDR needs more than 4
phases consider local phase
generation
28
DLL Local Phase Generation
• Only differential clock is
distributed from global PLL
• Delay-Locked Loop (DLL)
locally generates the
multiple clock phases for
the phase interpolators
• DLL can be per-channel or
shared by a small number (4)
• Same architecture can be
used in a forwarded-clock
system
• Replace frequency synthesis
PLL with forwarded-clock
signals
29
Phase Rotator PLL
• Phase interpolators can be
expensive in terms of power
and area
• Phase rotator PLL places
one interpolator in PLL
feedback to adjust all VCO
output phases
simultaneously
• Now frequency synthesis
and phase recovery loops
are coupled
• Need PLL bandwidth greater
than phase loop
• Useful in filtering VCO noise
30
Agenda
• CDR overview
• CDR phase detectors
• Single-loop analog PLL-based CDR
• Dual-loop CDRs
• Phase interpolators
• CDR jitter properties
31
Phase Interpolators
[Weinlader]
• Phase interpolators realize
digital-to-phase conversion
(DPC)
• Produce an output clock that is
a weighted sum of two input
clock phases
• Common circuit structures
• Tail current summation
interpolation
• Voltage-mode interpolation
• Interpolator code mapping
techniques
• Sinusoidal
[Bulzacchelli]
• Linear
32
Sinusoidal Phase Interpolation
X I A sin(t )
X Q A sin(t / 2) A cost
Y A sin t
A cos sin t A sin cost 0
2
cos X I sin X Q a1 X I a2 X Q
33
Sinusoidal vs Linear Phase Interpolation
[Kreienkamp]
small output
KDL
[Sidiropoulos]
40
DLL Delay Transfer Function
[Maneatis]
41
Agenda
• CDR overview
• CDR phase detectors
• Single-loop analog PLL-based CDR
• Dual-loop CDRs
• Phase interpolators
• CDR jitter properties
• Jitter transfer
• Jitter generation
• Jitter tolerance
42
CDR Jitter Model
“Linearized” KPD
[Lee]
43
Jitter Transfer
“Linearized” KPD
[Lee]
44
Jitter Transfer Measurement
[Walker]
45
Jitter Transfer Specification
[Walker]
46
Jitter Generation
[Mansuri]
out(s)
20log 10
vcon(s)
[McNeill]
• SONET specification:
• rms output jitter ≤ 0.01 UI
48
Jitter Tolerance
• How much sinusoidal jitter can the CDR “tolerate” and still achieve a
given BER? [Sheikholeslami]
[Lee]
Maximum tolerable e
out s Timing Margin
e s 1
n.in s
in s 2
TM
JTOLs 2n.in s
out s
1
in s
49
Jitter Tolerance Measurement
[Lee]
• Random and sinusoidal jitter are added by modulating the BERT clock
• Deterministic jitter is added by passing the data through the channel
• For a given frequency, sinusoidal jitter amplitude is increased until the
minimum acceptable BER (10-12) is recorded
50
Jitter Tolerance Measurement
[Lee]
(within CDR
bandwidth)
TM
JTOLs 2n.in s
Flat region is beyond out s
1
CDR bandwidth in s
51
CDR Take-Away Points
• CDRs extract the proper clock frequency and phase
position to sample the incoming data symbols
• Specialized phase detectors suited for random data
symbols are required
• Dual-loop CDRs are often used to both optimize
jitter performance and provide robust frequency
acquisition
• Jitter tolerance is an important CDR metric that is
improved with increased loop bandwidth
52
Next Time
• Forwarded-Clock Deskew Circuits
• Clock Distribution Techniques
53