Delay: Introduction To CMOS VLSI Design
Delay: Introduction To CMOS VLSI Design
Chapter 4
Delay
Delay Definitions
tpdr: rising propagation delay
– From input to rising output
crossing VDD/2
tpdf: falling propagation delay
– From input to falling output
crossing VDD/2
tpd: average propagation delay
– tpd = (tpdr + tpdf)/2
tr: rise time
– From output crossing 0.2
VDD to 0.8 VDD
tf: fall time
– From output crossing 0.8 Inverter
VDD to 0.2 VDD
1.5
1.0
(V)
tpdf = 66ps tpdr = 83ps
Vin
Vout
0.5
0.0
2C 2C
2C 2C
2 Y 2
A Y
1 1 R C
C
R C C
C output
capacitance
d = 6RC
Chapter 4 CMOS VLSI Design 10
Delay Model Comparison
(Example 4.1, p.145)
2 2 2
3
3
2C 2C 2C
2C 2C 2C
2 2 2
2C 2C 2C
3 3C
3C
3C
3
3C
3C
3
3C
3C
2 2 2
9C
3
5C
3C
3
5C
3C
3
5C
3 3C
3C
3C
3
3C
3C
3
3C 2 2 2
3C
9C
3
5C
3C
3
5C
3C
3
5C
C1 C2 C3 CN
t pdf 3C R3 3C R3 R3 9 5h C R3 R3 R3
t pdr 9 5h RC
11 5h RC
rising output (nạp tụ)
falling output (xả tụ)
2 2 2 Y
A 3 9C 5hC
n2
B 3 n1 3C
C 3 3C
R 5
tcdr 9 5h C 3 h RC
3 3
rising output (nạp tụ)
3C 3C 3C 3 3C
VDD VDD
A B A B
Y Y
GND GND
4:16 Decoder
Decoder specifications:
16 words
16
Register File
– 16 word register file
– Each word is 32 bits wide
– Each bit presents load of 3 unit-sized transistors
– True and complementary address inputs A[3:0]
– Each input may drive 10 unit-sized transistors
needs to decide:
– How many stages to use?
– How large should each gate be?
– How fast can decoder operate?
Chapter 4 CMOS VLSI Design 24
Delay in a Logic Gate
Express delays in process-independent unit d d abs
Delay has two components: d = f + p
3RC
f: effort delay = gh (a.k.a. stage effort)
3 ps in 65 nm process
– Again has two components 60 ps in 0.6 mm process
g: logical effort
– Measures relative ability of gate to deliver current
– g 1 for inverter
h: electrical effort (or fanout) = Cout / Cin
– Ratio of output to input capacitance
– Sometimes called fanout
p: parasitic delay (normally ~1)
– Represents delay of gate driving no load
– Set by internal parasitic capacitance
Normalized Delay: d
5 p=2
What about d = (4/3)h + 2
4 g=1
NOR2? p=1
3 d=h+1
2 Effort Delay: f
1
Parasitic Delay: p
0
0 1 2 3 4 5
Electrical Effort:
h = Cout / Cin
10
x z
y
20
g1 = 1 g2 = 5/3 g3 = 4/3 g4 = 1
h1 = x/10 h2 = y/x h3 = z/y h4 = 20/z
h BHi
Path Delay D d i DF P
fˆ gi hi F
1
N
fˆ gh g CCoutin
gi Couti
Cini
fˆ
Working backward, apply capacitance
transformation to find input capacitance of each gate
given load it drives.
Check work by verifying input cap spec is met.
y
x
45
A 8
x
y B
45
y
x
45
A 8
x
y B
45
y
x
45
45
A P:
84 P:
x 4
N: 4 P:
y 12 B
B
N: 6 45
N: 3 45
8 4 2.8
D = NF1/N + P 16 8
= N(64)1/N + N
23
Datapath Load 64 64 64 64
N: 1 2 3 4
f: 64 8 4 2.8
D: 65 18 15 15.3
Fastest
D NF pi N n1 pinv
1
N Path Effort F
i 1
D 1 1 1
F N ln F N F N pinv 0
N
F
1
Define best stage effort N
pinv 1 ln 0
D(N) /D(N)
1.4
1.26
1.2 1.15
1.0
(=6) ( =2.4)
0.0
0.5 0.7 1.0 1.4 2.0
N/ N
4:16 Decoder
Decoder specifications:
16 words
16
Register File
– 16 word register file
– Each word is 32 bits wide
– Each bit presents load of 3 unit-sized transistors
– True and complementary address inputs A[3:0]
– Each input may drive 10 unit-sized transistors
needs to decide:
– How many stages to use?
– How large should each gate be?
– How fast can decoder operate?
Chapter 4 CMOS VLSI Design 47
Number of Stages
Decoder effort is mainly electrical and branching
Electrical Effort: H = (32*3) / 10 = 9.6
Branching Effort: B=8
10 10 10 10 10 10 10 10
y z word[0]
y z word[15]
effort delay f DF f i
parasitic delay p P pi
delay d f p D di DF P
gi Couti
6) Find gate sizes Cini
fˆ