Handout Vlsi Ch3 2024
Handout Vlsi Ch3 2024
Speed (Delay)
Instructor: 江建學
Date:2024/11/21
Chapter 3
1
『注意:請遵守智慧財產權觀念。不得非法影印教科書。』
Speed - Outline
3.1 Introduction
2
Speed – Introduction - Definitions
傳播延遲
汙染,弄髒;玷汙
3
Propagation and Contamination Delay
-Combinational logic is characterized by its propagation delay and contamination delay. The propagation delay tpd is the
maximum time from when an input changes until the output or outputs reach their final value. The contamination delay
tcd is the minimum time from when an input changes until any output starts to change its value.
-When designers speak of calculating the delay of a circuit, they generally are referring to the worst-case value (the
propagation delay), unless it is clear otherwise from the context.
-Figure illustrates a buffer's propagation delay and contamination delay in blue and gray, respectively. The figure shows
that A is initially either HIGH or LOW and changes to the other state at a particular time; we are interested only in the fact
that it changes, not what value it has. In response, Y changes some time later. The arcs indicate that Y may start to
change tcd after A transitions and that Y definitely settles to its new value within tpd.
https://www.sciencedirect.com/topics/computer-science/contamination-
delay 4
Speed – Introduction - Definitions
-The timing analyzer computes the arrival times at each node and checks that the outputs arrive.
鬆弛;懈怠
-The slack is the difference between the required and arrival times.
→Positive slack means that the circuit meets timing.
→Negative slack means that the circuit is not fast enough
*If the outputs are all required at 200 ps, the circuit has 60 ps of slack.
(choose the max. delay as the main delay path)
5
Speed – Introduction - Definitions
-There will be a number of critical paths that limit the operating speed of the system and
require attention to timing details
6
Speed – Transient Response
-The most fundamental way to compute delay (transient response)
1.develop a physical model of the circuit
2.write a differential equation describing the output voltage as a function of input voltage and time,
3.solve the equation.
→The solution of the differential equation is called the transient response,(對t時間作微分)
7
Speed – Transient Response
-Find step response of inverter driving load cap
8
Example 4.1
9
Speed – RC Delay Model - Switch-level RC Delay Models
10
Speed – RC Delay Model - Switch-level RC Delay Models
-The unit inverters of Figure 4.6(a) are composed from an nMOS transistor of unit size and a pMOS transistor of twice unit
width to achieve equal rise and fall resistance
* The capacitors shorted between two constant supplies are also removed
because they are not charged or discharged
11
Speed – Example 4.2
-Sketch a 3-input NAND gate with transistor widths chosen to achieve
effective rise and fall resistance equal to that of a unit inverter (R).
-The three nMOS transistors are in series so the resistance is three times that of a single
transistor.
→each must be three times unit width to compensate.
→each transistor has resistance R/3
→the series combination has resistance R.
12
Speed – Example 4.2
- Figure 4.7(c) redraws the gate with these capacitances deleted and the
remaining capacitances lumped to ground
13
Speed – Example 4.2
- Figure 4.7(d) shows the equivalent circuit for the falling output transition.
→The output pulls down through the three series nMOS transistors.
-Figure 4.7(e) shows the equivalent circuit for the rising output transition.
→In the worst case, the upper two inputs are 1 and the bottom one falls to 0.
→The output pulls up through a single pMOS transistor.
→The upper two nMOS transistors are still on, so the diffusion capacitance
between the series nMOS transistors must also be considered.
WHY?
14
Speed – Elmore Delay
15
Speed – Example 4.4
Estimate tpd for a unit inverter driving m identical unit inverters
Inv_m
16
Speed – Example
-A NAND driving h identical NAND gates What is the worst case
→ Compute the worst rising and falling propagation delay - Estimate rising propagation delay
17
Speed – RC Delay Model
-A NAND driving h identical NAND gates What is the worst case
→ Compute the worst rising and falling propagation delay
- Estimate Falling propagation delay
18
Speed – RC Delay Model
https://slideplayer.com/slide/10652619/ 19
Speed – Linear Delay Model
-The RC delay model showed that delay is a linear function of the fanout of a gate
-The normalized delay of a gate p: internal parasitic delay driving no load
-f is the effort delay or stage effort that depends on the complexity and fanout of the gate.
-A gate driving h identical copies of itself is said to have a fanout or electrical effort of h.
→h: electrical effort = Cout / Cin (fanout)
20
Speed – Linear Delay Model
21
Speed – Logical Effort
22
Speed – Logical Effort
-Logical effort of a gate is defined as the ratio of the input capacitance of the gate to the
input capacitance of an inverter that can deliver the same output current.
→Equivalently, logical effort indicates how much worse a gate is at producing output
current as compared to an inverter, given that each input of the gate may only present as
much input capacitance as the inverter.
(相比的gate可產生與反向器相同輸出電流時的最差的延遲效果(more input capacitance))
23
Speed – Logical Effort
24
Speed – Parasitic Delay
-The parasitic delay of a gate is the delay of the gate when it drives zero load.
-The inverter has three units of diffusion capacitance on the output, so the parasitic delay is
→the normalized parasitic delay is 1
-Increasing transistor sizes reduces resistance but increases capacitance correspondingly, so
parasitic delay is independent of gate size
25
Speed – Parasitic Delay (More refined estimates)
-Figure 4.23 shows a model of an n-input NAND gate in which the upper inputs were all 1 and the
bottom input rises.
-The gate must discharge the diffusion capacitances of all of the internal nodes as well as the output
-The Elmore delay is
26
Speed – Parasitic Delay
Example 4.10
-Use the linear delay model to estimate the delay of the fanout-of-4 (FO4) inverter from Example 4.6. Assume the
inverter is constructed in a 65 nm process with τ=3ps
The total delay is d gh p 1 × 4 1 5 in normalized terms
→ tpd 15
* As a rough rule of thumb, the FO4 delay for a process (in picoseconds) is 1/3 to 1/2 of
the drawn channel length (in nanometers).
→ For example, a 65 nm process with a 50 nm channel length may have an FO4 delay of 16–25 ps
27
Speed – Parasitic Delay
Example 4.11
-A ring oscillator is constructed from an odd number of inverters, as shown in Figure 4.24. Estimate the
frequency of an N-stage ring oscillator.
*A 31-stage ring oscillator in a 65 nm process has a frequency of 1/(4 × 31 × 3 ps) = 2.7 GHz.
28