SlideShare a Scribd company logo
VLSI Digital Circuits
Lecture 15: Timing
Issues
[Adapted from Prof. Mary Jane Irwin’s slides, Rabaey’s Digital Integrated
Circuits, Second Edition, ©2003 J. Rabaey, A. Chandrakasan, B. Nikolic]
Timing Classifications
 Synchronous systems
 All memory elements in the system are simultaneously updated
using a globally distributed periodic synchronization signal (i.e., a
global clock signal)
 Functionality is ensure by strict constraints on the clock signal
generation and distribution to minimize
- Clock skew (spatial variations in clock edges)
- Clock jitter (temporal variations in clock edges)
 Asynchronous systems
 Self-timed (controlled) systems
 No need for a globally distributed clock, but have asynchronous
circuit overheads (handshaking logic, etc.)
 Hybrid systems
 Synchronization between different clock domains
 Interfacing between asynchronous and synchronous domains
Review: Synchronous Timing Basics
 Under ideal conditions (i.e., when tclk1 = tclk2)
T  tc-q + tplogic + tsu
thold ≤ tcdlogic + tcdreg
 Under real conditions, the clock signal can have both
spatial (clock skew) and temporal (clock jitter) variations
 skew is constant from cycle to cycle (by definition); skew can be
positive (clock and data flowing in the same direction) or negative
(clock and data flowing in opposite directions)
 jitter causes T to change on a cycle-by-cycle basis
D Q
R1
Combinational
logic
D Q
R2
clk
In
tclk1 tclk2
tc-q, tsu,
thold, tcdreg
tplogic, tcdlogic
Sources of Clock Skew and Jitter in Clock Network
PLL
1
2
4
3
5
6
7
clock
generation
clock drivers
power supply
interconnect
capacitive load
capacitive
coupling
temperature
 Skew
 manufacturing device
variations in clock drivers
 interconnect variations
 environmental variations
(power supply and
temperature)
 Jitter
 clock generation
 capacitive loading and
coupling
 environmental variations
(power supply and
temperature)
Positive Clock Skew
D Q
R1
Combinational
logic
D Q
R2
clk
In
tclk1 tclk2
delay
  > 0: Improves performance, but makes thold harder to
meet. If thold is not met (race conditions), the circuit
malfunctions independent of the clock period!
T
T + 
 > 0
 + thold
T +   tc-q + tplogic + tsu so T  tc-q + tplogic + tsu - 
thold +  ≤ tcdlogic + tcdreg so thold ≤ tcdlogic + tcdreg - 
1
2
3
4
 Clock and
data flow in
the same
direction
T :
thold :
Negative Clock Skew
D Q
R1
Combinational
logic
D Q
R2
clk
In
tclk1 tclk2
delay
 Clock and
data flow in
opposite
directions
T
T + 
 < 0
T +   tc-q + tplogic + tsu so T  tc-q + tplogic + tsu - 
thold +  ≤ tcdlogic + tcdreg so thold ≤ tcdlogic + tcdreg - 
1
2
3
4
  < 0: Degrades performance, but thold is easier to meet
(eliminating race conditions)
T :
thold :
Clock Jitter
 Jitter causes T to
vary on a cycle-by-
cycle basis
R1
Combinational
logic
clk
In
tclk
T
-tjitter +tjitter
T - 2tjitter  tc-q + tplogic + tsu so T  tc-q + tplogic + tsu + 2tjitter
 Jitter directly reduces the performance of a sequential
circuit
T :
Combined Impact of Skew and Jitter
D Q
R1
Combinational
logic
D Q
R2
In
tclk1 tclk2
 Constraints
on the
minimum
clock period
( > 0)
  > 0 with jitter: Degrades performance, and makes thold
even harder to meet. (The acceptable skew is reduced
by jitter.)
T
T + 
 > 0
1
6 12
-tjitter
T  tc-q + tplogic + tsu -  + 2tjitter thold ≤ tcdlogic + tcdreg –  – 2tjitter
Clock Distribution Networks
 Clock skew and jitter can ultimately limit the performance
of a digital system, so designing a clock network that
minimizes both is important
 In many high-speed processors, a majority of the dynamic power
is dissipated in the clock network.
 To reduce dynamic power, the clock network must support clock
gating (shutting down (disabling the clock) units)
 Clock distribution techniques
 Balanced paths (H-tree network, matched RC trees)
- In the ideal case, can eliminate skew
- Could take multiple cycles for the clock signal to propagate to the
leaves of the tree
 Clock grids
- Typically used in the final stage of the clock distribution network
- Minimizes absolute delay (not relative delay)
H-Tree Clock Network
Clock
Clock
Idle
condition
Gated
clock
Can insert clock gating at
multiple levels in clock tree
Can shut off entire subtree
if all gating conditions are
satisfied
 If the paths are perfectly balanced, clock skew is zero
Clock Grid Network
 Distributed buffering reduces absolute delay and makes
clock gating easier, but is sensitive to variations in the
buffer delay
Clock
secondary clock buffers
local logic
area
main clock
buffer
 The secondary buffers
isolate the local clock
nets from the upstream
load and amplify the
clock signals degraded
by the RC network
 decreases absolute skew
 gives steeper clocks
 Only have to bound the
skew within the local
logic area
DEC Alpha 21164 (EV5) Example
 300 MHz clock (9.3 million transistors on a 16.5x18.1
mm die in 0.5 micron CMOS technology)
 single phase clock
 3.75 nF total clock load
 Extensive use of dynamic logic
 20 W (out of 50) in clock distribution network
 Two level clock distribution
 Single 6 inverter stage main clock buffer at the center of the
chip
 Secondary clock buffers drive the left and right sides of the
clock grid in m3 and m4
 Total equivalent driver size of 58 cm !!
Secondary Clock Buffers
Clock Skew in Alpha Processor
 Absolute skew smaller than 90 ps
 The critical
instruction and
execution units all
see the clock within
65 ps
Dealing with Clock Skew and Jitter
 To minimize skew, balance clock paths using H-tree or
matched-tree clock distribution structures.
 If possible, route data and clock in opposite directions;
eliminates races at the cost of performance.
 The use of gated clocks to help with dynamic power
consumption make jitter worse.
 Shield clock wires (route power lines – VDD or GND – next to
clock lines) to minimize/eliminate coupling with neighboring
signal nets.
 Use dummy fills to reduce skew by reducing variations in
interconnect capacitances due to interlayer dielectric
thickness variations.
 Beware of temperature and supply rail variations and their
effects on skew and jitter. Power supply noise fundamentally
limits the performance of clock networks.
Major Components of a Computer
Processor
Control
Datapath
Memory
Devices
Input
Output
 Modern processor architecture styles (CSE 431)
 Pipelined, single issue (e.g., ARM)
 Pipelined, hardware controlled multiple issue – superscalar
 Pipelined, software controlled multiple issue – VLIW
 Pipelined, multiple issue from multiple process threads -
multithreaded
Basic Building Blocks
 Datapath
 Execution units
- Adder, multiplier, divider, shifter, etc.
 Register file and pipeline registers
 Multiplexers, decoders
 Control
 Finite state machines (PLA, ROM, random logic)
 Interconnect
 Switches, arbiters, buses
 Memory
 Caches, TLBs, DRAM, buffers
MIPS 5-Stage Pipelined (Single Issue) Datapath
Read
Address
I$
Add
PC
4
0
1
Write Data
Read Addr 1
Read Addr 2
Write Addr
Register
File
Read
Data 1
Read
Data 2
Sign
Extend
16 32
ALU
1
0
Shift
left 2
Add
D$
Address
Write Data
Read
Data
1
0
IF/Dec
Dec/Exec
Exec/Mem
Mem/WB
pipeline
stage
isolation
register
Fetch Decode Execute Memory WriteBack
clk
Icache
precharge
Dcache
precharge
RegWrite
Datapath Bit-Sliced Organization
Control Flow
Bit 0
Bit 1
Bit 2
Bit 3
Tile identical bit-slice elements
Register
File
Pipeline
Register
Adder
Shifter
Pipeline
Register
Multiplexer
Multiplexer
Data Flow
Pipeline
Register
From
I$
Pipeline
Register
To/From D$
Ad

More Related Content

Similar to vlsi digital circuits full power point presentation (20)

Synchronization and timing loop presentation -mapyourtech
Synchronization and timing loop presentation -mapyourtechSynchronization and timing loop presentation -mapyourtech
Synchronization and timing loop presentation -mapyourtech
MapYourTech
 
Vsync track c
Vsync   track cVsync   track c
Vsync track c
Alona Gradman
 
Te442 lecture02-2016-14-4-2016-1
Te442 lecture02-2016-14-4-2016-1Te442 lecture02-2016-14-4-2016-1
Te442 lecture02-2016-14-4-2016-1
colman mboya
 
Digital Electronics-ppt.pptx
Digital Electronics-ppt.pptxDigital Electronics-ppt.pptx
Digital Electronics-ppt.pptx
GopinathD17
 
INTERRUPT DRIVEN MULTIPLEXED 7 SEGMENT DIGITAL CLOCK
INTERRUPT DRIVEN MULTIPLEXED 7 SEGMENT DIGITAL CLOCKINTERRUPT DRIVEN MULTIPLEXED 7 SEGMENT DIGITAL CLOCK
INTERRUPT DRIVEN MULTIPLEXED 7 SEGMENT DIGITAL CLOCK
Santanu Chatterjee
 
1999 si pi_dws_training_course
1999 si pi_dws_training_course1999 si pi_dws_training_course
1999 si pi_dws_training_course
Piero Belforte
 
Clock tree synthesis in Physical Design flow
Clock tree synthesis in Physical Design flowClock tree synthesis in Physical Design flow
Clock tree synthesis in Physical Design flow
cottilard1995
 
Ch 1 overview
Ch 1 overviewCh 1 overview
Ch 1 overview
Er Ankita Kapoor
 
Range unlimited delay interleaving and –recycling clock skew compensation and...
Range unlimited delay interleaving and –recycling clock skew compensation and...Range unlimited delay interleaving and –recycling clock skew compensation and...
Range unlimited delay interleaving and –recycling clock skew compensation and...
jpstudcorner
 
A continuous time adc and digital signal processing system for smart dust and...
A continuous time adc and digital signal processing system for smart dust and...A continuous time adc and digital signal processing system for smart dust and...
A continuous time adc and digital signal processing system for smart dust and...
eSAT Publishing House
 
A continuous time adc and digital signal processing system for smart dust and...
A continuous time adc and digital signal processing system for smart dust and...A continuous time adc and digital signal processing system for smart dust and...
A continuous time adc and digital signal processing system for smart dust and...
eSAT Journals
 
A continuous time adc and digital signal processing system for smart dust and...
A continuous time adc and digital signal processing system for smart dust and...A continuous time adc and digital signal processing system for smart dust and...
A continuous time adc and digital signal processing system for smart dust and...
eSAT Journals
 
Sequential Logic Circuit Design Unit-4 VLSI.pptx
Sequential Logic Circuit Design Unit-4 VLSI.pptxSequential Logic Circuit Design Unit-4 VLSI.pptx
Sequential Logic Circuit Design Unit-4 VLSI.pptx
tamil arasan
 
specialtechniqueppt - 190626092926.pptx
specialtechniqueppt  - 190626092926.pptxspecialtechniqueppt  - 190626092926.pptx
specialtechniqueppt - 190626092926.pptx
vasudeva873639
 
Accurate Synchronization of EtherCAT Systems Using Distributed Clocks
Accurate Synchronization of EtherCAT Systems Using Distributed ClocksAccurate Synchronization of EtherCAT Systems Using Distributed Clocks
Accurate Synchronization of EtherCAT Systems Using Distributed Clocks
Design World
 
Performance Comparison of Various Clock Gating Techniques
Performance Comparison of Various Clock Gating TechniquesPerformance Comparison of Various Clock Gating Techniques
Performance Comparison of Various Clock Gating Techniques
iosrjce
 
lowpower consumption and details of dfferent power pdf
lowpower consumption and details of dfferent power pdflowpower consumption and details of dfferent power pdf
lowpower consumption and details of dfferent power pdf
ManiBharathNuti1
 
Lecture37_Logic_families_dcd_complete.pdf
Lecture37_Logic_families_dcd_complete.pdfLecture37_Logic_families_dcd_complete.pdf
Lecture37_Logic_families_dcd_complete.pdf
sonakshipuriji
 
Clock Distribution
Clock DistributionClock Distribution
Clock Distribution
Abhishek Tiwari
 
[IJET-V1I3P5] Authors :Dushyant Kumar Soni, Ashish Hiradhar
[IJET-V1I3P5] Authors :Dushyant Kumar Soni, Ashish Hiradhar[IJET-V1I3P5] Authors :Dushyant Kumar Soni, Ashish Hiradhar
[IJET-V1I3P5] Authors :Dushyant Kumar Soni, Ashish Hiradhar
IJET - International Journal of Engineering and Techniques
 
Synchronization and timing loop presentation -mapyourtech
Synchronization and timing loop presentation -mapyourtechSynchronization and timing loop presentation -mapyourtech
Synchronization and timing loop presentation -mapyourtech
MapYourTech
 
Te442 lecture02-2016-14-4-2016-1
Te442 lecture02-2016-14-4-2016-1Te442 lecture02-2016-14-4-2016-1
Te442 lecture02-2016-14-4-2016-1
colman mboya
 
Digital Electronics-ppt.pptx
Digital Electronics-ppt.pptxDigital Electronics-ppt.pptx
Digital Electronics-ppt.pptx
GopinathD17
 
INTERRUPT DRIVEN MULTIPLEXED 7 SEGMENT DIGITAL CLOCK
INTERRUPT DRIVEN MULTIPLEXED 7 SEGMENT DIGITAL CLOCKINTERRUPT DRIVEN MULTIPLEXED 7 SEGMENT DIGITAL CLOCK
INTERRUPT DRIVEN MULTIPLEXED 7 SEGMENT DIGITAL CLOCK
Santanu Chatterjee
 
1999 si pi_dws_training_course
1999 si pi_dws_training_course1999 si pi_dws_training_course
1999 si pi_dws_training_course
Piero Belforte
 
Clock tree synthesis in Physical Design flow
Clock tree synthesis in Physical Design flowClock tree synthesis in Physical Design flow
Clock tree synthesis in Physical Design flow
cottilard1995
 
Range unlimited delay interleaving and –recycling clock skew compensation and...
Range unlimited delay interleaving and –recycling clock skew compensation and...Range unlimited delay interleaving and –recycling clock skew compensation and...
Range unlimited delay interleaving and –recycling clock skew compensation and...
jpstudcorner
 
A continuous time adc and digital signal processing system for smart dust and...
A continuous time adc and digital signal processing system for smart dust and...A continuous time adc and digital signal processing system for smart dust and...
A continuous time adc and digital signal processing system for smart dust and...
eSAT Publishing House
 
A continuous time adc and digital signal processing system for smart dust and...
A continuous time adc and digital signal processing system for smart dust and...A continuous time adc and digital signal processing system for smart dust and...
A continuous time adc and digital signal processing system for smart dust and...
eSAT Journals
 
A continuous time adc and digital signal processing system for smart dust and...
A continuous time adc and digital signal processing system for smart dust and...A continuous time adc and digital signal processing system for smart dust and...
A continuous time adc and digital signal processing system for smart dust and...
eSAT Journals
 
Sequential Logic Circuit Design Unit-4 VLSI.pptx
Sequential Logic Circuit Design Unit-4 VLSI.pptxSequential Logic Circuit Design Unit-4 VLSI.pptx
Sequential Logic Circuit Design Unit-4 VLSI.pptx
tamil arasan
 
specialtechniqueppt - 190626092926.pptx
specialtechniqueppt  - 190626092926.pptxspecialtechniqueppt  - 190626092926.pptx
specialtechniqueppt - 190626092926.pptx
vasudeva873639
 
Accurate Synchronization of EtherCAT Systems Using Distributed Clocks
Accurate Synchronization of EtherCAT Systems Using Distributed ClocksAccurate Synchronization of EtherCAT Systems Using Distributed Clocks
Accurate Synchronization of EtherCAT Systems Using Distributed Clocks
Design World
 
Performance Comparison of Various Clock Gating Techniques
Performance Comparison of Various Clock Gating TechniquesPerformance Comparison of Various Clock Gating Techniques
Performance Comparison of Various Clock Gating Techniques
iosrjce
 
lowpower consumption and details of dfferent power pdf
lowpower consumption and details of dfferent power pdflowpower consumption and details of dfferent power pdf
lowpower consumption and details of dfferent power pdf
ManiBharathNuti1
 
Lecture37_Logic_families_dcd_complete.pdf
Lecture37_Logic_families_dcd_complete.pdfLecture37_Logic_families_dcd_complete.pdf
Lecture37_Logic_families_dcd_complete.pdf
sonakshipuriji
 

Recently uploaded (20)

theory-slides-for react for beginners.pptx
theory-slides-for react for beginners.pptxtheory-slides-for react for beginners.pptx
theory-slides-for react for beginners.pptx
sanchezvanessa7896
 
International Journal of Distributed and Parallel systems (IJDPS)
International Journal of Distributed and Parallel systems (IJDPS)International Journal of Distributed and Parallel systems (IJDPS)
International Journal of Distributed and Parallel systems (IJDPS)
samueljackson3773
 
Machine learning project on employee attrition detection using (2).pptx
Machine learning project on employee attrition detection using (2).pptxMachine learning project on employee attrition detection using (2).pptx
Machine learning project on employee attrition detection using (2).pptx
rajeswari89780
 
Artificial Intelligence (AI) basics.pptx
Artificial Intelligence (AI) basics.pptxArtificial Intelligence (AI) basics.pptx
Artificial Intelligence (AI) basics.pptx
aditichinar
 
MAQUINARIA MINAS CEMA 6th Edition (1).pdf
MAQUINARIA MINAS CEMA 6th Edition (1).pdfMAQUINARIA MINAS CEMA 6th Edition (1).pdf
MAQUINARIA MINAS CEMA 6th Edition (1).pdf
ssuser562df4
 
ELectronics Boards & Product Testing_Shiju.pdf
ELectronics Boards & Product Testing_Shiju.pdfELectronics Boards & Product Testing_Shiju.pdf
ELectronics Boards & Product Testing_Shiju.pdf
Shiju Jacob
 
Structural Response of Reinforced Self-Compacting Concrete Deep Beam Using Fi...
Structural Response of Reinforced Self-Compacting Concrete Deep Beam Using Fi...Structural Response of Reinforced Self-Compacting Concrete Deep Beam Using Fi...
Structural Response of Reinforced Self-Compacting Concrete Deep Beam Using Fi...
Journal of Soft Computing in Civil Engineering
 
some basics electrical and electronics knowledge
some basics electrical and electronics knowledgesome basics electrical and electronics knowledge
some basics electrical and electronics knowledge
nguyentrungdo88
 
Process Parameter Optimization for Minimizing Springback in Cold Drawing Proc...
Process Parameter Optimization for Minimizing Springback in Cold Drawing Proc...Process Parameter Optimization for Minimizing Springback in Cold Drawing Proc...
Process Parameter Optimization for Minimizing Springback in Cold Drawing Proc...
Journal of Soft Computing in Civil Engineering
 
IntroSlides-April-BuildWithAI-VertexAI.pdf
IntroSlides-April-BuildWithAI-VertexAI.pdfIntroSlides-April-BuildWithAI-VertexAI.pdf
IntroSlides-April-BuildWithAI-VertexAI.pdf
Luiz Carneiro
 
RICS Membership-(The Royal Institution of Chartered Surveyors).pdf
RICS Membership-(The Royal Institution of Chartered Surveyors).pdfRICS Membership-(The Royal Institution of Chartered Surveyors).pdf
RICS Membership-(The Royal Institution of Chartered Surveyors).pdf
MohamedAbdelkader115
 
Metal alkyne complexes.pptx in chemistry
Metal alkyne complexes.pptx in chemistryMetal alkyne complexes.pptx in chemistry
Metal alkyne complexes.pptx in chemistry
mee23nu
 
Introduction to FLUID MECHANICS & KINEMATICS
Introduction to FLUID MECHANICS &  KINEMATICSIntroduction to FLUID MECHANICS &  KINEMATICS
Introduction to FLUID MECHANICS & KINEMATICS
narayanaswamygdas
 
Introduction to Zoomlion Earthmoving.pptx
Introduction to Zoomlion Earthmoving.pptxIntroduction to Zoomlion Earthmoving.pptx
Introduction to Zoomlion Earthmoving.pptx
AS1920
 
AI-assisted Software Testing (3-hours tutorial)
AI-assisted Software Testing (3-hours tutorial)AI-assisted Software Testing (3-hours tutorial)
AI-assisted Software Testing (3-hours tutorial)
Vəhid Gəruslu
 
Value Stream Mapping Worskshops for Intelligent Continuous Security
Value Stream Mapping Worskshops for Intelligent Continuous SecurityValue Stream Mapping Worskshops for Intelligent Continuous Security
Value Stream Mapping Worskshops for Intelligent Continuous Security
Marc Hornbeek
 
ADVXAI IN MALWARE ANALYSIS FRAMEWORK: BALANCING EXPLAINABILITY WITH SECURITY
ADVXAI IN MALWARE ANALYSIS FRAMEWORK: BALANCING EXPLAINABILITY WITH SECURITYADVXAI IN MALWARE ANALYSIS FRAMEWORK: BALANCING EXPLAINABILITY WITH SECURITY
ADVXAI IN MALWARE ANALYSIS FRAMEWORK: BALANCING EXPLAINABILITY WITH SECURITY
ijscai
 
Development of MLR, ANN and ANFIS Models for Estimation of PCUs at Different ...
Development of MLR, ANN and ANFIS Models for Estimation of PCUs at Different ...Development of MLR, ANN and ANFIS Models for Estimation of PCUs at Different ...
Development of MLR, ANN and ANFIS Models for Estimation of PCUs at Different ...
Journal of Soft Computing in Civil Engineering
 
211421893-M-Tech-CIVIL-Structural-Engineering-pdf.pdf
211421893-M-Tech-CIVIL-Structural-Engineering-pdf.pdf211421893-M-Tech-CIVIL-Structural-Engineering-pdf.pdf
211421893-M-Tech-CIVIL-Structural-Engineering-pdf.pdf
inmishra17121973
 
The Gaussian Process Modeling Module in UQLab
The Gaussian Process Modeling Module in UQLabThe Gaussian Process Modeling Module in UQLab
The Gaussian Process Modeling Module in UQLab
Journal of Soft Computing in Civil Engineering
 
theory-slides-for react for beginners.pptx
theory-slides-for react for beginners.pptxtheory-slides-for react for beginners.pptx
theory-slides-for react for beginners.pptx
sanchezvanessa7896
 
International Journal of Distributed and Parallel systems (IJDPS)
International Journal of Distributed and Parallel systems (IJDPS)International Journal of Distributed and Parallel systems (IJDPS)
International Journal of Distributed and Parallel systems (IJDPS)
samueljackson3773
 
Machine learning project on employee attrition detection using (2).pptx
Machine learning project on employee attrition detection using (2).pptxMachine learning project on employee attrition detection using (2).pptx
Machine learning project on employee attrition detection using (2).pptx
rajeswari89780
 
Artificial Intelligence (AI) basics.pptx
Artificial Intelligence (AI) basics.pptxArtificial Intelligence (AI) basics.pptx
Artificial Intelligence (AI) basics.pptx
aditichinar
 
MAQUINARIA MINAS CEMA 6th Edition (1).pdf
MAQUINARIA MINAS CEMA 6th Edition (1).pdfMAQUINARIA MINAS CEMA 6th Edition (1).pdf
MAQUINARIA MINAS CEMA 6th Edition (1).pdf
ssuser562df4
 
ELectronics Boards & Product Testing_Shiju.pdf
ELectronics Boards & Product Testing_Shiju.pdfELectronics Boards & Product Testing_Shiju.pdf
ELectronics Boards & Product Testing_Shiju.pdf
Shiju Jacob
 
some basics electrical and electronics knowledge
some basics electrical and electronics knowledgesome basics electrical and electronics knowledge
some basics electrical and electronics knowledge
nguyentrungdo88
 
IntroSlides-April-BuildWithAI-VertexAI.pdf
IntroSlides-April-BuildWithAI-VertexAI.pdfIntroSlides-April-BuildWithAI-VertexAI.pdf
IntroSlides-April-BuildWithAI-VertexAI.pdf
Luiz Carneiro
 
RICS Membership-(The Royal Institution of Chartered Surveyors).pdf
RICS Membership-(The Royal Institution of Chartered Surveyors).pdfRICS Membership-(The Royal Institution of Chartered Surveyors).pdf
RICS Membership-(The Royal Institution of Chartered Surveyors).pdf
MohamedAbdelkader115
 
Metal alkyne complexes.pptx in chemistry
Metal alkyne complexes.pptx in chemistryMetal alkyne complexes.pptx in chemistry
Metal alkyne complexes.pptx in chemistry
mee23nu
 
Introduction to FLUID MECHANICS & KINEMATICS
Introduction to FLUID MECHANICS &  KINEMATICSIntroduction to FLUID MECHANICS &  KINEMATICS
Introduction to FLUID MECHANICS & KINEMATICS
narayanaswamygdas
 
Introduction to Zoomlion Earthmoving.pptx
Introduction to Zoomlion Earthmoving.pptxIntroduction to Zoomlion Earthmoving.pptx
Introduction to Zoomlion Earthmoving.pptx
AS1920
 
AI-assisted Software Testing (3-hours tutorial)
AI-assisted Software Testing (3-hours tutorial)AI-assisted Software Testing (3-hours tutorial)
AI-assisted Software Testing (3-hours tutorial)
Vəhid Gəruslu
 
Value Stream Mapping Worskshops for Intelligent Continuous Security
Value Stream Mapping Worskshops for Intelligent Continuous SecurityValue Stream Mapping Worskshops for Intelligent Continuous Security
Value Stream Mapping Worskshops for Intelligent Continuous Security
Marc Hornbeek
 
ADVXAI IN MALWARE ANALYSIS FRAMEWORK: BALANCING EXPLAINABILITY WITH SECURITY
ADVXAI IN MALWARE ANALYSIS FRAMEWORK: BALANCING EXPLAINABILITY WITH SECURITYADVXAI IN MALWARE ANALYSIS FRAMEWORK: BALANCING EXPLAINABILITY WITH SECURITY
ADVXAI IN MALWARE ANALYSIS FRAMEWORK: BALANCING EXPLAINABILITY WITH SECURITY
ijscai
 
211421893-M-Tech-CIVIL-Structural-Engineering-pdf.pdf
211421893-M-Tech-CIVIL-Structural-Engineering-pdf.pdf211421893-M-Tech-CIVIL-Structural-Engineering-pdf.pdf
211421893-M-Tech-CIVIL-Structural-Engineering-pdf.pdf
inmishra17121973
 
Ad

vlsi digital circuits full power point presentation

  • 1. VLSI Digital Circuits Lecture 15: Timing Issues [Adapted from Prof. Mary Jane Irwin’s slides, Rabaey’s Digital Integrated Circuits, Second Edition, ©2003 J. Rabaey, A. Chandrakasan, B. Nikolic]
  • 2. Timing Classifications  Synchronous systems  All memory elements in the system are simultaneously updated using a globally distributed periodic synchronization signal (i.e., a global clock signal)  Functionality is ensure by strict constraints on the clock signal generation and distribution to minimize - Clock skew (spatial variations in clock edges) - Clock jitter (temporal variations in clock edges)  Asynchronous systems  Self-timed (controlled) systems  No need for a globally distributed clock, but have asynchronous circuit overheads (handshaking logic, etc.)  Hybrid systems  Synchronization between different clock domains  Interfacing between asynchronous and synchronous domains
  • 3. Review: Synchronous Timing Basics  Under ideal conditions (i.e., when tclk1 = tclk2) T  tc-q + tplogic + tsu thold ≤ tcdlogic + tcdreg  Under real conditions, the clock signal can have both spatial (clock skew) and temporal (clock jitter) variations  skew is constant from cycle to cycle (by definition); skew can be positive (clock and data flowing in the same direction) or negative (clock and data flowing in opposite directions)  jitter causes T to change on a cycle-by-cycle basis D Q R1 Combinational logic D Q R2 clk In tclk1 tclk2 tc-q, tsu, thold, tcdreg tplogic, tcdlogic
  • 4. Sources of Clock Skew and Jitter in Clock Network PLL 1 2 4 3 5 6 7 clock generation clock drivers power supply interconnect capacitive load capacitive coupling temperature  Skew  manufacturing device variations in clock drivers  interconnect variations  environmental variations (power supply and temperature)  Jitter  clock generation  capacitive loading and coupling  environmental variations (power supply and temperature)
  • 5. Positive Clock Skew D Q R1 Combinational logic D Q R2 clk In tclk1 tclk2 delay   > 0: Improves performance, but makes thold harder to meet. If thold is not met (race conditions), the circuit malfunctions independent of the clock period! T T +   > 0  + thold T +   tc-q + tplogic + tsu so T  tc-q + tplogic + tsu -  thold +  ≤ tcdlogic + tcdreg so thold ≤ tcdlogic + tcdreg -  1 2 3 4  Clock and data flow in the same direction T : thold :
  • 6. Negative Clock Skew D Q R1 Combinational logic D Q R2 clk In tclk1 tclk2 delay  Clock and data flow in opposite directions T T +   < 0 T +   tc-q + tplogic + tsu so T  tc-q + tplogic + tsu -  thold +  ≤ tcdlogic + tcdreg so thold ≤ tcdlogic + tcdreg -  1 2 3 4   < 0: Degrades performance, but thold is easier to meet (eliminating race conditions) T : thold :
  • 7. Clock Jitter  Jitter causes T to vary on a cycle-by- cycle basis R1 Combinational logic clk In tclk T -tjitter +tjitter T - 2tjitter  tc-q + tplogic + tsu so T  tc-q + tplogic + tsu + 2tjitter  Jitter directly reduces the performance of a sequential circuit T :
  • 8. Combined Impact of Skew and Jitter D Q R1 Combinational logic D Q R2 In tclk1 tclk2  Constraints on the minimum clock period ( > 0)   > 0 with jitter: Degrades performance, and makes thold even harder to meet. (The acceptable skew is reduced by jitter.) T T +   > 0 1 6 12 -tjitter T  tc-q + tplogic + tsu -  + 2tjitter thold ≤ tcdlogic + tcdreg –  – 2tjitter
  • 9. Clock Distribution Networks  Clock skew and jitter can ultimately limit the performance of a digital system, so designing a clock network that minimizes both is important  In many high-speed processors, a majority of the dynamic power is dissipated in the clock network.  To reduce dynamic power, the clock network must support clock gating (shutting down (disabling the clock) units)  Clock distribution techniques  Balanced paths (H-tree network, matched RC trees) - In the ideal case, can eliminate skew - Could take multiple cycles for the clock signal to propagate to the leaves of the tree  Clock grids - Typically used in the final stage of the clock distribution network - Minimizes absolute delay (not relative delay)
  • 10. H-Tree Clock Network Clock Clock Idle condition Gated clock Can insert clock gating at multiple levels in clock tree Can shut off entire subtree if all gating conditions are satisfied  If the paths are perfectly balanced, clock skew is zero
  • 11. Clock Grid Network  Distributed buffering reduces absolute delay and makes clock gating easier, but is sensitive to variations in the buffer delay Clock secondary clock buffers local logic area main clock buffer  The secondary buffers isolate the local clock nets from the upstream load and amplify the clock signals degraded by the RC network  decreases absolute skew  gives steeper clocks  Only have to bound the skew within the local logic area
  • 12. DEC Alpha 21164 (EV5) Example  300 MHz clock (9.3 million transistors on a 16.5x18.1 mm die in 0.5 micron CMOS technology)  single phase clock  3.75 nF total clock load  Extensive use of dynamic logic  20 W (out of 50) in clock distribution network  Two level clock distribution  Single 6 inverter stage main clock buffer at the center of the chip  Secondary clock buffers drive the left and right sides of the clock grid in m3 and m4  Total equivalent driver size of 58 cm !!
  • 14. Clock Skew in Alpha Processor  Absolute skew smaller than 90 ps  The critical instruction and execution units all see the clock within 65 ps
  • 15. Dealing with Clock Skew and Jitter  To minimize skew, balance clock paths using H-tree or matched-tree clock distribution structures.  If possible, route data and clock in opposite directions; eliminates races at the cost of performance.  The use of gated clocks to help with dynamic power consumption make jitter worse.  Shield clock wires (route power lines – VDD or GND – next to clock lines) to minimize/eliminate coupling with neighboring signal nets.  Use dummy fills to reduce skew by reducing variations in interconnect capacitances due to interlayer dielectric thickness variations.  Beware of temperature and supply rail variations and their effects on skew and jitter. Power supply noise fundamentally limits the performance of clock networks.
  • 16. Major Components of a Computer Processor Control Datapath Memory Devices Input Output  Modern processor architecture styles (CSE 431)  Pipelined, single issue (e.g., ARM)  Pipelined, hardware controlled multiple issue – superscalar  Pipelined, software controlled multiple issue – VLIW  Pipelined, multiple issue from multiple process threads - multithreaded
  • 17. Basic Building Blocks  Datapath  Execution units - Adder, multiplier, divider, shifter, etc.  Register file and pipeline registers  Multiplexers, decoders  Control  Finite state machines (PLA, ROM, random logic)  Interconnect  Switches, arbiters, buses  Memory  Caches, TLBs, DRAM, buffers
  • 18. MIPS 5-Stage Pipelined (Single Issue) Datapath Read Address I$ Add PC 4 0 1 Write Data Read Addr 1 Read Addr 2 Write Addr Register File Read Data 1 Read Data 2 Sign Extend 16 32 ALU 1 0 Shift left 2 Add D$ Address Write Data Read Data 1 0 IF/Dec Dec/Exec Exec/Mem Mem/WB pipeline stage isolation register Fetch Decode Execute Memory WriteBack clk Icache precharge Dcache precharge RegWrite
  • 19. Datapath Bit-Sliced Organization Control Flow Bit 0 Bit 1 Bit 2 Bit 3 Tile identical bit-slice elements Register File Pipeline Register Adder Shifter Pipeline Register Multiplexer Multiplexer Data Flow Pipeline Register From I$ Pipeline Register To/From D$

Editor's Notes

  • #3: Skew delta is tphi’’ – tphi’ where tphi’s are the local clock times. Skew can be positive or negative depending on the routing direction of the clock. tmin’s are the best case delays of the logic, tmax are the worst case delays (assume register set up time is included in the combinational logic time). ti is the propagation delay of the interconnect. Clock skew is due to spatial variations in the arrival time of a clock transition skew is constant from cycle to cycle (by definition) can be positive (clock and data flowing in the same direction) or negative (clock and data flowing in the opposite direction) Clock jitter is due to temporal variations in the clock period (T changes on a cycle-by-cycle basis)
  • #4: clock is distributed using multiple matched paths to the sequential elements. The clock path includes the wiring and the associated distributed buffers required to drive the interconnect and loads. What matters is the relative arrival time at the register points at the end of each path – not the absolute delay through the clock distribution path. Both systematic (nominally identical from chip to chip and predictable – so easy to model and correct for at design time) and random (due to manufacturing variations – so are difficult to model and eliminate)
  • #5: For class handout
  • #6: For lecture clock skew has the potential to improve the performance of the circuit. Unfortunately, increasing skew makes the circuit more susceptible to race conditions! If the minimum delay of the combinational logic block is small, the inputs to R2 may change before R2’s first rising edge. To avoid races, we must ensure that the minimum delay through the register and logic must e long enough that the inputs to R2 are valid for a hold time after that edge. Reducing the clock frequency can’t fix it!
  • #7: For lecture a negative skew adversely impacts the performance of the system. However, assuming thold + delta < tcdreg + tcdlogic, a negative skew implies that the system never fails since edge 2 happens before edge 1 – i.e., there is never a race condition. Unfortunately, for general logic signals flow in both directions so skew can be both positive and negative in the same circuit
  • #8: for lecture
  • #11: Things to consider – interconnect material used for routing clock, shape of network, clock drivers and buffers used, load on clock lines, rise and fall time of clock (may have to consider transmission line effects as well!!) The H-tree network also helps with the clock skew problem (helps to minimize skew between neighboring elements) since only relative skew is important.
  • #13: 0.55 micron CMOS, 4 layer metal Clock load accounts for 40% of the total effective capacitance of the chip
  • #17: That is, any computer, no matter how primitive or advance, can be divided into five parts: 1. The input devices bring the data from the outside world into the computer. 2. These data are kept in the computer’s memory until ... 3. The datapath request and process them. 4. The operation of the datapath is controlled by the computer’s controller. All the work done by the computer will NOT do us any good unless we can get the data back to the outside world. 5. Getting the data back to the outside world is the job of the output devices. The most COMMON way to connect these 5 components together is to use a network of busses. Workstation Design Target: 25% of cost on Processor, 25% of cost on Memory (minimum memory size), rest on I/O devices, power supplies, box
  • #19: Five stage pipeline (originally for performance, but also helps with energy) Talk a bit about a vertical design approach versus a horizontal approach