Lecture-2 Low Power VLSI Design: Instructor: Rajesh Bathija, Hod-Ece, Mewar University, Chittorgarh
Lecture-2 Low Power VLSI Design: Instructor: Rajesh Bathija, Hod-Ece, Mewar University, Chittorgarh
T
E(T) = ∫ P(t) dt
0
Vin Vout
CL
Vdd -Vt
CL
E0 = CL Vdd V dd – Vt
1
Vin Vout
CL
p01
AND (1 - pApB)pApB
OR (1 - pA)(1 - pB)(1 - (1 - pA)(1 - pB))
XOR (1 - (pA +pB – 2pApB))(pA + pB – 2pApB)
Vout
Drain junction
OFF leakage
Sub-threshold
Gate leakage current
Independent of switching
12/08/21 Rajesh Bathija, Mewar University, Chittorgarh 11
Dynamic vs Static Power
1E+4
1E+2
Power Density (W/cm^2)
Active Power
1E+0
Shrinking Margin
1E-2
1E-4
SubThreshold
1E-6 Power
1E-8
0.01 0.1 1 10
Gate Length (microns)
Source: Leon Stok, DAC 42©
• System partitioning
• Busses/Memory/IO devices /interfaces
• Choice of components
• Coding
• System states (sleep/snooze etc)
• 12/08/21
DVS/DFS/.. Rajesh Bathija, Mewar University, Chittorgarh 16
Algorithmic/sub-system Level
Choice of algorithm (operation count etc.)
Word length choices
Module interfaces
Implementation technology
SW: Processor selection
HW: ASIC/FPGA/..
Behavioral synthesis constraints and trade-off
Technology node
180 130 90 60 40 30
[nm]
Supply [V] 1.5-1.8 1.5-1.8 1.2-1.5 0.9-1.2 0.6-0.9 0.5-0.6 0.3-0.6
Wiring levels 6-7 6-7 7 8 9 9-10 10
Max frequency 14.9
1.2 1.6-1.4 2.1-1.6 3.5-2 7.1-2.5 11-3
[GHz],Local-Global -3.6
Max P power [W] 90 106 130 160 171 177 186
Bat. power [W] 1.4 1.7 2.0 2.4 2.1 2.3 2.5
1
10
0
10
-1
10
-2
10
1960 1970 1980 1990 2000 2010
Year
tp decreases by 13%/year
50% every 5 years!
Propagation Delay
12/08/21 Rajesh Bathija, Mewar University, Chittorgarh 28
Technology Scaling Models
• General Scaling
most realistic for todays situation —
voltages and dimensions scale with different factors
30
210
25
160
20
110
15
60
10
30
25
20
15
10
5
Frequency
0
5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35
Demux
F1 F2 Fn
F1 F2 Fn
Function Mux
Select Function Mux
Select
Parameter extraction
from high-level
simulation
Extracted parameters
Parameter extraction
Operator from high-level
simulation
Models
Power estimates
12/08/21 Rajesh Bathija, Mewar University, Chittorgarh 47
Retiming
Leiserson[1] first proposed retiming for
optimizing synchronous circuits and
Monteiro[2] modified for low power design
Basic observation is that positioning a flip-flop
can stop propagation of “glitches” and thus
unnecessary transitions
This implies they can be positioned not only
to minimize delays (classical retiming) but
also to reduce transitions
Eg Eg ER
Logic Logic FF
CL CL
CR
P1 = k * Eg * CL P2 = k * (Eg * CR + ER * CL)
CR C2
C1
P1 = k* (E0 * CR + E1 * CL+ E2 * C2 )
E0 E2 E3
C1 CR C2
P2 = k* (E0 * C1 + E2 * CR+ E3 * C2 )
f1 > f0
Case2: Logic FF Logic
Logic FF v1 = v0
Pipelining for
performance
f2 = f0
Case 3: Logic FF Logic
Logic FF v2 < v0
Pipelining for
low power
12/08/21 Rajesh Bathija, Mewar University, Chittorgarh 53
Increasing Parallelism/
Concurrency
Chandrakasan[4] first showed that concurrency can
be used to reduce power instead of increasing
performance
Primary idea is to reduce the frequency of operation
and/or voltage to meet a certain throughput
Power consumed by additional logic required to
distribute computation and multiplex results needs
to be accounted for
reg FU
Case 3: M f2 < f0
Two FUs for U v2 < v0
reducing x
reg FU T2 = T0
power
Logic Logic
Logic Logic
Encoder/ Encoder/
decoder decoder
s0 000 s0 000
10 5 30 80
s1 s2 s3 s4 s1 s2 s3 s4
mux mux
mux mux
y y
A
X
B Z
C
Z
Glitch
Gate Delay
A,B A,B
C,D C,D
X X
Y Y
Z Z