Sorin System
Sorin System
Winter 2010
Low power implementation
A system perspective
Website: http://cseweb.ucsd.edu/classes/wi10/cse241a/
System optimization
Optimization at the system with the goal of moving most of the signal processing
(data transformation) in the digital domain. The power consumption in the digital
domain is scalable with the process technology scaling and with the system use
mode requirements.
Architectural optimization
Memory hierarchy
Processor architecture
Architectural optimization
Hardware accelerators:
Graphic 2D, 3D
Multimedia display
Modem baseband
Bus architecture
AXI
Clocking architecture:
PLLs
Frequency planning
Clock architecture
Architectural optimization
IO interfaces
Vddx
Interface block
x
Leakage current
Ileak
Virtual GND
(vssfx)
EN
ln=60nm
GND
ECE240B/CSE241A Low power techniques 9
(vssx)
Voltage droop
(-)
Area overhead.
(-)
Global PG Mesh
v
v
v v
v v
v v
v v
v
v
v v
v v
Local PG Mesh
GDFS
vssfx
En_few_in
Mf
En_rest_in
FS/HS ring
En_few_out
En_rest_out
Mr
vdd_ext Isub
D
Vgate HI Ig
vddx
HI
Vnwell
Isb
Vsup
HI
nz
S
vddx: collapsible vdd
buffer chain
In
Out
vssx
Itotal
0V Vgnd
In
Itotal
Vsup
HI
Out
vssfx : collapsible ground
D
Igate
Isb
G
B
Vgate
0V
S
Isub
vssx
Vgnd 0V
0V
Vsub
Courtesy of Krishnan Sundaresan, Aravind Oommen, Doug Meserve, Hemango Das, Jaewon Oh, Mohd Jamil
A tool for exploring advanced RTL Clock Gating Opportunities in Microprocessor Design
ECE240B/CSE241A Low power techniques 16
Clock gating propagation. When F1 is gated no new data will propagate downstream to
F2 and F3. Next clock cycle we can gate F2 and F3. We introduce in the design the
staging flops F5 and F6.
Courtesy of Krishnan Sundaresan, Aravind Oommen, Doug Meserve, Hemango Das, Jaewon Oh, Mohd Jamil
A tool for exploring advanced RTL Clock Gating Opportunities in Microprocessor Design
ECE240B/CSE241A Low power techniques 17
Low power technique used to adjust the frequency of operation and the
voltage supply of the system to the software application needs.
Using silicon measurements a (f, V) look-up table is created. In the
look-up table we define pairs of frequency of operation and the
corresponding voltage supply.
DCVS can be hardware or software controlled. Most of the
implementations are software (SW) due to simplicity of implementation.
DCVS can be easily implemented for CPUs, DSPs (IPs) which have a
dedicated power domain (power rail). If in the same power domain we
have multiple functional blocks (CPU, Graphic core, Bus etc) in order to
perform DCVS we will need to find a common denominator in terms of
pairs of frequencies (performance requirements) and voltage supply for
all the IPs, which does not happen very often. (diminishing return)
DCVS provides the biggest power saving for IPs which have dedicated
power rails.
For DCVS and AVS a very important quality metric is system latency.
How long it takes to adjust the frequency and voltage of the IP ? The
lower is the latency the bigger are of power savings.
The design will be timed closed during design phase to all the
predefined corners (PV1T), (PV2T) , (PV3T) and modes (SDC1),
(SDC2), (SDC3). (SDC = Synopsys Design Constraints)
SVS does not require silicon calibration
SVS is software controlled. The transition between different
modes of operation is controlled by software (SW) and/or the
Power Management Unit implemented on the die.
The opportunity of frequency and voltage scaling is bounded to
the predefined use modes of the device.(small number of
modes)
Memories
Standard cells
IOs (Pads)
PLLs
Mix signal blocks : ADCs, DACs
Standard interfaces: DDR, USB, MIPI, SDIO, HDMI
The power optimization at the IP level targets both active power and
leakage power:
Library customization:
Clk
pn1
Clk_en
test_en
40% lower power (enabled), 60% lower power (gated) than a standard CGC implementation
Memory design:
The periphery and the memory array have different power rail
supplies
Source biasing
synthesis:
Vt selection
Clock gating insertion
Flop/latch tray clustering
Power verification
RTL verification:
- Power verification and profiling
Power linting
Gate netlist power verification (post place and route)
PDN analysis:
- Voltage noise analysis
- Frequency domain analysis