0% found this document useful (0 votes)

346 views

Algoritham and Architectural Level Methodologies

The document discusses power estimation at various levels of design including algorithm, architectural, and implementation levels. It provides examples of vector quantization and tree search encoding algorithms. Power is estimated by considering the switching capacitance of hardware modules and activity models of data. Spatial and temporal locality in algorithms can be exploited to reduce switching capacitance and power consumption during mapping and implementation.

Uploaded by

xyz333447343

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

346 views

Algoritham and Architectural Level Methodologies

Uploaded by

xyz333447343

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 44

MANUKUMAR G.C M.

Tech 1st Sem(SP &VLSI)

Introduction Design flow

Algorithm level: Analysis and Optimization

Architectural level: Estimation and Synthesis

Power has become a critical design parameter in designing low power devices.
It has been demonstrated that the decisions made at this level have major impact on power consumption. By some of the known synthesis, optimization and estimation techniques, we can analyse the circuits at different stages with greater accuracy.

A design environment must include optimization and estimation tools at all level of the design flow. Effective decisions are made at the highest level of abstraction.

The decisions made at algorithm level are not accurate hence estimates are made at architectural level which gives more accurate result.

Vector quantization
It is one of the data compression method used in voice recognition and video system.

Vector quantization, Encoding and Decoding

The image is broken into a sequence of 4x4 pixel image. Each pixel is represented by an 8 bit word which is a vector of 16 words each having a 8 bit length. The vectors are compared with a previously generated codebook which contains 256 different combination of vectors.

After compression an 8 bit word is generated which defines the address of a code vector that approximates 4x4 vector image. It corresponds to a compression ratio of 16:1,since16 8-bit words are represented as a single word.

The power consumption of a CMOS chip is given as dynamic, short circuit and leakage power. Power= CeffVf
f frequency of operation V- supply voltage Ceff- effective switching capacitance. It combines two factors i,e
C- The capacitance being charged or discharged. - corresponding switching probability.

At the algorithm level we can predict the design decisions but not make absolute claim about power consumption.

Power dissipation is divided into two components

Algorithm inherent dissipationPower dissipation due to execution units and memory.

Implementation overheadPower dissipation due to control, interconnects and register

Inherent means that it is necessary for basic functionality and cannot be ignored irrespective of implementation. It serves as a prime factor for comparison between different algorithm. The power dissipation can be considered as a weighted sum of number of operations in the algorithm.

Weights must reflect respective switching capacitance depending on the operation.

Switching parameters are strongly dependent on hardware architecture. Mapping of different operations on to hardware resource effect correlation between signals. Hence mapping is not available until the architecture is not finalized. The power consumption at the algorithm level is same as that of hardware sharing.

It depends on the specific architectural platform. The power comes into account if it is not greater than the algorithm inherent dissipation. First order prediction are obtained on overhead components given some properties of algorithm and hardware architecture.

Distortion measure is given by full search through the entire codebook(FSVQ) combined with the standard mean square error(MSE).

C-codebook code vector X-4x4 vector representation i-index of individual pixel word

First order approximation is given by measuring number of executions required to search the codebook.(e.g. Multiplication,additon)

Computing MSE between two vectors require 16 memory access, 16 multiplies and 16 addition. In FSVQ it is done for each 256 vetors in the codebook.

Algorithm inherent dissipation: Operation count can be used to estimate the switching capacitance of the targeted hardware architecture.

Using black box capacitance model of the hardware a first order estimate of capacitance can be made.

FSVQ algorithmic inherent dissipation

First order analysis produces an overview to state which are the functions needed for optimization.

Ripple adder dissipates less power than the carry select adder then it fails to meet required throughput below 5v.
CSA continues to meet the required throughput when the voltage is set to 3v.

Estimation tools must be integrated with the design space exploration and optimization tools to provide an easy to use environment for designer. This provides an quick feedback for the designer about the effect of design choices.

Area and energy prediction of FIR Filter

Functional pipelining, algebraic transformation, loop transformation can be used to increase speed at low voltages.

This technique result in larger silicon area implementation hence termed has trading area for power.

Avoid wasteful activity

Activity at the algorithm level is given by size and complexity of the algorithm. (e.g. Operation count and word length) The algorithm with least number of operation is generally preferred.

Transformation include operation and strength reduction. Operation reduction includes

Common sub expression elimination Algebraic transformation Dead code elimination

Strength reduction

Replacing energy consuming operation by a combination of simpler operation.(e.g. Replacing expansion of multiplication by constants into shift and add operation)

Algorithms which possess some certain structural properties such as locality and regularity. The chip area has been reduced as this translates into a reduced bus capacitance.

Tree search encoding

It requires less computation as compared FSVQ. It performs binary search of the vector space instead of full search. Computational complexity is proportional to log N rather than N
2

N- number of vectors in the codebook

The input vector is compared with two code book entries. The branch which is closer to the input vector is assigned 0 and the other branch is assigned with 1 which is not considered for further analysis. There are 2*log2 (256)=16 distortion comparison have to be made instead of 256 in case of FSVQ.

It involves rearranging the difference between input vector X and two code vectors Ca and Cb.

Comparison is made between the two code vectors . Hence this can be under one summation.

The number of multiplication is reduced from 32 to 16 which is same for addition and subtraction.

Estimating power at architectural level is more accurate for two reasons

More precise information is obtained regarding the signal statistics, hence it yields more accurate model for hardware modules. The implementation overhead is now defined with respect to controllers, memories and buses which can be estimated accurately.

Power analysis at this level requires two entities

Capacitance model for hardware modules. Activity models for data or control signal.

Capacitance of RTL module (adder, multiplier) can be expressed as a function of complexity parameters.
E.g. The switching capacitance of a multiplier is proportional to square of its input word length.

Capacitance model for logarithmic shifter is given by

S & M are maximum shift values.

L=log2(M+1) represents number of shift stages.

Average power dissipation of a module is a function of the applied signal. It is difficult to find the capacitance model for all possible input patterns. Power factor approximation is employed to analyse power dissipation.

It uses experimentally determined weighting factor called the power factor to find out average power consumed by the given module. A more accurate model can be designed on the basis of twos complement data words. It can be divided into two regions on the basis of their behaviour
Activity in higher order data depends on temporal correlation. Lower order bits behave similar to white noise data.

Transition activity versus bit for typical data streams

The over all module is characterized by its capacitance model in the MSB & LSB. The break points can be determined from the applied signal statistics obtained from theoretical analysis.

Power consumption at the final stage of an algorithm depends on quality of its mapping onto the architecture. The mapping process must use the relevant properties of algorithm so it preserves data correlation.

Spatial locality can be utilized during the binding of operations to hardware units.

Spatial locality in parallel IIR filter

It consists of three distinct clusters

Algorithm was mapped into a single large unpartitioned chip with resource sharing different units of clusters.

Resource sharing was allowed between operations in the same cluster. At the final stage it can be used to reduce the size and allow to access capacitance of register files.

Processing one node of the search tree requires

17 memory access. 16 multiply/accumulate instruction Final add operation for comparison to find the location of the next node in the tree.

Total of 18 clock cycles are required for entire computation. Same number of calculations is required for each node. Hence 8x18=146 clock cycles are required.

The locality of reference enables partitioning of the memory into smaller memory associated to a single level of tree.

Locality of reference identification

Pipelined structure can be used to optimize system performance.

By using distributive architecture we can reduce switching capacitance.

Distributive memory

There are 8 controllers and processors, they are clocked 1/8 of the frequency, the capacitance switched per vector by these elements is unchanged.
As there is less overhead in reading from smaller memory , switching capacitance can be reduced.

Power and area breakdown

Chapter 1 Computer Science 10 Class Federal Board
83% (6)
Chapter 1 Computer Science 10 Class Federal Board
23 pages
Riddle of the Seventh Stone, The
From Everand
Riddle of the Seventh Stone, The
Monideepa Sahu
No ratings yet
Stick Diag
No ratings yet
Stick Diag
6 pages
Testing Int 1 Aug
No ratings yet
Testing Int 1 Aug
2 pages
VLSI Project - Final
No ratings yet
VLSI Project - Final
13 pages
Deep Submicron
50% (2)
Deep Submicron
20 pages
Bandgap References
No ratings yet
Bandgap References
18 pages
D.C Motor Control Using Voice Commands
100% (5)
D.C Motor Control Using Voice Commands
27 pages
Unit 2 - Week 1
No ratings yet
Unit 2 - Week 1
4 pages
Ahb Apb Bridge Design
No ratings yet
Ahb Apb Bridge Design
25 pages
VL9252 Low Power Vlsi Desing
No ratings yet
VL9252 Low Power Vlsi Desing
7 pages
17 FinFET
No ratings yet
17 FinFET
47 pages
Adders and Multipliers
No ratings yet
Adders and Multipliers
59 pages
PDK Reference Manual
No ratings yet
PDK Reference Manual
85 pages
M.tech VLSI SEM II MID I ImportantQuestions
No ratings yet
M.tech VLSI SEM II MID I ImportantQuestions
13 pages
9 .Efficient Design For Fixed Width Adder
No ratings yet
9 .Efficient Design For Fixed Width Adder
45 pages
Session - 8 - PLA PAL
No ratings yet
Session - 8 - PLA PAL
14 pages
Design of Testable Sequential Circuits
No ratings yet
Design of Testable Sequential Circuits
44 pages
Placement Test
No ratings yet
Placement Test
4 pages
VLSI Design FlowGoals of CTS
No ratings yet
VLSI Design FlowGoals of CTS
50 pages
Asic Design Flow: TKK "Laitteistokuvauskielinen Digitaalisuunnitelu" Syksy-1999
0% (1)
Asic Design Flow: TKK "Laitteistokuvauskielinen Digitaalisuunnitelu" Syksy-1999
25 pages
Unit - 5
No ratings yet
Unit - 5
4 pages
VLSI Physical Design: From Graph Partitioning To Timing Closure
No ratings yet
VLSI Physical Design: From Graph Partitioning To Timing Closure
88 pages
Password Protected Circuit Breaker: Audisankara Institute of Technology
No ratings yet
Password Protected Circuit Breaker: Audisankara Institute of Technology
22 pages
Power Optimization For Low Power VLSI Circuits
No ratings yet
Power Optimization For Low Power VLSI Circuits
4 pages
Vlsi Imlememt of Odfm
No ratings yet
Vlsi Imlememt of Odfm
10 pages
ONE Full Question From Each Module.: (08 Marks) (08 Marks)
No ratings yet
ONE Full Question From Each Module.: (08 Marks) (08 Marks)
2 pages
FPGA Interview Questions, FPGA Interview Questions & Answers
No ratings yet
FPGA Interview Questions, FPGA Interview Questions & Answers
6 pages
An Introduction To Functional Verification of I2C Protocol Using UVM
No ratings yet
An Introduction To Functional Verification of I2C Protocol Using UVM
5 pages
ECT282 - Ktu Qbank
No ratings yet
ECT282 - Ktu Qbank
6 pages
Hardware Description Languages
No ratings yet
Hardware Description Languages
12 pages
Logi Design AOI and OAI
No ratings yet
Logi Design AOI and OAI
6 pages
Static Properties Switching Threshold & Noise Margin Reference: Kang
No ratings yet
Static Properties Switching Threshold & Noise Margin Reference: Kang
25 pages
Routing 17: Key Terms and Concepts
100% (2)
Routing 17: Key Terms and Concepts
26 pages
Design of Low Power and High Speed Sense Amplifier
No ratings yet
Design of Low Power and High Speed Sense Amplifier
71 pages
PMOS, NMOS and CMOS Transmission Gate Characteristics.
No ratings yet
PMOS, NMOS and CMOS Transmission Gate Characteristics.
13 pages
CMOS Design Rules
No ratings yet
CMOS Design Rules
21 pages
VLSI-PPT-Module-3-Delays and Subsystems
No ratings yet
VLSI-PPT-Module-3-Delays and Subsystems
10 pages
Barrel Shifter
100% (2)
Barrel Shifter
79 pages
Verilog HDL Introduction: Textbook
No ratings yet
Verilog HDL Introduction: Textbook
41 pages
Sources of Power Dissipation: Dr. M. Madhusudhan Reddy
100% (1)
Sources of Power Dissipation: Dr. M. Madhusudhan Reddy
7 pages
Nyquist-Rate D/A Converters
No ratings yet
Nyquist-Rate D/A Converters
3 pages
Carry Skip Adder
No ratings yet
Carry Skip Adder
9 pages
Ebers Moll Model
No ratings yet
Ebers Moll Model
3 pages
Fingerprint Based Exam Hall Authentication System
No ratings yet
Fingerprint Based Exam Hall Authentication System
3 pages
VLSI Unit 1 - MOS
0% (1)
VLSI Unit 1 - MOS
86 pages
Chortle CRF
No ratings yet
Chortle CRF
7 pages
Matlab Manual
No ratings yet
Matlab Manual
16 pages
Clocked and Dynamic CMOS
No ratings yet
Clocked and Dynamic CMOS
25 pages
Mjs Smith Asic
No ratings yet
Mjs Smith Asic
1,179 pages
M.TECH VLSI System Design PDF
No ratings yet
M.TECH VLSI System Design PDF
47 pages
Assignment 8 - 2023 - Gate
No ratings yet
Assignment 8 - 2023 - Gate
10 pages
Ldica Lab Manual
No ratings yet
Ldica Lab Manual
83 pages
Verilog Soln
100% (1)
Verilog Soln
15 pages
1.VLSI Design Flow
No ratings yet
1.VLSI Design Flow
33 pages
ASIC Question Paper
No ratings yet
ASIC Question Paper
1 page
VLSI Lab Manual V Sem July16
No ratings yet
VLSI Lab Manual V Sem July16
65 pages
Computer Aided Design of Electrical Machines
From Everand
Computer Aided Design of Electrical Machines
K.M. Vishnu Murthy
No ratings yet
Register-Transfer Level - Wikipedia
No ratings yet
Register-Transfer Level - Wikipedia
6 pages
Accurate Power-Analysis Techniques Support Smart SOC-design Choices
No ratings yet
Accurate Power-Analysis Techniques Support Smart SOC-design Choices
4 pages
Run-Time Scaling of Microarchitecture Resources in A Processor For Energy Savings
No ratings yet
Run-Time Scaling of Microarchitecture Resources in A Processor For Energy Savings
4 pages
A VLSI Field-Programmable Mixed-Signal Array To Perform Neural Signal Processing and Neural Modeling in A Prosthetic System
No ratings yet
A VLSI Field-Programmable Mixed-Signal Array To Perform Neural Signal Processing and Neural Modeling in A Prosthetic System
13 pages
Homing Experiments: Manukumar G C
No ratings yet
Homing Experiments: Manukumar G C
4 pages
VCC Post-Build Guideline
No ratings yet
VCC Post-Build Guideline
8 pages
Guidelines For The Preparation of Synopsis of Project Report
No ratings yet
Guidelines For The Preparation of Synopsis of Project Report
3 pages
Project Guide:: Asst - Professor Project Associates: Chethan.B.V Deepak Kumar.b Mansur.M Manukumar.G.C
No ratings yet
Project Guide:: Asst - Professor Project Associates: Chethan.B.V Deepak Kumar.b Mansur.M Manukumar.G.C
15 pages
An Inverter I. CMOS Inverter Ii. Pseudo nMOS Inverter Iii. Tristate Inverter
No ratings yet
An Inverter I. CMOS Inverter Ii. Pseudo nMOS Inverter Iii. Tristate Inverter
53 pages
Fault Simulation
No ratings yet
Fault Simulation
27 pages
Lecture Module 3 - Conditional Program Flow
No ratings yet
Lecture Module 3 - Conditional Program Flow
24 pages
Micro Bit Lesson 456
No ratings yet
Micro Bit Lesson 456
48 pages
Lab 09
No ratings yet
Lab 09
4 pages
Current Differential Protection of Alternator Stator Winding
No ratings yet
Current Differential Protection of Alternator Stator Winding
7 pages
Computer Programming Logic & Algebra Statistics 1
No ratings yet
Computer Programming Logic & Algebra Statistics 1
10 pages
Full Download C Programming An Introduction Rajiv Chopra PDF DOCX
100% (3)
Full Download C Programming An Introduction Rajiv Chopra PDF DOCX
82 pages
Eyephone: Activating Mobile Phones With Your Eyes: Emiliano Miluzzo, Tianyu Wang, Andrew T. Campbell
No ratings yet
Eyephone: Activating Mobile Phones With Your Eyes: Emiliano Miluzzo, Tianyu Wang, Andrew T. Campbell
6 pages
Real-Time Tiny Part Defect Detection System in Manufacturing Using Deep Learning
No ratings yet
Real-Time Tiny Part Defect Detection System in Manufacturing Using Deep Learning
14 pages
Intro To Problem Solving CH 4
No ratings yet
Intro To Problem Solving CH 4
13 pages
Motor Thermal Model
No ratings yet
Motor Thermal Model
15 pages
Automatic Identification of Bug-Introducing Changes: Sunghun Kim, Thomas Zimmermann, Kai Pan, E. James Whitehead, JR
No ratings yet
Automatic Identification of Bug-Introducing Changes: Sunghun Kim, Thomas Zimmermann, Kai Pan, E. James Whitehead, JR
10 pages
Test Chapter # 1
No ratings yet
Test Chapter # 1
1 page
1306 4622
No ratings yet
1306 4622
4 pages
Algorithm Pseudocode Flowcharts
No ratings yet
Algorithm Pseudocode Flowcharts
32 pages
Tutorial Sheet 1-Introduction To Algorithm-Software Engineering
No ratings yet
Tutorial Sheet 1-Introduction To Algorithm-Software Engineering
9 pages
Assignment #4
No ratings yet
Assignment #4
2 pages
Not Expected From Rohit
No ratings yet
Not Expected From Rohit
358 pages
CS502 GDB Solution Fall 2022 by VU Answer
No ratings yet
CS502 GDB Solution Fall 2022 by VU Answer
4 pages
10 GCD
No ratings yet
10 GCD
3 pages
design programing logic final exam
No ratings yet
design programing logic final exam
8 pages
Programming Problems SU 4
No ratings yet
Programming Problems SU 4
3 pages
Algorithm&flowchart
No ratings yet
Algorithm&flowchart
8 pages
Pracitcal Lab Book
No ratings yet
Pracitcal Lab Book
72 pages
Outline PDF
No ratings yet
Outline PDF
3 pages
Problem Solving: Algorithms and Flowcharts: CSC 110-Introduction To Computer Systems
No ratings yet
Problem Solving: Algorithms and Flowcharts: CSC 110-Introduction To Computer Systems
19 pages
Quiz 2 CSE 21 Spring 2012
No ratings yet
Quiz 2 CSE 21 Spring 2012
2 pages
Pseudocode For O Level Computer Science
50% (2)
Pseudocode For O Level Computer Science
16 pages
Gate Study Materials
No ratings yet
Gate Study Materials
22 pages
Chapter - 1 Problem Solving Using Computer: Computer Programming in C
No ratings yet
Chapter - 1 Problem Solving Using Computer: Computer Programming in C
166 pages

Algoritham and Architectural Level Methodologies

Uploaded by

Algoritham and Architectural Level Methodologies

Uploaded by

MANUKUMAR G.C M.

Tech 1st Sem(SP &VLSI)

Introduction Design flow

Algorithm level: Analysis and Optimization

Vector quantization, Encoding and Decoding

Power dissipation is divided into two components

Implementation overheadPower dissipation due to control, interconnects and register

Weights must reflect respective switching capacitance depending on the operation.

FSVQ algorithmic inherent dissipation

Area and energy prediction of FIR Filter

Avoid wasteful activity

Transformation include operation and strength reduction. Operation reduction includes

Tree search encoding

N- number of vectors in the codebook

Estimating power at architectural level is more accurate for two reasons

Power analysis at this level requires two entities

Capacitance model for logarithmic shifter is given by

L=log2(M+1) represents number of shift stages.

Transition activity versus bit for typical data streams

Spatial locality in parallel IIR filter

It consists of three distinct clusters

Processing one node of the search tree requires

Locality of reference identification

Pipelined structure can be used to optimize system performance.

Power and area breakdown

You might also like