0% found this document useful (0 votes)

54 views

Unfolding Basic Good One

Unfolding is a transformation technique that creates a new program describing multiple iterations of the original program by unfolding it by a factor J. This increases parallelism for implementation. The unfolded data flow graph (DFG) contains J times as many nodes and edges as the original DFG. Unfolding preserves the precedence constraints and number of delays between operations in the original DFG. The algorithm for unfolding a DFG draws J copies of each node and connects them with the same delays as the original edges.

Uploaded by

Sameer Nandagave

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views

Unfolding Basic Good One

Uploaded by

Sameer Nandagave

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Unfolding basics

ArunachalamV
AP SG, SENSE
Unfolding - Basics
Transformationtechniquethat canbeappliedto DSP program
to createanewprogramdescribingmorethanoneiterationof
theoriginal program.
Unfolding is also referred to as loop unrolling, used in
compiler theory.
UnfoldingaDSPprogrambytheunfoldingfactor, J createsa
new program that describes J consecutive iterations of the
original program.
An Example
Consider
Replacing the index n with 2k
Replacing the index n with (2k+1)
Program (1) is described by (2) and (3) as two consecutive
iterations.
( ) ( ) ( ) ( ) 1 0 9 . = + = to n for n x n y a n y
( ) ( ) ( ) ( ) 2 0 2 9 2 . 2 = + = to k for k x k y a k y
( ) ( ) ( ) ( ) 3 0 1 2 8 2 . 1 2 = + + = + to k for k x k y a k y
J=2 unfolded version
(2) & (3) together describes a J=2 unfolded version of (1)
Why do graph based unfolding needed?
It canoftenbetedioustowritetheequationsfor theoriginal &
J-unfolded programs and then draw the corresponding
unfoldedDFG, especiallyfor larger valuesof J.
Therefore we describe a graph-based technique for directly
unfolding the DFG to createthe DFG of J-unfolded program
without explicitly writing equations describing theoriginal or
unfoldedprogram.
Applications of unfolding
Unfolding has applications in designing high speed and low
power VLSI architectures.
Unfold the program to reveal hidden concurrencies so that the program
can be scheduled to a smaller iteration period, thus increasing the
throughput of the implementation.
Design parallel architectures at the word level and bit level.
Some common notations used
Floor of x , largest integer less than or equal to x.
a % b : a mod b, remainder after dividing a by b ; also a & b
are integers.
Basic properties of unfolding
For each node U in the original DFG, there are J nodes with
same function as U in the J-unfolded DFG.
For each edge in the original DFG, there are J edges in the J-
unfolded DFG.
The J-unfolded DFG (program) contains J -times as many
nodes and edges as the original DFG (program).
Algorithm for unfolding a DFG by a factor J
1. For each node U in the original DFG, draw the J nodes U
0
,
U
1
, U
2
,, U
J-1
.
2. For each edge with w delays in the original DFG, draw J
edges with delays for i=0,1,2,3,, J-1.
V U
e

( ) J w i
e
i
V U
% +

+
J
w i
Example -1
( )
( )
( )
( ) 0 0 2 % 0 0 0
1 0 2 % 9 0 0
0 0 2 % 0 0 0
0 0 2 % 0 0 0
0
C D C D
D C D C
B C B C
C A C A
i

=
+
+
+
+
) 1 2 ( , 0 ; 2 = = i J
( )
( )
( )
( ) 1 1 2 % 0 1 1
0 1 2 % 9 1 1
1 1 2 % 0 1 1
1 1 2 % 0 1 1
1
C D C D
D C D C
B C B C
C A C A
i

=
+
+
+
+
( )
delays with
D C D C
i
5
2
9 1
1
0 1 2 % 9 1 1
=
(

+

=
+
( )
delays with
D C D C
i
4
2
9 0
0
1 0 2 % 9 0 0
=
(

+

=
+
w > J
Example -2
3 , 2 , 1 , 0 ; 4 = = i J
( )
delays with
V U V U
i
9
4
37 0
0
1 0 4 % 37 0 0
=
(

+

=
+
( )
delays with
V U V U
i
9
4
37 1
1
2 1 4 % 37 1 1
=
(

+

=
+
( )
delays with
V U V U
i
9
4
37 2
2
3 2 4 % 37 2 2
=
(

+

=
+
( )
delays with
V U V U
i
10
4
37 3
3
0 3 4 % 37 3 3
=
(

+

=
+
w > J
Example -3
InUV, w < J
In such cases J -w edges with
nodelayandwedgeswithone
delayeach.
2 , 1 , 0 ; 3 = = i J
Unfolding preserves precedence constraints
Unfoldingpreservesprecedenceconstraintsof aDSPprogram.
The e edges in the original DFG explicitly show the
precedence constraints for 1iterationof theoriginal program,
and Je edges in the J-unfolded DFG explicitly show the
precedenceconstraintsfor J iterationsof theprogram.
Proof
The edge with delays in the unfolded DFG
corresponds to the edge with w delays in the original
DFG.
k
th
iteration of the node U
i
in the J-unfolded DFG executes the
(Jk+i)
th
iteration of the node U in the original DFG.
Due to the delays on the edge , the output of the
k
th
iteration of the node U
i
is consumed by the -th
iteration of the node in the unfolded DFG.
V U
e

( ) J w i
e
i
V U
% +

+
J
w i
(

+
J
w i
( ) J w i
e
i
V U
% +

|
|
.
|

\
|
(

+
+
J
w i
k
( ) J w i
V
% +
Proof
k
th
iteration of the node U
i
corresponds to the (Jk+i)
th
iteration
of the node U, and the -th iteration of
corresponds to -th iteration of node
V.
Therefore in the original DFG, the output of the (Jk+i)
th
iteration of the node U is consumed by the
-th iteration of node V.
|
|
.
|

\
|
(

+
+
J
w i
k ( ) J w i
V
% +
( ) | |
|
|
.
|

\
|
+ +
(

|
|
.
|

\
|
(

+
+ J w i
J
w i
k J %
( ) | |
|
|
.
|

\
|
+ +
(

|
|
.
|

\
|
(

+
+ J w i
J
w i
k J %
k
th
iteration
Unfolded DFG Original DFG
k
th
iteration of U
i
node (Jk+i)
th
iteration of U node
- thiteration by
node
- th
iteration by node V
On edge
(unfolded)
On UV edge
(original)
Output of U
i
is consumed in
- thiteration by
node .
Output of U is consumed in
- th
iteration by node V
|
|
.
|

\
|
(

+
+
J
w i
k
( ) J w i
V
% +
( ) | |
|
|
.
|

\
|
+ +
(

|
|
.
|

\
|
(

+
+ J w i
J
w i
k J %
( ) J w i
e
i
V U
% +

|
|
.
|

\
|
(

+
+
J
w i
k
( ) J w i
V
% +
( ) | |
|
|
.
|

\
|
+ +
(

|
|
.
|

\
|
(

+
+ J w i
J
w i
k J %
U V
w
U
i
( ) J w i
V
% +
(

+
J
w i
( ) | | ( ) i Jk J w i
J
w i
k J +
|
|
.
|

\
|
+ +
(

|
|
.
|

\
|
(

+
+ %
( ) | | i J w i
J
w i
J + +
(

+
%
( ) | | i J w i
J
w i
J =
|
|
.
|

\
|
+ +
(

+
%
So the number of delays on the edge UV is (i+w)-i = w
Unfolding preserves the number of delays in a DFG
( ) | | i Jk J w i
J
w i
J Jk + +
(

+
+ %
k
J
w i
k
(

|
|
.
|

\
|
(

+
+
k
J
w i
k
(

+
+
(

J
w i
Original
Unfolded
PROPERTIES OF UNFOLDING
Next Class

26 Samss 088
100% (1)
26 Samss 088
8 pages
Vlsi DSP Chapter 3 Solution
No ratings yet
Vlsi DSP Chapter 3 Solution
29 pages
Innovus Foundation Flows Guide: Product Version 20.12 September 2020
No ratings yet
Innovus Foundation Flows Guide: Product Version 20.12 September 2020
258 pages
DSPA Solution Manual Chap 5 - KK Parhi
100% (1)
DSPA Solution Manual Chap 5 - KK Parhi
7 pages
Convex Optimization Approach To The Optimal Power Flow Problem in PDF
No ratings yet
Convex Optimization Approach To The Optimal Power Flow Problem in PDF
92 pages
Sales Force Notes
100% (2)
Sales Force Notes
47 pages
Chapter5 Unfolding Parhi Book
No ratings yet
Chapter5 Unfolding Parhi Book
30 pages
FPGA - Ch5 - Unfolding
No ratings yet
FPGA - Ch5 - Unfolding
76 pages
9
No ratings yet
9
18 pages
Chapter 7 Unfolding
No ratings yet
Chapter 7 Unfolding
18 pages
Chapter 5: Unfolding: Keshab K. Parhi
No ratings yet
Chapter 5: Unfolding: Keshab K. Parhi
13 pages
Chap5 PDF
No ratings yet
Chap5 PDF
13 pages
Unfolding
No ratings yet
Unfolding
11 pages
Xu Ly Tin Hieu So Fpga Ho Trung My Dsp Fpga Ch05 Unfolding Hk192 [Cuuduongthancong.com]
No ratings yet
Xu Ly Tin Hieu So Fpga Ho Trung My Dsp Fpga Ch05 Unfolding Hk192 [Cuuduongthancong.com]
75 pages
Chapter 5: Unfolding: Z Introduction
No ratings yet
Chapter 5: Unfolding: Z Introduction
27 pages
CSE4210 Architecture and Hardware For DSP: Unfolding
No ratings yet
CSE4210 Architecture and Hardware For DSP: Unfolding
14 pages
VSP Lec02 Unfolding
No ratings yet
VSP Lec02 Unfolding
47 pages
Unfolding Unfolding: Parallel Processing
No ratings yet
Unfolding Unfolding: Parallel Processing
13 pages
DSP Design - Lecture 6: Unfolding
No ratings yet
DSP Design - Lecture 6: Unfolding
44 pages
0.1 Unfolding: X (N) N y (N) X (n+1) N y (n+1)
No ratings yet
0.1 Unfolding: X (N) N y (N) X (n+1) N y (n+1)
6 pages
DSP-FPGA - Ch06 - Folding - HK202
No ratings yet
DSP-FPGA - Ch06 - Folding - HK202
84 pages
170 Dis
No ratings yet
170 Dis
5 pages
22 - Elementary Graph Algorithms
No ratings yet
22 - Elementary Graph Algorithms
55 pages
Combining Extended Retiming and Unfolding For Rate-Optimal Graph Transformation
No ratings yet
Combining Extended Retiming and Unfolding For Rate-Optimal Graph Transformation
31 pages
Lecture Notes On Petrinet Prof. Javier
No ratings yet
Lecture Notes On Petrinet Prof. Javier
119 pages
Properties of Context-Free Languages: Reading: Chapter 7
No ratings yet
Properties of Context-Free Languages: Reading: Chapter 7
61 pages
Quick Introduction Into SAT/SMT Solvers and Symbolic Execution
No ratings yet
Quick Introduction Into SAT/SMT Solvers and Symbolic Execution
85 pages
Iterative Data Flow Analysis
No ratings yet
Iterative Data Flow Analysis
88 pages
Conversion of CFG To PDA Conversion of PDA To CFG
No ratings yet
Conversion of CFG To PDA Conversion of PDA To CFG
23 pages
FPGA Lec04 Unfolding
No ratings yet
FPGA Lec04 Unfolding
26 pages
April+9 +Augmenting+DFS
No ratings yet
April+9 +Augmenting+DFS
64 pages
Ch-6 CNF and GNF
100% (1)
Ch-6 CNF and GNF
33 pages
Quiz2 3510 Cheat-Sheet
100% (1)
Quiz2 3510 Cheat-Sheet
4 pages
Compl Construction Past q
No ratings yet
Compl Construction Past q
11 pages
VLSI SP (EC6248) - R
No ratings yet
VLSI SP (EC6248) - R
2 pages
CMPE371 Lecture - 7 2324 FALL PART I
No ratings yet
CMPE371 Lecture - 7 2324 FALL PART I
37 pages
Conversion of CFG To PDA Conversion of PDA To CFG
No ratings yet
Conversion of CFG To PDA Conversion of PDA To CFG
22 pages
Chapter 4 Retiming: 1 ECE734 VLSI Arrays For Digital Signal Processing
No ratings yet
Chapter 4 Retiming: 1 ECE734 VLSI Arrays For Digital Signal Processing
24 pages
Chap2 PDF
No ratings yet
Chap2 PDF
25 pages
Vlsi Signal Processing
No ratings yet
Vlsi Signal Processing
455 pages
Chap2 PDF
No ratings yet
Chap2 PDF
25 pages
VLSI Digital Signal Processing Systems: Keshab K. Parhi
No ratings yet
VLSI Digital Signal Processing Systems: Keshab K. Parhi
25 pages
VLSI Digital Signal Processing Systems by Keshab K Parhi
50% (4)
VLSI Digital Signal Processing Systems by Keshab K Parhi
25 pages
Chap2 PDF
No ratings yet
Chap2 PDF
25 pages
Model-Driven Search-Based Loop Fusion Optimization For Handwritte
No ratings yet
Model-Driven Search-Based Loop Fusion Optimization For Handwritte
61 pages
Daa Notes (Final)
No ratings yet
Daa Notes (Final)
41 pages
Retiming
No ratings yet
Retiming
24 pages
Theory of Computing
No ratings yet
Theory of Computing
118 pages
L13 Graph Part02
No ratings yet
L13 Graph Part02
41 pages
Acd Unit-4
No ratings yet
Acd Unit-4
23 pages
Graphs
No ratings yet
Graphs
43 pages
FPGA - Ch0 - Folding
No ratings yet
FPGA - Ch0 - Folding
84 pages
Directed Acyclic Graph (DAG)
No ratings yet
Directed Acyclic Graph (DAG)
16 pages
FLAT 2
No ratings yet
FLAT 2
15 pages
Continuation-Passing, Closure-Passing Style: Andrew W. Appel Trevor Jim
No ratings yet
Continuation-Passing, Closure-Passing Style: Andrew W. Appel Trevor Jim
11 pages
Graphs
No ratings yet
Graphs
29 pages
1.L01-intro
No ratings yet
1.L01-intro
39 pages
FINAL_PAPER
No ratings yet
FINAL_PAPER
7 pages
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Math Practice Tests For The ACT
From Everand
Math Practice Tests For The ACT
Vibrant Publishers
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Complex Variables II Essentials
From Everand
Complex Variables II Essentials
Alan D. Solomon
No ratings yet
OpTransactionHistory10 03 2021
No ratings yet
OpTransactionHistory10 03 2021
2 pages
Evt MCQ
No ratings yet
Evt MCQ
11 pages
Stellar Examples
No ratings yet
Stellar Examples
22 pages
Ecsm
No ratings yet
Ecsm
22 pages
Standard Low Power Methods
No ratings yet
Standard Low Power Methods
7 pages
Unified Power Format
No ratings yet
Unified Power Format
3 pages
Steiner Tree Construction Heuristic
No ratings yet
Steiner Tree Construction Heuristic
19 pages
DDDC87ECAA084E5DA4D2D77F7DC80391
No ratings yet
DDDC87ECAA084E5DA4D2D77F7DC80391
5 pages
WINSEM2013-14 CP0760 07-May-2014 RM01 Ipdesignforlowpower
No ratings yet
WINSEM2013-14 CP0760 07-May-2014 RM01 Ipdesignforlowpower
9 pages
Cache Memory Management
No ratings yet
Cache Memory Management
7 pages
I2C Serial Communication Protocol: Inter Integrated Circuit
No ratings yet
I2C Serial Communication Protocol: Inter Integrated Circuit
7 pages
Remiting For Clockperiod Minimization
No ratings yet
Remiting For Clockperiod Minimization
15 pages
Fiche - Technique - TFC 300 - ACI - Resin TFC M - en - v03
No ratings yet
Fiche - Technique - TFC 300 - ACI - Resin TFC M - en - v03
2 pages
Hydraulic Functions
100% (1)
Hydraulic Functions
96 pages
Quantum Notes Os - Original
No ratings yet
Quantum Notes Os - Original
54 pages
Soc Details
No ratings yet
Soc Details
11 pages
Evaluating Smog Awareness and Preventive Practices Among Pakistani General Population: A Cross-Sectional Survey
No ratings yet
Evaluating Smog Awareness and Preventive Practices Among Pakistani General Population: A Cross-Sectional Survey
15 pages
4.quadratic Equations MCQs
No ratings yet
4.quadratic Equations MCQs
5 pages
NPTI (SR) Scheme Tracing Report TSII
75% (4)
NPTI (SR) Scheme Tracing Report TSII
62 pages
Timing Errors and Jitter: Background
No ratings yet
Timing Errors and Jitter: Background
6 pages
MAAB Style Guideline Version2p2
No ratings yet
MAAB Style Guideline Version2p2
113 pages
Herion 26230, 80107 NAMUR Series: Process Control Valves
No ratings yet
Herion 26230, 80107 NAMUR Series: Process Control Valves
4 pages
Switching and Amplifier Applications
No ratings yet
Switching and Amplifier Applications
4 pages
Applied Physics Lab Manual
No ratings yet
Applied Physics Lab Manual
81 pages
Research On The Application of Digital Twin in Aerospace Manufacturing Based On 3D Point Cloud
No ratings yet
Research On The Application of Digital Twin in Aerospace Manufacturing Based On 3D Point Cloud
6 pages
Nonstructural Components: Seismic Capacity and Demand: G. Magliulo, C. Petrone, G. Manfredi
No ratings yet
Nonstructural Components: Seismic Capacity and Demand: G. Magliulo, C. Petrone, G. Manfredi
46 pages
Unit 1 Signal Degradation
No ratings yet
Unit 1 Signal Degradation
50 pages
Soluções exercícios Szabo chap2
No ratings yet
Soluções exercícios Szabo chap2
14 pages
PeopleLink Epodium Elite - PeopleLink
No ratings yet
PeopleLink Epodium Elite - PeopleLink
4 pages
2012 SUSPENSION Front and Rear Suspension (Inspection) - TL
No ratings yet
2012 SUSPENSION Front and Rear Suspension (Inspection) - TL
19 pages
Child Development - 2022 - Tolmatcheff - The Effectiveness of Moral Disengagement and Social Norms As Anti Bullying
No ratings yet
Child Development - 2022 - Tolmatcheff - The Effectiveness of Moral Disengagement and Social Norms As Anti Bullying
16 pages
Intended Use: Ichroma™ TSH Is A Fluorescence Immunoassay (FIA) For
No ratings yet
Intended Use: Ichroma™ TSH Is A Fluorescence Immunoassay (FIA) For
4 pages
Advanced Modal Jazz Harmony Applied To Twentieth Century Music Compositional Techniques in Jazz Style
100% (5)
Advanced Modal Jazz Harmony Applied To Twentieth Century Music Compositional Techniques in Jazz Style
199 pages
Pilz
No ratings yet
Pilz
9 pages
JCTM New - Thomas PDF
No ratings yet
JCTM New - Thomas PDF
12 pages
Epistemology As Hermeneutics
No ratings yet
Epistemology As Hermeneutics
20 pages
BCM Holiday Package Edited 1
No ratings yet
BCM Holiday Package Edited 1
57 pages
45ps Phase
No ratings yet
45ps Phase
2 pages
M22A-EQP-MAN-MC6 Manual Magnet GTS MaplaSystem Operation
No ratings yet
M22A-EQP-MAN-MC6 Manual Magnet GTS MaplaSystem Operation
106 pages

Unfolding Basic Good One

Uploaded by

Unfolding Basic Good One

Uploaded by

Unfolding basics

You might also like