SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 86
An Area Efficient Mixed Decimation MDF Architecture for Radix
Parallel FFT
Reshma K J1, Prof. Ebin M Manuel2
1M-Tech, Dept. of ECE Engineering, Government Engineering College, Idukki, Kerala, India
2Professor, Dept. of ECE Engineering, Government Engineering College, Idukki, Kerala, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - Fast Fourier Transform (FFT) has got many
applications in the field of digital signal processing. In this
project, an area efficient Mixed decimation Multipath Delay
Feedback ( DF) methodology have been presented for the
radix FFT computation. The DF architecture can be
employed by using the principle of folding transformation.
Thereby the idle period of arithmetic units in Multipath Delay
Feedback (MDF) architecture can bemobilized. Thisis done by
the integration of Decimation-In-Time (DIT) operations into
the Decimation-In-Frequency(DIF)operatedcomputing units.
The DF architecture system design was modified by using
Han Carlson adder which is efficient in area and fast in
operation. Besides, the DF system is compared with
proposed DF system using Han Carlson adder both
theoretically and experimentally. From the obtained
expressions and statistics, it can be concluded that the
proposed system can be used as an area efficient system since
it achieves improved efficiency in the consumption of
arithmetic resources. The hardware simulation is done in
Xilinx Virtex-6 Field-Programmable Gate Array (FPGA), using
the programming software ISE 14.2 Vivado Design Suite.
Key Words: Fast Fourier Transform (FFT), Multipath
Delay Feedback (MDF), Decimation-In-Time (DIT),
Decimation-In-Frequency (DIF), Pipelined architecture.
1. INTRODUCTION
Fast Fourier Transform (FFT) is an efficient algorithm for
Discrete Fourier Transform (DFT) computation. Therefore,
an efficient implementation of FFT has attracted much
consideration and various schemes have been put forward
by the hardware designers to achieve reasonable tradeoffs
between area and performance. When compared to other
hardware structures, pipelined architectures [1]-[6] have a
characteristic advantage over other efficient hardware
structures in providing high throughputs. Single path Delay
Commutator (SDC) structure [1] is one of the most
conventional approaches to perform the pipelined FFT
computation in the Serial Input Serial Output (SISO)
scenario. Single path Delay Feedback (SDF) architecture [1]
is proposed in order to decrease the memory banks in SDC
pipelines. SDF architecture is composed of many feedback
connections in the circuits. These architectures can be used
with any algorithm such as radix-2, radix-4, and especially
radix- algorithm in order to execute the DFT operation.
The radix- pipeline is equipped with simpler butterfly
units when compared to radix-4 approach while making a
better utilization of complex multipliers than the typical
radix-2 scheme.
From the perspective of hardware design, radix-
algorithm acts as an effective alternative to theconventional
computation methods. Certainly, the extension of
communication service has encouraged a dramatic rise of
throughput requirements. The demand for high throughput
can be achieved by using another upgraded structures [2].
These structures can be used to calculate the FFT when
several samples of the same sequence are received in
parallel. So Multipath Delay Commutator (MDC) [3] and
Multipath Delay Feedback (MDF) [4] are proposed to
improve the throughput rate. MDC and MDF work as the
upgrade of SDC and SDF respectively.
In general, multiple interconnected SDF paths are joined
together to form MDF structure. Each SDF path is used for
managing one of the parallel input streams. This design
contributes to efficientutilizationofmemoryresourcesbutit
has got only 50 % utilization of adders and provides only
less throughput. By contrast, the hardware efficiency of
arithmetic units (AUs) can be improved by using the MDC
approach. But for either regrouping the samples or folding
the streams, additional memories have tobeconsumed. This
will again additionally leads to an increase of computing
delay. The feedback structures such as SDF and MDF design
afford possible solutions to make a balance between the
consumption of hardware resources and the reachable
performance [4].
As we move forward the discussion, the hardware
resources can be further divided into two categories:
arithmetic resources and memory resources. Arithmetic
resources are associated with logical or arithmetic
operations and memory resources are responsible for
caching samples. The MDF scheme [4] has been a great
success in a variety of applications due to the outstanding
performance in the efficient use of memory resources.
Beneath the triumph, the underutilization of arithmetic
resources is still an important problem for feedback design
and has not been determined satisfactorily.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 87
The objective of this project is to design a new area
efficient mixed decimation MDF architecture which can
achieve improved efficiency in utilization of arithmetic
resources while maintaining the advantages of feedback
structures. The theory of foldingtransformationis employed
to derive the proposed scheme namely, the mixed
decimation Multi path Delay Feedback ( DF)architecture.
The integration of Decimation-In-Time(DIT)operationsinto
the Decimation-In-Frequency(DIF)operatedbuildingblocks
can activate the idle period of arithmetic units in MDF
architectures. Thus significant decrease in the consumption
of arithmetic resources can be achieved by DF
architecture.
2. MIXED DECIMATION MDF SYSTEM
DF architecture is derived from the theory of folding
transformation. This will activates the idle period of
arithmetic units in MDF architecture. The operations in SDF
pipeline are rescheduled to reverse the underutilization of
arithmetic modules whichisaccomplishedbyintegratingthe
DIT operations into the DIF operated computing units. So
there by better efficiency in the consumption of arithmetic
resources [4].
2.1 SDF DIF Scheme Using Folding Transformation
The folding transformation offers a methodical procedure
to derive many FFT architectures. In folding transformation,
several algorithm operationsaretimemultiplexedonasingle
computing device [5]. Thealgorithmcanbepresentedusinga
data flow graph shown in Fig 1, where the nodes represent
computations and the directed edges represent data paths.
The data flow graph consists of four stages and eight
operations are executed within each stage. When , i = 0, …,
15 arrives in serial, multiple operations in each stage can be
time multiplexed to a single computing unit without any
collisions and the control circuits are determined
systematically by folding transformation.Inthisway,theFFT
Flow graph shown in Fig 1 can be converted into a pipelined
form as shown in Fig 2 , where operations A0, …, A7, B0, …,
B7,C0, …, C7 and D0, …, D7 are performed in the computing
unit , , and respectively [4]. The multiple
operations included in a computing module are arranged by
folding sets. A folding set is an ordered set of operations
executed by the same computing unit. Each folding set
contains many operations and some of which may be
generally called as null operations, Ф.
Fig -1: Data flow graph of 16 Point R 2^2 DIF FFT
Fig -2: Data flow graph converted into a pipelined version
through folding transformation
The folding set corresponding to , , and can
be written as :
= {Ф Ф Ф Ф Ф Ф Ф Ф A0 A1 A2 A3 A4 A5 A6 A7}
= {Ф Ф Ф Ф B0 B1 B2 B3 Ф Ф Ф Ф B4 B5 B6 B7}
= {Ф Ф C0 C1 Ф Ф C2 C3 Ф Ф C4 C5 Ф Ф C6 C7 }
= {Ф D0 Ф D1 Ф D2 Ф D3 Ф D4 Ф D5 Ф D6 Ф D7}
When obtaining the folding sets associated with , ,
and , the supporting circuits used to fulfill the time
multiplexing of computing units can be obtained through
folding transformation [5]. By introducing the register
minimization technique, the implementation scheme can be
further optimized, which eventually generates the SDF
hardware structure which is shown in Fig 3. So that
hardware efficiency can be improved.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 88
Fig -3: 16 Point R 2^2 SDF DIF FFT structure
Each stage in radix- SDF FFT [6] contains of butterfly I,
butterfly II, complex multipliers with twiddle factors.
Butterfly I operate on the input data, butterfly II operate on
the output data from butterflyI,thenmultipliedwithtwiddle
factors to get the result of the current stage. Shiftregisters or
Random Access Memory (RAM) are used to store the output
data from butterfly elements. Only one complexmultiplieris
used, since, input data sequence followsa singleoutputpath.
Butterfly 1 structure [6] is shown in Fig 4. The input for this
butterfly comes from the previous component which is the
twiddle factor multiplier. The output data from butterfly 1
goes to the next stage which is usually the butterfly II. The
control signal C1 has two options. That is either C1=0 or
C1=1. When C1=0, multiplexers direct the input data to the
feedback registers until they get filled. The other option is
C1=1, the multiplexers select the output of the adders and
subtracters.
Fig -4: Butterfly 1 structure
Butterfly 2 structure is shown in Fig 3. The butterfly 2
structure [6] is same as that of butterfly 1. But in additional,
there is a swap-mux provided for –j multiplication. The
multiplication by –j contains swapping between real part
and imaginary part and sign inversion. Swap-MUX is
efficiently used for swapping real andimaginarypartand the
sign inversion is handled by switching between the adding
and the subtracting operations by mean of swap-MUX. The
control signals C1 and C2 will be one when there is a need
for multiplication by –j .
Fig -5: Butterfly 2 structure
2.2 SDF DIT Scheme Using Folding Transformation
The DIT algorithm can also be represented using a data
flow graph shown in Fig 6. Similar processes as in DIF canbe
applied to analyze DIT algorithm. As shown in the Fig 6, the
flow graph contain four stages and eight operations are
performed within each stage. When , i = 0, . . . ,15 arrives
in serial, multiple operations in each stage can be time
multiplexed to a singlecomputingunitwithoutanycollisions
and the control circuits are determined systematically by
folding transformation [5]. In this way, the FFT Flow graph
shown in Fig 6 can be converted into a pipelined form
through folding transformation as shown in Fig 7, where
operations A0, . . . , A7, B0, . . . , B7,C0, . . . ,C7 and D0,……D7
are executed in the computing unit , , and
respectively. The folding set corresponding to , ,
and can be written as :
= {Ф A0 Ф A1 Ф A2 Ф A3 Ф A4 Ф A5 Ф A6 Ф A7}
={Ф Ф B0 B1 Ф Ф B2 B3 Ф Ф B4 B5 Ф Ф B6 B7 }
= {Ф Ф Ф Ф C0 C1 C2 C3 Ф Ф Ф Ф C4 C5 C6 C7}
={Ф Ф Ф Ф Ф Ф Ф Ф D0 D1 D2 D3 D4 D5 D6 D7}
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 89
Fig -6: Data flow graph of 16 Point R 2^2 DIT FFT
Fig -7: Data flow graph converted into a pipelined version
through folding transformation
When obtaining the folding sets associated with , ,
and , the supporting circuits used to fulfill the time
multiplexing of computing units can be obtained through
folding transformation [5]. By using register minimization
method implementation system can be further optimized
which eventually generates the SDF hardware structure
which is shown in Fig 8. There by hardware efficiencycanbe
enhanced. The DIT SDF hardware structure is exactly the
reverse of DIF hardware scheme.
Fig -8: 16 Point R 2^2 SDF DIT FFT structure
The folding sets conveys the fact that , whether the
designers adopt the DIF approach or DIT scheme to
construct the SDF circuit, the existence of null operations in
folding sets will always reduce the efficiency of arithmetic
components. This will lead to an approximate 50%
utilization of complex adders merely. To address this issue,
we rearrange the operations in SDF pipelines to activate the
idle intervals of computing units [4]. From the SDF DIF and
SDF DIT hardware structures wecanclearlyunderstandthat
both DIF and DIT approach uses same hardware resources
.But only difference is that DIT is the reverse of DIF scheme.
So by integrating the DIT operations into DIF operated
computing units, the standby time of arithmetic modules in
feedback architectures can be eliminated. The integration of
DIF and DIT pipelined processor to fully utilize the
computing units is shown in Fig 9.
Fig -9: Integration of DIF and DIT pipelined processor
The SDF hardware structure can be upgraded by using
another hardware structure called Type 1 [4] which is
shown in Fig 11. The multiplexors andselectorsinthecircuit
can be divided into two categories, white for category I and
gray for category II .When arithmetic modulesareinservice,
the components within each category share the identical
logic signal. The control scheme, which is synchronizedwith
the DIF computing stream is summarized as follows: For the
first M samples of the stream, the components belonging to
category I are controlled by logic 1, while others in category
II are controlled by logic 0, where M is the length of shift
register sets in the arithmetic module. During the next M
samples, the control signals should be inverted.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 90
Fig -10: Type 1 structure
Final system DF design can be constructed by using
this Type 1 structure. Fig 12 shows the system design of two
parallel DF design for 32 point FFT. During 32 point DFT
processing, and execute DIF and DIT scheme
respectively. represent first 16 input samples and
represent next 16 input samples. In addition, as input
samples required to be rearranged as bit reversal before
participating in DIT computation. The DIT output samples
are multiplied by necessary twiddle factors for 32 point FFT
computation. Finally parallel streams are connected to
simple radix 2 butterfly unit to get 32 output samples for
FFT [4].
Fig -11: System design of two parallel DF architecture
for 32 Point FFT
3. PROPOSED DF SYSTEM
In any of FFT architectures, Butterfly unit is the important
block to calculate data addition and substraction.Anadderis
the main building block of butterfly element. The key
requirement of adder is that the adder should be fast in
operation and efficient in terms of chip area. So the
performance of butterfly unit can be further enhanced by
using high speed and an area efficient adder [7].
3.1 DF Design by Using Han Carlson Adder
The simplest method ofdoingbinaryadditionistoconnect
the carry-out from the previous bit to the carry-inofthenext
bit. Each bit takes carry-in as one of the inputsandgivessum
and carry-out as output bit and therefore named as ripple
carry adder. This type of adders is made by cascading 1 bit
full adders. But ripple carry adder has high propagation
delay. So DF system designed by usingripplecarryadder
have only small performance [7].
The DF system design is modified by using highspeed
and an area efficient Han Carlson adder. Han Carlson adder
is a parallel prefix tree [8]. Parallel prefix adders are high
performance adders. Han Carlson adder is shown in Fig 13
and it uses only even bits for performing carry-merge
operations. Generate and propagate signals of odd bits are
transferred down the prefix tree. The true carry bits are
produced by the recombination of generate and propagate
signals with even bit carry signals. So by using high speed
and area efficient Han Carlson adder, system designcangive
better performance.
Fig -12: 16 Bit Han Carlson adder
4. RESULTS
Simulation results of DF system are givenbelow.Coding
was done using Verilog HDL (Hardware Description
Language). The hardware simulationisdonein Xilinx Virtex-
6 Field-Programmable Gate Array (FPGA), XC6VLX240T-
3FF784 using the programming software ISE 14.2.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 91
4.1 DF System
Fig 14 shows the simulation result of two parallel 32point
R 2^2 DF FFT. a1 and a2 represent two 16 point parallel
DIF and DIT input samples respectively. t_r and t_img are
the 32 point output samples for given input samples and
clock signal,clk.
Fig -13: Simulation result of two parallel 32 point R 2^2
DF FFT
4.2 Proposed DF System
DF architecture is modified by using highspeedand an
area efficient Han Carlson adder. So the Proposed DF
System is found to be area efficient. Fig 15 shows the
simulation result of proposed two parallel 32 point R 2^2
DF FFT. The simulation result obtained is same for both
system design and proposed scheme.
Fig -14: Simulation result of two parallel 32 point R 2^2
DF FFT
4.3 Performance Analysis
The performanceanalysiswasdoneonbothsystemdesign
and proposed DF system design. The occupationofslices
is evaluated for comparing hardware efficiency. Table 1
shows the design summary of system design and proposed
DF system for 32 point FFT.
Table -1: Device utilization summary of system design
and proposed DF system
Structures Slice
LUTs
Slice
registers
LUT FF
pairs
Total
DF
Architecture 1561 7722 1082 10365
Proposed
DF
Architecture
1527 7717 1078 10322
From the device utilization summary, we can understand
that when DF system was modified byusingareaefficient
Han Carlson adder, there is decrease in the utilization ofslice
registers, slice LUTS and no of fully used LUT FF pairs than
the DF system designed withripple carryadder.Thereby
we obtained a new area efficient hardware for DF
architecture. Chart 1 shows the graphical representation of
above obtained test results in table 1.
Chart -1: Device utilization summary of system design and
proposed DF system
5. CONCLUSIONS
A new area efficient DF architectureforradix parallel
FFT has been designed. The inefficient use of adders and
multipliers is an important problem in most of feedback
architectures. By using this DF architecture, the standby
time of arithmetic modules can be eliminated. The system
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 92
was designed by using Verilog HDL in Xilinx ISE 14.2 Design
Suite. The performance of adders can again be enhanced by
using high speed adder. The DF architecture system was
then modified by using high speed and an area efficient Han
Carlson adder. Performance of both system design and
proposed system was analyzed. Area and delay are taken as
the parameters for performance analysis. Hardware is
obtained to be efficient for proposed system by 0.4%.
ACKNOWLEDGEMENT
I would like to thank my project guide Prof. Ebin M Manuel
for his constant support and timely advice throughout my
work. He consistently steered me in the right direction
especially during my research and writing which helped me
to complete my work successfully.
REFERENCES
[1] A. Cortes, I. Velez, and J. F. Sevillano, “Radix r^k FFTs:
Matrical representation and SDC/SDF pipeline
implementation, ” IEEE Trans. Signal Process., vol. 57, no. 7,
pp. 2824–2839, Jul. 2009.
[2] C. Cheng and K. K. Parhi, “High-throughput VLSI
architecture for FFT computation, ”IEEETrans.CircuitsSyst.
II, Exp. Briefs, vol. 54, no. 10, pp. 863–867, Oct. 2007.
[3] M. Garrido, J. Grajal, M. A. Sanchez, and O. Gustafsson,
“Pipelined radix-2^k feedforward FFT architectures, ” IEEE
Trans. Very Large Scale Integr. (VLSI) Syst., vol. 21, no. 1,
pp. 23–32, Jan. 2013.
[4] Jian Wang, Chunlin Xiong, Kangli Zhang and Jibo Wei, “A
mixed decimation MDF architecture for radix 2^k parallel
FFT, ” IEEE Trans.on very large scale integration, vol. 24, no.
1, pp. 414–426, Jan 2016.
[5] M. Ayinala, M. Brown, and K. K. Parhi, “Pipelinedparallel
FFT architectures via folding transformation, ” IEEE Trans.
Very Large Scale Integr. (VLSI) Syst., vol. 20, no. 6, pp. 1068–
1081, Jun. 2012.
[6] Yazan Samir Algnabi, Rozita Teymourzadeh, Masuri
Othman, Md Shabiul Islam”FPGAImplementationofPipeline
Multiplier-Less Radix 2^2 DIF SDF Butterfly for Fast Fourier
Transform Structure”, MicroEngineering VLSI Design
Department, June 2012
[7] Mallapu Santhosh Kumar1 and K.Dhanunjaya (Ph.d),
“Comparison of Various 32-Bit Parallel Prefix Adders,
“International Journal for Research in Applied Science &
Engineering Technology (IJRASET), ISSN: 2321-9653,
Volume 3, Issue VI, Jun. 2015.
[8] Gayathri.G, Raju S.S and Suresh’s, “Parallel Prefix
Speculative Han Carlson Adder “, IOSR Journal ofElectronics
and Communication Engineering(IOSR-JECE),e-ISSN:2278-
2834,p-ISSN: 2278-8735, Volume 11, Issue 3, PP 38-43,May
2016.

More Related Content

What's hot (15)

PDF
High Performance MAC Unit for FFT Implementation
IJMER
 
PDF
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
VLSICS Design
 
PDF
International Journal of Computational Engineering Research(IJCER)
ijceronline
 
PDF
IRJET- Low Complexity Pipelined FFT Design for High Throughput and Low Densit...
IRJET Journal
 
PDF
FPGA based Efficient Interpolator design using DALUT Algorithm
cscpconf
 
PDF
IRJET- MAC Unit by Efficient Grouping of Partial Products along with Circular...
IRJET Journal
 
PDF
Transpose Form Fir Filter Design for Fixed and Reconfigurable Coefficients
IRJET Journal
 
PDF
Ad04606184188
IJERA Editor
 
PDF
Braun’s Multiplier Implementation using FPGA with Bypassing Techniques.
VLSICS Design
 
PDF
Compare Efficiency of Different Multipliers Using Verilog Simulation & Modify...
IJLT EMAS
 
PDF
High Speed and Time Efficient 1-D DWT on Xilinx Virtex4 DWT Using 9/7 Filter ...
IOSR Journals
 
PDF
F1074145
IJERD Editor
 
PDF
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
VLSICS Design
 
PDF
IMPLEMENTATION OF SDC - SDF ARCHITECTURE FOR RADIX-4 FFT
VLSICS Design
 
High Performance MAC Unit for FFT Implementation
IJMER
 
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
VLSICS Design
 
International Journal of Computational Engineering Research(IJCER)
ijceronline
 
IRJET- Low Complexity Pipelined FFT Design for High Throughput and Low Densit...
IRJET Journal
 
FPGA based Efficient Interpolator design using DALUT Algorithm
cscpconf
 
IRJET- MAC Unit by Efficient Grouping of Partial Products along with Circular...
IRJET Journal
 
Transpose Form Fir Filter Design for Fixed and Reconfigurable Coefficients
IRJET Journal
 
Ad04606184188
IJERA Editor
 
Braun’s Multiplier Implementation using FPGA with Bypassing Techniques.
VLSICS Design
 
Compare Efficiency of Different Multipliers Using Verilog Simulation & Modify...
IJLT EMAS
 
High Speed and Time Efficient 1-D DWT on Xilinx Virtex4 DWT Using 9/7 Filter ...
IOSR Journals
 
F1074145
IJERD Editor
 
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
VLSICS Design
 
IMPLEMENTATION OF SDC - SDF ARCHITECTURE FOR RADIX-4 FFT
VLSICS Design
 

Similar to An Area Efficient Mixed Decimation MDF Architecture for Radix 22 Parallel FFT (20)

DOCX
A combined sdc
sakthi1986
 
DOCX
s.Magesh kumar DECE,BTECH,ME (ASAN MEMORIAL COLLEGE OF ENGINEERING AND TECHNO...
sakthi1986
 
PDF
IRJET - Design and Implementation of FFT using Compressor with XOR Gate Topology
IRJET Journal
 
PDF
427 432
Editor IJARCET
 
PDF
Iaetsd pipelined parallel fft architecture through folding transformation
Iaetsd Iaetsd
 
PDF
HIGH PERFORMANCE SPLIT RADIX FFT
AM Publications
 
PDF
Efficient fpga mapping of pipeline sdf fft cores
Nxfee Innovation
 
PDF
Gn3311521155
IJERA Editor
 
PDF
Aw25293296
IJERA Editor
 
PDF
Performance Analysis of OFDM Transceiver with Folded FFT and LMS Filter
idescitation
 
PDF
IRJET- VLSI Architecture for Reversible Radix-2 FFT Algorithm using Programma...
IRJET Journal
 
PDF
Design Radix-4 64-Point Pipeline FFT/IFFT Processor for Wireless Application
International Journal of Engineering Inventions www.ijeijournal.com
 
PDF
My paper
Venkat Malai Avichi
 
PDF
A Pipelined Fused Processing Unit for DSP Applications
ijiert bestjournal
 
PDF
IRJET - Distributed Arithmetic Method for Complex Multiplication
IRJET Journal
 
DOCX
A mixed decimation mdf architecture for radix-2k parallel fft
I3E Technologies
 
PDF
IRJET- Implementation of Reversible Radix-2 FFT VLSI Architecture using P...
IRJET Journal
 
PDF
High Speed Area Efficient 8-point FFT using Vedic Multiplier
IJERA Editor
 
PDF
FPGA Implementation of Mixed Radix CORDIC FFT
IJSRD
 
PDF
journal for research
chaitanya451336
 
A combined sdc
sakthi1986
 
s.Magesh kumar DECE,BTECH,ME (ASAN MEMORIAL COLLEGE OF ENGINEERING AND TECHNO...
sakthi1986
 
IRJET - Design and Implementation of FFT using Compressor with XOR Gate Topology
IRJET Journal
 
Iaetsd pipelined parallel fft architecture through folding transformation
Iaetsd Iaetsd
 
HIGH PERFORMANCE SPLIT RADIX FFT
AM Publications
 
Efficient fpga mapping of pipeline sdf fft cores
Nxfee Innovation
 
Gn3311521155
IJERA Editor
 
Aw25293296
IJERA Editor
 
Performance Analysis of OFDM Transceiver with Folded FFT and LMS Filter
idescitation
 
IRJET- VLSI Architecture for Reversible Radix-2 FFT Algorithm using Programma...
IRJET Journal
 
Design Radix-4 64-Point Pipeline FFT/IFFT Processor for Wireless Application
International Journal of Engineering Inventions www.ijeijournal.com
 
A Pipelined Fused Processing Unit for DSP Applications
ijiert bestjournal
 
IRJET - Distributed Arithmetic Method for Complex Multiplication
IRJET Journal
 
A mixed decimation mdf architecture for radix-2k parallel fft
I3E Technologies
 
IRJET- Implementation of Reversible Radix-2 FFT VLSI Architecture using P...
IRJET Journal
 
High Speed Area Efficient 8-point FFT using Vedic Multiplier
IJERA Editor
 
FPGA Implementation of Mixed Radix CORDIC FFT
IJSRD
 
journal for research
chaitanya451336
 
Ad

More from IRJET Journal (20)

PDF
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
IRJET Journal
 
PDF
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
IRJET Journal
 
PDF
Kiona – A Smart Society Automation Project
IRJET Journal
 
PDF
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
IRJET Journal
 
PDF
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
IRJET Journal
 
PDF
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
IRJET Journal
 
PDF
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
IRJET Journal
 
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
PDF
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
IRJET Journal
 
PDF
BRAIN TUMOUR DETECTION AND CLASSIFICATION
IRJET Journal
 
PDF
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
IRJET Journal
 
PDF
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
IRJET Journal
 
PDF
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
IRJET Journal
 
PDF
Breast Cancer Detection using Computer Vision
IRJET Journal
 
PDF
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
PDF
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
PDF
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
IRJET Journal
 
PDF
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
PDF
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
IRJET Journal
 
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
IRJET Journal
 
Kiona – A Smart Society Automation Project
IRJET Journal
 
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
IRJET Journal
 
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
IRJET Journal
 
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
IRJET Journal
 
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
IRJET Journal
 
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
IRJET Journal
 
BRAIN TUMOUR DETECTION AND CLASSIFICATION
IRJET Journal
 
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
IRJET Journal
 
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
IRJET Journal
 
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
IRJET Journal
 
Breast Cancer Detection using Computer Vision
IRJET Journal
 
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
IRJET Journal
 
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
Ad

Recently uploaded (20)

PPTX
Biosensors, BioDevices, Biomediccal.pptx
AsimovRiyaz
 
PPT
Testing and final inspection of a solar PV system
MuhammadSanni2
 
PPTX
MODULE 05 - CLOUD COMPUTING AND SECURITY.pptx
Alvas Institute of Engineering and technology, Moodabidri
 
PPTX
原版一样(EC Lille毕业证书)法国里尔中央理工学院毕业证补办
Taqyea
 
PDF
aAn_Introduction_to_Arcadia_20150115.pdf
henriqueltorres1
 
PPTX
Numerical-Solutions-of-Ordinary-Differential-Equations.pptx
SAMUKTHAARM
 
PPTX
Distribution reservoir and service storage pptx
dhanashree78
 
PPTX
澳洲电子毕业证澳大利亚圣母大学水印成绩单UNDA学生证网上可查学历
Taqyea
 
PDF
Bachelor of information technology syll
SudarsanAssistantPro
 
PDF
mbse_An_Introduction_to_Arcadia_20150115.pdf
henriqueltorres1
 
PDF
methodology-driven-mbse-murphy-july-hsv-huntsville6680038572db67488e78ff00003...
henriqueltorres1
 
PDF
Digital water marking system project report
Kamal Acharya
 
PDF
Data structures notes for unit 2 in computer science.pdf
sshubhamsingh265
 
PDF
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
PPTX
MODULE 04 - CLOUD COMPUTING AND SECURITY.pptx
Alvas Institute of Engineering and technology, Moodabidri
 
PDF
methodology-driven-mbse-murphy-july-hsv-huntsville6680038572db67488e78ff00003...
henriqueltorres1
 
PDF
Water Industry Process Automation & Control Monthly July 2025
Water Industry Process Automation & Control
 
PPTX
darshai cross section and river section analysis
muk7971
 
PPTX
How Industrial Project Management Differs From Construction.pptx
jamespit799
 
PDF
Reasons for the succes of MENARD PRESSUREMETER.pdf
majdiamz
 
Biosensors, BioDevices, Biomediccal.pptx
AsimovRiyaz
 
Testing and final inspection of a solar PV system
MuhammadSanni2
 
MODULE 05 - CLOUD COMPUTING AND SECURITY.pptx
Alvas Institute of Engineering and technology, Moodabidri
 
原版一样(EC Lille毕业证书)法国里尔中央理工学院毕业证补办
Taqyea
 
aAn_Introduction_to_Arcadia_20150115.pdf
henriqueltorres1
 
Numerical-Solutions-of-Ordinary-Differential-Equations.pptx
SAMUKTHAARM
 
Distribution reservoir and service storage pptx
dhanashree78
 
澳洲电子毕业证澳大利亚圣母大学水印成绩单UNDA学生证网上可查学历
Taqyea
 
Bachelor of information technology syll
SudarsanAssistantPro
 
mbse_An_Introduction_to_Arcadia_20150115.pdf
henriqueltorres1
 
methodology-driven-mbse-murphy-july-hsv-huntsville6680038572db67488e78ff00003...
henriqueltorres1
 
Digital water marking system project report
Kamal Acharya
 
Data structures notes for unit 2 in computer science.pdf
sshubhamsingh265
 
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
MODULE 04 - CLOUD COMPUTING AND SECURITY.pptx
Alvas Institute of Engineering and technology, Moodabidri
 
methodology-driven-mbse-murphy-july-hsv-huntsville6680038572db67488e78ff00003...
henriqueltorres1
 
Water Industry Process Automation & Control Monthly July 2025
Water Industry Process Automation & Control
 
darshai cross section and river section analysis
muk7971
 
How Industrial Project Management Differs From Construction.pptx
jamespit799
 
Reasons for the succes of MENARD PRESSUREMETER.pdf
majdiamz
 

An Area Efficient Mixed Decimation MDF Architecture for Radix 22 Parallel FFT

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 86 An Area Efficient Mixed Decimation MDF Architecture for Radix Parallel FFT Reshma K J1, Prof. Ebin M Manuel2 1M-Tech, Dept. of ECE Engineering, Government Engineering College, Idukki, Kerala, India 2Professor, Dept. of ECE Engineering, Government Engineering College, Idukki, Kerala, India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - Fast Fourier Transform (FFT) has got many applications in the field of digital signal processing. In this project, an area efficient Mixed decimation Multipath Delay Feedback ( DF) methodology have been presented for the radix FFT computation. The DF architecture can be employed by using the principle of folding transformation. Thereby the idle period of arithmetic units in Multipath Delay Feedback (MDF) architecture can bemobilized. Thisis done by the integration of Decimation-In-Time (DIT) operations into the Decimation-In-Frequency(DIF)operatedcomputing units. The DF architecture system design was modified by using Han Carlson adder which is efficient in area and fast in operation. Besides, the DF system is compared with proposed DF system using Han Carlson adder both theoretically and experimentally. From the obtained expressions and statistics, it can be concluded that the proposed system can be used as an area efficient system since it achieves improved efficiency in the consumption of arithmetic resources. The hardware simulation is done in Xilinx Virtex-6 Field-Programmable Gate Array (FPGA), using the programming software ISE 14.2 Vivado Design Suite. Key Words: Fast Fourier Transform (FFT), Multipath Delay Feedback (MDF), Decimation-In-Time (DIT), Decimation-In-Frequency (DIF), Pipelined architecture. 1. INTRODUCTION Fast Fourier Transform (FFT) is an efficient algorithm for Discrete Fourier Transform (DFT) computation. Therefore, an efficient implementation of FFT has attracted much consideration and various schemes have been put forward by the hardware designers to achieve reasonable tradeoffs between area and performance. When compared to other hardware structures, pipelined architectures [1]-[6] have a characteristic advantage over other efficient hardware structures in providing high throughputs. Single path Delay Commutator (SDC) structure [1] is one of the most conventional approaches to perform the pipelined FFT computation in the Serial Input Serial Output (SISO) scenario. Single path Delay Feedback (SDF) architecture [1] is proposed in order to decrease the memory banks in SDC pipelines. SDF architecture is composed of many feedback connections in the circuits. These architectures can be used with any algorithm such as radix-2, radix-4, and especially radix- algorithm in order to execute the DFT operation. The radix- pipeline is equipped with simpler butterfly units when compared to radix-4 approach while making a better utilization of complex multipliers than the typical radix-2 scheme. From the perspective of hardware design, radix- algorithm acts as an effective alternative to theconventional computation methods. Certainly, the extension of communication service has encouraged a dramatic rise of throughput requirements. The demand for high throughput can be achieved by using another upgraded structures [2]. These structures can be used to calculate the FFT when several samples of the same sequence are received in parallel. So Multipath Delay Commutator (MDC) [3] and Multipath Delay Feedback (MDF) [4] are proposed to improve the throughput rate. MDC and MDF work as the upgrade of SDC and SDF respectively. In general, multiple interconnected SDF paths are joined together to form MDF structure. Each SDF path is used for managing one of the parallel input streams. This design contributes to efficientutilizationofmemoryresourcesbutit has got only 50 % utilization of adders and provides only less throughput. By contrast, the hardware efficiency of arithmetic units (AUs) can be improved by using the MDC approach. But for either regrouping the samples or folding the streams, additional memories have tobeconsumed. This will again additionally leads to an increase of computing delay. The feedback structures such as SDF and MDF design afford possible solutions to make a balance between the consumption of hardware resources and the reachable performance [4]. As we move forward the discussion, the hardware resources can be further divided into two categories: arithmetic resources and memory resources. Arithmetic resources are associated with logical or arithmetic operations and memory resources are responsible for caching samples. The MDF scheme [4] has been a great success in a variety of applications due to the outstanding performance in the efficient use of memory resources. Beneath the triumph, the underutilization of arithmetic resources is still an important problem for feedback design and has not been determined satisfactorily.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 87 The objective of this project is to design a new area efficient mixed decimation MDF architecture which can achieve improved efficiency in utilization of arithmetic resources while maintaining the advantages of feedback structures. The theory of foldingtransformationis employed to derive the proposed scheme namely, the mixed decimation Multi path Delay Feedback ( DF)architecture. The integration of Decimation-In-Time(DIT)operationsinto the Decimation-In-Frequency(DIF)operatedbuildingblocks can activate the idle period of arithmetic units in MDF architectures. Thus significant decrease in the consumption of arithmetic resources can be achieved by DF architecture. 2. MIXED DECIMATION MDF SYSTEM DF architecture is derived from the theory of folding transformation. This will activates the idle period of arithmetic units in MDF architecture. The operations in SDF pipeline are rescheduled to reverse the underutilization of arithmetic modules whichisaccomplishedbyintegratingthe DIT operations into the DIF operated computing units. So there by better efficiency in the consumption of arithmetic resources [4]. 2.1 SDF DIF Scheme Using Folding Transformation The folding transformation offers a methodical procedure to derive many FFT architectures. In folding transformation, several algorithm operationsaretimemultiplexedonasingle computing device [5]. Thealgorithmcanbepresentedusinga data flow graph shown in Fig 1, where the nodes represent computations and the directed edges represent data paths. The data flow graph consists of four stages and eight operations are executed within each stage. When , i = 0, …, 15 arrives in serial, multiple operations in each stage can be time multiplexed to a single computing unit without any collisions and the control circuits are determined systematically by folding transformation.Inthisway,theFFT Flow graph shown in Fig 1 can be converted into a pipelined form as shown in Fig 2 , where operations A0, …, A7, B0, …, B7,C0, …, C7 and D0, …, D7 are performed in the computing unit , , and respectively [4]. The multiple operations included in a computing module are arranged by folding sets. A folding set is an ordered set of operations executed by the same computing unit. Each folding set contains many operations and some of which may be generally called as null operations, Ф. Fig -1: Data flow graph of 16 Point R 2^2 DIF FFT Fig -2: Data flow graph converted into a pipelined version through folding transformation The folding set corresponding to , , and can be written as : = {Ф Ф Ф Ф Ф Ф Ф Ф A0 A1 A2 A3 A4 A5 A6 A7} = {Ф Ф Ф Ф B0 B1 B2 B3 Ф Ф Ф Ф B4 B5 B6 B7} = {Ф Ф C0 C1 Ф Ф C2 C3 Ф Ф C4 C5 Ф Ф C6 C7 } = {Ф D0 Ф D1 Ф D2 Ф D3 Ф D4 Ф D5 Ф D6 Ф D7} When obtaining the folding sets associated with , , and , the supporting circuits used to fulfill the time multiplexing of computing units can be obtained through folding transformation [5]. By introducing the register minimization technique, the implementation scheme can be further optimized, which eventually generates the SDF hardware structure which is shown in Fig 3. So that hardware efficiency can be improved.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 88 Fig -3: 16 Point R 2^2 SDF DIF FFT structure Each stage in radix- SDF FFT [6] contains of butterfly I, butterfly II, complex multipliers with twiddle factors. Butterfly I operate on the input data, butterfly II operate on the output data from butterflyI,thenmultipliedwithtwiddle factors to get the result of the current stage. Shiftregisters or Random Access Memory (RAM) are used to store the output data from butterfly elements. Only one complexmultiplieris used, since, input data sequence followsa singleoutputpath. Butterfly 1 structure [6] is shown in Fig 4. The input for this butterfly comes from the previous component which is the twiddle factor multiplier. The output data from butterfly 1 goes to the next stage which is usually the butterfly II. The control signal C1 has two options. That is either C1=0 or C1=1. When C1=0, multiplexers direct the input data to the feedback registers until they get filled. The other option is C1=1, the multiplexers select the output of the adders and subtracters. Fig -4: Butterfly 1 structure Butterfly 2 structure is shown in Fig 3. The butterfly 2 structure [6] is same as that of butterfly 1. But in additional, there is a swap-mux provided for –j multiplication. The multiplication by –j contains swapping between real part and imaginary part and sign inversion. Swap-MUX is efficiently used for swapping real andimaginarypartand the sign inversion is handled by switching between the adding and the subtracting operations by mean of swap-MUX. The control signals C1 and C2 will be one when there is a need for multiplication by –j . Fig -5: Butterfly 2 structure 2.2 SDF DIT Scheme Using Folding Transformation The DIT algorithm can also be represented using a data flow graph shown in Fig 6. Similar processes as in DIF canbe applied to analyze DIT algorithm. As shown in the Fig 6, the flow graph contain four stages and eight operations are performed within each stage. When , i = 0, . . . ,15 arrives in serial, multiple operations in each stage can be time multiplexed to a singlecomputingunitwithoutanycollisions and the control circuits are determined systematically by folding transformation [5]. In this way, the FFT Flow graph shown in Fig 6 can be converted into a pipelined form through folding transformation as shown in Fig 7, where operations A0, . . . , A7, B0, . . . , B7,C0, . . . ,C7 and D0,……D7 are executed in the computing unit , , and respectively. The folding set corresponding to , , and can be written as : = {Ф A0 Ф A1 Ф A2 Ф A3 Ф A4 Ф A5 Ф A6 Ф A7} ={Ф Ф B0 B1 Ф Ф B2 B3 Ф Ф B4 B5 Ф Ф B6 B7 } = {Ф Ф Ф Ф C0 C1 C2 C3 Ф Ф Ф Ф C4 C5 C6 C7} ={Ф Ф Ф Ф Ф Ф Ф Ф D0 D1 D2 D3 D4 D5 D6 D7}
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 89 Fig -6: Data flow graph of 16 Point R 2^2 DIT FFT Fig -7: Data flow graph converted into a pipelined version through folding transformation When obtaining the folding sets associated with , , and , the supporting circuits used to fulfill the time multiplexing of computing units can be obtained through folding transformation [5]. By using register minimization method implementation system can be further optimized which eventually generates the SDF hardware structure which is shown in Fig 8. There by hardware efficiencycanbe enhanced. The DIT SDF hardware structure is exactly the reverse of DIF hardware scheme. Fig -8: 16 Point R 2^2 SDF DIT FFT structure The folding sets conveys the fact that , whether the designers adopt the DIF approach or DIT scheme to construct the SDF circuit, the existence of null operations in folding sets will always reduce the efficiency of arithmetic components. This will lead to an approximate 50% utilization of complex adders merely. To address this issue, we rearrange the operations in SDF pipelines to activate the idle intervals of computing units [4]. From the SDF DIF and SDF DIT hardware structures wecanclearlyunderstandthat both DIF and DIT approach uses same hardware resources .But only difference is that DIT is the reverse of DIF scheme. So by integrating the DIT operations into DIF operated computing units, the standby time of arithmetic modules in feedback architectures can be eliminated. The integration of DIF and DIT pipelined processor to fully utilize the computing units is shown in Fig 9. Fig -9: Integration of DIF and DIT pipelined processor The SDF hardware structure can be upgraded by using another hardware structure called Type 1 [4] which is shown in Fig 11. The multiplexors andselectorsinthecircuit can be divided into two categories, white for category I and gray for category II .When arithmetic modulesareinservice, the components within each category share the identical logic signal. The control scheme, which is synchronizedwith the DIF computing stream is summarized as follows: For the first M samples of the stream, the components belonging to category I are controlled by logic 1, while others in category II are controlled by logic 0, where M is the length of shift register sets in the arithmetic module. During the next M samples, the control signals should be inverted.
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 90 Fig -10: Type 1 structure Final system DF design can be constructed by using this Type 1 structure. Fig 12 shows the system design of two parallel DF design for 32 point FFT. During 32 point DFT processing, and execute DIF and DIT scheme respectively. represent first 16 input samples and represent next 16 input samples. In addition, as input samples required to be rearranged as bit reversal before participating in DIT computation. The DIT output samples are multiplied by necessary twiddle factors for 32 point FFT computation. Finally parallel streams are connected to simple radix 2 butterfly unit to get 32 output samples for FFT [4]. Fig -11: System design of two parallel DF architecture for 32 Point FFT 3. PROPOSED DF SYSTEM In any of FFT architectures, Butterfly unit is the important block to calculate data addition and substraction.Anadderis the main building block of butterfly element. The key requirement of adder is that the adder should be fast in operation and efficient in terms of chip area. So the performance of butterfly unit can be further enhanced by using high speed and an area efficient adder [7]. 3.1 DF Design by Using Han Carlson Adder The simplest method ofdoingbinaryadditionistoconnect the carry-out from the previous bit to the carry-inofthenext bit. Each bit takes carry-in as one of the inputsandgivessum and carry-out as output bit and therefore named as ripple carry adder. This type of adders is made by cascading 1 bit full adders. But ripple carry adder has high propagation delay. So DF system designed by usingripplecarryadder have only small performance [7]. The DF system design is modified by using highspeed and an area efficient Han Carlson adder. Han Carlson adder is a parallel prefix tree [8]. Parallel prefix adders are high performance adders. Han Carlson adder is shown in Fig 13 and it uses only even bits for performing carry-merge operations. Generate and propagate signals of odd bits are transferred down the prefix tree. The true carry bits are produced by the recombination of generate and propagate signals with even bit carry signals. So by using high speed and area efficient Han Carlson adder, system designcangive better performance. Fig -12: 16 Bit Han Carlson adder 4. RESULTS Simulation results of DF system are givenbelow.Coding was done using Verilog HDL (Hardware Description Language). The hardware simulationisdonein Xilinx Virtex- 6 Field-Programmable Gate Array (FPGA), XC6VLX240T- 3FF784 using the programming software ISE 14.2.
  • 6. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 91 4.1 DF System Fig 14 shows the simulation result of two parallel 32point R 2^2 DF FFT. a1 and a2 represent two 16 point parallel DIF and DIT input samples respectively. t_r and t_img are the 32 point output samples for given input samples and clock signal,clk. Fig -13: Simulation result of two parallel 32 point R 2^2 DF FFT 4.2 Proposed DF System DF architecture is modified by using highspeedand an area efficient Han Carlson adder. So the Proposed DF System is found to be area efficient. Fig 15 shows the simulation result of proposed two parallel 32 point R 2^2 DF FFT. The simulation result obtained is same for both system design and proposed scheme. Fig -14: Simulation result of two parallel 32 point R 2^2 DF FFT 4.3 Performance Analysis The performanceanalysiswasdoneonbothsystemdesign and proposed DF system design. The occupationofslices is evaluated for comparing hardware efficiency. Table 1 shows the design summary of system design and proposed DF system for 32 point FFT. Table -1: Device utilization summary of system design and proposed DF system Structures Slice LUTs Slice registers LUT FF pairs Total DF Architecture 1561 7722 1082 10365 Proposed DF Architecture 1527 7717 1078 10322 From the device utilization summary, we can understand that when DF system was modified byusingareaefficient Han Carlson adder, there is decrease in the utilization ofslice registers, slice LUTS and no of fully used LUT FF pairs than the DF system designed withripple carryadder.Thereby we obtained a new area efficient hardware for DF architecture. Chart 1 shows the graphical representation of above obtained test results in table 1. Chart -1: Device utilization summary of system design and proposed DF system 5. CONCLUSIONS A new area efficient DF architectureforradix parallel FFT has been designed. The inefficient use of adders and multipliers is an important problem in most of feedback architectures. By using this DF architecture, the standby time of arithmetic modules can be eliminated. The system
  • 7. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 92 was designed by using Verilog HDL in Xilinx ISE 14.2 Design Suite. The performance of adders can again be enhanced by using high speed adder. The DF architecture system was then modified by using high speed and an area efficient Han Carlson adder. Performance of both system design and proposed system was analyzed. Area and delay are taken as the parameters for performance analysis. Hardware is obtained to be efficient for proposed system by 0.4%. ACKNOWLEDGEMENT I would like to thank my project guide Prof. Ebin M Manuel for his constant support and timely advice throughout my work. He consistently steered me in the right direction especially during my research and writing which helped me to complete my work successfully. REFERENCES [1] A. Cortes, I. Velez, and J. F. Sevillano, “Radix r^k FFTs: Matrical representation and SDC/SDF pipeline implementation, ” IEEE Trans. Signal Process., vol. 57, no. 7, pp. 2824–2839, Jul. 2009. [2] C. Cheng and K. K. Parhi, “High-throughput VLSI architecture for FFT computation, ”IEEETrans.CircuitsSyst. II, Exp. Briefs, vol. 54, no. 10, pp. 863–867, Oct. 2007. [3] M. Garrido, J. Grajal, M. A. Sanchez, and O. Gustafsson, “Pipelined radix-2^k feedforward FFT architectures, ” IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 21, no. 1, pp. 23–32, Jan. 2013. [4] Jian Wang, Chunlin Xiong, Kangli Zhang and Jibo Wei, “A mixed decimation MDF architecture for radix 2^k parallel FFT, ” IEEE Trans.on very large scale integration, vol. 24, no. 1, pp. 414–426, Jan 2016. [5] M. Ayinala, M. Brown, and K. K. Parhi, “Pipelinedparallel FFT architectures via folding transformation, ” IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 20, no. 6, pp. 1068– 1081, Jun. 2012. [6] Yazan Samir Algnabi, Rozita Teymourzadeh, Masuri Othman, Md Shabiul Islam”FPGAImplementationofPipeline Multiplier-Less Radix 2^2 DIF SDF Butterfly for Fast Fourier Transform Structure”, MicroEngineering VLSI Design Department, June 2012 [7] Mallapu Santhosh Kumar1 and K.Dhanunjaya (Ph.d), “Comparison of Various 32-Bit Parallel Prefix Adders, “International Journal for Research in Applied Science & Engineering Technology (IJRASET), ISSN: 2321-9653, Volume 3, Issue VI, Jun. 2015. [8] Gayathri.G, Raju S.S and Suresh’s, “Parallel Prefix Speculative Han Carlson Adder “, IOSR Journal ofElectronics and Communication Engineering(IOSR-JECE),e-ISSN:2278- 2834,p-ISSN: 2278-8735, Volume 11, Issue 3, PP 38-43,May 2016.