SlideShare a Scribd company logo
ACEEE Int. J. on Information Technology, Vol. 01, No. 01, Mar 2011



                   FIR Filter Implementation by
            Systolization using DA-based Decomposition
                                            Shreeharsha K G, Rekha Bhandarkar
       Department of Electronics and Communication Engineering Alva’s Institute of Engineering and Technology,
                                        Mijar – 574 225, Moodbidri, INDIA
                                            shreeharshakg01@gmail.com
                                                  Rekha Bhandarkar
                              Department of Electronics and Communication Engineering
                                    NMAM Institute of Technology, Nitte – 574 110
                                                   Karkala, INDIA
                                          rekhabhandarkar@rediffmail.com

Abstract– In this paper we present 1D and 2D systolic                   architecture, array in particular, is replacing a single
Distributed Arithmetic (DA) based structures that are designed          Processing Element (PE) with an array of PEs or cells. Being
for the implementation of Finite Impulse Response (FIR) filters.        able to use each input data item a number of times (and thus
The paper compares the 1D DA based systolic structure with              achieving high computation throughput with only modest
1D systolic DA based decomposition method. The filters are              memory bandwidth) is one of the advantages of the systolic
implemented on a Xilinx Virtex II Pro (XC2VP30) FPGA using
                                                                        approach. They have several attractive features such as
HDL and system metrics like Area, Gate Count, Maximum
Usable Frequency and Power consumption are estimated for
                                                                        simplicity, regularity and modularity of structure [2]. In
different filter orders and address lengths. The 1D systolic            addition, they also possess significant potential to yield high-
decomposition structure is also compared with the existing              throughput rate by exploiting high-level of concurrency using
system generator implementation of DA FIR.. Results for an              pipelining or parallel processing or both.
exemplary implementation are presented.
                                                                                           II.DISTRIBUTED   ARITHMETIC
Keywords— Distributed arithmetic (DA), Field Programmable
Gate Arrays (FPGA), Finite Impulse Response (FIR) filter,                    Distributed Arithmetic (DA) is an efficient method for
systolic array.                                                         computing inner products when one of the input vectors is
                                                                        fixed[4]. It uses look-up tables and accumulators instead of
                         I.INTRODUCTION                                 multipliers for computing inner products.
                                                                            Let us consider the inner-product of two N-point vectors
    Finite Impulse Response (FIR) filters are one of the most
                                                                        A and B given by Eq. (1) as,
common components of Digital Signal Processing (DSP)
systems. FIR filtering is achieved by convolving the input
data samples with the desired unit response of the filter. Since
the complexity of implementation grows with the filter order
and the precision of computation, real-time realization of
these filters with desired level of accuracy is a challenging
task. Several attempts have, therefore, been made to develop
dedicated and reconfigurable architectures for realization of
FIR filters in Application Specific Integrated Circuits (ASIC)
and FPGA platforms. DA provides an approach for
multiplier-less implementation of FIR filters where the filter
coefficients are programmable. In other words, the same filter
structure can be used for a different set of coefficients.
    A systolic system consists of a set of interconnected cells,
each capable of performing some simple operation. Because
simple, regular communication and control structures have
substantial advantages over complicated ones in design and
implementation, cells in a systolic system are typically
interconnected to form a systolic array or a systolic tree.
Information in a systolic system flows between cells in a
pipelined fashion, and communication with the outside world
occurs only at the “boundary cells.” For example, in a systolic
array, only those cells on the array boundaries may be I/O
ports for the system[5]. The basic principle of a systolic

© 2011 ACEEE                                                       42
DOI: 01.IJIT.01.01.120
ACEEE Int. J. on Information Technology, Vol. 01, No. 01, Mar 2011


          where



for l = 0, 1, …. , L-1 and p = 0, 1, …. , P-1.
    The bit vector (bn)l,p in Eq. (4b) is used as address word
for the lookup table and F is the memory-read operation.

              III.1-D SYSTOLIC ARRAY FOR FIR FILTERS
    A linear array consisting of P number of PEs and an output
cell is shown in Fig. 1 and the function of the PEs is described
in Fig. 1(b)[3].




                                                                                            From the data presented in Table 1 and from the
                                                                                   Figures 2 to 5 it can be seen that for a given filter order N,
                                                                                   the case for M = 4 yields the most area-time efficient archi-
                                                                                   tecture when compared to the case for M = 2 and 8. This can
                                                                                   be explained by the fact that the increase in control logic
                                                                                   and number of delay elements outweighs the gains made by
                                                                                   reduction of LUT size for M = 2, while for M = 8, the memory
                                                                                   requirement of LUTs is too high [1].Frequency is also maxi-
                                                                                   mum for lower orders. Power consumption is the lowest.




Figure. 1. The 1-D array for DA-based implementation of FIR filter: (a)
Linear systolic array; (b) function of PE; and (c) function of output cell.
                       delta stands for a unit delay.

    The input sequence {x(n)} is fed to a serial-in parallel-
out input register where content of the register is serially
right shifted by one position and transferred in parallel to
the bit-serial word-parallel converter in every L cycles. The
function of the output cell is shown in Fig. 1(c). After L
cycles, it delivers a desired filter output. The structure will
yield its first filter output (L+P) cycles after the first input is
fed to the first PE, while the successive output becomes
available in every L cycles.

                           IV.IMPLEMENTATION
  This section is concerned with the description of the
implementation of the FIR filter based on conventional and
systolic decomposition of DA-based computation.




© 2011 ACEEE                                                                  43
DOI: 01.IJIT.01.01.120
ACEEE Int. J. on Information Technology, Vol. 01, No. 01, Mar 2011

                                                                                     The above shown Table 2 is the comparison results of
                                                                                 filters of different orders for 1-D systolic conventional
                                                                                 method and 1-D systolic decomposition method. The
                                                                                 decomposition method is better in all metrics for all values
                                                                                 of N, as seen from the graphs shown in Figure 6 to 9. The
                                                                                 synthesis tool used is Xilinx ISE 9.2i. The simulation tool
                                                                                 used is ModelSim XE 6.2c. The target device selected is
                                                                                 Virtex II Pro (XC2VP30).




       Figure 2. Plot of variation of Area with filter order for 1-D
                   Decomposition method for L = 8.




                                                                                 Figure 6. Plot of variation of Area with filter order for 1-D Conventional
                                                                                            method and 1-D Decomposition method for L = 8.



    Figure 3. Plot of variation of Frequency with filter order for 1-D
                   Decomposition method for L = 8.




                                                                                 Figure 7. Plot of variation of Power Consumption with filter order for 1-
                                                                                   D Conventional method and 1-D Decomposition method for L = 8.

Figure 4. Plot of variation of Power Consumption with filter order for 1-
                   D Decomposition method for L = 8.




   Figure 5. Plot of variation of Gate Count with filter order for 1-D              Figure 8. Plot of variation of Gate Count with filter order for 1-D
                   Decomposition method for L = 8.                                  Conventional method and 1-D Decomposition method for L = 8.




© 2011 ACEEE                                                                44
DOI: 01.IJIT.01.01.120
ACEEE Int. J. on Information Technology, Vol. 01, No. 01, Mar 2011




                                                                                 Figure 11. Plot of variation of Power Consumption with filter order for
                                                                                 existing system generator block and 1-D Decomposition method for L
                                                                                                                   = 8.
   Figure 9. Plot of variation of Frequency with filter order for 1-D
   Conventional method and 1-D Decomposition method for L = 8.




                                                                                   Figure 12. Hardware co-simulation of 1-D DA-based Decomposition
                                                                                                          method for L = 8.

                                                                                                          CONCLUSION
                                                                                     The project presents hardware-efficient designs for
                                                                                 computation of finite digital convolution by address
                                                                                 decomposition of DA-based inner-product computation. The
                                                                                 advantages of DA kind of implementation are its high usable
                                                                                 frequency and minimum gate count. The main advantage is
                                                                                 it overcomes the usage of multipliers. This method uses
                                                                                 adders, LUTs and shift registers.The systolic decomposition
The address length M is taken to be four for the proposed implementation.        scheme is found to offer a flexible choice of the address
          From the above shown Table 3 it is clear that the 1-                   length of the lookup tables (LUT) for DA-based computation.
D systolic decomposition method significantly outperforms                        The 1-D systolic array provides reduction in ROM size and
the existing implementations in terms of two important key                       the number of adders by several orders of magnitude
metrics, namely the frequency and power consumption for                          compared to the conventional method.
all the values of N.
                                                                                                           REFERENCES
                                                                                 [1] P. K. Meher, Shrutisagar Chandrasekaran and Abbes Amira,
                                                                                     “FPGA realization of FIR filters by efficient and flexible
                                                                                     systolization using distributed arithmetic”, IEEE Trans. signal
                                                                                     process., vol. 56, no. 7, pp. 3009-3017, July 2008.
                                                                                 [2] B.K.Mohanty, P.K.meher “Cost effective novel flexible cell-
                                                                                     level systolic architecture for high throughput implementation
                                                                                     of 2D FIR filters” IEE 1996. Technical note.
                                                                                 [3] P. K. Meher, “Hardware-efficient systolization of DA-based
                                                                                     calculation of finite digital convolution”, IEEE Trans. Circuits
                                                                                     Syst. II, Exp. Briefs, vol. 53, no. 8, pp. 707–711, Aug. 2006.
                                                                                 [4] S. A. White, “Applications of the distributed arithmetic to digital
                                                                                     signal processing: A tutorial review”, IEEE ASSP Mag., vol.
                                                                                     6, no. 3, pp. 5–19, Jul. 1989.
                                                                                 [5] H. T. Kung, “Why systolic architectures?”, IEEE Computer,
 Figure 10. Plot of variation of Frequency with filter order for existing
                                                                                     vol. 15, no. 1, pp. 37–45, Jan. 1982.
  system generator block and 1-D Decomposition method for L = 8.                 [6] R. Wyrzykowski and S. Ovramenko, “Flexible systolic
                                                                                     architecture for VLSI FIR filters”, IEE Proceedings-E,
                                                                                     vol.139,no.2,March1992

© 2011 ACEEE                                                                45
DOI: 01.IJIT.01.01.120
Ad

Recommended

“FIELD PROGRAMMABLE DSP ARRAYS” - A NOVEL RECONFIGURABLE ARCHITECTURE FOR EFF...
“FIELD PROGRAMMABLE DSP ARRAYS” - A NOVEL RECONFIGURABLE ARCHITECTURE FOR EFF...
sipij
 
Dynamic Texture Coding using Modified Haar Wavelet with CUDA
Dynamic Texture Coding using Modified Haar Wavelet with CUDA
IJERA Editor
 
A High Performance Modified SPIHT for Scalable Image Compression
A High Performance Modified SPIHT for Scalable Image Compression
CSCJournals
 
High Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image Compression
sipij
 
IRJET- Implementation of Reversible Radix-2 FFT VLSI Architecture using P...
IRJET- Implementation of Reversible Radix-2 FFT VLSI Architecture using P...
IRJET Journal
 
IRJET - Distributed Arithmetic Method for Complex Multiplication
IRJET - Distributed Arithmetic Method for Complex Multiplication
IRJET Journal
 
IRJET- Efficient Shift add Implementation of Fir Filter using Variable Pa...
IRJET- Efficient Shift add Implementation of Fir Filter using Variable Pa...
IRJET Journal
 
AN EFFICIENT DSP ARCHITECTURE DESIGN IN FPGA USING LOOP BACK ALGORITHM
AN EFFICIENT DSP ARCHITECTURE DESIGN IN FPGA USING LOOP BACK ALGORITHM
IJARIDEA Journal
 
IRJET- VLSI Architecture for Reversible Radix-2 FFT Algorithm using Programma...
IRJET- VLSI Architecture for Reversible Radix-2 FFT Algorithm using Programma...
IRJET Journal
 
A Novel Approach of Area-Efficient FIR Filter Design Using Distributed Arithm...
A Novel Approach of Area-Efficient FIR Filter Design Using Distributed Arithm...
IOSR Journals
 
Optimization of Cmos 0.18 µM Low Noise Amplifier Using Nsga-Ii for UWB Applic...
Optimization of Cmos 0.18 µM Low Noise Amplifier Using Nsga-Ii for UWB Applic...
VLSICS Design
 
Noise Immune and Area Optimized Serial Interface for FPGA based Industrial In...
Noise Immune and Area Optimized Serial Interface for FPGA based Industrial In...
International Journal of Science and Research (IJSR)
 
Determining the Efficient Subband Coefficients of Biorthogonal Wavelet for Gr...
Determining the Efficient Subband Coefficients of Biorthogonal Wavelet for Gr...
CSCJournals
 
Efficient video compression using EZWT
Efficient video compression using EZWT
IJERA Editor
 
Implementation Of Grigoryan FFT For Its Performance Case Study Over Cooley-Tu...
Implementation Of Grigoryan FFT For Its Performance Case Study Over Cooley-Tu...
ijma
 
Bz25454457
Bz25454457
IJERA Editor
 
Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...
Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...
CSCJournals
 
Design of Processing Element (PE3) for Implementing Pipeline FFT Processor
Design of Processing Element (PE3) for Implementing Pipeline FFT Processor
ijcisjournal
 
Optimization of latency of temporal key Integrity protocol (tkip) using graph...
Optimization of latency of temporal key Integrity protocol (tkip) using graph...
ijcseit
 
Gf3511031106
Gf3511031106
IJERA Editor
 
Qualitative Analysis of Optical Interleave Division Multiple Access using Spe...
Qualitative Analysis of Optical Interleave Division Multiple Access using Spe...
IRJET Journal
 
145 153
145 153
Editor IJARCET
 
Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...
eSAT Journals
 
B046050711
B046050711
IJERA Editor
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
theijes
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
ijceronline
 
40120140505005
40120140505005
IAEME Publication
 
FPGA based Efficient Interpolator design using DALUT Algorithm
FPGA based Efficient Interpolator design using DALUT Algorithm
cscpconf
 
FPGA based Efficient Interpolator design using DALUT Algorithm
FPGA based Efficient Interpolator design using DALUT Algorithm
cscpconf
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 

More Related Content

What's hot (19)

IRJET- VLSI Architecture for Reversible Radix-2 FFT Algorithm using Programma...
IRJET- VLSI Architecture for Reversible Radix-2 FFT Algorithm using Programma...
IRJET Journal
 
A Novel Approach of Area-Efficient FIR Filter Design Using Distributed Arithm...
A Novel Approach of Area-Efficient FIR Filter Design Using Distributed Arithm...
IOSR Journals
 
Optimization of Cmos 0.18 µM Low Noise Amplifier Using Nsga-Ii for UWB Applic...
Optimization of Cmos 0.18 µM Low Noise Amplifier Using Nsga-Ii for UWB Applic...
VLSICS Design
 
Noise Immune and Area Optimized Serial Interface for FPGA based Industrial In...
Noise Immune and Area Optimized Serial Interface for FPGA based Industrial In...
International Journal of Science and Research (IJSR)
 
Determining the Efficient Subband Coefficients of Biorthogonal Wavelet for Gr...
Determining the Efficient Subband Coefficients of Biorthogonal Wavelet for Gr...
CSCJournals
 
Efficient video compression using EZWT
Efficient video compression using EZWT
IJERA Editor
 
Implementation Of Grigoryan FFT For Its Performance Case Study Over Cooley-Tu...
Implementation Of Grigoryan FFT For Its Performance Case Study Over Cooley-Tu...
ijma
 
Bz25454457
Bz25454457
IJERA Editor
 
Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...
Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...
CSCJournals
 
Design of Processing Element (PE3) for Implementing Pipeline FFT Processor
Design of Processing Element (PE3) for Implementing Pipeline FFT Processor
ijcisjournal
 
Optimization of latency of temporal key Integrity protocol (tkip) using graph...
Optimization of latency of temporal key Integrity protocol (tkip) using graph...
ijcseit
 
Gf3511031106
Gf3511031106
IJERA Editor
 
Qualitative Analysis of Optical Interleave Division Multiple Access using Spe...
Qualitative Analysis of Optical Interleave Division Multiple Access using Spe...
IRJET Journal
 
145 153
145 153
Editor IJARCET
 
Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...
eSAT Journals
 
B046050711
B046050711
IJERA Editor
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
theijes
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
ijceronline
 
40120140505005
40120140505005
IAEME Publication
 
IRJET- VLSI Architecture for Reversible Radix-2 FFT Algorithm using Programma...
IRJET- VLSI Architecture for Reversible Radix-2 FFT Algorithm using Programma...
IRJET Journal
 
A Novel Approach of Area-Efficient FIR Filter Design Using Distributed Arithm...
A Novel Approach of Area-Efficient FIR Filter Design Using Distributed Arithm...
IOSR Journals
 
Optimization of Cmos 0.18 µM Low Noise Amplifier Using Nsga-Ii for UWB Applic...
Optimization of Cmos 0.18 µM Low Noise Amplifier Using Nsga-Ii for UWB Applic...
VLSICS Design
 
Determining the Efficient Subband Coefficients of Biorthogonal Wavelet for Gr...
Determining the Efficient Subband Coefficients of Biorthogonal Wavelet for Gr...
CSCJournals
 
Efficient video compression using EZWT
Efficient video compression using EZWT
IJERA Editor
 
Implementation Of Grigoryan FFT For Its Performance Case Study Over Cooley-Tu...
Implementation Of Grigoryan FFT For Its Performance Case Study Over Cooley-Tu...
ijma
 
Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...
Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...
CSCJournals
 
Design of Processing Element (PE3) for Implementing Pipeline FFT Processor
Design of Processing Element (PE3) for Implementing Pipeline FFT Processor
ijcisjournal
 
Optimization of latency of temporal key Integrity protocol (tkip) using graph...
Optimization of latency of temporal key Integrity protocol (tkip) using graph...
ijcseit
 
Qualitative Analysis of Optical Interleave Division Multiple Access using Spe...
Qualitative Analysis of Optical Interleave Division Multiple Access using Spe...
IRJET Journal
 
Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...
eSAT Journals
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
theijes
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
ijceronline
 

Similar to FIR Filter Implementation by Systolization using DA-based Decomposition (20)

FPGA based Efficient Interpolator design using DALUT Algorithm
FPGA based Efficient Interpolator design using DALUT Algorithm
cscpconf
 
FPGA based Efficient Interpolator design using DALUT Algorithm
FPGA based Efficient Interpolator design using DALUT Algorithm
cscpconf
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
FPGA Implementation of High Speed FIR Filters and less power consumption stru...
FPGA Implementation of High Speed FIR Filters and less power consumption stru...
International Journal of Engineering Inventions www.ijeijournal.com
 
Analysis of different FIR Filter Design Method in terms of Resource Utilizati...
Analysis of different FIR Filter Design Method in terms of Resource Utilizati...
ijsrd.com
 
A Novel Architecture for Different DSP Applications Using Field Programmable ...
A Novel Architecture for Different DSP Applications Using Field Programmable ...
journal ijme
 
Design and Implementation of an Embedded System for Software Defined Radio
Design and Implementation of an Embedded System for Software Defined Radio
IJECEIAES
 
INDUSTRIAL TRAINING REPORT
INDUSTRIAL TRAINING REPORT
ABHISHEK DABRAL
 
IRJET- Review Paper on Radix-2 DIT Fast Fourier Transform using Reversible Gate
IRJET- Review Paper on Radix-2 DIT Fast Fourier Transform using Reversible Gate
IRJET Journal
 
Z4301132136
Z4301132136
IJERA Editor
 
Analysis of Microstrip Finger on Bandwidth of Interdigital Band Pass Filter u...
Analysis of Microstrip Finger on Bandwidth of Interdigital Band Pass Filter u...
IJREST
 
Hv2514131415
Hv2514131415
IJERA Editor
 
Hv2514131415
Hv2514131415
IJERA Editor
 
Design of Multiplier Less 32 Tap FIR Filter using VHDL
Design of Multiplier Less 32 Tap FIR Filter using VHDL
IJMER
 
241 250
241 250
Editor IJARCET
 
FPGA Based Design of High Performance Decimator using DALUT Algorithm
FPGA Based Design of High Performance Decimator using DALUT Algorithm
IDES Editor
 
J010325764
J010325764
IOSR Journals
 
An fpga implementation of the lms adaptive filter
An fpga implementation of the lms adaptive filter
eSAT Journals
 
An fpga implementation of the lms adaptive filter
An fpga implementation of the lms adaptive filter
eSAT Publishing House
 
International Journal of Computational Engineering Research (IJCER)
International Journal of Computational Engineering Research (IJCER)
ijceronline
 
FPGA based Efficient Interpolator design using DALUT Algorithm
FPGA based Efficient Interpolator design using DALUT Algorithm
cscpconf
 
FPGA based Efficient Interpolator design using DALUT Algorithm
FPGA based Efficient Interpolator design using DALUT Algorithm
cscpconf
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
Analysis of different FIR Filter Design Method in terms of Resource Utilizati...
Analysis of different FIR Filter Design Method in terms of Resource Utilizati...
ijsrd.com
 
A Novel Architecture for Different DSP Applications Using Field Programmable ...
A Novel Architecture for Different DSP Applications Using Field Programmable ...
journal ijme
 
Design and Implementation of an Embedded System for Software Defined Radio
Design and Implementation of an Embedded System for Software Defined Radio
IJECEIAES
 
INDUSTRIAL TRAINING REPORT
INDUSTRIAL TRAINING REPORT
ABHISHEK DABRAL
 
IRJET- Review Paper on Radix-2 DIT Fast Fourier Transform using Reversible Gate
IRJET- Review Paper on Radix-2 DIT Fast Fourier Transform using Reversible Gate
IRJET Journal
 
Analysis of Microstrip Finger on Bandwidth of Interdigital Band Pass Filter u...
Analysis of Microstrip Finger on Bandwidth of Interdigital Band Pass Filter u...
IJREST
 
Design of Multiplier Less 32 Tap FIR Filter using VHDL
Design of Multiplier Less 32 Tap FIR Filter using VHDL
IJMER
 
FPGA Based Design of High Performance Decimator using DALUT Algorithm
FPGA Based Design of High Performance Decimator using DALUT Algorithm
IDES Editor
 
An fpga implementation of the lms adaptive filter
An fpga implementation of the lms adaptive filter
eSAT Journals
 
An fpga implementation of the lms adaptive filter
An fpga implementation of the lms adaptive filter
eSAT Publishing House
 
International Journal of Computational Engineering Research (IJCER)
International Journal of Computational Engineering Research (IJCER)
ijceronline
 
Ad

More from IDES Editor (20)

Power System State Estimation - A Review
Power System State Estimation - A Review
IDES Editor
 
Artificial Intelligence Technique based Reactive Power Planning Incorporating...
Artificial Intelligence Technique based Reactive Power Planning Incorporating...
IDES Editor
 
Design and Performance Analysis of Genetic based PID-PSS with SVC in a Multi-...
Design and Performance Analysis of Genetic based PID-PSS with SVC in a Multi-...
IDES Editor
 
Optimal Placement of DG for Loss Reduction and Voltage Sag Mitigation in Radi...
Optimal Placement of DG for Loss Reduction and Voltage Sag Mitigation in Radi...
IDES Editor
 
Line Losses in the 14-Bus Power System Network using UPFC
Line Losses in the 14-Bus Power System Network using UPFC
IDES Editor
 
Study of Structural Behaviour of Gravity Dam with Various Features of Gallery...
Study of Structural Behaviour of Gravity Dam with Various Features of Gallery...
IDES Editor
 
Assessing Uncertainty of Pushover Analysis to Geometric Modeling
Assessing Uncertainty of Pushover Analysis to Geometric Modeling
IDES Editor
 
Secure Multi-Party Negotiation: An Analysis for Electronic Payments in Mobile...
Secure Multi-Party Negotiation: An Analysis for Electronic Payments in Mobile...
IDES Editor
 
Selfish Node Isolation & Incentivation using Progressive Thresholds
Selfish Node Isolation & Incentivation using Progressive Thresholds
IDES Editor
 
Various OSI Layer Attacks and Countermeasure to Enhance the Performance of WS...
Various OSI Layer Attacks and Countermeasure to Enhance the Performance of WS...
IDES Editor
 
Responsive Parameter based an AntiWorm Approach to Prevent Wormhole Attack in...
Responsive Parameter based an AntiWorm Approach to Prevent Wormhole Attack in...
IDES Editor
 
Cloud Security and Data Integrity with Client Accountability Framework
Cloud Security and Data Integrity with Client Accountability Framework
IDES Editor
 
Genetic Algorithm based Layered Detection and Defense of HTTP Botnet
Genetic Algorithm based Layered Detection and Defense of HTTP Botnet
IDES Editor
 
Enhancing Data Storage Security in Cloud Computing Through Steganography
Enhancing Data Storage Security in Cloud Computing Through Steganography
IDES Editor
 
Low Energy Routing for WSN’s
Low Energy Routing for WSN’s
IDES Editor
 
Permutation of Pixels within the Shares of Visual Cryptography using KBRP for...
Permutation of Pixels within the Shares of Visual Cryptography using KBRP for...
IDES Editor
 
Rotman Lens Performance Analysis
Rotman Lens Performance Analysis
IDES Editor
 
Band Clustering for the Lossless Compression of AVIRIS Hyperspectral Images
Band Clustering for the Lossless Compression of AVIRIS Hyperspectral Images
IDES Editor
 
Microelectronic Circuit Analogous to Hydrogen Bonding Network in Active Site ...
Microelectronic Circuit Analogous to Hydrogen Bonding Network in Active Site ...
IDES Editor
 
Texture Unit based Monocular Real-world Scene Classification using SOM and KN...
Texture Unit based Monocular Real-world Scene Classification using SOM and KN...
IDES Editor
 
Power System State Estimation - A Review
Power System State Estimation - A Review
IDES Editor
 
Artificial Intelligence Technique based Reactive Power Planning Incorporating...
Artificial Intelligence Technique based Reactive Power Planning Incorporating...
IDES Editor
 
Design and Performance Analysis of Genetic based PID-PSS with SVC in a Multi-...
Design and Performance Analysis of Genetic based PID-PSS with SVC in a Multi-...
IDES Editor
 
Optimal Placement of DG for Loss Reduction and Voltage Sag Mitigation in Radi...
Optimal Placement of DG for Loss Reduction and Voltage Sag Mitigation in Radi...
IDES Editor
 
Line Losses in the 14-Bus Power System Network using UPFC
Line Losses in the 14-Bus Power System Network using UPFC
IDES Editor
 
Study of Structural Behaviour of Gravity Dam with Various Features of Gallery...
Study of Structural Behaviour of Gravity Dam with Various Features of Gallery...
IDES Editor
 
Assessing Uncertainty of Pushover Analysis to Geometric Modeling
Assessing Uncertainty of Pushover Analysis to Geometric Modeling
IDES Editor
 
Secure Multi-Party Negotiation: An Analysis for Electronic Payments in Mobile...
Secure Multi-Party Negotiation: An Analysis for Electronic Payments in Mobile...
IDES Editor
 
Selfish Node Isolation & Incentivation using Progressive Thresholds
Selfish Node Isolation & Incentivation using Progressive Thresholds
IDES Editor
 
Various OSI Layer Attacks and Countermeasure to Enhance the Performance of WS...
Various OSI Layer Attacks and Countermeasure to Enhance the Performance of WS...
IDES Editor
 
Responsive Parameter based an AntiWorm Approach to Prevent Wormhole Attack in...
Responsive Parameter based an AntiWorm Approach to Prevent Wormhole Attack in...
IDES Editor
 
Cloud Security and Data Integrity with Client Accountability Framework
Cloud Security and Data Integrity with Client Accountability Framework
IDES Editor
 
Genetic Algorithm based Layered Detection and Defense of HTTP Botnet
Genetic Algorithm based Layered Detection and Defense of HTTP Botnet
IDES Editor
 
Enhancing Data Storage Security in Cloud Computing Through Steganography
Enhancing Data Storage Security in Cloud Computing Through Steganography
IDES Editor
 
Low Energy Routing for WSN’s
Low Energy Routing for WSN’s
IDES Editor
 
Permutation of Pixels within the Shares of Visual Cryptography using KBRP for...
Permutation of Pixels within the Shares of Visual Cryptography using KBRP for...
IDES Editor
 
Rotman Lens Performance Analysis
Rotman Lens Performance Analysis
IDES Editor
 
Band Clustering for the Lossless Compression of AVIRIS Hyperspectral Images
Band Clustering for the Lossless Compression of AVIRIS Hyperspectral Images
IDES Editor
 
Microelectronic Circuit Analogous to Hydrogen Bonding Network in Active Site ...
Microelectronic Circuit Analogous to Hydrogen Bonding Network in Active Site ...
IDES Editor
 
Texture Unit based Monocular Real-world Scene Classification using SOM and KN...
Texture Unit based Monocular Real-world Scene Classification using SOM and KN...
IDES Editor
 
Ad

Recently uploaded (20)

10 Key Challenges for AI within the EU Data Protection Framework.pdf
10 Key Challenges for AI within the EU Data Protection Framework.pdf
Priyanka Aash
 
A Constitutional Quagmire - Ethical Minefields of AI, Cyber, and Privacy.pdf
A Constitutional Quagmire - Ethical Minefields of AI, Cyber, and Privacy.pdf
Priyanka Aash
 
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
Curietech AI in action - Accelerate MuleSoft development
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
 
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
revolcs10
 
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Priyanka Aash
 
AI vs Human Writing: Can You Tell the Difference?
AI vs Human Writing: Can You Tell the Difference?
Shashi Sathyanarayana, Ph.D
 
You are not excused! How to avoid security blind spots on the way to production
You are not excused! How to avoid security blind spots on the way to production
Michele Leroux Bustamante
 
Oh, the Possibilities - Balancing Innovation and Risk with Generative AI.pdf
Oh, the Possibilities - Balancing Innovation and Risk with Generative AI.pdf
Priyanka Aash
 
Connecting Data and Intelligence: The Role of FME in Machine Learning
Connecting Data and Intelligence: The Role of FME in Machine Learning
Safe Software
 
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
UserCon Belgium: Honey, VMware increased my bill
UserCon Belgium: Honey, VMware increased my bill
stijn40
 
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
Earley Information Science
 
Cyber Defense Matrix Workshop - RSA Conference
Cyber Defense Matrix Workshop - RSA Conference
Priyanka Aash
 
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
yosra Saidani
 
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Safe Software
 
9-1-1 Addressing: End-to-End Automation Using FME
9-1-1 Addressing: End-to-End Automation Using FME
Safe Software
 
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
Priyanka Aash
 
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
 
Lessons Learned from Developing Secure AI Workflows.pdf
Lessons Learned from Developing Secure AI Workflows.pdf
Priyanka Aash
 
10 Key Challenges for AI within the EU Data Protection Framework.pdf
10 Key Challenges for AI within the EU Data Protection Framework.pdf
Priyanka Aash
 
A Constitutional Quagmire - Ethical Minefields of AI, Cyber, and Privacy.pdf
A Constitutional Quagmire - Ethical Minefields of AI, Cyber, and Privacy.pdf
Priyanka Aash
 
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
Curietech AI in action - Accelerate MuleSoft development
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
 
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
revolcs10
 
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Priyanka Aash
 
AI vs Human Writing: Can You Tell the Difference?
AI vs Human Writing: Can You Tell the Difference?
Shashi Sathyanarayana, Ph.D
 
You are not excused! How to avoid security blind spots on the way to production
You are not excused! How to avoid security blind spots on the way to production
Michele Leroux Bustamante
 
Oh, the Possibilities - Balancing Innovation and Risk with Generative AI.pdf
Oh, the Possibilities - Balancing Innovation and Risk with Generative AI.pdf
Priyanka Aash
 
Connecting Data and Intelligence: The Role of FME in Machine Learning
Connecting Data and Intelligence: The Role of FME in Machine Learning
Safe Software
 
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
UserCon Belgium: Honey, VMware increased my bill
UserCon Belgium: Honey, VMware increased my bill
stijn40
 
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
Earley Information Science
 
Cyber Defense Matrix Workshop - RSA Conference
Cyber Defense Matrix Workshop - RSA Conference
Priyanka Aash
 
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
yosra Saidani
 
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Safe Software
 
9-1-1 Addressing: End-to-End Automation Using FME
9-1-1 Addressing: End-to-End Automation Using FME
Safe Software
 
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
Priyanka Aash
 
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
 
Lessons Learned from Developing Secure AI Workflows.pdf
Lessons Learned from Developing Secure AI Workflows.pdf
Priyanka Aash
 

FIR Filter Implementation by Systolization using DA-based Decomposition

  • 1. ACEEE Int. J. on Information Technology, Vol. 01, No. 01, Mar 2011 FIR Filter Implementation by Systolization using DA-based Decomposition Shreeharsha K G, Rekha Bhandarkar Department of Electronics and Communication Engineering Alva’s Institute of Engineering and Technology, Mijar – 574 225, Moodbidri, INDIA [email protected] Rekha Bhandarkar Department of Electronics and Communication Engineering NMAM Institute of Technology, Nitte – 574 110 Karkala, INDIA [email protected] Abstract– In this paper we present 1D and 2D systolic architecture, array in particular, is replacing a single Distributed Arithmetic (DA) based structures that are designed Processing Element (PE) with an array of PEs or cells. Being for the implementation of Finite Impulse Response (FIR) filters. able to use each input data item a number of times (and thus The paper compares the 1D DA based systolic structure with achieving high computation throughput with only modest 1D systolic DA based decomposition method. The filters are memory bandwidth) is one of the advantages of the systolic implemented on a Xilinx Virtex II Pro (XC2VP30) FPGA using approach. They have several attractive features such as HDL and system metrics like Area, Gate Count, Maximum Usable Frequency and Power consumption are estimated for simplicity, regularity and modularity of structure [2]. In different filter orders and address lengths. The 1D systolic addition, they also possess significant potential to yield high- decomposition structure is also compared with the existing throughput rate by exploiting high-level of concurrency using system generator implementation of DA FIR.. Results for an pipelining or parallel processing or both. exemplary implementation are presented. II.DISTRIBUTED ARITHMETIC Keywords— Distributed arithmetic (DA), Field Programmable Gate Arrays (FPGA), Finite Impulse Response (FIR) filter, Distributed Arithmetic (DA) is an efficient method for systolic array. computing inner products when one of the input vectors is fixed[4]. It uses look-up tables and accumulators instead of I.INTRODUCTION multipliers for computing inner products. Let us consider the inner-product of two N-point vectors Finite Impulse Response (FIR) filters are one of the most A and B given by Eq. (1) as, common components of Digital Signal Processing (DSP) systems. FIR filtering is achieved by convolving the input data samples with the desired unit response of the filter. Since the complexity of implementation grows with the filter order and the precision of computation, real-time realization of these filters with desired level of accuracy is a challenging task. Several attempts have, therefore, been made to develop dedicated and reconfigurable architectures for realization of FIR filters in Application Specific Integrated Circuits (ASIC) and FPGA platforms. DA provides an approach for multiplier-less implementation of FIR filters where the filter coefficients are programmable. In other words, the same filter structure can be used for a different set of coefficients. A systolic system consists of a set of interconnected cells, each capable of performing some simple operation. Because simple, regular communication and control structures have substantial advantages over complicated ones in design and implementation, cells in a systolic system are typically interconnected to form a systolic array or a systolic tree. Information in a systolic system flows between cells in a pipelined fashion, and communication with the outside world occurs only at the “boundary cells.” For example, in a systolic array, only those cells on the array boundaries may be I/O ports for the system[5]. The basic principle of a systolic © 2011 ACEEE 42 DOI: 01.IJIT.01.01.120
  • 2. ACEEE Int. J. on Information Technology, Vol. 01, No. 01, Mar 2011 where for l = 0, 1, …. , L-1 and p = 0, 1, …. , P-1. The bit vector (bn)l,p in Eq. (4b) is used as address word for the lookup table and F is the memory-read operation. III.1-D SYSTOLIC ARRAY FOR FIR FILTERS A linear array consisting of P number of PEs and an output cell is shown in Fig. 1 and the function of the PEs is described in Fig. 1(b)[3]. From the data presented in Table 1 and from the Figures 2 to 5 it can be seen that for a given filter order N, the case for M = 4 yields the most area-time efficient archi- tecture when compared to the case for M = 2 and 8. This can be explained by the fact that the increase in control logic and number of delay elements outweighs the gains made by reduction of LUT size for M = 2, while for M = 8, the memory requirement of LUTs is too high [1].Frequency is also maxi- mum for lower orders. Power consumption is the lowest. Figure. 1. The 1-D array for DA-based implementation of FIR filter: (a) Linear systolic array; (b) function of PE; and (c) function of output cell. delta stands for a unit delay. The input sequence {x(n)} is fed to a serial-in parallel- out input register where content of the register is serially right shifted by one position and transferred in parallel to the bit-serial word-parallel converter in every L cycles. The function of the output cell is shown in Fig. 1(c). After L cycles, it delivers a desired filter output. The structure will yield its first filter output (L+P) cycles after the first input is fed to the first PE, while the successive output becomes available in every L cycles. IV.IMPLEMENTATION This section is concerned with the description of the implementation of the FIR filter based on conventional and systolic decomposition of DA-based computation. © 2011 ACEEE 43 DOI: 01.IJIT.01.01.120
  • 3. ACEEE Int. J. on Information Technology, Vol. 01, No. 01, Mar 2011 The above shown Table 2 is the comparison results of filters of different orders for 1-D systolic conventional method and 1-D systolic decomposition method. The decomposition method is better in all metrics for all values of N, as seen from the graphs shown in Figure 6 to 9. The synthesis tool used is Xilinx ISE 9.2i. The simulation tool used is ModelSim XE 6.2c. The target device selected is Virtex II Pro (XC2VP30). Figure 2. Plot of variation of Area with filter order for 1-D Decomposition method for L = 8. Figure 6. Plot of variation of Area with filter order for 1-D Conventional method and 1-D Decomposition method for L = 8. Figure 3. Plot of variation of Frequency with filter order for 1-D Decomposition method for L = 8. Figure 7. Plot of variation of Power Consumption with filter order for 1- D Conventional method and 1-D Decomposition method for L = 8. Figure 4. Plot of variation of Power Consumption with filter order for 1- D Decomposition method for L = 8. Figure 5. Plot of variation of Gate Count with filter order for 1-D Figure 8. Plot of variation of Gate Count with filter order for 1-D Decomposition method for L = 8. Conventional method and 1-D Decomposition method for L = 8. © 2011 ACEEE 44 DOI: 01.IJIT.01.01.120
  • 4. ACEEE Int. J. on Information Technology, Vol. 01, No. 01, Mar 2011 Figure 11. Plot of variation of Power Consumption with filter order for existing system generator block and 1-D Decomposition method for L = 8. Figure 9. Plot of variation of Frequency with filter order for 1-D Conventional method and 1-D Decomposition method for L = 8. Figure 12. Hardware co-simulation of 1-D DA-based Decomposition method for L = 8. CONCLUSION The project presents hardware-efficient designs for computation of finite digital convolution by address decomposition of DA-based inner-product computation. The advantages of DA kind of implementation are its high usable frequency and minimum gate count. The main advantage is it overcomes the usage of multipliers. This method uses adders, LUTs and shift registers.The systolic decomposition The address length M is taken to be four for the proposed implementation. scheme is found to offer a flexible choice of the address From the above shown Table 3 it is clear that the 1- length of the lookup tables (LUT) for DA-based computation. D systolic decomposition method significantly outperforms The 1-D systolic array provides reduction in ROM size and the existing implementations in terms of two important key the number of adders by several orders of magnitude metrics, namely the frequency and power consumption for compared to the conventional method. all the values of N. REFERENCES [1] P. K. Meher, Shrutisagar Chandrasekaran and Abbes Amira, “FPGA realization of FIR filters by efficient and flexible systolization using distributed arithmetic”, IEEE Trans. signal process., vol. 56, no. 7, pp. 3009-3017, July 2008. [2] B.K.Mohanty, P.K.meher “Cost effective novel flexible cell- level systolic architecture for high throughput implementation of 2D FIR filters” IEE 1996. Technical note. [3] P. K. Meher, “Hardware-efficient systolization of DA-based calculation of finite digital convolution”, IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 53, no. 8, pp. 707–711, Aug. 2006. [4] S. A. White, “Applications of the distributed arithmetic to digital signal processing: A tutorial review”, IEEE ASSP Mag., vol. 6, no. 3, pp. 5–19, Jul. 1989. [5] H. T. Kung, “Why systolic architectures?”, IEEE Computer, Figure 10. Plot of variation of Frequency with filter order for existing vol. 15, no. 1, pp. 37–45, Jan. 1982. system generator block and 1-D Decomposition method for L = 8. [6] R. Wyrzykowski and S. Ovramenko, “Flexible systolic architecture for VLSI FIR filters”, IEE Proceedings-E, vol.139,no.2,March1992 © 2011 ACEEE 45 DOI: 01.IJIT.01.01.120