10 Hymel Conger Abstract

The document evaluates using partial reconfiguration to perform remote updates on an FPGA. It describes three experimental architectures using different numbers and arrangements of partially reconfigurable regions. The results show reductions in bitstream size and frequency when using partial reconfiguration compared to a non-reconfigurable baseline.

Uploaded by

Tamiltamil Tamil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views

10 Hymel Conger Abstract

Uploaded by

Tamiltamil Tamil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

1

Evaluating Partial Reconfiguration for Embedded FPGA Applications

Ross Hymel, Alan D. George, and Herman Lam {hymel, george, lam}@chrec.org NSF Center for High-Performance Reconfigurable Computing (CHREC), University of Florida applications has been limited due to their reduced flexibility after field-deployment and relative high cost. If an embedded FPGAs reconfigurable resources become static, the device turns into an expensive, power-hungry, low-performance ASIC. Thus, for FPGAs to become more practical as end-use devices, there needs to be a way to maintain true fieldreprogrammability once deployed, i.e., the use of remote updating. Remote updating for FPGAs is the equivalent of inapplication programming for microprocessors and is used to dynamically tailor the hardware to the applications needs in real-time. Traditionally, an external configuration controller, usually a separate FPGA or microprocessor, performs remote updating. In the most generic sense, bitstreams that define hardware modules are sent to this device from a local or remote storage, over some type of communication link (e.g. MIL-STD-1553). The external controller then proceeds to fully reconfigure the user FPGA. This baseline approach has distinct advantages, namely that it provides an extremely flexible development environment since 100% of the user FPGA logic/routing resources are available for processing with no reconfiguration overhead or performance degradation. Unfortunately, the need for an external controller presents undesirable drawbacks. Because the entire user FPGA is reconfigured, a full device bitstream must be transmitted over the communication link even if the designer only wishes to change a small portion of the design. This requirement results in a needlessly high data transfer, which is especially detrimental in bandwidth-limited applications, such as satellite payloads, where the update bitstreams may not be stored onboard. Furthermore, fully reconfiguring the user FPGA produces the longest possible reconfiguration period, translating into lost processing time. A second drawback is the increased component count and PCB requirements necessary to accommodate an external device. Besides increasing the cost of the design, the extra complexity allows more failure points to exist in all phases of the systems lifetime (fabrication, assembly, testing, deployment, etc.). Most DOD designs are particularly affected since they must be qualified to strict environmental standards with regard to shock, vibration, ESD, etc. In this paper, we describe an approach for configuration control in which we embed the controller within the user FPGA. By using the Internal Configuration Access Port (ICAP) to perform partial reconfiguration, the remote update is performed in-situ, eliminating the need for an external device. In addition to mitigating many of the disadvantages previously mentioned, there are many advantages inherent in this approach. Most importantly, unrelated processing can

AbstractRecent advances in Xilinxs FPGA hardware and commercial software design tools, spurred in large part by the DODs Joint Tactical Radio System initiative, offer the possibility of incorporating dynamic partial reconfiguration (PR) into highperformance, embedded systems outside of academic research laboratories. PR can provide the flexibility and run-time reconfigurability that no pure hardware or software solution can offer. By multiplexing the hardware resources of a single programmable device with time-independent tasks, a common architecture in DOD systems, one FPGA can handle the same processing workload as a multi-device equivalent. This paper analyzes the performance impact of using PR to perform remote updating, an important capability often used in embedded applications.

I. INTRODUCTION

an SRAM-based FPGA is a multiprocessing device in that multiple, user-defined hardware modules can operate in parallel and independently within the same chip. One of the great advantages of such a device is the ability to modify its configuration memory easily and at any time. PR enhances this paradigm by reconfiguring only a portion of the chips configuration memory, allowing the user to load and unload these functional hardware modules without interrupting or resetting the rest of the device. Despite this advantage, commercial interest in PR has never materialized due mainly to a lack of supporting software tools and merciless design flows. Nevertheless, different academic approaches have been developed to incorporate PR into embedded systems using the Virtex-II FPGA [1-2]. Recently, however, the release of the Virtex-4 and Virtex-5 series of FPGAs, with their tile-based frame architectures, coupled with the lucrative softwaredefined radio market, has pushed Xilinx to engineer a workable PR design flow [3]. While still unreleased to the general public, the new design flow eliminates many of the burdensome requirements put in place by the previous flow [4] and now supports the Virtex-4 (though not yet the Virtex-5). Unfortunately, due to the relatively recent unveiling of this new design flow, as well as the still restricted nature of its release, there exists a vacuum in research and results exploring high-performance PR systems targeting these new devices. In response, we present a study of the performance impact (timing, resource utilization, and other metrics) of the new design flow when targeting Virtex-4 FPGAs, with remote updating, an important usage of PR, as a platform for analysis. II. TARGET APPLICATION

ENERICALLY,

Although commercial FPGAs have enjoyed great success as development and testing platforms, their use in embedded

2 continue uninterrupted during partial device reconfiguration, automatically maintaining state information. The remainder of this paper analyzes the performance impact of incorporating remote updating into three permutations of a generic PR architecture targeting an XC4VLX25 FPGA. III. EXPERIMENTAL ARCHITECTURES In order to facilitate PR in real hardware with a commercially-available design flow, key design issues and trade-offs must be addressed, including the number of partially reconfigurable regions (PRRs), the PRR shape, size, and placement, the PRRs access to the global clock network and I/O pads, and the communication interface amongst different PRRs and the static portion of the design. A complete description of each experimental study will appear in the full presentation, while a condensed version appears here. Each design permutation contains a static communication and configuration controller, as well as a different number of PRRs, ranging from one PRR of maximal size, to two side-byside PRRs, to four PRRs arranged in a 2x2 fashion. Each of the regions has a generic black-box, top-level interface. The advantage of such an approach is that a designer can use any high- or low-level tool to synthesize the PRR, so long as the top-level interfaces match. Then the designer need only run an existing script that automatically handles the details of the PR design flow to generate the partial bitstreams. We evaluated each design permutation using different highperformance computing cores, including Radix-4 FFT, AES, ARM7 soft-core processing, and others. We measured the minimum clock period at which each design could run twice, once when the design operated without any PR modifications and once after plugging into the experimental PR architecture. We also measured the size of the programming bitstream twice in the same fashion.
% Change from non-PR Baseline
40 35 30 25 20 15 10 5 0 Bitstream Reduction Overhead 1 PRR Max. Freq. Reduction 4 PRRs Max. Freq. Reduction (<100 MHz)

macros) but that do not contribute to processing. The clock frequency numbers are split into two categories, one for all designs and one for designs that originally operated at less than 100 MHz. The discrepancy is due to a single enable net in the static region whose purpose is to put the PRRs into a known state during reconfiguration. This net is most often the critical path for designs over 100 MHz due to its length and fanout. In absolute terms, the results averaged across all design permutations are -162 KB, +727 slices, -57.6 MHz, and -8.09 MHz, respectively. In addition, the relative percentages should remain constant across different device sizes. The full presentation will include a detailed breakdown of these results. IV. CONCLUSIONS The use of partial reconfiguration in conjunction with commercial FPGAs and software tools can provide a reliable, resource-saving, and flexible means for updating the processing load of a deployed programmable device. By timemultiplexing the device, the designer has, in effect, an FPGA that contains more resources than are actually physically present, providing multiprocessing across both time and space. This method not only reduces the reconfiguration time but also the amount of bitstream data. Furthermore, using a generic architecture simplifies the design flow at the hardware level to allow rapid system development by designers untrained in the nuances of PR. These factors are especially important in DOD systems, as the generic hardware can be qualified to the necessary environmental standards and then reused in other platforms without knowledge of the low-level details. Future directions for this work include exploring full partial reconfiguration. As Virtex-4 devices contain two separate ICAP primitives, we have the ability to reconfigure the reconfiguration engine itself by switching configuration control between different regions. Doing so would allow us to update the previously static controller, e.g., to change the encryption standard or the communication protocol it uses. V. ACKNOWLEDGEMENTS This work was supported in part by the I/UCRC Program of the National Science Foundation under Grant No. EEC0642422. The authors gratefully acknowledge tools and equipment provided by Sandia National Laboratories and Xilinx that helped make this work possible. VI. REFERENCES
[1] M. Ullmann, B. Grimm, M. Hbner, and J. Becker, An FPGA Run-Time System for Dynamical On-Demand Reconfiguration, Proc. IEEE Parallel and Distributed Processing Symposium, Santa Fe, NM, Apr. 26-30, 2004. [2] M. Hbner, J. Becker, Exploiting Dynamic and Partial Reconfiguration for FPGAs Toolflow, Architecture, and System Integration, Proc. 19th SBCCI Symp. on Integrated Circuits and Systems Design, Ouro Preot, Brazil, 2006. [3] Early Access Partial Reconfiguration User Guide, UG208 (v1.1), Xilinx Inc., Mar. 6, 2006. [4] Two Flows for Partial Reconfiguration: Module Based or Difference Based, XAPP290 (v1.2), Xilinx Inc., Sept. 9, 2004.

2 PRRs

Figure 1: Measured Effects of PR vs. non-PR Baseline Figure 1 displays a set of average measured PR performance effects, including the bitstream size reduction, the PR overhead of each design, and the decrease in maximum clock frequency due to PR. The PR overhead consists of resources that the FPGA uses to facilitate the design flow (e.g. bus

Accelerated Computing with HIP
From Everand
Accelerated Computing with HIP
Yifan Sun
4.5/5 (2)
Sambasiva Swarajati
No ratings yet
Sambasiva Swarajati
3 pages
Sambasiva Swarajati
No ratings yet
Sambasiva Swarajati
3 pages
All-In-One Oracle ASM Quick Reference Guide
75% (4)
All-In-One Oracle ASM Quick Reference Guide
14 pages
FPGA Dynamic and Partial Reconfiguration: A Survey of Architectures, Methods, and Applications
No ratings yet
FPGA Dynamic and Partial Reconfiguration: A Survey of Architectures, Methods, and Applications
39 pages
Efficient Reconfigurable On-Chip
No ratings yet
Efficient Reconfigurable On-Chip
11 pages
FPGAs Memory Synchronization and Performance Evaluation Using The Open Computing Language Framework
No ratings yet
FPGAs Memory Synchronization and Performance Evaluation Using The Open Computing Language Framework
8 pages
FPGA Paper PDF
No ratings yet
FPGA Paper PDF
18 pages
DML Dynamic Partial Reconfiguration With Scalable Task Scheduling for Multi-Applications on FPGAs
No ratings yet
DML Dynamic Partial Reconfiguration With Scalable Task Scheduling for Multi-Applications on FPGAs
15 pages
Earth and Atmospheric Sciences
No ratings yet
Earth and Atmospheric Sciences
10 pages
Efficient Implementation of Scan Register Insertion On Integer Arithmetic Cores For Fpgas
No ratings yet
Efficient Implementation of Scan Register Insertion On Integer Arithmetic Cores For Fpgas
6 pages
Dynamic Reconfigurability in Embedded System Design: Vincenzo Rana, Marco Santambrogio, Donatella Sciuto
No ratings yet
Dynamic Reconfigurability in Embedded System Design: Vincenzo Rana, Marco Santambrogio, Donatella Sciuto
4 pages
Implementing VHDL Code On Fpga
No ratings yet
Implementing VHDL Code On Fpga
1 page
Partial Reconfiguration On FPGAs
100% (1)
Partial Reconfiguration On FPGAs
306 pages
Lec7 - Partial Reconfiguration
No ratings yet
Lec7 - Partial Reconfiguration
37 pages
The Rise of SoC FPAA Devices
No ratings yet
The Rise of SoC FPAA Devices
8 pages
Dynamic Partial Reconfiguration in Fpgas
No ratings yet
Dynamic Partial Reconfiguration in Fpgas
4 pages
Elec2630 Embedded Systems Theory
No ratings yet
Elec2630 Embedded Systems Theory
14 pages
Reconfigurable Computing
No ratings yet
Reconfigurable Computing
38 pages
ES-MEL-AEL ZG554 - Lec6
No ratings yet
ES-MEL-AEL ZG554 - Lec6
43 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Prototyping Advanced Control Systems On FPGA
No ratings yet
Prototyping Advanced Control Systems On FPGA
14 pages
FPGA: Field Programmable Gate Array
No ratings yet
FPGA: Field Programmable Gate Array
5 pages
Hardware-Software Debugging Techniques For Reconfigurable Systems-on-Chip
No ratings yet
Hardware-Software Debugging Techniques For Reconfigurable Systems-on-Chip
6 pages
Frequently and Non Frequently Configurable Devices Applications of Reconfigurable Devices
No ratings yet
Frequently and Non Frequently Configurable Devices Applications of Reconfigurable Devices
14 pages
20,21
No ratings yet
20,21
21 pages
04_abstract (1)
No ratings yet
04_abstract (1)
40 pages
Training Report
No ratings yet
Training Report
30 pages
Ams 16th Smoi Martin 10.3
No ratings yet
Ams 16th Smoi Martin 10.3
10 pages
Programming and Synthesis For Software-Defined FPGA Acceleration - Status and Future Prospects
No ratings yet
Programming and Synthesis For Software-Defined FPGA Acceleration - Status and Future Prospects
39 pages
An On Chip Network Inside A FPGA For Run-Time Reconfigurable Low Latency Grid Communication
No ratings yet
An On Chip Network Inside A FPGA For Run-Time Reconfigurable Low Latency Grid Communication
8 pages
Fpga Da
No ratings yet
Fpga Da
137 pages
Partial Reconfiguration
No ratings yet
Partial Reconfiguration
18 pages
A Brief Study of Reconfigurable Computation Systems With A Focus of FPGA Based Devices
No ratings yet
A Brief Study of Reconfigurable Computation Systems With A Focus of FPGA Based Devices
11 pages
FPGA PROGRAMIN
No ratings yet
FPGA PROGRAMIN
24 pages
FPAA IEEEXPlore 2020
No ratings yet
FPAA IEEEXPlore 2020
20 pages
Fpga Viva Question
No ratings yet
Fpga Viva Question
4 pages
lec5-FPGA
No ratings yet
lec5-FPGA
46 pages
Wisniewski 18
No ratings yet
Wisniewski 18
16 pages
Final IEEE Penang
No ratings yet
Final IEEE Penang
74 pages
Ieee Fpga
No ratings yet
Ieee Fpga
3 pages
Lecture RC
No ratings yet
Lecture RC
81 pages
Introduction To Field Programmable Gate Arrays AND Its Applications
No ratings yet
Introduction To Field Programmable Gate Arrays AND Its Applications
13 pages
Implementation of Uart Using Systemc and Fpga Based Co-Design Methodology
No ratings yet
Implementation of Uart Using Systemc and Fpga Based Co-Design Methodology
7 pages
Asic Unit2 Lectfpga 2018 Nov15 MSC Electronic Science Semester 3
No ratings yet
Asic Unit2 Lectfpga 2018 Nov15 MSC Electronic Science Semester 3
26 pages
Hacking The Fabric: Targeting Partial Reconfiguration For Fault Injection in FPGA Fabrics
No ratings yet
Hacking The Fabric: Targeting Partial Reconfiguration For Fault Injection in FPGA Fabrics
6 pages
A Dynamic Instruction Set Computer
No ratings yet
A Dynamic Instruction Set Computer
9 pages
Lec 1
No ratings yet
Lec 1
25 pages
Architecture Design and FPGA Code Development For ADC Module
No ratings yet
Architecture Design and FPGA Code Development For ADC Module
6 pages
Adaptive Computing
No ratings yet
Adaptive Computing
15 pages
Fpgas Design Ebook Emea Emeaen
No ratings yet
Fpgas Design Ebook Emea Emeaen
19 pages
FPGA Frontiers - Digital Book PDF
No ratings yet
FPGA Frontiers - Digital Book PDF
87 pages
LAB NO 1
No ratings yet
LAB NO 1
6 pages
A Summary On FPGA
100% (1)
A Summary On FPGA
28 pages
100 Power Tips For FPGA Designers Stavinov Evgeni
No ratings yet
100 Power Tips For FPGA Designers Stavinov Evgeni
213 pages
Image Hardware PDF
No ratings yet
Image Hardware PDF
19 pages
Dynamic Reconfiguration
No ratings yet
Dynamic Reconfiguration
12 pages
Dean Seminar1
No ratings yet
Dean Seminar1
19 pages
FPGA Kitap BLM
No ratings yet
FPGA Kitap BLM
30 pages
RC
No ratings yet
RC
175 pages
ES-MEL-AEL ZG554 - Lec1
No ratings yet
ES-MEL-AEL ZG554 - Lec1
40 pages
BeagleBone Systems and Applications: Definitive Reference for Developers and Engineers
From Everand
BeagleBone Systems and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DeepSeek vs. ChatGPT – Why DeepSeek is the Superior AI.
From Everand
DeepSeek vs. ChatGPT – Why DeepSeek is the Superior AI.
Gary Thatcher
No ratings yet
Tukaram A Bhang
No ratings yet
Tukaram A Bhang
4 pages
PDF Documentation Package How To Integrate CPP Code in Python
No ratings yet
PDF Documentation Package How To Integrate CPP Code in Python
4 pages
VHDL O: There Is NO Order of Precedence So Use Lots of Parentheses XNOR Was Not in Original VHDL (Added in 1993)
No ratings yet
VHDL O: There Is NO Order of Precedence So Use Lots of Parentheses XNOR Was Not in Original VHDL (Added in 1993)
2 pages
Jayadeva and The Gitagovinda
No ratings yet
Jayadeva and The Gitagovinda
11 pages
Tho Day A Mangalam
No ratings yet
Tho Day A Mangalam
7 pages
abhimAnamennaDu Kunjari
No ratings yet
abhimAnamennaDu Kunjari
6 pages
C - Ltwùu Æ Ro Auwwù°V ©U) O Ghpvù WM Læ - L©Wu V Tùwßvùo CVT V Cwelt Tùdls
No ratings yet
C - Ltwùu Æ Ro Auwwù°V ©U) O Ghpvù WM Læ - L©Wu V Tùwßvùo CVT V Cwelt Tùdls
3 pages
C - Ltwùu Æ Ro Auwwù°V ©U) O Ghpvù WM Læ - L©Wu V Tùwßvùo CVT V Cwelt Tùdls
No ratings yet
C - Ltwùu Æ Ro Auwwù°V ©U) O Ghpvù WM Læ - L©Wu V Tùwßvùo CVT V Cwelt Tùdls
3 pages
A Vat Harika S Lokas
No ratings yet
A Vat Harika S Lokas
7 pages
Ayyappa Saranam Endre English
No ratings yet
Ayyappa Saranam Endre English
1 page
FPGA Power Calculations
No ratings yet
FPGA Power Calculations
3 pages
B HamBG
No ratings yet
B HamBG
1 page
Thy Aga Raja Meaning
No ratings yet
Thy Aga Raja Meaning
15 pages
JanakSutha 11geetham5
100% (2)
JanakSutha 11geetham5
2 pages
Narada gAna-aThANA PDF
No ratings yet
Narada gAna-aThANA PDF
5 pages
Javagrm README
No ratings yet
Javagrm README
3 pages
Bhimpalasi
No ratings yet
Bhimpalasi
2 pages
Notes On SaptaTala Alankaras
No ratings yet
Notes On SaptaTala Alankaras
2 pages
Transliteration-Telugu: Kanugonu-Nayaki
No ratings yet
Transliteration-Telugu: Kanugonu-Nayaki
6 pages
Transliteration-Telugu: Nikevari Bodhana-Suddhasaveri
No ratings yet
Transliteration-Telugu: Nikevari Bodhana-Suddhasaveri
4 pages
BrocheVareVarura Page
100% (1)
BrocheVareVarura Page
8 pages
EE 552 (Logic Design and Switching Theory) Project: Quantitative Measurement of The Benefits of Reduction Techniques For Asynchronous Finite State Machines
No ratings yet
EE 552 (Logic Design and Switching Theory) Project: Quantitative Measurement of The Benefits of Reduction Techniques For Asynchronous Finite State Machines
9 pages
Do Not View The Solution Until You Have Completed The Above Quiz
No ratings yet
Do Not View The Solution Until You Have Completed The Above Quiz
1 page
Mamava satataM-jaganmOhini
No ratings yet
Mamava satataM-jaganmOhini
7 pages
Practical2 Final
No ratings yet
Practical2 Final
896 pages
Cracking Delphi Programs
No ratings yet
Cracking Delphi Programs
11 pages
Bash Shell Vulnerability (Shellshock) Patch For Avaya Aura® System Manager and WebLM Releases
No ratings yet
Bash Shell Vulnerability (Shellshock) Patch For Avaya Aura® System Manager and WebLM Releases
4 pages
Solaris/Unix Training
No ratings yet
Solaris/Unix Training
119 pages
O2T Selenium Webdriver Framework Introduction
No ratings yet
O2T Selenium Webdriver Framework Introduction
11 pages
"Life Insurance Management System": Bachelor of Computer Application
No ratings yet
"Life Insurance Management System": Bachelor of Computer Application
14 pages
SAP NetWeaver - Process Integration - Simple Use Cases
No ratings yet
SAP NetWeaver - Process Integration - Simple Use Cases
81 pages
Rownum
No ratings yet
Rownum
4 pages
Criticism
No ratings yet
Criticism
2 pages
SAlesforce Integration
0% (1)
SAlesforce Integration
19 pages
Python Exercises
No ratings yet
Python Exercises
1 page
Library Management System Presentation
0% (1)
Library Management System Presentation
32 pages
Image Interpolation
No ratings yet
Image Interpolation
51 pages
Pump It Up: Data Mining The Water Table
No ratings yet
Pump It Up: Data Mining The Water Table
5 pages
(Hua) (ST) Ne40ecx600me60ne20e v800r008 CC St-Security Target-V1 51
No ratings yet
(Hua) (ST) Ne40ecx600me60ne20e v800r008 CC St-Security Target-V1 51
65 pages
Process Maker Installation Esolutions
No ratings yet
Process Maker Installation Esolutions
5 pages
Procedural Extension To SQL Using Triggers - Lecture 2: DR Akhtar Ali
No ratings yet
Procedural Extension To SQL Using Triggers - Lecture 2: DR Akhtar Ali
28 pages
Fast, Accurate Data Management Across The Enterprise: Fact Sheet: File-Aid / Mvs
No ratings yet
Fast, Accurate Data Management Across The Enterprise: Fact Sheet: File-Aid / Mvs
4 pages
Split The Non-Key Columns To Separate Tables With Key Column in Both
No ratings yet
Split The Non-Key Columns To Separate Tables With Key Column in Both
25 pages
Email Policy by Cabinet Division (NTISB)
No ratings yet
Email Policy by Cabinet Division (NTISB)
25 pages
Export Text File To Excel File - Delphi
No ratings yet
Export Text File To Excel File - Delphi
2 pages
Codexx
No ratings yet
Codexx
3 pages
Recovery Instructions (UnBrick) PDF
No ratings yet
Recovery Instructions (UnBrick) PDF
4 pages
State Machine Entry and Debugging Tutorial
No ratings yet
State Machine Entry and Debugging Tutorial
32 pages
Student Information System
No ratings yet
Student Information System
11 pages
FPRB User Manual - Rev AB
75% (4)
FPRB User Manual - Rev AB
64 pages
Littles Law
No ratings yet
Littles Law
22 pages
A Course in Data Design For Relational Databases
No ratings yet
A Course in Data Design For Relational Databases
76 pages
Requirement Engineering
No ratings yet
Requirement Engineering
2 pages

10 Hymel Conger Abstract

Uploaded by

10 Hymel Conger Abstract

Uploaded by

1

Evaluating Partial Reconfiguration for Embedded FPGA Applications

You might also like