0% found this document useful (0 votes)

27 views

xilinx-versal-ai-compute-solution-brief

Uploaded by

Ramón Antonio Parada

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

xilinx-versal-ai-compute-solution-brief

Uploaded by

Ramón Antonio Parada

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

SOLUTION BRIEF

AI Inference with Versal™ AI Core Series

> 2.7X performance/watt vs. competing FPGAs1 for cloud acceleration

> Accelerates the whole application from pre- to post-processing

> Adaptable to evolving AI Algorithms

CHALLENGE
Applied machine learning techniques have now become pervasive
across a wide range of applications, with tremendous growth in vision
and video in particular. FPGA-based AI/ML acceleration has already
shown performance and latency advantages over GPU accelerators,
but next-generation CNN-based workloads demand compute density
beyond what traditional FPGA programmable logic and multipliers
can offer. Fabric-based DSP blocks offer flexible precision and are still
capable accelerators, but the bit-level interconnect and fine-grained
programmability come with overhead that limits scalability for the
most compute-intensive CNN-based workloads.

SOLUTION: VERSAL AI CORE SERIES FOR AI COMPUTE ACCELERATION

The AMD Versal™ AI Core series is a highly integrated, multicore, heterogeneous device that can dynamically adapt at the hardware and software
level for a wide range of AI workloads, making it ideal for cloud accelerator cards. The platform integrates next-generation Scalar Engines for
embedded compute, Adaptable Engines for hardware flexibility, and Intelligent Engines consisting of DSP Engines and revolutionary AI Engines
for inference and signal processing. The result is an adaptable accelerator that exceeds the performance, latency, and power efficiency of
traditional FPGAs and GPUs for AI/ML workloads.

AI Engines for Breakthrough AI/ML Inference

2.7X
Within the Versal platform is a unique architecture for AI inference—the AI Engines—which are
an array of software programmable vector processors with flexible interconnect and tightly
coupled local memory—ideal for CNN-based inference and delivering 2.7X performance/watt
over competing 10nm FPGAs.1 AI Engines deliver compute density, power efficiency, and low
latency not possible with GPUs and traditional FPGA architectures, all while retaining hardware
Performance/Watt vs. FPGAs1
adaptability to evolve with AI algorithms.
Versal AI Core device for
AI Accelerator Cards
Whole Application Acceleration
Machine learning is typically integrated into a larger application rather than a stand-alone
workload. As a complete heterogeneous compute platform, the Versal AI Core series leverages
its diverse engines to infuse deep learning as “an element” of a larger application that has other
pre/post-processing requirements, delivering end-to-end application acceleration.

Complete Development Environment for SW Developers

Fully supported by the Vitis™ AI development environment, which consists of optimized IP,
tools, libraries, models, and example designs, Versal AI Core devices enable data scientists
to compile TensorFlow, PyTorch, and Caffe models using Python or C++ APIs in minutes—
without prior FPGA knowledge.
1: Versal AI Core VC1902 device vs. Intel Agilex AGF027 FPGA, ResNet50 v1.5 (frames-per-second/watt)
VERSAL ADAPTIVE SOC IMPLEMENTATION
AI Compute Accelerator with Versal AI Core Series
The Versal AI Core series solves the unique and most difficult problem of AI inference—compute efficiency—by coupling ASIC-class compute
engines (AI Engines) together with flexible fabric (Adaptable Engines) to build accelerators with maximum efficiency for any given network,
while delivering low power and low latency. Through its integrated shell—enabled by a programmable network on chip and hardened interfaces—
Versal SoCs are built from the ground up to ensure streamlined connectivity to data center compute infrastructure, simplifying accelerator card
development.

VERSAL AI CORE VC1902 IMPLEMENTATION

Scalar Engines Adaptable Engines Intelligent Engines

Dual-Core Arm ®
AI Engines
Cortex®-A72 Image Scaling
• Queue Management
• Kubernetes Machine Learning
Orchestration
Neural Network (e.g., CNN, RNN)
RT Compression
Dual-Core Arm
Cortex-R5F Custom Memory DSP Engines
Hierachy
Platform • Video
Management Controller • Imaging
• Custom Datapaths
Bitstream Management

Programmable Network on Chip

PCIe Gen 4 DDR4 32Gb/s SerDes 100G Multirate Custom I/O

w/ DMA & CCIX Ethernet Cores

Host CPU LPDDR4-4200, DDR4-3600 10G/25G/40G/100G Ethernet

PLATFORM HIGHLIGHTS

> Custom memory hierarchy optimizes data movement and management for accelerator kernels
Adaptable Engines
> Pre- and post-processing functions including neural network RT compression and image scaling

> Tiled array of vector processors, flexible interconnect, and local memory enabling massive parallelism
> Up to 133 INT8 TOPS with the Versal AI Core VC1902 device, scales up to 405 INT4 TOPS in the portfolio
AI Engines
> Compiles models in minutes based on TensorFlow, PyTorch, and Caffe using Python or C++ APIs
> Ideal for neural networks ranging from CNN, RNN, and MLP; hardware adaptable to optimize for evolving algorithms

> Arm processing subsystem for queue management and Kubernetes orchestration
Scalar Engines
> Platform management controller for security, power management, and bitstream management

Programmable > Seamlessly integrates all engines and key interfaces

Network on Chip > Simplifies kernel and IP placement, reducing soft logic needed for connectivity
(NoC) > Streamlines programming experience for software and hardware developers

> Comprises hardened host interface, programmable NoC, and Scalar Engines
Integrated Shell > Ensures streamlined device bring-up and connectivity to off-chip interfaces, making the platform available at boot
> Delivers pre-engineered timing closure and logic resource savings, simplifying development of accelerator cards
BENCHMARK
ResNet50 v1.5 Performance Comparison
Shown below is a comparison of measured results on Versal devices as submitted to the ML Perf Data Center v1.0, and projected performance
of competing 10nm Intel Agilex FPGAs.

REL ATIVE RESNET50 V1.5 FR AMES/SEC/WATT

3.0

2.0 2.7X
1.0
1X
Intel Agilex AGF027-2 Versal AI Core VC19021
Peak INT8 TOPS 61 TOPs2 2.2X 133 TOPS

Logic Shell 117k ALMs 62% Less 45k LUTs3

Frames/Sec 2,246 2.9X 6,471

Total Power 92 Watts4 6% Less 87 Watts5

Frames/Sec/Watt 24 FPS/Watt 2.7X 66 FPS/Watt

1: Measured results of VCK5000 development card based on Versal AI Core VC1902

2: Assumes 30% compute efficiency for Intel Agilex FPGA 18x19 multipliers and 40% compute efficiency of AI Engines

3: Integrated shell reduces logic required for connectivity, 45K LUTs required for run-time SW & deep-learning processor support

4: Based on Quartus Power & Thermal Calculator 2021.2, assumes SmartVID and claimed static power savings

5: Device power estimates, based on Xilinx Power Estimator (XPE) available at https://www.xilinx.com/products/technology/power/xpe.html

TAKE THE NEXT STEP AMD VCK5000 Versal Development Card

www.xilinx.com/vck5000
> For more information on Versal AI Core series, visit www.xilinx.com/versal-ai-core

> To try the above benchmark yourself, visit www.xilinx.com/versal-performance-elevated

> To start designing for cloud acceleration and edge computing, visit www.xilinx.com/vck5000

> To start designing on a Versal AI Core Evaluation Kit, visit www.xilinx.com/vck190

> To contact your local AMD sales representative, visit Contact Sales

DISCLAIMERS
(The information contained herein is for informational purposes only and is subject to change without notice. While every precaution has been taken in the preparation of this document, it may contain technical
inaccuracies, omissions and typographical errors, and AMD is under no obligation to update or otherwise correct this information. Advanced Micro Devices, Inc. makes no representations or warranties with respect
to the accuracy or completeness of the contents of this document, and assumes no liability of any kind, including the implied warranties of noninfringement, merchantability or fitness for purposes, with respect to the
operation or use of AMD hardware, software or other products described herein. No license, including implied or arising by estoppel, to any intellectual property rights is granted by this document. Terms and limitations
applicable to the purchase or use of AMD’s products are as set forth in a signed agreement between the parties or in AMD’s Standard Terms and Conditions of Sale.

COPYRIGHT NOTICE
© 2023 Advanced Micro Devices, Inc. All rights reserved. Xilinx, the Xilinx logo, AMD, the AMD Arrow logo, Alveo, Artix, Kintex, Kria, Spartan, Versal, Vitis, Virtex, Vivado, Zynq, and other designated brands included herein
are trademarks of Advanced Micro Devices, Inc. Other product names used in this publication are for identification purposes only and may be trademarks of their respective companies. AMBA, AMBA Designer, ARM,
ARM1176JZ-S, CoreSight, Cortex, and PrimeCell are trademarks of ARM in the EU and other countries. PCIe, and PCI Express are trademarks of PCI-SIG and used under license. PID# 231846771-B

READY TO CONNECT? VISIT xilinx.com/versal-ai-edge

04 AMD Edge AI TechDay_Singapore_2024_FrankWang
No ratings yet
04 AMD Edge AI TechDay_Singapore_2024_FrankWang
29 pages
Design Patterns by Tutorials (2nd Edition) - 2018
100% (1)
Design Patterns by Tutorials (2nd Edition) - 2018
364 pages
Calibration and Quality Control in The Laboratory
No ratings yet
Calibration and Quality Control in The Laboratory
15 pages
MSX User - Vol 1 No 1 - Dec 1984
100% (2)
MSX User - Vol 1 No 1 - Dec 1984
100 pages
versal-ai-edge-gen2-ai-box-solution-brief
No ratings yet
versal-ai-edge-gen2-ai-box-solution-brief
2 pages
02 AMD Tech Day AECG Portfolio Overview
No ratings yet
02 AMD Tech Day AECG Portfolio Overview
34 pages
AMD versal™ AI edge series gen 2 for vision and automotive_[Knopp 等]_2024
No ratings yet
AMD versal™ AI edge series gen 2 for vision and automotive_[Knopp 等]_2024
28 pages
versal-ai-edge-gen2-automotive-solution-brief
No ratings yet
versal-ai-edge-gen2-automotive-solution-brief
3 pages
versal-ai-edge-gen2-product-brief
No ratings yet
versal-ai-edge-gen2-product-brief
3 pages
FPGA's: Software Defined Radio
No ratings yet
FPGA's: Software Defined Radio
17 pages
Versal Ai Edge Gen2 Infographic
No ratings yet
Versal Ai Edge Gen2 Infographic
1 page
introducing-the-versal-architecture
No ratings yet
introducing-the-versal-architecture
35 pages
Robotics Webinar Series Session 3 Slides
No ratings yet
Robotics Webinar Series Session 3 Slides
46 pages
General Description: Versal Architecture and Product Data Sheet: Overview
No ratings yet
General Description: Versal Architecture and Product Data Sheet: Overview
23 pages
Jason Vidmar - AI and SDR
No ratings yet
Jason Vidmar - AI and SDR
22 pages
Am009 Versal Ai Engine
No ratings yet
Am009 Versal Ai Engine
62 pages
XMP464versal Ai Edge Product Selection Guide
No ratings yet
XMP464versal Ai Edge Product Selection Guide
7 pages
Xilinx Edge Processors: Aie Engineering Team Hotchips-33 Conference, August 2021
No ratings yet
Xilinx Edge Processors: Aie Engineering Team Hotchips-33 Conference, August 2021
21 pages
Hc2024.Amd.vpeng
No ratings yet
Hc2024.Amd.vpeng
36 pages
0930 18.07.18 Neel Gala InCore Semiconductors PDF
No ratings yet
0930 18.07.18 Neel Gala InCore Semiconductors PDF
33 pages
CPUs GPUs Accelerators
No ratings yet
CPUs GPUs Accelerators
22 pages
Ai Inst
No ratings yet
Ai Inst
7 pages
sp-11v2-wei-han-tenstorrent-gsa-edge-ai-2024-final-submit-2
No ratings yet
sp-11v2-wei-han-tenstorrent-gsa-edge-ai-2024-final-submit-2
17 pages
Xilinx Presentation
100% (1)
Xilinx Presentation
35 pages
versal-ai-edge-gen2-psg
No ratings yet
versal-ai-edge-gen2-psg
5 pages
alveo-v80-product-brief
No ratings yet
alveo-v80-product-brief
4 pages
Ventana HotChips23 - Final
No ratings yet
Ventana HotChips23 - Final
16 pages
03 Computing with DSPs and AI Engines
No ratings yet
03 Computing with DSPs and AI Engines
30 pages
Vitis_AI_PG358_2024_20
No ratings yet
Vitis_AI_PG358_2024_20
27 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
15 pages
PUBLIC-ai-vsphere-vsan-with-xeon-amx-brief Final
No ratings yet
PUBLIC-ai-vsphere-vsan-with-xeon-amx-brief Final
5 pages
DL TR 2022 002
No ratings yet
DL TR 2022 002
20 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Zynq Ultrascale Plus Product Brief
No ratings yet
Zynq Ultrascale Plus Product Brief
6 pages
Understanding AI Part 2 Inference, Revised
No ratings yet
Understanding AI Part 2 Inference, Revised
4 pages
Am016 Versal CPM Ccix
No ratings yet
Am016 Versal CPM Ccix
41 pages
CPUs_GPUs_accelerators_and_memory_v1.0
No ratings yet
CPUs_GPUs_accelerators_and_memory_v1.0
44 pages
051024_Nvidia_update_for_Lenovo[1]
No ratings yet
051024_Nvidia_update_for_Lenovo[1]
30 pages
Versal:: The First Adaptive Compute Acceleration Platform (ACAP)
No ratings yet
Versal:: The First Adaptive Compute Acceleration Platform (ACAP)
21 pages
zynq-ultrascale-plus-product-brief
No ratings yet
zynq-ultrascale-plus-product-brief
2 pages
Getting Started With The AMD Robotics Hardware Portfolio - Final v2
No ratings yet
Getting Started With The AMD Robotics Hardware Portfolio - Final v2
38 pages
Thinkmate AI Cluster Solution Overview (1)
No ratings yet
Thinkmate AI Cluster Solution Overview (1)
3 pages
Instinct Mi325x Infographic
No ratings yet
Instinct Mi325x Infographic
1 page
Marsellus A Heterogeneous RISC-V AI-IoT End-Node SoC With 28 B DNN Acceleration
No ratings yet
Marsellus A Heterogeneous RISC-V AI-IoT End-Node SoC With 28 B DNN Acceleration
15 pages
Accelerating ML Recommendation
No ratings yet
Accelerating ML Recommendation
23 pages
Ai Engine Development For Versal: Olivier Tremois, PHD SW Technical Marketing Ai Engine Tools
No ratings yet
Ai Engine Development For Versal: Olivier Tremois, PHD SW Technical Marketing Ai Engine Tools
30 pages
AMD - Instinct MI300A APU Datasheet
No ratings yet
AMD - Instinct MI300A APU Datasheet
2 pages
Accelerating ML Recommendation With Over A Thousand Risc-V/Tensor Processors On Esperanto'S Et-Soc-1 Chip
No ratings yet
Accelerating ML Recommendation With Over A Thousand Risc-V/Tensor Processors On Esperanto'S Et-Soc-1 Chip
23 pages
U11455EU4V0SG00
No ratings yet
U11455EU4V0SG00
300 pages
GTC2025 Keynote
No ratings yet
GTC2025 Keynote
73 pages
Next-Generation switching OS configuration and management: Troubleshooting NX-OS in Enterprise Environments
From Everand
Next-Generation switching OS configuration and management: Troubleshooting NX-OS in Enterprise Environments
Mamta Devi
No ratings yet
Nvidia A100 Datasheet Us Nvidia 1758950 r4 Web
No ratings yet
Nvidia A100 Datasheet Us Nvidia 1758950 r4 Web
3 pages
AMD and AI: The Dynamic Duo Transforming Data Processing and Machine Learning 146638
No ratings yet
AMD and AI: The Dynamic Duo Transforming Data Processing and Machine Learning 146638
4 pages
Tektalkcomputeaisolutons 1700043747452
No ratings yet
Tektalkcomputeaisolutons 1700043747452
23 pages
Unleashing the Potential of Alternative Deep Learning Hardware - EE Times
No ratings yet
Unleashing the Potential of Alternative Deep Learning Hardware - EE Times
5 pages
Hyperion Research HPC and AI Processors
No ratings yet
Hyperion Research HPC and AI Processors
14 pages
Intel AI Everywhere
No ratings yet
Intel AI Everywhere
29 pages
RVfpga Slides
No ratings yet
RVfpga Slides
116 pages
04 TechTalks the Evolution of Edge Computing and AI Consolidated Sharing 1213
No ratings yet
04 TechTalks the Evolution of Edge Computing and AI Consolidated Sharing 1213
41 pages
Introduction To Hardware Accelerator Systems For Artificial Intelligence and Machine Learning
No ratings yet
Introduction To Hardware Accelerator Systems For Artificial Intelligence and Machine Learning
21 pages
Skylake Architecture
No ratings yet
Skylake Architecture
31 pages
Intel Architecture Day 2021 Presentation
No ratings yet
Intel Architecture Day 2021 Presentation
195 pages
XMP452versal Ai Core Product Selection Guide
No ratings yet
XMP452versal Ai Core Product Selection Guide
7 pages
The Significance of SIMD, SSE and AVX - Intel - Slides (3a - SIMD)
No ratings yet
The Significance of SIMD, SSE and AVX - Intel - Slides (3a - SIMD)
57 pages
Theory of Locality Sensitive Hashing - CS246 Stanford (Slides)
No ratings yet
Theory of Locality Sensitive Hashing - CS246 Stanford (Slides)
52 pages
These Are Not Your Grand Daddys CPU Performance Counters - CPU Hardware Performance Counters For Security - Slides (2015)
No ratings yet
These Are Not Your Grand Daddys CPU Performance Counters - CPU Hardware Performance Counters For Security - Slides (2015)
89 pages
Data Encryption 101: A Pragmatic Approach To PCI Compliance: Securosis, L.L.C. 1
No ratings yet
Data Encryption 101: A Pragmatic Approach To PCI Compliance: Securosis, L.L.C. 1
13 pages
GA-Z77N-WIFI GA-H77N-WIFI: User's Manual
No ratings yet
GA-Z77N-WIFI GA-H77N-WIFI: User's Manual
108 pages
Technical Debt Latest
No ratings yet
Technical Debt Latest
42 pages
Prober, Soldering Tool, Nylon
No ratings yet
Prober, Soldering Tool, Nylon
1 page
Írásbeli Angol Érettségi Feladatoknyv 5
No ratings yet
Írásbeli Angol Érettségi Feladatoknyv 5
26 pages
HF Network Nov4-09 Nemeth
No ratings yet
HF Network Nov4-09 Nemeth
22 pages
Pantar2014 Audit Report 3
No ratings yet
Pantar2014 Audit Report 3
58 pages
Factoring GCF and Difference of Two Squares
No ratings yet
Factoring GCF and Difference of Two Squares
23 pages
Chapter 2 - General Fracture Mechanics
75% (4)
Chapter 2 - General Fracture Mechanics
45 pages
Business Model of Bkash. Mohammad Azmal Huda Chief Technology Officer Bkash Limited
No ratings yet
Business Model of Bkash. Mohammad Azmal Huda Chief Technology Officer Bkash Limited
33 pages
IPLogger Output
No ratings yet
IPLogger Output
7 pages
Netiq Identity Manager: Driver For Identity Governance Implementation Guide
No ratings yet
Netiq Identity Manager: Driver For Identity Governance Implementation Guide
28 pages
Miguel Villatuya Vs Atty Tabalingcos A.C No. 6622 July 10,2012
No ratings yet
Miguel Villatuya Vs Atty Tabalingcos A.C No. 6622 July 10,2012
10 pages
Emplifi Report State of Influencer Marketing 2023 EN
No ratings yet
Emplifi Report State of Influencer Marketing 2023 EN
27 pages
同意转让合同
100% (1)
同意转让合同
6 pages
Art 298 Labor Law Report
100% (1)
Art 298 Labor Law Report
2 pages
Manual Encoder WDG 58A
No ratings yet
Manual Encoder WDG 58A
14 pages
Full download Cronies or Capitalists the Russian Bourgeoisie and the Bourgeois Revolution from 1850 to 1917 David Lockwood pdf docx
No ratings yet
Full download Cronies or Capitalists the Russian Bourgeoisie and the Bourgeois Revolution from 1850 to 1917 David Lockwood pdf docx
51 pages
Full download Building Schools Key Issues for Contemporary Design 240 p. with 460 ills. Edition Leo Care (Editor) pdf docx
100% (1)
Full download Building Schools Key Issues for Contemporary Design 240 p. with 460 ills. Edition Leo Care (Editor) pdf docx
40 pages
Implementing Entrepreneurial Ideas
No ratings yet
Implementing Entrepreneurial Ideas
9 pages
Project Proposals - NGOs India - Database and Resources of Indian NGOs, NPOs
0% (1)
Project Proposals - NGOs India - Database and Resources of Indian NGOs, NPOs
2 pages
March 19 (Binary and CSV Files by Swati Chawla)
No ratings yet
March 19 (Binary and CSV Files by Swati Chawla)
8 pages
LAS PRODUCT DOC
No ratings yet
LAS PRODUCT DOC
4 pages
Level of Implementation of Safety and Health in Selected Secondary Schools in Lucena City-Sorianokim-Patrick-E.
100% (1)
Level of Implementation of Safety and Health in Selected Secondary Schools in Lucena City-Sorianokim-Patrick-E.
95 pages
Beta Saham 20191018 en
No ratings yet
Beta Saham 20191018 en
13 pages
X0231-Uae-Ecb-Po-00001-03 1
No ratings yet
X0231-Uae-Ecb-Po-00001-03 1
88 pages
Entrepreneur Chapter 15
No ratings yet
Entrepreneur Chapter 15
8 pages
Risk Based Inspection Jacketed Platforms
No ratings yet
Risk Based Inspection Jacketed Platforms
19 pages
Colgate Palmolive
No ratings yet
Colgate Palmolive
7 pages
Ohms Law Calculator
No ratings yet
Ohms Law Calculator
2 pages
Akema Brochure 2011
No ratings yet
Akema Brochure 2011
19 pages