0% found this document useful (0 votes)

12 views

arc22-ai-creating_optimized_ai_soc_architecture-virtual-prototyping-mojin-kottarathil

The document discusses the creation of optimized AI System-on-Chip (SoC) architectures using virtual prototyping, highlighting advancements in embedded AI applications and the challenges faced in design and verification. It outlines the use of Synopsys Virtual Prototyping for early architecture analysis and optimization, along with a case study of an AI SoC platform utilizing ARC Processor IP. The presentation concludes with insights on how to get started with faster development of AI SoCs using Synopsys tools and services.

Uploaded by

lapnd.english

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

arc22-ai-creating_optimized_ai_soc_architecture-virtual-prototyping-mojin-kottarathil

Uploaded by

lapnd.english

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Creating Optimized AI SoC Architecture

Using Virtual Prototyping

Mojin Kottarathil, Staff Applications Engineer

Synopsys ARC® Processor Summit 2022
Agenda

• Recent advancements in embedded AI applications and architectures

• Challenges in the design and verification of AI SoCs
• Synopsys Virtual Prototyping for early architecture analysis and optimization
• AI SoC platform case-study with ARC Processor IP
• How to get started

Processor Summit © 2022 Synopsys, Inc. 2

AI SoCs: A New Golden Age for Computer Architecture
AI
• Applications becoming smart
enabled
– autonomous vehicles, smart IoT, robots, etc.
Applications
– AI moving to the client for better cost, latency, reliability

• Neural Networks are getting bigger Neural

– More accurate results, higher image size, complex NLP models Network

• Software is often the hardest part Neural

– Need optimizing compilers to map applications to custom chips Network
– ResNet-50 is easy, real workloads are hard Compiler

• Moore’s Law winds down - Domain-Specific Architectures gain AI

– Custom accelerators/data-paths/instructions, SIMD SoC
– Many startups, semiconductors, super-scalers build AI SoCs

Processor Summit © 2022 Synopsys, Inc. 3

AI SoC Design Challenges
Brute-force Processing of Huge Data Sets
• Choosing the right algorithm and architecture: CPU, vector DSP, ASIP, DNN accelerator
– DNN graphs are evolving fast, need short time to market and cannot optimize for one single graph
– Joint design of AI algorithm, compiler and SoC architecture
– Joint optimization of power, performance, accuracy, and cost

• Highly parallel compute drives memory requirements

– E.g. in computer vision: higher resolution, higher frame-rate, more cameras
– High on-chip and chip to chip bandwidth at low latency
– High memory bandwidth requirements for parameters and layer to layer communication

• Power & Performance analysis require realistic workloads to consider dynamic effects
– Scheduling of AI operators on parallel processing elements
– Unpredictable interconnect and memory access latencies

Large Design Space Drives Differentiation by AI Algorithm & Architecture

Processor Summit © 2022 Synopsys, Inc. 4
Shift Left Architecture Analysis of AI SoCs
Analytical Performance Model
Architecture spec

APM-based Workload Model Fast Performance Model RTL Emulation RTL Prototyping
• Partitioning and exploration • HW/SW co-optimization • HW/SW co-verification • HW/SW co-verification
• Interconnect/memory analysis • Performance/power analysis • Power characterization • KPI validation

Model-based Architecture Simulation RTL-based HW/SW co-verification

Processor Summit © 2022 Synopsys, Inc. 5

Use-cases for Architecture Analysis with Virtual Prototyping
Early
Early architecture
architecture partitioning
exploration and
and exploration
optimization
Performance
Performancevalidation
optimization
withwith
Software
Software
with workload
with
models,
workloadcalibrated
modelsfrom APM
• KPI capture and sensitivity analysis • KPI tracking and validation
• Traffic and application workload modeling • IP selection and benchmarking
• HW/SW partitioning, architecture specification • SoC performance validation
• power/performance analysis • L1/L2 cache & cache coherency optimization

execute

map

execute

Hardware resource Application Near cycle accurate

Workload model
model Software Hardware model

Processor Summit © 2022 Synopsys, Inc. 6

Platform Architect Power and Performance Analysis Flow
Application workload Workload trace and statistics
Application
specification

Software
and
traces
map

Model Hardware platform Root-cause analysis

libraries

Power models
and
characterization

Sensitivity analysis

Parallel
parameter
sweep
…
Design space exploration

Processor Summit © 2022 Synopsys, Inc. 7

Platform Architect Based Workload Modelling cycles:
rd_bytes:
2000
0
wr_bytes: 0

• Analytic Performance Model (APM)

cycles: 0 cycles: 0
– Used internally by Synopsys NPX System Architecture Team rd_bytes: 0x200 rd_bytes: 0
wr_bytes: 0 wr_bytes: 0x200
• Workload Model generated from APM
Coef inDMA
– Calibrated tasks for in-DMA, out-DMA, and processing
Proc outDMA
• SoC Platform Model inDMA
– Accurate SystemC Transaction Level Models (TLM)
of processing elements, interconnect and memory
• Map workload to NPX6 VPU (Virtual Processing Unit) model
NPX6 VPU Model
– Process VPUs has execution time of layer group
inDMA NPU
– DMA execution times are based on actual bus and memory delays
w L1 and
• Analyze performance metrics outDMA L2 mem

– End-to-end performance
– Workload activity
NoC / Bus
– Utilization of resources
– Interconnect metrics Host SRAM DDR

– Latency, Throughput
– Contention, Outstanding transactions

Processor Summit © 2022 Synopsys, Inc. 8

ARC Processor Simulation Models
Support for building virtual prototypes
• nSIM NCAM has
– SystemC wrapper
– Model Libraries for Platform Architect and Virtualizer
– For easy deployment in Synopsys Virtual Prototyping tools
– Instrumented for debug and analysis

• Allows for easy creation of your own Virtual Platform

• Integration of MetaWare Debugger (mdb) into PA and Virtualizer

– For debugging complete systems containing ARC IP models

• Accurate model of ARC STU with non-blocking FT-AXI interfaces

Processor Summit © 2022 Synopsys, Inc. 9

ARC AI Fast Performance Model (FPM) in Platform Architect
Whitepaper "Performance Analysis Using ARC EV7x Fast Performance Model"
Neural Platform Architect SoC performance model Platform Architect Analysis
Network
ARC
SW trace
DNN DNN
Graph shared task trace
Mapping library DNN
Tool address trace
DNN
utilization
Runtime, DDR
binary utilization
Libraries,
Compiler image bus
Computer throughput
Vision
DNN power

• Use MetaWare production build flow to compile DNN model and ARC Vector DSP binary image
• Use Platform Architect to execute application on cycle-approximate performance model in context of SoC platform
• Analyze AI application and SoC power and performance metrics,
– e.g. Arc function profile, DNN trace, utilization, and address pattern, SoC bus and memory throughput and latency

Processor Summit © 2022 Synopsys, Inc. 10

Accuracy of FPM with FT interfaces in Platform Architect
Interconnect & memory models are crucial to achieve high accuracy for multi-core systems

Processor Summit © 2022 Synopsys, Inc. 11

AI SoC platform case-study
with Fast Performance Model of ARC AI processor IP
• Capture an AI SoC platform with ARC AI processor IP,
a Network-on-Chip, and DDR and SRAM memory hierarchy
• Analysis and optimization of IP-level and SoC architecture configurations

Processor Summit © 2022 Synopsys, Inc. 12

AI SoC Platform Case-study with ARC AI Subsystem

Platform Architect MobileNet

AI SoC with ARC IP & LP-DDR5

Goals:
 4 ms latency for inference of 5 frames
 minimize DNN power and energy
Root-cause analysis Sensitivity analysis
Optimize Hardware configuration:
– IP configuration
– Speed of DDR memory
– Interconnect, buffers, transactions

Processor Summit © 2022 Synopsys, Inc. 13

Platform Architect with ARC AI sub-system and DWC LPDDR5

Model Library
Block diagram

Parameters

Components

Connections

Processor Summit © 2022 Synopsys, Inc. 14

Video 1: Platform creation and tracing

• Example Platform creation

• Software tracing
• Hardware tracing

Processor Summit © 2022 Synopsys, Inc. 15

Processor Summit © 2022 Synopsys, Inc. 16
What We Just Learned
Platform creation and tracing

We learned how to:

✓ Create demo platform with ARC
Fast Performance Model and
DesignWare LPDDR5 memory
controller

✓ Use ARC VPX Function Trace to

analyze Software activity

✓ Correlate Software trace with

Hardware traces from DNN
accelerator and interconnect

Video 2: Performance Analysis

• Performance analysis of initial result

• Change architecture configuration
• Compare results from different simulations

We learned how to:

✓ Analyze activity and stall cycles of ARC AI

accelerator, correlate DNN activity with
interconnect and LPDDR analysis views

✓ Change bus and LPDDR5 controller

configuration to increase memory bandwidth

✓ Compare results from multiple runs,

new results show diminishing returns from higher
memory bandwidth

AI SoC Block Diagram in Platform Architect
Scaling AI Sub-system and LPDRR5 memory controller

Single-core sub-system Dual-core sub-system Quad-core sub-system

- 1 ARC VPX cores - 2 ARC VPX cores - 4 ARC VPX cores
- 1 DNN slice - 2 DNN slices - 4 DNN slices

DesignWare LPDDR5 - Multi-port LPDDR5 Mctrl

Memory Controller - parallel AXI bus fabric

AI SoC Architecture Sweep
Goal: 4 ms inference latency, minimize power & energy
Sweep parameters
– AI configuration: 1, 2, 4 DNN slices
– Outstanding transactions: 16, 32, 64
– LPDDR5 memory speed: 3733, 4800, 6400
– Interconnect/LPDDR controller: single port, multi-port
– LPDDR controller scheduler queue: 32, 64
– LPDDR channels: 2, 4

Sensitivity

Root-Cause
Analysis

Analysis and Optimization of Architecture Configurations
Inference latency for 5 frames vs. DNN power and energy consumption
Power
Latency [us]
Energy

Outstanding
transactions

LPDDR
channels Sufficient
performance
LPDDR speed
1 DNN slice 2 DNN slices 4 DNN slices
Processor Summit © 2022 Synopsys, Inc. 23
Example Summary

Platform Architect MobileNet

Goals:
 4 ms latency for inference of 5 frames
 minimize DNN power and energy
Optimized Hardware configuration:
Root-cause analysis Sensitivity analysis
– AI configuration:1, 2, 4 DNN slices
– Outstanding transactions: 16, 32, 64
– LPDDR memory speed: 3733, 4800, 6400
– Interconnect/LPDDR controller: single port, multi-port
– LPDDR controller scheduler queue: 32, 64

How To Get Started?
Faster Development of AI SoCs with Synopsys IP, tools, and services
Deep Knowledge in: Platform Architect
– AI Frameworks, AI & CNN Graphs, Graph Compression, • Exploration and optimization flows
Architecture • Power and performance analysis
and Mapping Tools Exploration & • Tooling for model creation and
– Class leading CNN, State of the art Vector DSP, & ASIP Optimization platform assembly
capabilities • Rich model library
– Leading edge processor IP and SW (ARC)
– Mastery of key support IP (HBM, PCIe, DDR, MIPI) ZeBu/HAPS
– Foundry Process, Memory Compilers and Logic Libraries • SoC verification
Verification, • Software development & bring-up
Emulation & • Hybrid emulation
Prototyping • Power & performance analysis
• AI benchmarks

Services
• Architectural tradeoffs
• IP subsystems
Services • ASIP design
• System verification
• Early Software development

Thank You!
• Further resources
• Landing page: DesignWare IP for Artificial Intelligence
• Landing page: Platform Architect

• Further questions
• [email protected]

Thank You

04 AMD Edge AI TechDay_Singapore_2024_FrankWang
No ratings yet
04 AMD Edge AI TechDay_Singapore_2024_FrankWang
29 pages
Analysis On Twitter
100% (7)
Analysis On Twitter
26 pages
Business Plan SP Electronics
No ratings yet
Business Plan SP Electronics
23 pages
Embraer Legacy 650
No ratings yet
Embraer Legacy 650
17 pages
Building Better IP With RTL Architect NoC IP Physical Exploration by Arteris
No ratings yet
Building Better IP With RTL Architect NoC IP Physical Exploration by Arteris
30 pages
HC31 1.11 Huawei - Davinci.HengLiao v4.0 PDF
No ratings yet
HC31 1.11 Huawei - Davinci.HengLiao v4.0 PDF
44 pages
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
SCM Create Fault Polygons and Map Petrel 2010
No ratings yet
SCM Create Fault Polygons and Map Petrel 2010
14 pages
Business Analysis
100% (2)
Business Analysis
37 pages
Platform Architect Ds
No ratings yet
Platform Architect Ds
5 pages
Hc2024.Amd.vpeng
No ratings yet
Hc2024.Amd.vpeng
36 pages
Client 2 - Synopsys - ATS Speaker Slide - Thomas Li (Synopsys)
No ratings yet
Client 2 - Synopsys - ATS Speaker Slide - Thomas Li (Synopsys)
29 pages
(SOC Intro) Introduction to ArmBased System on Chip Hashtag Design
No ratings yet
(SOC Intro) Introduction to ArmBased System on Chip Hashtag Design
210 pages
Ta2 1 Walston Pres Snps
No ratings yet
Ta2 1 Walston Pres Snps
66 pages
Gaysse Jerome A Comparison of In-Storage Processing Architectures and Technologies
No ratings yet
Gaysse Jerome A Comparison of In-Storage Processing Architectures and Technologies
50 pages
HC2021.C1.3 IBM Cristian Jacobi Final
No ratings yet
HC2021.C1.3 IBM Cristian Jacobi Final
22 pages
AI From The Data Center To The Edge An Optimized Path Using Intel Architecture PDF
No ratings yet
AI From The Data Center To The Edge An Optimized Path Using Intel Architecture PDF
149 pages
AI From The Data Center To The Edge An Optimized Path Using Intel Architecture PDF
No ratings yet
AI From The Data Center To The Edge An Optimized Path Using Intel Architecture PDF
149 pages
Backend Development
From Everand
Backend Development
Kai Turing
No ratings yet
ARM System On Chip Design
No ratings yet
ARM System On Chip Design
25 pages
Automated Instruction Stream Throughput Prediction For Intel and AMD Microarchitectures 11 - 2018 (1809.00912)
No ratings yet
Automated Instruction Stream Throughput Prediction For Intel and AMD Microarchitectures 11 - 2018 (1809.00912)
11 pages
Client 3 - Tech Symposia - Heterogeneous AI On Arm (Taiwan Version v2)
No ratings yet
Client 3 - Tech Symposia - Heterogeneous AI On Arm (Taiwan Version v2)
27 pages
Hackstorm_ Intel® AI PC Edition Sample Idea
No ratings yet
Hackstorm_ Intel® AI PC Edition Sample Idea
10 pages
Ug1703 Vitis Ai Developer Guide WTMKX
No ratings yet
Ug1703 Vitis Ai Developer Guide WTMKX
137 pages
IBM WebSphere Portal 8: Web Experience Factory and the Cloud
From Everand
IBM WebSphere Portal 8: Web Experience Factory and the Cloud
Chelis Camargo
No ratings yet
Google AI Infrastructure Supremacy_ Systems Matter More Than Microarchitecture – SemiAnalysis
No ratings yet
Google AI Infrastructure Supremacy_ Systems Matter More Than Microarchitecture – SemiAnalysis
22 pages
TB 01 Gibbons Pres Snps
No ratings yet
TB 01 Gibbons Pres Snps
43 pages
dell-ai-factory-with-nvidia-ebook
No ratings yet
dell-ai-factory-with-nvidia-ebook
12 pages
Mantovani Thesis PDF
No ratings yet
Mantovani Thesis PDF
230 pages
Giulio Corradi Presentation PDF
No ratings yet
Giulio Corradi Presentation PDF
64 pages
HPE Compute Certification Guide: 444 Practice Questions for the Advanced HPE1-H02 Exam
From Everand
HPE Compute Certification Guide: 444 Practice Questions for the Advanced HPE1-H02 Exam
Steve Brown
No ratings yet
FOSDEM14 HPC Devroom 12 Sniper
No ratings yet
FOSDEM14 HPC Devroom 12 Sniper
33 pages
Intel Architecture Day 2021 Presentation
No ratings yet
Intel Architecture Day 2021 Presentation
195 pages
HC2024.T2.Qualcomm.NaderNikfar.final-0824
No ratings yet
HC2024.T2.Qualcomm.NaderNikfar.final-0824
25 pages
Systems On Chip (SoC) - 01
No ratings yet
Systems On Chip (SoC) - 01
47 pages
Zhan Xu Huawei
No ratings yet
Zhan Xu Huawei
35 pages
RAPIDO 2023 Paper 2868
No ratings yet
RAPIDO 2023 Paper 2868
6 pages
Module 4 - Hardware Accelerators for Deep Learning
No ratings yet
Module 4 - Hardware Accelerators for Deep Learning
25 pages
Electronic System Level Power and Performance Analysis For Multi-Processor-System-on-Chip
No ratings yet
Electronic System Level Power and Performance Analysis For Multi-Processor-System-on-Chip
2 pages
Systems On Chip (SoC)
No ratings yet
Systems On Chip (SoC)
46 pages
Module 10 - Learners Guide
No ratings yet
Module 10 - Learners Guide
29 pages
Full Download Artificial Intelligence Hardware Design: Challenges and Solutions 1st Edition Albert Chun-Chen Liu PDF DOCX
100% (1)
Full Download Artificial Intelligence Hardware Design: Challenges and Solutions 1st Edition Albert Chun-Chen Liu PDF DOCX
50 pages
BRKFP292
No ratings yet
BRKFP292
15 pages
inference-whitepaper-mar23-update
No ratings yet
inference-whitepaper-mar23-update
42 pages
Software Architecture with Python
From Everand
Software Architecture with Python
Anand Balachandran Pillai
3/5 (1)
AI Accelerator
No ratings yet
AI Accelerator
5 pages
Download full Artificial Intelligence Hardware Design: Challenges and Solutions 1st Edition Albert Chun-Chen Liu ebook all chapters
100% (4)
Download full Artificial Intelligence Hardware Design: Challenges and Solutions 1st Edition Albert Chun-Chen Liu ebook all chapters
40 pages
Ten Lessons From Three Generations Shaped Google S Tpuv4i
No ratings yet
Ten Lessons From Three Generations Shaped Google S Tpuv4i
40 pages
AI Computing Trends - Challenges Innovations-Final
No ratings yet
AI Computing Trends - Challenges Innovations-Final
18 pages
06 From AMD Zynq US+ MPSoC_to_RFSoC_v02
No ratings yet
06 From AMD Zynq US+ MPSoC_to_RFSoC_v02
36 pages
Dell Networking
No ratings yet
Dell Networking
27 pages
Lecture01 IntroToArmBasedSoCDesign
No ratings yet
Lecture01 IntroToArmBasedSoCDesign
27 pages
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
Product Overview
No ratings yet
Product Overview
17 pages
PUBLIC-ai-vsphere-vsan-with-xeon-amx-brief Final
No ratings yet
PUBLIC-ai-vsphere-vsan-with-xeon-amx-brief Final
5 pages
MPCL Brief Overview
No ratings yet
MPCL Brief Overview
5 pages
GDC AMD Ryzen Processor Software Optimization
No ratings yet
GDC AMD Ryzen Processor Software Optimization
63 pages
Oracle Modernization Solutions
From Everand
Oracle Modernization Solutions
Tom Laszewski
No ratings yet
Aula Ch1
No ratings yet
Aula Ch1
40 pages
Low Power UPF and VP
No ratings yet
Low Power UPF and VP
66 pages
Intel AI Everywhere
No ratings yet
Intel AI Everywhere
29 pages
SNUG Home Gateway Architecture Case Study
No ratings yet
SNUG Home Gateway Architecture Case Study
25 pages
Generative-AI-at-the-edge
100% (1)
Generative-AI-at-the-edge
37 pages
HC2023 Qualcomm Hexagon NPU
No ratings yet
HC2023 Qualcomm Hexagon NPU
19 pages
Administering ArcGIS for Server
From Everand
Administering ArcGIS for Server
Hussein Nasser
No ratings yet
S.R. Luthra Institute of Management
No ratings yet
S.R. Luthra Institute of Management
15 pages
Dotnet Core Interview Que Ans
100% (1)
Dotnet Core Interview Que Ans
4 pages
Nova Scotia Web
No ratings yet
Nova Scotia Web
2 pages
Historia Iglesia de Dios
No ratings yet
Historia Iglesia de Dios
36 pages
Shutdown Maintenance
No ratings yet
Shutdown Maintenance
5 pages
Open Source Forensics Tools
No ratings yet
Open Source Forensics Tools
131 pages
ACC166 Assessment 7
No ratings yet
ACC166 Assessment 7
5 pages
Senior 2 WH Lists REVISED RATE 22-2-2021
No ratings yet
Senior 2 WH Lists REVISED RATE 22-2-2021
5 pages
Características Técnicas Probador Hastings 6799
100% (1)
Características Técnicas Probador Hastings 6799
1 page
Informix HQ
No ratings yet
Informix HQ
68 pages
Curing Comparator Instability With Hysteresis: by Reza Moghimi
No ratings yet
Curing Comparator Instability With Hysteresis: by Reza Moghimi
3 pages
8051 Microcontroller Based Pick and Place Robot Major Projects in - YouTube
No ratings yet
8051 Microcontroller Based Pick and Place Robot Major Projects in - YouTube
3 pages
Gear Box
No ratings yet
Gear Box
70 pages
Cyber Crime - MIS Presentation
No ratings yet
Cyber Crime - MIS Presentation
27 pages
SEP-OPE-OHSF1-SC03-00001 Ohaji South EPF Heater Treater Installation Project - FEED SOW - A01 - Signed
No ratings yet
SEP-OPE-OHSF1-SC03-00001 Ohaji South EPF Heater Treater Installation Project - FEED SOW - A01 - Signed
36 pages
Robotic Shotcrete Applications For Mining and Tunneling: A History of Robotic Applicators
No ratings yet
Robotic Shotcrete Applications For Mining and Tunneling: A History of Robotic Applicators
6 pages
NT Unit 3
No ratings yet
NT Unit 3
11 pages
Mn-2a Cyber Defence Vidya Guess Paper Fyugp Sem-2 (23-27
No ratings yet
Mn-2a Cyber Defence Vidya Guess Paper Fyugp Sem-2 (23-27
70 pages
Access 30
No ratings yet
Access 30
198 pages
Actcut ENG
No ratings yet
Actcut ENG
4 pages
OneOcean PassageManager
100% (1)
OneOcean PassageManager
2 pages
DeepSea100 Data Sheet
No ratings yet
DeepSea100 Data Sheet
2 pages
FireFinder XLS System Basics (12!10!04) 1
0% (1)
FireFinder XLS System Basics (12!10!04) 1
28 pages
proventia_lfp-battery-systems_datasheet-2
No ratings yet
proventia_lfp-battery-systems_datasheet-2
2 pages
DKK2462 Chapter 7
No ratings yet
DKK2462 Chapter 7
15 pages

arc22-ai-creating_optimized_ai_soc_architecture-virtual-prototyping-mojin-kottarathil

Uploaded by

arc22-ai-creating_optimized_ai_soc_architecture-virtual-prototyping-mojin-kottarathil

Uploaded by

Creating Optimized AI SoC Architecture

Using Virtual Prototyping

Mojin Kottarathil, Staff Applications Engineer

• Recent advancements in embedded AI applications and architectures

Processor Summit © 2022 Synopsys, Inc. 2

• Neural Networks are getting bigger Neural

• Software is often the hardest part Neural

• Moore’s Law winds down - Domain-Specific Architectures gain AI

Processor Summit © 2022 Synopsys, Inc. 3

• Highly parallel compute drives memory requirements

Large Design Space Drives Differentiation by AI Algorithm & Architecture

Model-based Architecture Simulation RTL-based HW/SW co-verification

Processor Summit © 2022 Synopsys, Inc. 5

Hardware resource Application Near cycle accurate

Processor Summit © 2022 Synopsys, Inc. 6

Model Hardware platform Root-cause analysis

Processor Summit © 2022 Synopsys, Inc. 7

• Analytic Performance Model (APM)

Processor Summit © 2022 Synopsys, Inc. 8

• Allows for easy creation of your own Virtual Platform

• Integration of MetaWare Debugger (mdb) into PA and Virtualizer

• Accurate model of ARC STU with non-blocking FT-AXI interfaces

Processor Summit © 2022 Synopsys, Inc. 9

Processor Summit © 2022 Synopsys, Inc. 10

Processor Summit © 2022 Synopsys, Inc. 11

Processor Summit © 2022 Synopsys, Inc. 12

Platform Architect MobileNet

Processor Summit © 2022 Synopsys, Inc. 13

Processor Summit © 2022 Synopsys, Inc. 14

• Example Platform creation

Processor Summit © 2022 Synopsys, Inc. 15

We learned how to:

✓ Use ARC VPX Function Trace to

✓ Correlate Software trace with

Processor Summit © 2022 Synopsys, Inc. 17

• Performance analysis of initial result

Processor Summit © 2022 Synopsys, Inc. 18

We learned how to:

✓ Analyze activity and stall cycles of ARC AI

✓ Change bus and LPDDR5 controller

✓ Compare results from multiple runs,

Processor Summit © 2022 Synopsys, Inc. 20

Single-core sub-system Dual-core sub-system Quad-core sub-system

DesignWare LPDDR5 - Multi-port LPDDR5 Mctrl

Processor Summit © 2022 Synopsys, Inc. 21

Processor Summit © 2022 Synopsys, Inc. 22

Platform Architect MobileNet

Processor Summit © 2022 Synopsys, Inc. 24

Processor Summit © 2022 Synopsys, Inc. 25

Processor Summit © 2022 Synopsys, Inc. 26

You might also like