SlideShare a Scribd company logo
2
Most read
6
Most read
7
Most read
1
It’s Time To ROCm!
1
AMD INNOVATIONS AND SUPERCOMPUTING
AMD has been a leading innovator in high-performance systems
‒ First with 64-bit x86 architecture, HyperTransport™, integrated memory controllers, multi-core socket compatible
upgrades, SSE FMAC extensions
AMD processors have been in many leading DOE supercomputers
‒ In 2008, seven of the top ten supercomputers used AMD Opteron processors
AMD will lead again with systems incorporating advanced CPUs, GPUs, memory, and networks
2
>$50M funding
Memory,
Programming,
APUs, Network,
more
2012-2016
HBM
High Bandwidth
Memory
14 NM
Radeon RX480
June 2016
Driving Innovation Not Waiting For Miracles
Extreme
Perf/Watt
Efficacy
Radeon R9 Nano
46 GFLOP/Watt
Sept 2015
Open GPU
Computing
Platform
ROCm 1.0
April 2016
DOE Invests
in AMD
The first GPU devices to
use HBM and CoWoS
Radeon R9 Fury X,
June 2015
3
4
INTRODUCING ROCm SOFTWARE PLATFORM
Graphics Core Next Headless Linux® 64-bit
Driver
• Multi-GPU Shared Virtual Memory
• Large Memory Single Allocation
• Peer to Peer Multi-GPU
• Peer to Peer with RDMA
• Systems Management API and Tools
HSA drives rich capabilities into the ROCm
hardware and software
• User Mode Queues
• Architected Queuing Language
• Flat memory Addressing
• Atomic Memory Transactions
• Process Concurrency & Preemption
Rich Compiler Foundation for HPC Developer
• LLVM Native GCN ISA Code Generation
• Offline Compilation Support
• Standardized loader and Code Object
• GCN ISA Assembler and Disassembler
A New Fully Open Source Foundation for HPC Class GPU computing
Open Source Tools and Libraries
• Rich Set of Open Source Math Libraries
• Tuned Deep Learning Library
• Optimized Parallel Programing Frameworks
• CodeXL Profiler and GDB Debugging
5
HIP
Convert CUDA to portable C++
• Single-source Host+Kernel
• C++ Kernel Language
• C Runtime (CUDA-like)
• Platforms: AMD GPU, NVIDIA
(same perf as native CUDA)
When to use it?
• Port existing CUDA code
• Developers familiar with CUDA
• New project that needs
portability to AMD and NVIDIA
ROCm PROGRAMMING MODEL OPTIONS
HCC
True single-source C++
accelerator language
• Single-source Host+Kernel
• C++ Kernel Language
• C++ Runtime
• Platforms: AMD GPU
When to use it?
• New projects where true C++
language preferred
• Use features from latest ISO
C++ standards
OpenCL
Khronos Industry Standard
accelerator language
• Split Host/Kernel
• C99-based Kernel Language
• C Runtime
• Platforms: CPU, GPU, FPGA
When to use it?
• Port existing OpenCL code
• New project that needs
portability to CPU,GPU,FPGA
6
Where you get your “Song Sheets”
• Overview, Install, Docs, Code, Links: https://radeonopencompute.github.io/install.html
• Blogs: http://gpuopen.com/
7
WE ARE LOOKING TO BUILD OUT A WORLDWIDE
BAND
Get started today developing with ROCm - GPUOpen ROCm Getting Started http://bit.ly/1ZTlk82
Engage In the develop of ROCm @ GitHub RadeonOpenCompute
Show case your applications, libraries and tools on to ROCm via GPUOpen
How to Join
“The power of one, if fearless and focused, is formidable, but the power of many
working together is better.”
– Gloria Macapagal Arroyo
8
9
CUDA
PORTING WITH HIPIFY
hipify
Developer
Cleanup and
Tuning
95%+ Automatic Conversion
Portable
HIP C++
Developer maintains HIP port
Resulting C++ code runs on NVIDIA
(same performance) or AMD GPUs
10
HIP COMPILATION PROCESS
HIP->HC
Header
HCC
Portable HIP C++
(Kernels + HIP API)
HCC C++
(Kernels + HC)
HIP API implemented with lightweight
HIP runtime
Uses HCC’s hc::accelerator,
hc::accelerator_view,
hc::completion_future
Some calls directly into ROCR
Compute kernels mostly unchanged
Code compiled with HCC
Can use CodeXL
HIP->CUDA
Header
NVCC
CUDA
(Kernels + CUDA API)
HIP API implemented as
inlined calls to CUDA Runtime
Compute kernels mostly
unchanged
Code compiled with NVCC
(same as CUDA)
Can use nvprof, CUDA
debugger, other tools
AMDNVIDIA

More Related Content

PDF
Introduction to CUDA
Raymond Tay
 
PPTX
Nvidia (History, GPU Architecture and New Pascal Architecture)
Saksham Tanwar
 
PDF
Introduction to OpenCL
Unai Lopez-Novoa
 
PDF
Nvidia cuda tutorial_no_nda_apr08
Angela Mendoza M.
 
PDF
Introduction to GPU Programming
Chakkrit (Kla) Tantithamthavorn
 
PPTX
Linux Kernel Booting Process (1) - For NLKB
shimosawa
 
PDF
Virtualization Support in ARMv8+
Aananth C N
 
PDF
OpenCL Programming 101
Yoss Cohen
 
Introduction to CUDA
Raymond Tay
 
Nvidia (History, GPU Architecture and New Pascal Architecture)
Saksham Tanwar
 
Introduction to OpenCL
Unai Lopez-Novoa
 
Nvidia cuda tutorial_no_nda_apr08
Angela Mendoza M.
 
Introduction to GPU Programming
Chakkrit (Kla) Tantithamthavorn
 
Linux Kernel Booting Process (1) - For NLKB
shimosawa
 
Virtualization Support in ARMv8+
Aananth C N
 
OpenCL Programming 101
Yoss Cohen
 

What's hot (20)

PDF
Linux kernel modules
Eddy Reyes
 
PDF
Message Signaled Interrupts
Anshuman Biswal
 
PDF
AMD EPYC™ Microprocessor Architecture
AMD
 
PDF
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Danielle Womboldt
 
PPTX
GPU Computing
Khan Mostafa
 
PPT
linux device driver
Rahul Batra
 
PDF
Storage tiering and erasure coding in Ceph (SCaLE13x)
Sage Weil
 
PDF
Cuda tutorial
Mahesh Khadatare
 
PDF
DPDK in Containers Hands-on Lab
Michelle Holley
 
PDF
LAS16-200: SCMI - System Management and Control Interface
Linaro
 
PPTX
Your 1st Ceph cluster
Mirantis
 
PPTX
NGINX: Basics & Best Practices - EMEA Broadcast
NGINX, Inc.
 
PPTX
Ceph and Openstack in a Nutshell
Karan Singh
 
PDF
Introduction to OpenCL, 2010
Tomasz Bednarz
 
ODP
An Introduction to PC-Lint
Ralf Holly
 
PPTX
Bootloaders (U-Boot)
Omkar Rane
 
PDF
Security Monitoring with eBPF
Alex Maestretti
 
PPTX
Hands on OpenCL
Vladimir Starostenkov
 
Linux kernel modules
Eddy Reyes
 
Message Signaled Interrupts
Anshuman Biswal
 
AMD EPYC™ Microprocessor Architecture
AMD
 
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Danielle Womboldt
 
GPU Computing
Khan Mostafa
 
linux device driver
Rahul Batra
 
Storage tiering and erasure coding in Ceph (SCaLE13x)
Sage Weil
 
Cuda tutorial
Mahesh Khadatare
 
DPDK in Containers Hands-on Lab
Michelle Holley
 
LAS16-200: SCMI - System Management and Control Interface
Linaro
 
Your 1st Ceph cluster
Mirantis
 
NGINX: Basics & Best Practices - EMEA Broadcast
NGINX, Inc.
 
Ceph and Openstack in a Nutshell
Karan Singh
 
Introduction to OpenCL, 2010
Tomasz Bednarz
 
An Introduction to PC-Lint
Ralf Holly
 
Bootloaders (U-Boot)
Omkar Rane
 
Security Monitoring with eBPF
Alex Maestretti
 
Hands on OpenCL
Vladimir Starostenkov
 
Ad

Viewers also liked (20)

PDF
Modern Computing: Cloud, Distributed, & High Performance
inside-BigData.com
 
PDF
Introduction to GPUs in HPC
inside-BigData.com
 
PDF
AMD It's Time to ROC
inside-BigData.com
 
PPTX
SC16 Student Cluster Competition Configurations & Results
inside-BigData.com
 
PPTX
EMC in HPC – The Journey so far and the Road Ahead
inside-BigData.com
 
PPTX
Maximizing HPC Compute Resources with Minimal Cost
inside-BigData.com
 
PDF
Business Wizard Of The Year : Mr. AMAR BABU,M.D.-LENOVO INDIA
VARINDIA
 
PDF
Bringing HPC Algorithms to Big Data Platforms: Spark Summit East talk by Niko...
Spark Summit
 
PDF
Nvidia SC16: The Greatest Challenges Can't Wait
inside-BigData.com
 
PDF
Best Practices: Large Scale Multiphysics
inside-BigData.com
 
PDF
Content marketing in the B2B customer journey
Headline.nl
 
PDF
IDC España Predictions 2014
Lluis Altes
 
PDF
Intersect360 Top of All Things in HPC Snapshot Analysis
inside-BigData.com
 
PPTX
IDC Perspectives on Big Data Outside of HPC
inside-BigData.com
 
PDF
Idc predictions 2016
Laura Torrejón
 
PDF
Hot Technology Topics in 2017
inside-BigData.com
 
PDF
Towards Exascale Computing with Fortran 2015
inside-BigData.com
 
PPT
Conflictmanagement
amit singh
 
PDF
IDC HPC Market Update
inside-BigData.com
 
PDF
Don't Fall Into a Trap: How Business Continuity Management Can Help Data Brea...
IBM Services
 
Modern Computing: Cloud, Distributed, & High Performance
inside-BigData.com
 
Introduction to GPUs in HPC
inside-BigData.com
 
AMD It's Time to ROC
inside-BigData.com
 
SC16 Student Cluster Competition Configurations & Results
inside-BigData.com
 
EMC in HPC – The Journey so far and the Road Ahead
inside-BigData.com
 
Maximizing HPC Compute Resources with Minimal Cost
inside-BigData.com
 
Business Wizard Of The Year : Mr. AMAR BABU,M.D.-LENOVO INDIA
VARINDIA
 
Bringing HPC Algorithms to Big Data Platforms: Spark Summit East talk by Niko...
Spark Summit
 
Nvidia SC16: The Greatest Challenges Can't Wait
inside-BigData.com
 
Best Practices: Large Scale Multiphysics
inside-BigData.com
 
Content marketing in the B2B customer journey
Headline.nl
 
IDC España Predictions 2014
Lluis Altes
 
Intersect360 Top of All Things in HPC Snapshot Analysis
inside-BigData.com
 
IDC Perspectives on Big Data Outside of HPC
inside-BigData.com
 
Idc predictions 2016
Laura Torrejón
 
Hot Technology Topics in 2017
inside-BigData.com
 
Towards Exascale Computing with Fortran 2015
inside-BigData.com
 
Conflictmanagement
amit singh
 
IDC HPC Market Update
inside-BigData.com
 
Don't Fall Into a Trap: How Business Continuity Management Can Help Data Brea...
IBM Services
 
Ad

Similar to It's Time to ROCm! (20)

PPTX
Computação acelerada – a era das ap us roberto brandão, ciência
Campus Party Brasil
 
PDF
LCU13: GPGPU on ARM Experience Report
Linaro
 
PPTX
Ceph on 64-bit ARM with X-Gene
Ceph Community
 
PPTX
Heterogeneous computing
Rashid Ansari
 
PPTX
Amd accelerated computing -ufrj
Roberto Brandao
 
PDF
Exploring the Programming Models for the LUMI Supercomputer
George Markomanolis
 
PDF
HKG15: Opening Keynote - George Grey, Linaro CEO
Linaro
 
PPTX
Debugging Numerical Simulations on Accelerated Architectures - TotalView fo...
Rogue Wave Software
 
PDF
Arm - ceph on arm update
inwin stack
 
PDF
"Making Computer Vision Software Run Fast on Your Embedded Platform," a Prese...
Edge AI and Vision Alliance
 
PPTX
Debugging CUDA applications
Rogue Wave Software
 
PDF
Ken Liao, Senior Associate VP, Faraday
chiportal
 
ODP
Armbian linux
igorpecovnik
 
PPTX
Introduction to HPC & Supercomputing in AI
Tyrone Systems
 
PDF
OpenPOWER Acceleration of HPCC Systems
HPCC Systems
 
PDF
Demystify OpenPOWER
Anand Haridass
 
PDF
Utilizing AMD GPUs: Tuning, programming models, and roadmap
George Markomanolis
 
PDF
Power overview 2018 08-13b
Ganesan Narayanasamy
 
PDF
Accelerate Big Data Processing with High-Performance Computing Technologies
Intel® Software
 
PPTX
Modular by Design: Supermicro’s New Standards-Based Universal GPU Server
Rebekah Rodriguez
 
Computação acelerada – a era das ap us roberto brandão, ciência
Campus Party Brasil
 
LCU13: GPGPU on ARM Experience Report
Linaro
 
Ceph on 64-bit ARM with X-Gene
Ceph Community
 
Heterogeneous computing
Rashid Ansari
 
Amd accelerated computing -ufrj
Roberto Brandao
 
Exploring the Programming Models for the LUMI Supercomputer
George Markomanolis
 
HKG15: Opening Keynote - George Grey, Linaro CEO
Linaro
 
Debugging Numerical Simulations on Accelerated Architectures - TotalView fo...
Rogue Wave Software
 
Arm - ceph on arm update
inwin stack
 
"Making Computer Vision Software Run Fast on Your Embedded Platform," a Prese...
Edge AI and Vision Alliance
 
Debugging CUDA applications
Rogue Wave Software
 
Ken Liao, Senior Associate VP, Faraday
chiportal
 
Armbian linux
igorpecovnik
 
Introduction to HPC & Supercomputing in AI
Tyrone Systems
 
OpenPOWER Acceleration of HPCC Systems
HPCC Systems
 
Demystify OpenPOWER
Anand Haridass
 
Utilizing AMD GPUs: Tuning, programming models, and roadmap
George Markomanolis
 
Power overview 2018 08-13b
Ganesan Narayanasamy
 
Accelerate Big Data Processing with High-Performance Computing Technologies
Intel® Software
 
Modular by Design: Supermicro’s New Standards-Based Universal GPU Server
Rebekah Rodriguez
 

More from inside-BigData.com (20)

PDF
Major Market Shifts in IT
inside-BigData.com
 
PDF
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
PPTX
Transforming Private 5G Networks
inside-BigData.com
 
PDF
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
inside-BigData.com
 
PDF
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
inside-BigData.com
 
PDF
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
inside-BigData.com
 
PDF
HPC Impact: EDA Telemetry Neural Networks
inside-BigData.com
 
PDF
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
PDF
Machine Learning for Weather Forecasts
inside-BigData.com
 
PPTX
HPC AI Advisory Council Update
inside-BigData.com
 
PDF
Fugaku Supercomputer joins fight against COVID-19
inside-BigData.com
 
PDF
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
PDF
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
inside-BigData.com
 
PDF
State of ARM-based HPC
inside-BigData.com
 
PDF
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
PDF
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
inside-BigData.com
 
PDF
Scaling TCO in a Post Moore's Era
inside-BigData.com
 
PDF
CUDA-Python and RAPIDS for blazing fast scientific computing
inside-BigData.com
 
PDF
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 
PDF
Overview of HPC Interconnects
inside-BigData.com
 
Major Market Shifts in IT
inside-BigData.com
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Transforming Private 5G Networks
inside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
inside-BigData.com
 
HPC AI Advisory Council Update
inside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
inside-BigData.com
 
State of ARM-based HPC
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
inside-BigData.com
 
Scaling TCO in a Post Moore's Era
inside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
inside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 
Overview of HPC Interconnects
inside-BigData.com
 

Recently uploaded (20)

PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Doc9.....................................
SofiaCollazos
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PDF
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Architecture of the Future (09152021)
EdwardMeyman
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PDF
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Software Development Company | KodekX
KodekX
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PPTX
ChatGPT's Deck on The Enduring Legacy of Fax Machines
Greg Swan
 
PDF
Software Development Methodologies in 2025
KodekX
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Doc9.....................................
SofiaCollazos
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Architecture of the Future (09152021)
EdwardMeyman
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Software Development Company | KodekX
KodekX
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
ChatGPT's Deck on The Enduring Legacy of Fax Machines
Greg Swan
 
Software Development Methodologies in 2025
KodekX
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 

It's Time to ROCm!

  • 2. 1 AMD INNOVATIONS AND SUPERCOMPUTING AMD has been a leading innovator in high-performance systems ‒ First with 64-bit x86 architecture, HyperTransport™, integrated memory controllers, multi-core socket compatible upgrades, SSE FMAC extensions AMD processors have been in many leading DOE supercomputers ‒ In 2008, seven of the top ten supercomputers used AMD Opteron processors AMD will lead again with systems incorporating advanced CPUs, GPUs, memory, and networks
  • 3. 2 >$50M funding Memory, Programming, APUs, Network, more 2012-2016 HBM High Bandwidth Memory 14 NM Radeon RX480 June 2016 Driving Innovation Not Waiting For Miracles Extreme Perf/Watt Efficacy Radeon R9 Nano 46 GFLOP/Watt Sept 2015 Open GPU Computing Platform ROCm 1.0 April 2016 DOE Invests in AMD The first GPU devices to use HBM and CoWoS Radeon R9 Fury X, June 2015
  • 4. 3
  • 5. 4 INTRODUCING ROCm SOFTWARE PLATFORM Graphics Core Next Headless Linux® 64-bit Driver • Multi-GPU Shared Virtual Memory • Large Memory Single Allocation • Peer to Peer Multi-GPU • Peer to Peer with RDMA • Systems Management API and Tools HSA drives rich capabilities into the ROCm hardware and software • User Mode Queues • Architected Queuing Language • Flat memory Addressing • Atomic Memory Transactions • Process Concurrency & Preemption Rich Compiler Foundation for HPC Developer • LLVM Native GCN ISA Code Generation • Offline Compilation Support • Standardized loader and Code Object • GCN ISA Assembler and Disassembler A New Fully Open Source Foundation for HPC Class GPU computing Open Source Tools and Libraries • Rich Set of Open Source Math Libraries • Tuned Deep Learning Library • Optimized Parallel Programing Frameworks • CodeXL Profiler and GDB Debugging
  • 6. 5 HIP Convert CUDA to portable C++ • Single-source Host+Kernel • C++ Kernel Language • C Runtime (CUDA-like) • Platforms: AMD GPU, NVIDIA (same perf as native CUDA) When to use it? • Port existing CUDA code • Developers familiar with CUDA • New project that needs portability to AMD and NVIDIA ROCm PROGRAMMING MODEL OPTIONS HCC True single-source C++ accelerator language • Single-source Host+Kernel • C++ Kernel Language • C++ Runtime • Platforms: AMD GPU When to use it? • New projects where true C++ language preferred • Use features from latest ISO C++ standards OpenCL Khronos Industry Standard accelerator language • Split Host/Kernel • C99-based Kernel Language • C Runtime • Platforms: CPU, GPU, FPGA When to use it? • Port existing OpenCL code • New project that needs portability to CPU,GPU,FPGA
  • 7. 6 Where you get your “Song Sheets” • Overview, Install, Docs, Code, Links: https://radeonopencompute.github.io/install.html • Blogs: http://gpuopen.com/
  • 8. 7 WE ARE LOOKING TO BUILD OUT A WORLDWIDE BAND Get started today developing with ROCm - GPUOpen ROCm Getting Started http://bit.ly/1ZTlk82 Engage In the develop of ROCm @ GitHub RadeonOpenCompute Show case your applications, libraries and tools on to ROCm via GPUOpen How to Join “The power of one, if fearless and focused, is formidable, but the power of many working together is better.” – Gloria Macapagal Arroyo
  • 9. 8
  • 10. 9 CUDA PORTING WITH HIPIFY hipify Developer Cleanup and Tuning 95%+ Automatic Conversion Portable HIP C++ Developer maintains HIP port Resulting C++ code runs on NVIDIA (same performance) or AMD GPUs
  • 11. 10 HIP COMPILATION PROCESS HIP->HC Header HCC Portable HIP C++ (Kernels + HIP API) HCC C++ (Kernels + HC) HIP API implemented with lightweight HIP runtime Uses HCC’s hc::accelerator, hc::accelerator_view, hc::completion_future Some calls directly into ROCR Compute kernels mostly unchanged Code compiled with HCC Can use CodeXL HIP->CUDA Header NVCC CUDA (Kernels + CUDA API) HIP API implemented as inlined calls to CUDA Runtime Compute kernels mostly unchanged Code compiled with NVCC (same as CUDA) Can use nvprof, CUDA debugger, other tools AMDNVIDIA