0% found this document useful (0 votes)

10 views

Introduction To HPC and Current Usage in HEP

Uploaded by

nadiabha

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Introduction To HPC and Current Usage in HEP

Uploaded by

nadiabha

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Introduction to HPC and Current Usage in HEP

erhtjhtyhy

Rui Wang
Argonne National Laboratory
Monday, 20 May, 2024
What is High Performance Computing (HPC)

● High Performance Computing (HPC) refers to technology that combines

computing powers to process data and perform complex calculations at high
speeds
– Parallel processing techniques and advanced algorithms to solve complex
computational problems eﬃciently and rapidly
– Cluster of powerful computing nodes eg. supercomputer
● HPC systems are designed to handle large-scale simulations, data analysis
tasks, and modeling challenges

2
Basics of HPC Architecture
● Compute nodes, interconnects, storage systems, and a software stack
optimized for high performance

AMD instinct MI250x accelerators

(OLCF Frontier))

3
Cluster management and job scheduling
● Allocate computational resources and manage job execution on HPC clusters

– Slurm (Simple Linux Utility for Resource A two rank MPI job which utilizes 2 physical cores (and 4 hyperthreads) of
a Perlmutter CPU node
Management) #!/bin/bash
#SBATCH --qos=shared
i. Highly scalable cluster management and #SBATCH --constraint=cpu
#SBATCH --time=5
job scheduling system. Allocates #SBATCH --nodes=1
#SBATCH --ntasks=2
resources based on user-deﬁned policies #SBATCH --cpus-per-task=2
srun --cpu-bind=cores ./a.out
and manages job execution on the cluster
– PBS (Portable Batch System) A 30 min 1 node interactive job on Aurora

qsub -l select=1 -l walltime=30:00 -A [your_ProjectName] -q

i. Cluster and resource management EarlyAppAccess -I

Recommended PBSPro options follow.

software suite used for managing and
#!/bin/sh
scheduling jobs on high-performance #PBS -A [your_ProjectName]
#PBS -N
computing (HPC) clusters #PBS -l walltime=[requested_walltime_value]
#PBS -k doe
#PBS -l place=scatter
#PBS -q EarlyAppAccess

4
Parallel Processing

● To solve problem faster, works are often dissected in pieces and executed in
parallel
● Embarrassingly Parallel
– A bunch of copies of the same task, each has different input parameters

https://researchcomputing.princeton.edu/support/knowledge-base/parallel-code

5
Parallel Processing
● Shared-Memory Parallelism (Multithreading)
– Tasks are run as threads on separate CPU-cores of the same computer
– Methods: OpenMP, POSIX Threads (pthread), vector intrinsics (Intel MKL), C++
Parallel STL (Intel TBB) and etc.

program hello_world_multithreaded
use omp_lib

!$omp parallel

write(,) "Hello from process ",

omp_get_thread_num(), " of ",
omp_get_num_threads()

!$omp end parallel

end program

https://researchcomputing.princeton.edu/support/knowledge-base/parallel-code

6
Parallel Processing
● Distributed-Memory Parallelism (Multiprocessing)
– Running tasks as multiple processes that do not share the same space in memory
– Methods: MPI (Message-Passing Interface) & Spark/Hadoop, Dask, General
Multiprocessing and etc. for Machine Learning

#include <iostream>
#include <mpi.h>

int main(int argc, char** argv) {

using namespace std;

MPI_Init(&argc, &argv);

int world_size, world_rank;

MPI_Comm_size(MPI_COMM_WORLD, &world_size);
MPI_Comm_rank(MPI_COMM_WORLD, &world_rank);

// Get the name of the processor

char processor_name[MPI_MAX_PROCESSOR_NAME];
int name_len;
MPI_Get_processor_name(processor_name, &name_len);

// Print off a hello world message

cout << "Process " << world_rank << " of " << world_size
<< " says hello from " << processor_name << endl;

// uncomment next line to make CPU-cores work (infinitely)

// while (true) {};

MPI_Finalize();
https://researchcomputing.princeton.edu/support/knowledge-base/parallel-code return 0;
}

7
NERSC Perlmutter @LBNL 14th of Top500

CPU (3072 nodes) GPU (1536 40GB + 256 80GB nodes)

4x NVIDIA A100 (Ampere) GPU

2x AMD EPYC 7763 (Milan) CPU
● Peak FP64: 59.9 PFLOPS, 119.8 (tensor)
● 64 cores/CPU

● Peak FP64: 7.7 PFLOPS 1x AMD EPYC 7763 (Milan) CPU

● Peak FP64: 3.9 PFLOPS
1x HPE SlingShot 11
● 200G (25 GB/s) bandwidth 1x HPE SlingShot 11
● 200G (25 GB/s) bandwidth

Floating point operations per second

(FLOPS)
teraFLOPS TFLOPS 1012
petaFLOPS PFLOPS 1015
exaFLOPS EFLOPS 1018

8
Vega @ IZUM Slovenian
European first Peta-scale HPC
Peak performance: 10.1 petaflops

9
TACC Frontera @ UTexas 33th of Top500

CPU (8,008 nodes)

Intel Xeon Platinum 8280 ("Cascade Lake")

● 28 cores/socket, 56 cores/node

● Clock rate: 2.7Ghz

● Peak performance: 4.8TFLOPS

GPU (90 nodes)

4 NVIDIA Quadro RTX 5000/node

● 3072 CUDA Parallel Processing Cores/card

● 384 NVIDIA Tensor Cores/card

2 Intel Xeon E5-2620 v4 (“Broadwell”)/node

Mellanox InfiniBand HDR: Full HDR (200 Gb/s); HDR100 (100 Gb/s)

10
ALCF Aurora @ Argonne National Lab 2nd of Top500

10,624 nodes

May 13, 2023 1.012 ExaFLOPS

● 8 HPE Slingshot-11 NICs

● DAOS – high performance
storage system for storing
checkpoints and analysis
files

11
OLCF Frontier @ Oak Ridge National Lab 1st of Top500

America’s first exascale system

12
Computing Challenge in High Energy Physics

● HEP experiments, such as the experiments at Large Hadron Collider (LHC) at

CERN, produce an enormous amount of data with each collision event

Same scale
Albrecht et al., 2019

13
Computing Challenge in High Energy Physics
● Large-scale Monte Carlo (MC) Simulations
● Data Analysis from Particle Detectors
● Complex Computational Models (e.g., Lattice QCD)

CMS Offline and Computing Public Results & ATLAS Software and Computing HL-LHC Roadmap

14
HPC usage – CMS&ATLAS

Perlmutter, TACC, Vega

CPU @ Perlmutter
MC Simulation

GPU @ Perlmutter
R&D / ML
(eg. TrackML)

15
HPC usage – Cosmological N-Body simulation
Evolution of matter distribution over cosmic time for a Adiabatic Hydro Simulation on Sunspot
sub-volume of a HACC simulation (Aurora TDS)
FarPoint: https://arxiv.org/abs/2109.01956

https://www.exascaleproject.org/event/hacc/

Summit (GPU) @ OLCF Aurora @ ALCF

16
Challenges and Future Directions
● HEP Computing Resource Challenges Distributed Heterogeneous Computing with PanDA in ATLAS
– T.Maeno CHEP23 talk
– HL-LHC: 10X data, 10X complexity
– In 2030, LHC experiments will need
i. > O(100) PFlops in sustained compute performance
ii. O(10) Exabyte/year data throughput
● HPC Challenges
– Ability to run multiple HEP workflows at O(10) Pflops
sustained on multiple heterogeneous exascale systems
– Match HEP I/O requirements to HPC file systems and
networking infrastructure
– Address realtime to near-realtime applications and resilience
challenges Evolution of ASCR and HEP computing resources

● HEP Long-Term Involvement in HPC

– Chosen platform for Cosmic Frontier (CMB-S4, DESI, LSST
DESC, LZ, and etc.)
– LHC experiments among top users of NERSC
– DUNE (Neutrino experiment) has identiﬁed HPC resource
utilization as a major component of its computing strategy

https://indico.fnal.gov/event/61971/contributions/281095/attachments/173755/23
5361/HEP-CCE_intro_dec23_AHM%5B1%5D.pdf
17
HEP-CCE project
Advanced Scientific High Energy Physics (HEP)
Computing Research
(ASCR)
Research and Facilities Research and Facilities
HEP-CCE
Computational HEP Program
ASCR Programs
Cross-Cuts

HEP-CCE as a joint effort across the participating

laboratories to bring large-scale computational and data
management resources to bear on pressing HEP science
problems
● Focus on enabling cross-cutting solutions leveraging ASCR and
HEP expertise and resources
● Integrate individual strengths and resources

https://indico.fnal.gov/event/61971/contributions/281095/attachments/173755/23
5361/HEP-CCE_intro_dec23_AHM%5B1%5D.pdf
18
Tasks identified
● Ability for HEP workflows to exploit concurrency at the node level of HPC platforms
– Natural event-level parallelism is not enough
● Multiple compute platforms as a portability challenge
– Complex HEP code needs to run on grid/cloud/HPC systems in production
● I/O needs to scale to thousands of HPC nodes
– Not just for data but also for software libraries, databases, configurations (~100K files)
● Ability to run complex HEP workflows robustly on HPC systems
– Resilience challenge
● Mixed CPU/GPU pipelines will become more common as AI/ML methods are
increasingly incorporated
– HPC facilities offer unique large-scale AI/ML training opportunities

https://indico.fnal.gov/event/61971/contributions/281095/attachments/173755/235361/HEP-CCE_intro_dec23_AHM%5B1%5D.pdf

19
Portable Applications to Portable Workflows
● Address needs of large and small HEP workﬂows 2019
● Hybrid CPU/GPU application support
● Current and promised future support of hardware
● Long term sustainability, code stability, and performance
FastCaloSim: ATLAS parameterized LAr calorimeter simulation
Patatrack: CMS pixel detector reconstruction
Wirecell Toolkit: LArTPC signal simulation
p2r: CMS "propagate-to-R" track reconstruction
better

2023

better

Charles Leggett's summary on PPS

20
Optimizing Data Storage HDF5 as intermediate event storage for HPC

● Identify I/O & storage bottleneck of HEP application (Proto)DUNE (will) write its raw data in HDF5 format

● Data management, data reduction/compression

● Optimized data delivery
Parallel HDF5
ROOT RNTuple & object storage (DAOS) with Darshan
I/O activities related to ROOT files(ATLAS derivation)

ry
TTree ina RNTuple
lim
Pre

Summary on PPS
21
Accelerating HEP Simulation
● Event generators for accelerated systems
● Celeritas: new MC particle transport code designed for high performance simulation of
complex HEP detectors on GPU-accelerated hardware
MadGraph (SYCL) Geometry modeling
CMS (CMSSW)
Portability Scalability

Up to 500
nodes

Sherpa (CUDA, Kokkos)

Optical photon
simulation

Scalability

Optical photon shower (JUNO

neutrino detector); S.C. Blyth,
Opticks: GPU Optical Photon
Portability Simulation for Particle Physics
with Nvidia OptiX™, CHEP 2018.

Stefan Hoeche & Taylor Childers's summary on Event Generator

22
Scaling up HEP AI/ML Applications

Hyper parameter scan Inference as a Service (IaaS)

particle flow HPO

Large Problems

ML@NERSC 2022 survey Paolo Calafiura's talk on HEP-CCE ML scaling

23
Complex workflow
● Characterizes workﬂows across various dimensions
● Captures software and system use
● Identiﬁes challenges for each experiment

Kyle Chard's summary on CW

24
Summary
● HPC plays a vital role in HEP research, enabling large-scale simulations, data analysis,
and theoretical calculations.
● Current applications include Monte Carlo simulations, data analysis from particle
detectors, and event generations
● providing excellent opportunities for utilizing large resources and acquiring advanced
expertise

● Addressing challenges and leveraging

advancements in HPC for groundbreaking HPC HEP
This
discoveries in High Energy Physics intro

25
Q&A

26
Backup

27
HPC system statistics

28
US HPC facilities
Two distinct types of HPC facilities in the US
▪ DOE Leadership-class Facilities (LCFs)
– Argonne National Lab, Oak Ridge National Lab
– Focused on FLOPS, GPU heavy
– Prefer jobs that can fill the entire HPC
– Very locked down network - absolutely no WAN file transfer from the compute nodes

▪ User-focused facilities
– TACC, NERSC and so on
– Usually a mix of GPU-focused and CPU-focused machines
– WAN data transfer often allowed in jobs, but using DTN preferred

29
HPC usage – Lattice QCD
Perlmutter benchmarking example

● Generation: lattices are propagated until they sample an equilibrium

distribution of lattice configurations. Due to long equilibration and
decorrelation times, lattice generation typically uses checkpoint restart
methods to split the generation stages into multiple jobs.
● Spectrum: The generation stages continue after the lattice distribution has
equilibrated, and a subset of the lattices (that have been written to disk) are
periodically sampled to be analyzed by spectrum stages
30
Event Reconstruction – Tracking
▪ Traditionally, reconstruction software is very
experiment-dependent
– detector geometry, calibrations, etc.
▪ Recent hardware evolution dictate common
solutions to tackle the HL-LHC challenges
▪ A Common Tracking Software Project (ACTS)
develops an experiment-independent set of
track reconstruction tools
– Provide high-level modules that can be used E
for any tracking detector
– Highly performant, yet largely customizable
v
implementations e
– Users from LHC, FCC, SPhenix, EIC, … nt
▪ Key features: R
– Tracking geometry description
e
– Simple event data model
– Common algorithms for: c
• Track propagation and fitting, Seed finding, o
Vertexing C. Rougier - ATLAS ITk Track Reconstruction with a GNN-based Pipeline
n
31
32
Programing model for Accelerators
● Kokkos: abstraction layer which supports CPU, GPU and etc., as well as
various HPC architectures
● CUDA: parallel computing platform and programming model developed by
NVIDIA
● SYCL: higher-level programming model for various hardware accelerators
● HIP: C++ Runtime API and Kernel Language for portable applications creation
with AMD and NVIDIA GPUs
● alpaka: header-only abstraction library for accelerator development
● std:par: parallele excursion added in C++20

Model Service Manual: Green Laser Photocoagulator
100% (2)
Model Service Manual: Green Laser Photocoagulator
84 pages
Event Management PDF Books
100% (1)
Event Management PDF Books
2 pages
Introduction To High Performance Computing: Shaohao Chen Research Computing Services (RCS) Boston University
No ratings yet
Introduction To High Performance Computing: Shaohao Chen Research Computing Services (RCS) Boston University
29 pages
L1.1 HPC Environment
No ratings yet
L1.1 HPC Environment
27 pages
UNIT 1
No ratings yet
UNIT 1
31 pages
HPC Intro Ad OS
No ratings yet
HPC Intro Ad OS
44 pages
Lecture01 Intro ToHPC
No ratings yet
Lecture01 Intro ToHPC
48 pages
High Performance Computing Lecture 1 HPC Public
No ratings yet
High Performance Computing Lecture 1 HPC Public
50 pages
AUS16Session3CISTERNINODAVINI PDF
No ratings yet
AUS16Session3CISTERNINODAVINI PDF
51 pages
Good
No ratings yet
Good
39 pages
New Advances in High Performance Computing and Simulation: Parallel and Distributed Systems, Algorithms, and Applications
No ratings yet
New Advances in High Performance Computing and Simulation: Parallel and Distributed Systems, Algorithms, and Applications
7 pages
High Performance Computing (HPC)
No ratings yet
High Performance Computing (HPC)
11 pages
2219-Article Text-15412-2-10-20230802
No ratings yet
2219-Article Text-15412-2-10-20230802
12 pages
Module 1-Topic 1
No ratings yet
Module 1-Topic 1
36 pages
Introduction To High-Performance Computing
No ratings yet
Introduction To High-Performance Computing
13 pages
HPC Lecture (1) Summary
No ratings yet
HPC Lecture (1) Summary
8 pages
Ebook What Is HPC
No ratings yet
Ebook What Is HPC
25 pages
HPC%20Tools%20and%20Technologies%20for%20Web%20Programming
No ratings yet
HPC%20Tools%20and%20Technologies%20for%20Web%20Programming
33 pages
HPC_Seminar_Report (1)
No ratings yet
HPC_Seminar_Report (1)
2 pages
High Performance Computing (HPC) Lec1
No ratings yet
High Performance Computing (HPC) Lec1
30 pages
01_Lecture Intro to HPC
No ratings yet
01_Lecture Intro to HPC
62 pages
CC Unit1 Notes Compressed
No ratings yet
CC Unit1 Notes Compressed
41 pages
20240515 APS DPF Elmer Future of Computing (1)
No ratings yet
20240515 APS DPF Elmer Future of Computing (1)
38 pages
Cost-Effective HPC Clustering For Computer Vision Applications
No ratings yet
Cost-Effective HPC Clustering For Computer Vision Applications
6 pages
HPC
No ratings yet
HPC
30 pages
2219-Article Text-15412-2-10-20230802
No ratings yet
2219-Article Text-15412-2-10-20230802
12 pages
High Performance Computing: What Is It Used For and Why?
No ratings yet
High Performance Computing: What Is It Used For and Why?
19 pages
E1 Applications On High Performance Computing-5
No ratings yet
E1 Applications On High Performance Computing-5
7 pages
A Survey of High-Performance Computing Scaling Challenge
No ratings yet
A Survey of High-Performance Computing Scaling Challenge
10 pages
Lecture Week - 1 Introduction 1 - SP-24
No ratings yet
Lecture Week - 1 Introduction 1 - SP-24
51 pages
Presentation cc 1
No ratings yet
Presentation cc 1
63 pages
Lec1 and 2
No ratings yet
Lec1 and 2
52 pages
Introduction To Parallel Computing Tutorial - HPC at LLNL
No ratings yet
Introduction To Parallel Computing Tutorial - HPC at LLNL
46 pages
CC-All 5 Units Notes
No ratings yet
CC-All 5 Units Notes
86 pages
d
No ratings yet
d
2 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
23 pages
CAQA5e ch1
No ratings yet
CAQA5e ch1
42 pages
Levque Cluster User Manual
No ratings yet
Levque Cluster User Manual
34 pages
2-IJCI Vol. 3 No. 7-July 2024-Paper1-Mr. M.mahagoub3
No ratings yet
2-IJCI Vol. 3 No. 7-July 2024-Paper1-Mr. M.mahagoub3
41 pages
Understanding of HPC Cluster and Its Component
No ratings yet
Understanding of HPC Cluster and Its Component
29 pages
CC notes I unit
No ratings yet
CC notes I unit
31 pages
Parallel Computing
No ratings yet
Parallel Computing
57 pages
Mathematics 11 01055
No ratings yet
Mathematics 11 01055
13 pages
Khaitan PSERC Webinar HPC Mar 2013 Slides
No ratings yet
Khaitan PSERC Webinar HPC Mar 2013 Slides
52 pages
Intro - HPC Cluster Computing v2 PDF
No ratings yet
Intro - HPC Cluster Computing v2 PDF
73 pages
L1.3a HPC Concepts
No ratings yet
L1.3a HPC Concepts
43 pages
01 ParProg20
No ratings yet
01 ParProg20
19 pages
Hpc in Abstract
No ratings yet
Hpc in Abstract
3 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
34 pages
Introduction To Parallel Computing Tutorial
No ratings yet
Introduction To Parallel Computing Tutorial
35 pages
1.Introduction
No ratings yet
1.Introduction
65 pages
Pendahuluan Paralel Komputer
No ratings yet
Pendahuluan Paralel Komputer
167 pages
CC 1
No ratings yet
CC 1
11 pages
Quantum Processing Unit
No ratings yet
Quantum Processing Unit
13 pages
Lecture 9
No ratings yet
Lecture 9
72 pages
Performance Computing
100% (1)
Performance Computing
102 pages
High Performance Computing
No ratings yet
High Performance Computing
18 pages
NTNU HetComp Topublish PDF
No ratings yet
NTNU HetComp Topublish PDF
83 pages
PP Cuda Unit1 1
No ratings yet
PP Cuda Unit1 1
77 pages
1-s2.0-S0010465523003429-main
No ratings yet
1-s2.0-S0010465523003429-main
21 pages
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
From Everand
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
Kanto
No ratings yet
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Hipaa 101 Fact Sheet
No ratings yet
Hipaa 101 Fact Sheet
3 pages
Access Test Bank for Health Informatics 2nd Edition Nelson, All Chapters Immediate PDF Download
100% (9)
Access Test Bank for Health Informatics 2nd Edition Nelson, All Chapters Immediate PDF Download
11 pages
CD Baby Music Release Plan
No ratings yet
CD Baby Music Release Plan
15 pages
Quadratic Forms and Definite Matrices: Q X Ax (X ... X A X X A X
No ratings yet
Quadratic Forms and Definite Matrices: Q X Ax (X ... X A X X A X
23 pages
Alpari Uk Userguide Metatrader4
No ratings yet
Alpari Uk Userguide Metatrader4
179 pages
Mongodb Project
No ratings yet
Mongodb Project
16 pages
TheAvionicsHandbook Contents PDF
0% (3)
TheAvionicsHandbook Contents PDF
11 pages
The Miller-Rabin Randomized Primality Test
0% (1)
The Miller-Rabin Randomized Primality Test
10 pages
Ttrc-Ansys CFD
No ratings yet
Ttrc-Ansys CFD
2 pages
SP1 7schematics
No ratings yet
SP1 7schematics
22 pages
First Course in Mathematical Modeling 5th ed. A instant download
No ratings yet
First Course in Mathematical Modeling 5th ed. A instant download
14 pages
Digital India
No ratings yet
Digital India
2 pages
SkySales FRBoardingPassDisplay
100% (3)
SkySales FRBoardingPassDisplay
2 pages
Alpine CDA-W560EG User Manual
No ratings yet
Alpine CDA-W560EG User Manual
36 pages
Esc550 Source
No ratings yet
Esc550 Source
47 pages
ACB - Tempower2
No ratings yet
ACB - Tempower2
8 pages
HCI
No ratings yet
HCI
119 pages
Lenovo BIOS Setup Using Windows Management Instrumentation Deployment Guide V
No ratings yet
Lenovo BIOS Setup Using Windows Management Instrumentation Deployment Guide V
8 pages
Quotation CV. Elang ATK 2020
No ratings yet
Quotation CV. Elang ATK 2020
10 pages
Cost Distributions Query
No ratings yet
Cost Distributions Query
3 pages
Thesis PDF
No ratings yet
Thesis PDF
198 pages
Bhel Vis
No ratings yet
Bhel Vis
11 pages
555 Timer IC Working Principal
No ratings yet
555 Timer IC Working Principal
6 pages
CAN FD Network Design Hints and Recommendations: Holger Zeltwanger
No ratings yet
CAN FD Network Design Hints and Recommendations: Holger Zeltwanger
4 pages
Pampers All Round Protection Pants, Large Size Baby Diapers (LG), 64 Count, Anti Rash Diapers, Lotion With Aloe Vera
No ratings yet
Pampers All Round Protection Pants, Large Size Baby Diapers (LG), 64 Count, Anti Rash Diapers, Lotion With Aloe Vera
7 pages
Syllabus MMRB
No ratings yet
Syllabus MMRB
37 pages
Case 1:19-cr-00391-AT
100% (2)
Case 1:19-cr-00391-AT
17 pages
RAG Slide ENG
No ratings yet
RAG Slide ENG
41 pages

Introduction To HPC and Current Usage in HEP

Uploaded by

Introduction To HPC and Current Usage in HEP

Uploaded by

Introduction to HPC and Current Usage in HEP

● High Performance Computing (HPC) refers to technology that combines

AMD instinct MI250x accelerators

qsub -l select=1 -l walltime=30:00 -A [your_ProjectName] -q

Recommended PBSPro options follow.

write(*,*) "Hello from process ",

!$omp end parallel

int main(int argc, char** argv) {

int world_size, world_rank;

// Get the name of the processor

// Print off a hello world message

// uncomment next line to make CPU-cores work (infinitely)

CPU (3072 nodes) GPU (1536 40GB + 256 80GB nodes)

4x NVIDIA A100 (Ampere) GPU

● Peak FP64: 7.7 PFLOPS 1x AMD EPYC 7763 (Milan) CPU

Floating point operations per second

CPU (8,008 nodes)

Intel Xeon Platinum 8280 ("Cascade Lake")

● Clock rate: 2.7Ghz

● Peak performance: 4.8TFLOPS

GPU (90 nodes)

4 NVIDIA Quadro RTX 5000/node

● 384 NVIDIA Tensor Cores/card

2 Intel Xeon E5-2620 v4 (“Broadwell”)/node

May 13, 2023 1.012 ExaFLOPS

● 8 HPE Slingshot-11 NICs

America’s first exascale system

● HEP experiments, such as the experiments at Large Hadron Collider (LHC) at

Perlmutter, TACC, Vega

Summit (GPU) @ OLCF Aurora @ ALCF

● HEP Long-Term Involvement in HPC

HEP-CCE as a joint effort across the participating

Charles Leggett's summary on PPS

● Data management, data reduction/compression

Sherpa (CUDA, Kokkos)

Optical photon shower (JUNO

Stefan Hoeche & Taylor Childers's summary on Event Generator

Hyper parameter scan Inference as a Service (IaaS)

ML@NERSC 2022 survey Paolo Calafiura's talk on HEP-CCE ML scaling

Kyle Chard's summary on CW

● Addressing challenges and leveraging

● Generation: lattices are propagated until they sample an equilibrium

You might also like

write(,) "Hello from process ",