SlideShare a Scribd company logo
“The Pacific Research Platform
Two Years In”
Welcome and Overview Talk
to the Pacific Research Platform “PRPv2” Workshop 2017
University of California, San Diego
February 21, 2017
Dr. Larry Smarr
Director, California Institute for Telecommunications and Information Technology
Harry E. Gruber Professor,
Dept. of Computer Science and Engineering
Jacobs School of Engineering, UCSD
http://lsmarr.calit2.net
Initially Proposed PRP Multi-Campus Science Driver Teams
• Biomedical
– Cancer Genomics Hub/Browser( UCSC/SDSC project over, connecting PRP to U. Chicago)
– Microbiome and Integrative ‘Omics (UCSD, Caltech, UCSF, UCD)
– Integrative Structural Biology (UCSF, NERSC, SDSC)
• Earth Sciences
– Data Analysis and Simulation for Earthquakes and Natural Disasters (Phase II)
– Climate Modeling: NCAR/UCAR (UCSD, NCAR)
– California/Nevada Regional Climate Data Analysis (UCI, UCSD, NCAR)
– CO2 Subsurface Modeling (SDSC)
• Particle Physics (UCD, UCI, UCSC, UCSD, others soon)
• Astronomy and Astrophysics
– Telescope Surveys (NERSC connected to PRP) (Phase II)
– Galaxy Evolution (UCI, UCSC) (Phase II)
– Gravitational Wave Astronomy (Caltech, UCSD)
• Scalable Visualization, Virtual Reality, and Ultra-Res Video (UCB, UCLA, UCM, UCSD)
100 Gbps FIONA at UCSC Connects the UCSC Hyades Cluster
to the NERSC Supercomputer at LBNL
Supporting UCSC Remote Access
to Large Data Subsets
of the Dark Energy Spectroscopic Instrument (DESI)
and AGORA Galaxy Simulation Data
Produced at NERSC.
250 images per night
800GB per night
Shawfeng Dong, UCSC Cyberengineer
UCSC Feb 7, 2017
Global Scientific Instruments Will Produce Ultralarge Datasets Continuously
Requiring Dedicated Optic Fiber and Supercomputers
Square Kilometer Array Large Synoptic Survey Telescope
https://tnc15.terena.org/getfile/1939 www.lsst.org/sites/default/files/documents/DM%20Introduction%20-%20Kantor.pdf
Tracks ~40B Objects,
Creates 10M Alerts/Night
Within 1 Minute of Observing
2x40Gb/s
40G FIONAs
20x40G PRP-connected
WAVE@UC San Diego
PRP Will Enable
Distributed Virtual Reality
PRP
MerWAVE @UC Merced
UC Merced’s VR CAVE:
Merced WAVE
• Transferring 5 CAVECam Images Over
10 Gbit/sec Fiber Path From UCSD to UC Merced:
– Total Data Size: 1.96 GBytes
– Transfer Took 2.17 seconds
– Transfer Rate: 924.49 MBytes/sec (~8Gbit/sec)
• This Transfer Would Have Taken:
– 21 Seconds Over 1Gbit/sec Connection
(Regular Ethernet)
– 5.35 Minutes Over 50Mbit/sec Connection
(Residential Internet)
PRP Will Link the Laboratories of
the Pacific Earthquake Engineering Research Center
http://peer.berkeley.edu/
The Second FIONette was Deployed at the PEER Facility at UC Berkeley,
and its Performance is Being Monitored
John Graham Installing FIONette at PEER Feb 10, 2017
Cancer Genomics Hub (UCSC) is Housed in SDSC:
Large Data Flows to End Users at UCSC, UCB, UCSF, …
1G
8G
Data Source: David Haussler,
Brad Smith, UCSC
15G
Jan 2016
30,000 TB
Per Year
Slide on Cancer Genomics
Newly Added PRP Multi-Campus Science Driver Teams
• Biomedical
– Cryo Electronic Microscopy (UCB/LLNL, UCD,UCLA, UCSD, UCSF)
– Bioinformatics (UCD)
– High-Resolution Microscopy (UCR, UCSD, NSCC)
• Computer Science and Engineering /Electrical and Computer Engineering, etc.
– JupyterHub (UCB, UCSD)
– Deep Learning (UCB, UCSD, UIC)
– Drones, Terrestrial Modeling/GIS (UCSD, UCM)
– Contextual Robotics (new)
• High Performance Wireless Research and Education Networks
– UCSD/SIO, UCI, UCR, UCM, CENIC, others tbd.
• Humanities and Social Sciences
– Preserving Cultural Heritage
PRP First Application: Distributed IPython/Jupyter Notebooks:
Cross-Platform, Browser-Based Application Interleaves Code, Text, & Images
IJulia
IHaskell
IFSharp
IRuby
IGo
IScala
IMathics
Ialdor
LuaJIT/Torch
Lua Kernel
IRKernel (for the R language)
IErlang
IOCaml
IForth
IPerl
IPerl6
Ioctave
Calico Project
• kernels implemented in Mono,
including Java, IronPython, Boo,
Logo, BASIC, and many others
IScilab
IMatlab
ICSharp
Bash
Clojure Kernel
Hy Kernel
Redis Kernel
jove, a kernel for io.js
IJavascript
Calysto Scheme
Calysto Processing
idl_kernel
Mochi Kernel
Lua (used in Splash)
Spark Kernel
Skulpt Python Kernel
MetaKernel Bash
MetaKernel Python
Brython Kernel
IVisual VPython Kernel
Source: John Graham, QI
GPU JupyterHub:
2 x 14-core CPUs
256GB RAM
1.2TB FLASH
3.8TB SSD
Nvidia K80 GPU
Dual 40GbE NICs
And a Trusted Platform
Module
GPU JupyterHub:
1 x 18-core CPUs
128GB RAM
3.8TB SSD
Nvidia K80 GPU
Dual 40GbE NICs
And a Trusted Platform
Module
PRP UC-JupyterHub Backbone
UCB Next Step: Deploy Across PRP UCSD
Source: John Graham, Calit2
Cryo-electron Microscopy (cryo-EM)
Has Driven a “Resolution Revolution” in the Last Five Years
Exposure (every 60 seconds):
X & Y dimensions: 7420 x 7676 pixels
Frames per movie: 10 - 50
Size: 3 - 10 GB per movie
Every 24 hours:
Number of movies: ~1400
Data size: ~5 TB
Typical datasets:
Length of time: 2 - 6 days
Total size: 10 - 30 TB
Each Cryo-EM ‘Image’ is Actually a Movie
Source: Michael A. Cianfrocco,
Elizabeth Villa, & Andres Leschziner, UCSD
~20 microscopes in CA
UCLA
UC Davis
UC Santa Cruz
SF Bay
UC Berkeley, LBNL,
UCSF, Stanford
San Diego
UCSD, TSRI, Salk
*
*
SDSC
NERSC
*Xstream
Using PRP to Connect Cryo-EM across California
With End Users and Computational Facilities
Long term:
‣ Partner with cryo-EM facilities to stream data
straight from microscopes (over PRP) to SDSC
‣ Perform all cryo-EM analysis (from micrographs
to 3D models) via web browser on SDSC
‣ Expand computing to other XSEDE resources
(e.g. Xstream)
Short term:
‣ Provide 2D and 3D analysis on particle stacks on
Comet at SDSC
Source: Michael A. Cianfrocco, UCSD3 supercomputer centers
cosmic-cryoem.org
UCD
UCSF
Stanford
NASA
AMES/
NREN
UCSC
UCSB
Caltech
USC UCLA
UCI
UCSD SDSU
UCR
Esnet
DoE Labs
UW/
PNWGP
Seattle
Berkeley
UCM
Los
Nettos
Internet2
Internet2
Seattle
Note: This diagram represents a subset of sites and connections.
* Institutions with
Active Archaeology Programs
“In an ideal world –
Extremely high bandwidth to
move large cultural heritage
datasets around the PRP cloud for
processing & viewing in CAVEs
around PRP with Unlimited Storage
for permanent archiving.”
-Tom Levy, UCSD
PRP is NOT Just for Big Data Science and Engineering:
Linking Cultural Heritage and Archaeology Datasets
Building on CENIC’s Expansion
To Libraries, Museums,
and Cultural Sites
Linking Libraries at UCB, UCLA, UCM and UCSD with CAVE
Kiosks
48 Megapixel CAVE Kiosk for UCSD
Library
UCSD Library Review, June 24 Megapixel UCM Library
Installation, July
PRP Backbone Sets Stage for 2017 Expansion
of HPWREN, Connected to CENIC, into Orange and Riverside Counties
• PRP CENIC 100G Link
UCSD to SDSU
– DTN FIONAs Endpoints
– Data Redundancy
– Disaster Recovery
– High Availability
– Network Redundancy
• Anchor to CENIC at UCI
– PRP FIONA Connects to
CalREN-HPR Network
– Data Replication Site
• Potential Future UCR
CENIC Anchor
UCR
UCI
UCSD
SDSU
Source: Frank Vernon,
Greg Hidley, UCSD

More Related Content

PPTX
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
EarthCube
 
PPTX
Peering The Pacific Research Platform With The Great Plains Network
Larry Smarr
 
PPTX
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
Larry Smarr
 
PPT
Toward Real-Time Analysis of Large Data Volumes for Diffraction Studies by Ma...
EarthCube
 
PPTX
NERSC, AI and the Superfacility, Debbie Bard
PacificResearchPlatform
 
PPTX
PRP, CHASE-CI, TNRP and OSG
Larry Smarr
 
PPTX
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
Larry Smarr
 
PDF
NASA Advanced Computing Environment for Science & Engineering
inside-BigData.com
 
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
EarthCube
 
Peering The Pacific Research Platform With The Great Plains Network
Larry Smarr
 
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
Larry Smarr
 
Toward Real-Time Analysis of Large Data Volumes for Diffraction Studies by Ma...
EarthCube
 
NERSC, AI and the Superfacility, Debbie Bard
PacificResearchPlatform
 
PRP, CHASE-CI, TNRP and OSG
Larry Smarr
 
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
Larry Smarr
 
NASA Advanced Computing Environment for Science & Engineering
inside-BigData.com
 

What's hot (20)

PPT
An Integrated West Coast Science DMZ for Data-Intensive Research
Larry Smarr
 
PPTX
Taming Big Data!
Ian Foster
 
PPTX
The Transformation of Systems Biology Into A Large Data Science
Robert Grossman
 
PDF
The Interplay of Workflow Execution and Resource Provisioning
Rafael Ferreira da Silva
 
PPTX
Big data at experimental facilities
Ian Foster
 
PPT
Many Task Applications for Grids and Supercomputers
Ian Foster
 
PPT
High Performance Cyberinfrastructure Enabling Data-Driven Science Supporting ...
Larry Smarr
 
PDF
Database of Topological Materials and Spin-orbit Spillage
KAMAL CHOUDHARY
 
PPTX
Accelerating Discovery via Science Services
Ian Foster
 
PDF
How HPC and large-scale data analytics are transforming experimental science
inside-BigData.com
 
PDF
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Anubhav Jain
 
PPTX
Toward a National Research Platform
Larry Smarr
 
PPTX
Creating a Big Data Machine Learning Platform in California
Larry Smarr
 
PDF
Cognitive Engine: Boosting Scientific Discovery
diannepatricia
 
PDF
Polar Domain Discovery with Sparkler - EarthCube
Karanjeet Singh
 
PDF
Discovering advanced materials for energy applications (with high-throughput ...
Anubhav Jain
 
PPTX
Toward a National Research Platform
Larry Smarr
 
PPT
Computation and Knowledge
Ian Foster
 
PPTX
Round Table Introduction: Analytics on 100 TB+ catalogs
Mario Juric
 
PPT
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
Laurent Lefort
 
An Integrated West Coast Science DMZ for Data-Intensive Research
Larry Smarr
 
Taming Big Data!
Ian Foster
 
The Transformation of Systems Biology Into A Large Data Science
Robert Grossman
 
The Interplay of Workflow Execution and Resource Provisioning
Rafael Ferreira da Silva
 
Big data at experimental facilities
Ian Foster
 
Many Task Applications for Grids and Supercomputers
Ian Foster
 
High Performance Cyberinfrastructure Enabling Data-Driven Science Supporting ...
Larry Smarr
 
Database of Topological Materials and Spin-orbit Spillage
KAMAL CHOUDHARY
 
Accelerating Discovery via Science Services
Ian Foster
 
How HPC and large-scale data analytics are transforming experimental science
inside-BigData.com
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Anubhav Jain
 
Toward a National Research Platform
Larry Smarr
 
Creating a Big Data Machine Learning Platform in California
Larry Smarr
 
Cognitive Engine: Boosting Scientific Discovery
diannepatricia
 
Polar Domain Discovery with Sparkler - EarthCube
Karanjeet Singh
 
Discovering advanced materials for energy applications (with high-throughput ...
Anubhav Jain
 
Toward a National Research Platform
Larry Smarr
 
Computation and Knowledge
Ian Foster
 
Round Table Introduction: Analytics on 100 TB+ catalogs
Mario Juric
 
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
Laurent Lefort
 
Ad

Similar to The Pacific Research Platform
 Two Years In (20)

PPTX
Creating a Science-Driven Big Data Superhighway
Larry Smarr
 
PPTX
Toward A National Big Data Superhighway
Larry Smarr
 
PPTX
Pacific Research Platform Application Drivers
Larry Smarr
 
PPTX
The Pacific Research Platform
Larry Smarr
 
PPT
Cyberinfrastructure to Support Ocean Observatories
Larry Smarr
 
PPTX
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
Larry Smarr
 
PPTX
The Pacific Research Platform: Leading Up to the National Research Platform
Larry Smarr
 
PPT
Positioning University of California Information Technology for the Future: S...
Larry Smarr
 
PPT
The Pacific Research Platform
Larry Smarr
 
PPTX
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
Larry Smarr
 
PPTX
Global Research Platforms: Past, Present, Future
Larry Smarr
 
PPT
A National Big Data Cyberinfrastructure Supporting Computational Biomedical R...
Larry Smarr
 
PPTX
The Pacific Research Platform:a Science-Driven Big-Data Freeway System
Larry Smarr
 
PPTX
A California-Wide Cyberinfrastructure for Data-Intensive Research
Larry Smarr
 
PPTX
The Pacific Research Platform: The First Six Years
Larry Smarr
 
PPT
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
Larry Smarr
 
PPT
Toward a Global Interactive Earth Observing Cyberinfrastructure
Larry Smarr
 
PPTX
Toward a National Research Platform to Enable Data-Intensive Open-Source Sci...
Larry Smarr
 
PPTX
The Pacific Research Platform Two Years In
Larry Smarr
 
PPT
The Academic and R&D Sectors' Current and Future Broadband and Fiber Access N...
Larry Smarr
 
Creating a Science-Driven Big Data Superhighway
Larry Smarr
 
Toward A National Big Data Superhighway
Larry Smarr
 
Pacific Research Platform Application Drivers
Larry Smarr
 
The Pacific Research Platform
Larry Smarr
 
Cyberinfrastructure to Support Ocean Observatories
Larry Smarr
 
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
Larry Smarr
 
The Pacific Research Platform: Leading Up to the National Research Platform
Larry Smarr
 
Positioning University of California Information Technology for the Future: S...
Larry Smarr
 
The Pacific Research Platform
Larry Smarr
 
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
Larry Smarr
 
Global Research Platforms: Past, Present, Future
Larry Smarr
 
A National Big Data Cyberinfrastructure Supporting Computational Biomedical R...
Larry Smarr
 
The Pacific Research Platform:a Science-Driven Big-Data Freeway System
Larry Smarr
 
A California-Wide Cyberinfrastructure for Data-Intensive Research
Larry Smarr
 
The Pacific Research Platform: The First Six Years
Larry Smarr
 
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
Larry Smarr
 
Toward a Global Interactive Earth Observing Cyberinfrastructure
Larry Smarr
 
Toward a National Research Platform to Enable Data-Intensive Open-Source Sci...
Larry Smarr
 
The Pacific Research Platform Two Years In
Larry Smarr
 
The Academic and R&D Sectors' Current and Future Broadband and Fiber Access N...
Larry Smarr
 
Ad

More from Larry Smarr (20)

PPTX
Smart Patients, Big Data, NextGen Primary Care
Larry Smarr
 
PPTX
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Larry Smarr
 
PPTX
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Larry Smarr
 
PPTX
National Research Platform: Application Drivers
Larry Smarr
 
PPT
From Supercomputing to the Grid - Larry Smarr
Larry Smarr
 
PPTX
The CENIC-AI Resource - Los Angeles Community College District (LACCD)
Larry Smarr
 
PPT
Redefining Collaboration through Groupware - From Groupware to Societyware
Larry Smarr
 
PPT
The Coming of the Grid - September 8-10,1997
Larry Smarr
 
PPT
Supercomputers: Directions in Technology, Architecture, and Applications
Larry Smarr
 
PPT
High Performance Geographic Information Systems
Larry Smarr
 
PPT
Data Intensive Applications at UCSD: Driving a Campus Research Cyberinfrastru...
Larry Smarr
 
PPT
Enhanced Telepresence and Green IT — The Next Evolution in the Internet
Larry Smarr
 
PPTX
The CENIC AI Resource CENIC AIR - CENIC Retreat 2024
Larry Smarr
 
PPTX
The CENIC-AI Resource: The Right Connection
Larry Smarr
 
PPTX
The NSF Grants Leading Up to CHASE-CI ENS
Larry Smarr
 
PPTX
Integrated Optical Fiber/Wireless Systems for Environmental Monitoring
Larry Smarr
 
PPTX
Toward a National Research Platform to Enable Data-Intensive Computing
Larry Smarr
 
PPTX
Digital Twins of Physical Reality - Future in Review
Larry Smarr
 
PPTX
Larry Smarr’s Prostate Cancer Early Detection and Focal Therapy
Larry Smarr
 
PPTX
The National Research Platform Enables a Growing Diversity of Users and Appl...
Larry Smarr
 
Smart Patients, Big Data, NextGen Primary Care
Larry Smarr
 
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Larry Smarr
 
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Larry Smarr
 
National Research Platform: Application Drivers
Larry Smarr
 
From Supercomputing to the Grid - Larry Smarr
Larry Smarr
 
The CENIC-AI Resource - Los Angeles Community College District (LACCD)
Larry Smarr
 
Redefining Collaboration through Groupware - From Groupware to Societyware
Larry Smarr
 
The Coming of the Grid - September 8-10,1997
Larry Smarr
 
Supercomputers: Directions in Technology, Architecture, and Applications
Larry Smarr
 
High Performance Geographic Information Systems
Larry Smarr
 
Data Intensive Applications at UCSD: Driving a Campus Research Cyberinfrastru...
Larry Smarr
 
Enhanced Telepresence and Green IT — The Next Evolution in the Internet
Larry Smarr
 
The CENIC AI Resource CENIC AIR - CENIC Retreat 2024
Larry Smarr
 
The CENIC-AI Resource: The Right Connection
Larry Smarr
 
The NSF Grants Leading Up to CHASE-CI ENS
Larry Smarr
 
Integrated Optical Fiber/Wireless Systems for Environmental Monitoring
Larry Smarr
 
Toward a National Research Platform to Enable Data-Intensive Computing
Larry Smarr
 
Digital Twins of Physical Reality - Future in Review
Larry Smarr
 
Larry Smarr’s Prostate Cancer Early Detection and Focal Therapy
Larry Smarr
 
The National Research Platform Enables a Growing Diversity of Users and Appl...
Larry Smarr
 

Recently uploaded (20)

PPTX
How to Add SBCGlobal.net Email to MacBook Air in Minutes
raymondjones7273
 
PDF
A water-rich interior in the temperate sub-Neptune K2-18 b revealed by JWST
Sérgio Sacani
 
PPTX
Discovery of Novel Antibiotics from Uncultured Microbes.pptx
SaakshiSharma26
 
PDF
Package-Aware Approach for Repository-Level Code Completion in Pharo
ESUG
 
PPT
Grade_9_Science_Atomic_S_t_r_u_cture.ppt
QuintReynoldDoble
 
PPTX
The Toxic Effects of Aflatoxin B1 and Aflatoxin M1 on Kidney through Regulati...
OttokomaBonny
 
PDF
Integrating Executable Requirements in Prototyping
ESUG
 
PDF
Microbial Biofilms and Their Role in Chronic Infections
Prachi Virat
 
PPTX
fghvqwhfugqaifbiqufbiquvbfuqvfuqyvfqvfouiqvfq
PERMISONJERWIN
 
PDF
Renewable Energy Resources (Solar, Wind, Nuclear, Geothermal) Presentation
RimshaNaeem23
 
DOCX
Echoes_of_Andromeda_Partial (1).docx9989
yakshitkrishnia5a3
 
PDF
A deep Search for Ethylene Glycol and Glycolonitrile in the V883 Ori Protopla...
Sérgio Sacani
 
PDF
Vera C. Rubin Observatory of interstellar Comet 3I ATLAS - July 21, 2025.pdf
SOCIEDAD JULIO GARAVITO
 
PDF
Bacteria, Different sizes and Shapes of of bacteria
Vishal Sakhare
 
PDF
Evaluating Benchmark Quality: a Mutation-Testing- Based Methodology
ESUG
 
PPTX
Pharmacognosy: ppt :pdf :pharmacognosy :
Vishnukanchi darade
 
PDF
Multiwavelength Study of a Hyperluminous X-Ray Source near NGC6099: A Strong ...
Sérgio Sacani
 
PPTX
2019 Upper Respiratory Tract Infections.pptx
jackophyta10
 
PPTX
Quality control test for plastic & metal.pptx
shrutipandit17
 
PPTX
Home Garden as a Component of Agroforestry system : A survey-based Study
AkhangshaRoy
 
How to Add SBCGlobal.net Email to MacBook Air in Minutes
raymondjones7273
 
A water-rich interior in the temperate sub-Neptune K2-18 b revealed by JWST
Sérgio Sacani
 
Discovery of Novel Antibiotics from Uncultured Microbes.pptx
SaakshiSharma26
 
Package-Aware Approach for Repository-Level Code Completion in Pharo
ESUG
 
Grade_9_Science_Atomic_S_t_r_u_cture.ppt
QuintReynoldDoble
 
The Toxic Effects of Aflatoxin B1 and Aflatoxin M1 on Kidney through Regulati...
OttokomaBonny
 
Integrating Executable Requirements in Prototyping
ESUG
 
Microbial Biofilms and Their Role in Chronic Infections
Prachi Virat
 
fghvqwhfugqaifbiqufbiquvbfuqvfuqyvfqvfouiqvfq
PERMISONJERWIN
 
Renewable Energy Resources (Solar, Wind, Nuclear, Geothermal) Presentation
RimshaNaeem23
 
Echoes_of_Andromeda_Partial (1).docx9989
yakshitkrishnia5a3
 
A deep Search for Ethylene Glycol and Glycolonitrile in the V883 Ori Protopla...
Sérgio Sacani
 
Vera C. Rubin Observatory of interstellar Comet 3I ATLAS - July 21, 2025.pdf
SOCIEDAD JULIO GARAVITO
 
Bacteria, Different sizes and Shapes of of bacteria
Vishal Sakhare
 
Evaluating Benchmark Quality: a Mutation-Testing- Based Methodology
ESUG
 
Pharmacognosy: ppt :pdf :pharmacognosy :
Vishnukanchi darade
 
Multiwavelength Study of a Hyperluminous X-Ray Source near NGC6099: A Strong ...
Sérgio Sacani
 
2019 Upper Respiratory Tract Infections.pptx
jackophyta10
 
Quality control test for plastic & metal.pptx
shrutipandit17
 
Home Garden as a Component of Agroforestry system : A survey-based Study
AkhangshaRoy
 

The Pacific Research Platform
 Two Years In

  • 1. “The Pacific Research Platform Two Years In” Welcome and Overview Talk to the Pacific Research Platform “PRPv2” Workshop 2017 University of California, San Diego February 21, 2017 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD http://lsmarr.calit2.net
  • 2. Initially Proposed PRP Multi-Campus Science Driver Teams • Biomedical – Cancer Genomics Hub/Browser( UCSC/SDSC project over, connecting PRP to U. Chicago) – Microbiome and Integrative ‘Omics (UCSD, Caltech, UCSF, UCD) – Integrative Structural Biology (UCSF, NERSC, SDSC) • Earth Sciences – Data Analysis and Simulation for Earthquakes and Natural Disasters (Phase II) – Climate Modeling: NCAR/UCAR (UCSD, NCAR) – California/Nevada Regional Climate Data Analysis (UCI, UCSD, NCAR) – CO2 Subsurface Modeling (SDSC) • Particle Physics (UCD, UCI, UCSC, UCSD, others soon) • Astronomy and Astrophysics – Telescope Surveys (NERSC connected to PRP) (Phase II) – Galaxy Evolution (UCI, UCSC) (Phase II) – Gravitational Wave Astronomy (Caltech, UCSD) • Scalable Visualization, Virtual Reality, and Ultra-Res Video (UCB, UCLA, UCM, UCSD)
  • 3. 100 Gbps FIONA at UCSC Connects the UCSC Hyades Cluster to the NERSC Supercomputer at LBNL Supporting UCSC Remote Access to Large Data Subsets of the Dark Energy Spectroscopic Instrument (DESI) and AGORA Galaxy Simulation Data Produced at NERSC. 250 images per night 800GB per night Shawfeng Dong, UCSC Cyberengineer UCSC Feb 7, 2017
  • 4. Global Scientific Instruments Will Produce Ultralarge Datasets Continuously Requiring Dedicated Optic Fiber and Supercomputers Square Kilometer Array Large Synoptic Survey Telescope https://tnc15.terena.org/getfile/1939 www.lsst.org/sites/default/files/documents/DM%20Introduction%20-%20Kantor.pdf Tracks ~40B Objects, Creates 10M Alerts/Night Within 1 Minute of Observing 2x40Gb/s
  • 5. 40G FIONAs 20x40G PRP-connected WAVE@UC San Diego PRP Will Enable Distributed Virtual Reality PRP MerWAVE @UC Merced
  • 6. UC Merced’s VR CAVE: Merced WAVE • Transferring 5 CAVECam Images Over 10 Gbit/sec Fiber Path From UCSD to UC Merced: – Total Data Size: 1.96 GBytes – Transfer Took 2.17 seconds – Transfer Rate: 924.49 MBytes/sec (~8Gbit/sec) • This Transfer Would Have Taken: – 21 Seconds Over 1Gbit/sec Connection (Regular Ethernet) – 5.35 Minutes Over 50Mbit/sec Connection (Residential Internet)
  • 7. PRP Will Link the Laboratories of the Pacific Earthquake Engineering Research Center http://peer.berkeley.edu/
  • 8. The Second FIONette was Deployed at the PEER Facility at UC Berkeley, and its Performance is Being Monitored John Graham Installing FIONette at PEER Feb 10, 2017
  • 9. Cancer Genomics Hub (UCSC) is Housed in SDSC: Large Data Flows to End Users at UCSC, UCB, UCSF, … 1G 8G Data Source: David Haussler, Brad Smith, UCSC 15G Jan 2016 30,000 TB Per Year
  • 10. Slide on Cancer Genomics
  • 11. Newly Added PRP Multi-Campus Science Driver Teams • Biomedical – Cryo Electronic Microscopy (UCB/LLNL, UCD,UCLA, UCSD, UCSF) – Bioinformatics (UCD) – High-Resolution Microscopy (UCR, UCSD, NSCC) • Computer Science and Engineering /Electrical and Computer Engineering, etc. – JupyterHub (UCB, UCSD) – Deep Learning (UCB, UCSD, UIC) – Drones, Terrestrial Modeling/GIS (UCSD, UCM) – Contextual Robotics (new) • High Performance Wireless Research and Education Networks – UCSD/SIO, UCI, UCR, UCM, CENIC, others tbd. • Humanities and Social Sciences – Preserving Cultural Heritage
  • 12. PRP First Application: Distributed IPython/Jupyter Notebooks: Cross-Platform, Browser-Based Application Interleaves Code, Text, & Images IJulia IHaskell IFSharp IRuby IGo IScala IMathics Ialdor LuaJIT/Torch Lua Kernel IRKernel (for the R language) IErlang IOCaml IForth IPerl IPerl6 Ioctave Calico Project • kernels implemented in Mono, including Java, IronPython, Boo, Logo, BASIC, and many others IScilab IMatlab ICSharp Bash Clojure Kernel Hy Kernel Redis Kernel jove, a kernel for io.js IJavascript Calysto Scheme Calysto Processing idl_kernel Mochi Kernel Lua (used in Splash) Spark Kernel Skulpt Python Kernel MetaKernel Bash MetaKernel Python Brython Kernel IVisual VPython Kernel Source: John Graham, QI
  • 13. GPU JupyterHub: 2 x 14-core CPUs 256GB RAM 1.2TB FLASH 3.8TB SSD Nvidia K80 GPU Dual 40GbE NICs And a Trusted Platform Module GPU JupyterHub: 1 x 18-core CPUs 128GB RAM 3.8TB SSD Nvidia K80 GPU Dual 40GbE NICs And a Trusted Platform Module PRP UC-JupyterHub Backbone UCB Next Step: Deploy Across PRP UCSD Source: John Graham, Calit2
  • 14. Cryo-electron Microscopy (cryo-EM) Has Driven a “Resolution Revolution” in the Last Five Years Exposure (every 60 seconds): X & Y dimensions: 7420 x 7676 pixels Frames per movie: 10 - 50 Size: 3 - 10 GB per movie Every 24 hours: Number of movies: ~1400 Data size: ~5 TB Typical datasets: Length of time: 2 - 6 days Total size: 10 - 30 TB Each Cryo-EM ‘Image’ is Actually a Movie Source: Michael A. Cianfrocco, Elizabeth Villa, & Andres Leschziner, UCSD
  • 15. ~20 microscopes in CA UCLA UC Davis UC Santa Cruz SF Bay UC Berkeley, LBNL, UCSF, Stanford San Diego UCSD, TSRI, Salk * * SDSC NERSC *Xstream Using PRP to Connect Cryo-EM across California With End Users and Computational Facilities Long term: ‣ Partner with cryo-EM facilities to stream data straight from microscopes (over PRP) to SDSC ‣ Perform all cryo-EM analysis (from micrographs to 3D models) via web browser on SDSC ‣ Expand computing to other XSEDE resources (e.g. Xstream) Short term: ‣ Provide 2D and 3D analysis on particle stacks on Comet at SDSC Source: Michael A. Cianfrocco, UCSD3 supercomputer centers cosmic-cryoem.org
  • 16. UCD UCSF Stanford NASA AMES/ NREN UCSC UCSB Caltech USC UCLA UCI UCSD SDSU UCR Esnet DoE Labs UW/ PNWGP Seattle Berkeley UCM Los Nettos Internet2 Internet2 Seattle Note: This diagram represents a subset of sites and connections. * Institutions with Active Archaeology Programs “In an ideal world – Extremely high bandwidth to move large cultural heritage datasets around the PRP cloud for processing & viewing in CAVEs around PRP with Unlimited Storage for permanent archiving.” -Tom Levy, UCSD PRP is NOT Just for Big Data Science and Engineering: Linking Cultural Heritage and Archaeology Datasets Building on CENIC’s Expansion To Libraries, Museums, and Cultural Sites
  • 17. Linking Libraries at UCB, UCLA, UCM and UCSD with CAVE Kiosks 48 Megapixel CAVE Kiosk for UCSD Library UCSD Library Review, June 24 Megapixel UCM Library Installation, July
  • 18. PRP Backbone Sets Stage for 2017 Expansion of HPWREN, Connected to CENIC, into Orange and Riverside Counties • PRP CENIC 100G Link UCSD to SDSU – DTN FIONAs Endpoints – Data Redundancy – Disaster Recovery – High Availability – Network Redundancy • Anchor to CENIC at UCI – PRP FIONA Connects to CalREN-HPR Network – Data Replication Site • Potential Future UCR CENIC Anchor UCR UCI UCSD SDSU Source: Frank Vernon, Greg Hidley, UCSD

Editor's Notes

  • #3: Campus Cyberinfrastructure – Network Infrastructure and Engineering (CC-NIE) Campus Cyberinfrastructure – Infrastructure, Innovation, and Engineering (CC-IIE) Campus Cyberinfrastructure – Data, Networking, and Innovation (CC-DNI) NSF 15-534 incorporates Data Infrastructure Building Blocks (CC-DNI-DIBBs) – Multi-Campus / Multi-Institution Model Implementation from Program Solicitation NSF 14-530
  • #12: Campus Cyberinfrastructure – Network Infrastructure and Engineering (CC-NIE) Campus Cyberinfrastructure – Infrastructure, Innovation, and Engineering (CC-IIE) Campus Cyberinfrastructure – Data, Networking, and Innovation (CC-DNI) NSF 15-534 incorporates Data Infrastructure Building Blocks (CC-DNI-DIBBs) – Multi-Campus / Multi-Institution Model Implementation from Program Solicitation NSF 14-530
  • #17: We already have 11 major research universities in California poised to partner.