SlideShare a Scribd company logo
Science and Cyberinfrastructure  in the Data-Dominated Era  Symposium #1610, How Computational Science Is Tackling the Grand Challenges Facing Science and Society  San Diego, CA February 22, 2010 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor,  Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD
Abstract The NSF Supercomputer Centers program not only directly stimulated a hundred-fold increase in the number of U.S. university computational scientists and engineers, but it also facilitated the emergence of the Internet, Web, scientific visualization, and synchronous collaboration.  I will show how two NSF-funded grand challenges, one in basic scientific research (cosmological evolution) and one in computer science (super high bandwidth optical networks) are interweaving to enable new modes of discovery. Today we are living in a data-dominated world where supercomputers and increasingly distributed scientific instruments generate terabytes to petabytes of data. It was in response to this challenge that the NSF funded the OptIPuter project to research how user-controlled 10Gbps dedicated lightpaths (or “lambdas”) could provide direct access to global data repositories, scientific instruments, and computational resources from “OptIPortals,” PC clusters which provide scalable visualization, computing, and storage in the user's campus laboratory. The use of dedicated lightpaths over fiber optic cables enables individual researchers to experience “clear channel” 10,000 megabits/sec, 100-1000 times faster than over today’s shared Internet—a critical capability for data-intensive science. The seven-year OptIPuter computer science research project is now over, but it stimulated a national and global build-out of dedicated fiber optic networks. U.S. universities now have access to high bandwidth lambdas through the National LambdaRail, Internet2's Dynamic Circuit Services, and the Global Lambda Integrated Facility. A few pioneering campuses are now building on-campus lightpaths to connect the data-intensive researchers, data generators, and vast storage systems to each other on campus, as well as to the national network campus gateways.  I will show how this next generation cyberinfrastructure is being used to support cosmological simulations containing 64 billion zones on remote NSF-funded TeraGrid facilities coupled to the end-users laboratory by national fiber networks.  I will review how increasingly powerful NSF supercomputers have allowed for more and more realistic cosmological models over the last two decades. The 25 years of innovation in information infrastructure and scientific simulation that NSF has funded has steadily pushed out the frontier of knowledge while transforming our society and economy.
NCSA Telnet--“Hide the Cray” Paradigm That We Still Use Today NCSA Telnet -- Interactive Access  From Macintosh or PC Computer  To Telnet Hosts on TCP/IP Networks Allows for Simultaneous Connections  To Numerous Computers on The Net Standard File Transfer Server (FTP)  Lets You Transfer Files to and from Remote Machines and Other Users John Kogut Simulating  Quantum Chromodynamics He Uses a Mac—The Mac Uses the Cray Source: Larry Smarr 1985 Data  Generator Data  Portal Data  Transmission
Launching the Nation’s Information Infrastructure: NSFnet Supernetwork and the Six NSF Supercomputers NCSA NSFNET 56 Kb/s Backbone (1986-8) PSC NCAR CTC JVNC SDSC Supernetwork Backbone: 56kbps is 50 Times Faster than 1200 bps PC Modem!
Why Teraflop Supercomputers Matter  For Accurate Science & Engineering Simulations FLOating Point OperationS per Spatial Point Ten Variables Hundred Operations Per Updated Variable One Thousand FLOPS per Updated Spatial Point One Dimensional Dynamics For 1000 Spatial Points Need MEGAFLOP Two Dimensions For 1000x1000 Spatial Points Need GIGAFLOP Three Dimensions For 1000x1000x1000 Spatial Points Need TERAFLOP Three Dimensions + Adaptive Mesh Refinement Need PETAFLOP
Today Dedicated 10,000Mbps Supernetworks  Tie Together State and Regional Fiber Infrastructure NLR 40 x 10Gb Wavelengths  Expanding with Darkstrand to 80 Interconnects  Two Dozen  State and Regional Optical Networks Internet2 Dynamic Circuit Network  Is Now Available
NSF’s OptIPuter Project: Using Supernetworks  to Meet the Needs of Data-Intensive Researchers OptIPortal–  Termination Device  for the OptIPuter Global Backplane Calit2 (UCSD, UCI), SDSC, and UIC Leads—Larry Smarr PI Univ. Partners: NCSA, USC, SDSU, NW, TA&M, UvA, SARA, KISTI, AIST Industry: IBM, Sun, Telcordia, Chiaro, Calient, Glimmerglass, Lucent
Short History of Cosmological Supercomputing: Early Days -1993 Convex C3880 (8-way SMP) GigaFLOPs Simulation of X-ray clusters in a 3D cube 85 Mpc/h on a side and Cartesian grid of size 270 3 Bryan, Cen, Norman, Ostriker, Stone (1994), ApJ Source: Michael Norman, SDSC, UCSD
Great Leap Forward-1994 Thinking Machines CM5  (512-cpu MPP) Simulation of X-ray clusters in a 3D cube 170 Mpc/h on a side and Cartesian grid of size 512 3 Bryan & Norman (1998), ApJ Source: Michael Norman, SDSC, UCSD
The Power of Adaptive Mesh Refinement-2006 IBM Power4 cluster (64 node, 8-way SMP) Simulation of X-ray clusters in a 3D cube 512 Mpc/h on a side with 7-level AMR for an effective resolution of 65,562 3 Norman et al. (2007) Source: Michael Norman, SDSC, UCSD
Adaptive Grids Resolve Individual Galaxy Collisions  as Clusters Form in 15 Million Light Year Volume Source: Simulation: Mike Norman and Brian O’Shea; Animation: Donna Cox, Robert Patterson, Matthew Hall, Stuart Levy, Jeff Carpenter, Lorne Leonard-NCSA  SGI Altix DSM cluster (512 cpu)
Exploring Cosmology With Supercomputers, Supernetworks, and Supervisualization 4096 3  Particle/Cell Hydrodynamic Cosmology Simulation NICS Kraken (XT5) 16,384 cores Output 148 TB Movie Output (0.25 TB/file) 80 TB Diagnostic Dumps (8 TB/file) Science:  Norman, Harkness,Paschos SDSC Visualization:  Insley, ANL; Wagner SDSC ANL  *  Calit2  *  LBNL  *  NICS  *  ORNL  *   SDSC Intergalactic Medium on 2 GLyr Scale Source: Mike Norman, SDSC
Enormous Detail in Simulation: Full Simulation with Blowup of a 1/512 Subcube
Project StarGate Goals: Combining Supercomputers and Supernetworks Create an “End-to-End” 10Gbps Workflow Explore Use of OptIPortals as Petascale Supercomputer “Scalable Workstations” Exploit Dynamic 10Gbps Circuits on ESnet Connect Hardware Resources at ORNL, ANL, SDSC Show that Data Need Not be Trapped by the Network “Event Horizon” [email_address] Rick Wagner Mike Norman ANL  *  Calit2  *  LBNL  *  NICS  *  ORNL  *   SDSC Source: Michael Norman, SDSC, UCSD
Using Supernetworks to Couple End User’s OptIPortal  to Remote Supercomputers and Visualization Servers *ANL  *  Calit2  *  LBNL  *  NICS  *  ORNL  *   SDSC Source: Mike Norman, SDSC From 1985 to Project StarGate NICS ORNL NSF TeraGrid Kraken Cray XT5 8,256 Compute Nodes 99,072 Compute Cores 129 TB RAM simulation Argonne NL DOE Eureka 100 Dual Quad Core  Xeon Servers 200 NVIDIA Quadro FX GPUs in 50 Quadro Plex S4 1U enclosures 3.2 TB RAM rendering SDSC Calit2/SDSC OptIPortal1 20 30” (2560 x 1600 pixel) LCD panels 10 NVIDIA Quadro FX 4600 graphics cards > 80 megapixels 10 Gb/s network throughout visualization ESnet 10 Gb/s fiber optic network
Project StarGate Credits Lawrence Berkeley National Laboratory (ESnet) Eli Dart San Diego Supercomputer Center Science application Michael Norman Rick Wagner (coordinator) Network Tom Hutton Oak Ridge National Laboratory Susan Hicks National Institute for Computational Sciences Nathaniel Mendoza Argonne National Laboratory Network/Systems Linda Winkler  Loren Jan Wilson Visualization Joseph Insley Eric Olsen Mark Hereld Michael Papka [email_address] Larry Smarr (Overall Concept) Brian Dunne (Networking) Joe Keefe (OptIPortal) Kai Doerr, Falko Kuester (CGLX) ANL  *  Calit2  *  LBNL  *  NICS  *  ORNL  *   SDSC
Blue Waters is a Sustained PetaFLOPs Supercomputer One Million Times the Convex 3880 of 1993! Planned for 2011-2012 Science Self-consistent simulation of the formation of the first galaxies and cosmic ionization Scale of Simulations AMR: 1536 3  base grid, 10 levels of refinement Cartesian: 6400 3  with radiation transport Source: Michael Norman, SDSC, UCSD
Academic Research “OptIPlatform” Cyberinfrastructure: A 10Gbps “End-to-End” Lightpath Cloud National LambdaRail Campus Optical Switch Data Repositories & Clusters HPC HD/4k Video Images HD/4k Video Cams End User  OptIPortal 10G  Lightpath HD/4k Telepresence Instruments
High Definition Video Connected OptIPortals: Virtual Working Spaces for Data Intensive Research Source: Falko Kuester, Kai Doerr Calit2; Michael Sims, NASA NASA Ames Lunar Science Institute Mountain View, CA NASA Interest  in Supporting  Virtual Institutes LifeSize HD
You Can Download This Presentation  at lsmarr.calit2.net

More Related Content

PPT
LambdaGrids--Earth and Planetary Sciences Driving High Performance Networks a...
Larry Smarr
 
PPT
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
Larry Smarr
 
PPT
The OptIPuter as a Prototype for CalREN-XD
Larry Smarr
 
PPT
The Academic and R&D Sectors' Current and Future Broadband and Fiber Access N...
Larry Smarr
 
PPT
Positioning University of California Information Technology for the Future: S...
Larry Smarr
 
PPT
Applying Photonics to User Needs: The Application Challenge
Larry Smarr
 
PPT
High Resolution Multimedia in a Ultra Bandwidth World
Larry Smarr
 
PPT
Calit2
Larry Smarr
 
LambdaGrids--Earth and Planetary Sciences Driving High Performance Networks a...
Larry Smarr
 
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
Larry Smarr
 
The OptIPuter as a Prototype for CalREN-XD
Larry Smarr
 
The Academic and R&D Sectors' Current and Future Broadband and Fiber Access N...
Larry Smarr
 
Positioning University of California Information Technology for the Future: S...
Larry Smarr
 
Applying Photonics to User Needs: The Application Challenge
Larry Smarr
 
High Resolution Multimedia in a Ultra Bandwidth World
Larry Smarr
 
Calit2
Larry Smarr
 

What's hot (20)

PPT
The OptIPuter and Its Applications
Larry Smarr
 
PPT
Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...
Larry Smarr
 
PPT
Blowing up the Box--the Emergence of the Planetary Computer
Larry Smarr
 
PPT
Creating High Performance Lambda Collaboratories
Larry Smarr
 
PPT
Using OptIPuter Innovations to Enable LambdaGrid Applications
Larry Smarr
 
PPT
OptIPuter Overview
Larry Smarr
 
PPT
Set My Data Free: High-Performance CI for Data-Intensive Research
Larry Smarr
 
PPT
Ceoa Nov 2005 Final Small
Larry Smarr
 
PPTX
The Pacific Research Platform
 Two Years In
Larry Smarr
 
PPT
Cyberinfrastructure to Support Ocean Observatories
Larry Smarr
 
PPT
Genomics at the Speed of Light: Understanding the Living Ocean
Larry Smarr
 
PPT
The Coming Revolution in Environmental Awareness
Larry Smarr
 
PPTX
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
EarthCube
 
PDF
Cognitive Engine: Boosting Scientific Discovery
diannepatricia
 
PPT
Calit2-a Persistent UCSD/UCI Framework for Collaboration
Larry Smarr
 
PPTX
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...
Larry Smarr
 
PPT
Remote Telepresence for Exploring Virtual Worlds
Larry Smarr
 
PPT
Cal-(IT)2 Projects with Sun Microsystems
Larry Smarr
 
PPTX
Big Data, Big Computing, AI, and Environmental Science
Ian Foster
 
PDF
Deep Learning for Hidden Signals - Enabling Real-time Multimessenger Astrophy...
Daniel George
 
The OptIPuter and Its Applications
Larry Smarr
 
Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...
Larry Smarr
 
Blowing up the Box--the Emergence of the Planetary Computer
Larry Smarr
 
Creating High Performance Lambda Collaboratories
Larry Smarr
 
Using OptIPuter Innovations to Enable LambdaGrid Applications
Larry Smarr
 
OptIPuter Overview
Larry Smarr
 
Set My Data Free: High-Performance CI for Data-Intensive Research
Larry Smarr
 
Ceoa Nov 2005 Final Small
Larry Smarr
 
The Pacific Research Platform
 Two Years In
Larry Smarr
 
Cyberinfrastructure to Support Ocean Observatories
Larry Smarr
 
Genomics at the Speed of Light: Understanding the Living Ocean
Larry Smarr
 
The Coming Revolution in Environmental Awareness
Larry Smarr
 
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
EarthCube
 
Cognitive Engine: Boosting Scientific Discovery
diannepatricia
 
Calit2-a Persistent UCSD/UCI Framework for Collaboration
Larry Smarr
 
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...
Larry Smarr
 
Remote Telepresence for Exploring Virtual Worlds
Larry Smarr
 
Cal-(IT)2 Projects with Sun Microsystems
Larry Smarr
 
Big Data, Big Computing, AI, and Environmental Science
Ian Foster
 
Deep Learning for Hidden Signals - Enabling Real-time Multimessenger Astrophy...
Daniel George
 
Ad

Similar to Science and Cyberinfrastructure in the Data-Dominated Era (20)

PPT
The Jump to Light Speed - Data Intensive Earth Sciences are Leading the Way t...
Larry Smarr
 
PPT
Physics Research in an Era of Global Cyberinfrastructure
Larry Smarr
 
PPT
OptIPuter-A High Performance SOA LambdaGrid Enabling Scientific Applications
Larry Smarr
 
PPT
How Fiber Optics are Transforming our World
Larry Smarr
 
PPT
Toward a Global Interactive Earth Observing Cyberinfrastructure
Larry Smarr
 
PPT
Cyberinfrastructure for Ocean Cabled Observatories
Larry Smarr
 
PPT
Coupling Australia’s Researchers to the Global Innovation Economy
Larry Smarr
 
PPT
The Pacific Research Platform: a Science-Driven Big-Data Freeway System
Larry Smarr
 
PPT
Metacomputer Architecture of the Global LambdaGrid
Larry Smarr
 
PPT
A Mobile Internet Powered by a Planetary Computer
Larry Smarr
 
PPT
High Performance Collaboration – The Jump to Light Speed
Larry Smarr
 
PPT
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Larry Smarr
 
PPT
The OptIPuter Project: From the Grid to the LambdaGrid
Larry Smarr
 
PPT
Why Researchers are Using Advanced Networks
Larry Smarr
 
PPT
Calit2: a View Into the Future of the Wired and Unwired Internet
Larry Smarr
 
PPT
The Future of the Internet and its Impact on Digitally Enabled Genomic Medicine
Larry Smarr
 
PPT
The OptiPuter, Quartzite, and Starlight Projects: A Campus to Global-Scale Te...
Larry Smarr
 
PPT
Building a Global Collaboration System for Data-Intensive Discovery
Larry Smarr
 
PPT
How Global-Scale Personal Lightwaves are Transforming Scientific Research
Larry Smarr
 
PPT
Riding the Light: How Dedicated Optical Circuits are Enabling New Science
Larry Smarr
 
The Jump to Light Speed - Data Intensive Earth Sciences are Leading the Way t...
Larry Smarr
 
Physics Research in an Era of Global Cyberinfrastructure
Larry Smarr
 
OptIPuter-A High Performance SOA LambdaGrid Enabling Scientific Applications
Larry Smarr
 
How Fiber Optics are Transforming our World
Larry Smarr
 
Toward a Global Interactive Earth Observing Cyberinfrastructure
Larry Smarr
 
Cyberinfrastructure for Ocean Cabled Observatories
Larry Smarr
 
Coupling Australia’s Researchers to the Global Innovation Economy
Larry Smarr
 
The Pacific Research Platform: a Science-Driven Big-Data Freeway System
Larry Smarr
 
Metacomputer Architecture of the Global LambdaGrid
Larry Smarr
 
A Mobile Internet Powered by a Planetary Computer
Larry Smarr
 
High Performance Collaboration – The Jump to Light Speed
Larry Smarr
 
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Larry Smarr
 
The OptIPuter Project: From the Grid to the LambdaGrid
Larry Smarr
 
Why Researchers are Using Advanced Networks
Larry Smarr
 
Calit2: a View Into the Future of the Wired and Unwired Internet
Larry Smarr
 
The Future of the Internet and its Impact on Digitally Enabled Genomic Medicine
Larry Smarr
 
The OptiPuter, Quartzite, and Starlight Projects: A Campus to Global-Scale Te...
Larry Smarr
 
Building a Global Collaboration System for Data-Intensive Discovery
Larry Smarr
 
How Global-Scale Personal Lightwaves are Transforming Scientific Research
Larry Smarr
 
Riding the Light: How Dedicated Optical Circuits are Enabling New Science
Larry Smarr
 
Ad

More from Larry Smarr (20)

PPTX
Smart Patients, Big Data, NextGen Primary Care
Larry Smarr
 
PPTX
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Larry Smarr
 
PPTX
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Larry Smarr
 
PPTX
National Research Platform: Application Drivers
Larry Smarr
 
PPT
From Supercomputing to the Grid - Larry Smarr
Larry Smarr
 
PPTX
The CENIC-AI Resource - Los Angeles Community College District (LACCD)
Larry Smarr
 
PPT
Redefining Collaboration through Groupware - From Groupware to Societyware
Larry Smarr
 
PPT
The Coming of the Grid - September 8-10,1997
Larry Smarr
 
PPT
Supercomputers: Directions in Technology, Architecture, and Applications
Larry Smarr
 
PPT
High Performance Geographic Information Systems
Larry Smarr
 
PPT
Data Intensive Applications at UCSD: Driving a Campus Research Cyberinfrastru...
Larry Smarr
 
PPT
Enhanced Telepresence and Green IT — The Next Evolution in the Internet
Larry Smarr
 
PPTX
The CENIC AI Resource CENIC AIR - CENIC Retreat 2024
Larry Smarr
 
PPTX
The CENIC-AI Resource: The Right Connection
Larry Smarr
 
PPTX
The Pacific Research Platform: The First Six Years
Larry Smarr
 
PPTX
The NSF Grants Leading Up to CHASE-CI ENS
Larry Smarr
 
PPTX
Integrated Optical Fiber/Wireless Systems for Environmental Monitoring
Larry Smarr
 
PPTX
Toward a National Research Platform to Enable Data-Intensive Open-Source Sci...
Larry Smarr
 
PPTX
Toward a National Research Platform to Enable Data-Intensive Computing
Larry Smarr
 
PPTX
Digital Twins of Physical Reality - Future in Review
Larry Smarr
 
Smart Patients, Big Data, NextGen Primary Care
Larry Smarr
 
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Larry Smarr
 
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Larry Smarr
 
National Research Platform: Application Drivers
Larry Smarr
 
From Supercomputing to the Grid - Larry Smarr
Larry Smarr
 
The CENIC-AI Resource - Los Angeles Community College District (LACCD)
Larry Smarr
 
Redefining Collaboration through Groupware - From Groupware to Societyware
Larry Smarr
 
The Coming of the Grid - September 8-10,1997
Larry Smarr
 
Supercomputers: Directions in Technology, Architecture, and Applications
Larry Smarr
 
High Performance Geographic Information Systems
Larry Smarr
 
Data Intensive Applications at UCSD: Driving a Campus Research Cyberinfrastru...
Larry Smarr
 
Enhanced Telepresence and Green IT — The Next Evolution in the Internet
Larry Smarr
 
The CENIC AI Resource CENIC AIR - CENIC Retreat 2024
Larry Smarr
 
The CENIC-AI Resource: The Right Connection
Larry Smarr
 
The Pacific Research Platform: The First Six Years
Larry Smarr
 
The NSF Grants Leading Up to CHASE-CI ENS
Larry Smarr
 
Integrated Optical Fiber/Wireless Systems for Environmental Monitoring
Larry Smarr
 
Toward a National Research Platform to Enable Data-Intensive Open-Source Sci...
Larry Smarr
 
Toward a National Research Platform to Enable Data-Intensive Computing
Larry Smarr
 
Digital Twins of Physical Reality - Future in Review
Larry Smarr
 

Recently uploaded (20)

PDF
Beyond Automation: The Role of IoT Sensor Integration in Next-Gen Industries
Rejig Digital
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
This slide provides an overview Technology
mineshkharadi333
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PPTX
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
 
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PDF
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
Beyond Automation: The Role of IoT Sensor Integration in Next-Gen Industries
Rejig Digital
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
This slide provides an overview Technology
mineshkharadi333
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
 
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 

Science and Cyberinfrastructure in the Data-Dominated Era

  • 1. Science and Cyberinfrastructure in the Data-Dominated Era Symposium #1610, How Computational Science Is Tackling the Grand Challenges Facing Science and Society San Diego, CA February 22, 2010 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD
  • 2. Abstract The NSF Supercomputer Centers program not only directly stimulated a hundred-fold increase in the number of U.S. university computational scientists and engineers, but it also facilitated the emergence of the Internet, Web, scientific visualization, and synchronous collaboration. I will show how two NSF-funded grand challenges, one in basic scientific research (cosmological evolution) and one in computer science (super high bandwidth optical networks) are interweaving to enable new modes of discovery. Today we are living in a data-dominated world where supercomputers and increasingly distributed scientific instruments generate terabytes to petabytes of data. It was in response to this challenge that the NSF funded the OptIPuter project to research how user-controlled 10Gbps dedicated lightpaths (or “lambdas”) could provide direct access to global data repositories, scientific instruments, and computational resources from “OptIPortals,” PC clusters which provide scalable visualization, computing, and storage in the user's campus laboratory. The use of dedicated lightpaths over fiber optic cables enables individual researchers to experience “clear channel” 10,000 megabits/sec, 100-1000 times faster than over today’s shared Internet—a critical capability for data-intensive science. The seven-year OptIPuter computer science research project is now over, but it stimulated a national and global build-out of dedicated fiber optic networks. U.S. universities now have access to high bandwidth lambdas through the National LambdaRail, Internet2's Dynamic Circuit Services, and the Global Lambda Integrated Facility. A few pioneering campuses are now building on-campus lightpaths to connect the data-intensive researchers, data generators, and vast storage systems to each other on campus, as well as to the national network campus gateways. I will show how this next generation cyberinfrastructure is being used to support cosmological simulations containing 64 billion zones on remote NSF-funded TeraGrid facilities coupled to the end-users laboratory by national fiber networks. I will review how increasingly powerful NSF supercomputers have allowed for more and more realistic cosmological models over the last two decades. The 25 years of innovation in information infrastructure and scientific simulation that NSF has funded has steadily pushed out the frontier of knowledge while transforming our society and economy.
  • 3. NCSA Telnet--“Hide the Cray” Paradigm That We Still Use Today NCSA Telnet -- Interactive Access From Macintosh or PC Computer To Telnet Hosts on TCP/IP Networks Allows for Simultaneous Connections To Numerous Computers on The Net Standard File Transfer Server (FTP) Lets You Transfer Files to and from Remote Machines and Other Users John Kogut Simulating Quantum Chromodynamics He Uses a Mac—The Mac Uses the Cray Source: Larry Smarr 1985 Data Generator Data Portal Data Transmission
  • 4. Launching the Nation’s Information Infrastructure: NSFnet Supernetwork and the Six NSF Supercomputers NCSA NSFNET 56 Kb/s Backbone (1986-8) PSC NCAR CTC JVNC SDSC Supernetwork Backbone: 56kbps is 50 Times Faster than 1200 bps PC Modem!
  • 5. Why Teraflop Supercomputers Matter For Accurate Science & Engineering Simulations FLOating Point OperationS per Spatial Point Ten Variables Hundred Operations Per Updated Variable One Thousand FLOPS per Updated Spatial Point One Dimensional Dynamics For 1000 Spatial Points Need MEGAFLOP Two Dimensions For 1000x1000 Spatial Points Need GIGAFLOP Three Dimensions For 1000x1000x1000 Spatial Points Need TERAFLOP Three Dimensions + Adaptive Mesh Refinement Need PETAFLOP
  • 6. Today Dedicated 10,000Mbps Supernetworks Tie Together State and Regional Fiber Infrastructure NLR 40 x 10Gb Wavelengths Expanding with Darkstrand to 80 Interconnects Two Dozen State and Regional Optical Networks Internet2 Dynamic Circuit Network Is Now Available
  • 7. NSF’s OptIPuter Project: Using Supernetworks to Meet the Needs of Data-Intensive Researchers OptIPortal– Termination Device for the OptIPuter Global Backplane Calit2 (UCSD, UCI), SDSC, and UIC Leads—Larry Smarr PI Univ. Partners: NCSA, USC, SDSU, NW, TA&M, UvA, SARA, KISTI, AIST Industry: IBM, Sun, Telcordia, Chiaro, Calient, Glimmerglass, Lucent
  • 8. Short History of Cosmological Supercomputing: Early Days -1993 Convex C3880 (8-way SMP) GigaFLOPs Simulation of X-ray clusters in a 3D cube 85 Mpc/h on a side and Cartesian grid of size 270 3 Bryan, Cen, Norman, Ostriker, Stone (1994), ApJ Source: Michael Norman, SDSC, UCSD
  • 9. Great Leap Forward-1994 Thinking Machines CM5 (512-cpu MPP) Simulation of X-ray clusters in a 3D cube 170 Mpc/h on a side and Cartesian grid of size 512 3 Bryan & Norman (1998), ApJ Source: Michael Norman, SDSC, UCSD
  • 10. The Power of Adaptive Mesh Refinement-2006 IBM Power4 cluster (64 node, 8-way SMP) Simulation of X-ray clusters in a 3D cube 512 Mpc/h on a side with 7-level AMR for an effective resolution of 65,562 3 Norman et al. (2007) Source: Michael Norman, SDSC, UCSD
  • 11. Adaptive Grids Resolve Individual Galaxy Collisions as Clusters Form in 15 Million Light Year Volume Source: Simulation: Mike Norman and Brian O’Shea; Animation: Donna Cox, Robert Patterson, Matthew Hall, Stuart Levy, Jeff Carpenter, Lorne Leonard-NCSA SGI Altix DSM cluster (512 cpu)
  • 12. Exploring Cosmology With Supercomputers, Supernetworks, and Supervisualization 4096 3 Particle/Cell Hydrodynamic Cosmology Simulation NICS Kraken (XT5) 16,384 cores Output 148 TB Movie Output (0.25 TB/file) 80 TB Diagnostic Dumps (8 TB/file) Science: Norman, Harkness,Paschos SDSC Visualization: Insley, ANL; Wagner SDSC ANL * Calit2 * LBNL * NICS * ORNL * SDSC Intergalactic Medium on 2 GLyr Scale Source: Mike Norman, SDSC
  • 13. Enormous Detail in Simulation: Full Simulation with Blowup of a 1/512 Subcube
  • 14. Project StarGate Goals: Combining Supercomputers and Supernetworks Create an “End-to-End” 10Gbps Workflow Explore Use of OptIPortals as Petascale Supercomputer “Scalable Workstations” Exploit Dynamic 10Gbps Circuits on ESnet Connect Hardware Resources at ORNL, ANL, SDSC Show that Data Need Not be Trapped by the Network “Event Horizon” [email_address] Rick Wagner Mike Norman ANL * Calit2 * LBNL * NICS * ORNL * SDSC Source: Michael Norman, SDSC, UCSD
  • 15. Using Supernetworks to Couple End User’s OptIPortal to Remote Supercomputers and Visualization Servers *ANL * Calit2 * LBNL * NICS * ORNL * SDSC Source: Mike Norman, SDSC From 1985 to Project StarGate NICS ORNL NSF TeraGrid Kraken Cray XT5 8,256 Compute Nodes 99,072 Compute Cores 129 TB RAM simulation Argonne NL DOE Eureka 100 Dual Quad Core Xeon Servers 200 NVIDIA Quadro FX GPUs in 50 Quadro Plex S4 1U enclosures 3.2 TB RAM rendering SDSC Calit2/SDSC OptIPortal1 20 30” (2560 x 1600 pixel) LCD panels 10 NVIDIA Quadro FX 4600 graphics cards > 80 megapixels 10 Gb/s network throughout visualization ESnet 10 Gb/s fiber optic network
  • 16. Project StarGate Credits Lawrence Berkeley National Laboratory (ESnet) Eli Dart San Diego Supercomputer Center Science application Michael Norman Rick Wagner (coordinator) Network Tom Hutton Oak Ridge National Laboratory Susan Hicks National Institute for Computational Sciences Nathaniel Mendoza Argonne National Laboratory Network/Systems Linda Winkler Loren Jan Wilson Visualization Joseph Insley Eric Olsen Mark Hereld Michael Papka [email_address] Larry Smarr (Overall Concept) Brian Dunne (Networking) Joe Keefe (OptIPortal) Kai Doerr, Falko Kuester (CGLX) ANL * Calit2 * LBNL * NICS * ORNL * SDSC
  • 17. Blue Waters is a Sustained PetaFLOPs Supercomputer One Million Times the Convex 3880 of 1993! Planned for 2011-2012 Science Self-consistent simulation of the formation of the first galaxies and cosmic ionization Scale of Simulations AMR: 1536 3 base grid, 10 levels of refinement Cartesian: 6400 3 with radiation transport Source: Michael Norman, SDSC, UCSD
  • 18. Academic Research “OptIPlatform” Cyberinfrastructure: A 10Gbps “End-to-End” Lightpath Cloud National LambdaRail Campus Optical Switch Data Repositories & Clusters HPC HD/4k Video Images HD/4k Video Cams End User OptIPortal 10G Lightpath HD/4k Telepresence Instruments
  • 19. High Definition Video Connected OptIPortals: Virtual Working Spaces for Data Intensive Research Source: Falko Kuester, Kai Doerr Calit2; Michael Sims, NASA NASA Ames Lunar Science Institute Mountain View, CA NASA Interest in Supporting Virtual Institutes LifeSize HD
  • 20. You Can Download This Presentation at lsmarr.calit2.net