SlideShare a Scribd company logo
High Performance Computing

        Jawwad Shamsi
           Lecture #4
       25th January 2010
Recap
• Pipelining
• Super Scalar Execution
  – Dependencies
     • Branch
     • Data
     • Resource
• Effect of Latency and Memory
• Effect of Parallelism
Flynn’s taxanomy
It distinguishes multiprocessor computers according
   to data and instruction
• the dimensions of Instruction and Data
• SISD: Single Instruction Single data (Uniprocessor)
• SIMD: Single Instruction Multiple data (Vector
   Processing)
• MISD: Multiple Instruction Single date
• MIMD: Multiple Instruction Multiple data (SMP,
   cluster, NUMA)
MIMD
• MIMD
 – Shared Memory (tightly coupled)
    • SMP (Symmetric Multiprocessing)
    • Non-Uniform Memory access
 – Distributed Memory (loosely coupled)
    • Clusters
Taxonomy of Parallel Processor
       Architectures
Shared Address Space
• Shared Memory
• Distributed Memory
SMP
•   Two or more similar processors
•   Same main memory and I/O
•   Can perform similar operations
•   Share access to I/O devices
Multiprogramming and
  Multiprocessing
SMP Advantages
•   Performance
•   Availability
•   Incremental growth
•   Scaling
Block Diagram of Tightly Coupled
         Multiprocessor
Cache Coherence
• Multiple copies of cache can maintain
  different data
  – Protocols?
Processor Design: Modes of
               Parallelism
• Two ways to increase parallelism
  – Superscaling
     • Instruction level parallelism
  – Threading
     • Thread level parallelism
        – Concept of Multithreaded processors
            » May or may not be different than OS level mult-threading
     • Temporal Multi-threading (also called implicit)
        – Instructions from only one thread
     • Simultaneous Multi-threading (explicit)
        – Instructions from more than one thread can be executed
Scalar Processor Approaches
• Single-threaded scalar
   – Simple pipeline
   – No multithreading
• Interleaved multithreaded scalar
   –   Easiest multithreading to implement
   –   Switch threads at each clock cycle
   –   Pipeline stages kept close to fully occupied
   –   Hardware needs to switch thread context between cycles
• Blocked multithreaded scalar
   – Thread executed until latency event occurs
   – Would stop pipeline
   – Processor switches to another thread
Clusters
•   Alternative to SMP
•   High performance
•   High availability
•   Server applications

•   A group of interconnected whole computers
•   Working together as unified resource
•   Illusion of being one machine
•   Each computer called a node
Cluster Benefits
•   Absolute scalability
•   Incremental scalability
•   High availability
•   Superior price/performance
Cluster Configurations - Standby
     Server, No Shared Disk
Cluster v. SMP
• Both provide multiprocessor support to high demand
  applications.
• Both available commercially
   – SMP for longer
• SMP:
   – Easier to manage and control
   – Closer to single processor systems
      • Scheduling is main difference
      • Less physical space
      • Lower power consumption
• Clustering:
   – Superior incremental & absolute scalability
   – Superior availability
      • Redundancy

More Related Content

What's hot (20)

PPT
Introduction 1
Yasir Khan
 
PPTX
parallel processing
Sudarshan Mondal
 
PPTX
Introduction to parallel processing
Page Maker
 
PDF
Multithreaded processors ppt
Siddhartha Anand
 
PPT
Hardware multithreading
Fraboni Ec
 
PDF
Lecture 6.1
Mr SMAK
 
PPTX
Parallel architecture-programming
Shaveta Banda
 
PPTX
Multithreading computer architecture
Haris456
 
PPTX
Advanced processor principles
Dhaval Bagal
 
PPTX
Cache coherence problem and its solutions
Majid Saleem
 
PDF
What is simultaneous multithreading
Fraboni Ec
 
PPTX
Scope of parallelism
Syed Zaid Irshad
 
PPT
Introduction to parallel_computing
Mehul Patel
 
PPT
Parallel processing
rajshreemuthiah
 
PPTX
Multiprocessor
Neel Patel
 
PPTX
Superscalar & superpipeline processor
Muhammad Ishaq
 
PDF
Coherence and consistency models in multiprocessor architecture
University of Pisa
 
PPT
Lecture 6
Mr SMAK
 
PPTX
Parallel computing
virend111
 
PPTX
Dichotomy of parallel computing platforms
Syed Zaid Irshad
 
Introduction 1
Yasir Khan
 
parallel processing
Sudarshan Mondal
 
Introduction to parallel processing
Page Maker
 
Multithreaded processors ppt
Siddhartha Anand
 
Hardware multithreading
Fraboni Ec
 
Lecture 6.1
Mr SMAK
 
Parallel architecture-programming
Shaveta Banda
 
Multithreading computer architecture
Haris456
 
Advanced processor principles
Dhaval Bagal
 
Cache coherence problem and its solutions
Majid Saleem
 
What is simultaneous multithreading
Fraboni Ec
 
Scope of parallelism
Syed Zaid Irshad
 
Introduction to parallel_computing
Mehul Patel
 
Parallel processing
rajshreemuthiah
 
Multiprocessor
Neel Patel
 
Superscalar & superpipeline processor
Muhammad Ishaq
 
Coherence and consistency models in multiprocessor architecture
University of Pisa
 
Lecture 6
Mr SMAK
 
Parallel computing
virend111
 
Dichotomy of parallel computing platforms
Syed Zaid Irshad
 

Viewers also liked (20)

PPTX
Bus
markloresco
 
PPT
Motherboards (A+)
ajw
 
PPT
como llegar al corazon de una mujer
verododds
 
PPTX
A+ computer hardware slide
Rajendra Tete
 
PPTX
Transmission Media
MAK
 
POT
Computer motherboard
Ray Mkindo
 
PDF
Computer Troubleshooting
Lisa Hartman
 
PPTX
Computer virus 1
wargames12
 
PPT
Transmission media
sanjeev kumar
 
PPTX
Computer Literacy Lesson 9
cpashke
 
PPTX
Introduction to computer virus
YouQue ™
 
PPT
Chapter 6: Expansion Buses
askme
 
PDF
Transmission Media
Abhishek Wadhwa
 
PPT
Computer Virus
izzul
 
PPTX
Types Of Buses
Akhil Ahuja
 
PPT
Motherboard components and their functions
BESOR ACADEMY
 
PPT
Switch mode power supply
twilight28
 
PPTX
Computer hardware troubleshooting
Jerome Luison
 
Motherboards (A+)
ajw
 
como llegar al corazon de una mujer
verododds
 
A+ computer hardware slide
Rajendra Tete
 
Transmission Media
MAK
 
Computer motherboard
Ray Mkindo
 
Computer Troubleshooting
Lisa Hartman
 
Computer virus 1
wargames12
 
Transmission media
sanjeev kumar
 
Computer Literacy Lesson 9
cpashke
 
Introduction to computer virus
YouQue ™
 
Chapter 6: Expansion Buses
askme
 
Transmission Media
Abhishek Wadhwa
 
Computer Virus
izzul
 
Types Of Buses
Akhil Ahuja
 
Motherboard components and their functions
BESOR ACADEMY
 
Switch mode power supply
twilight28
 
Computer hardware troubleshooting
Jerome Luison
 
Ad

Similar to Lecture4 (20)

PPTX
Parallel Processing (Part 2)
Ajeng Savitri
 
PPTX
Week 13-14 Parrallel Processing-new.pptx
FaizanSaleem81
 
PPTX
parellelisum edited_jsdnsfnjdnjfnjdn.pptx
aravym456
 
PPT
parallel-processing.ppt
MohammedAbdelgader2
 
ODP
Distributed Computing
Sudarsun Santhiappan
 
PPT
18 parallel processing
dilip kumar
 
PPT
Parallel processing Concepts
Army Public School and College -Faisal
 
PPT
Multiprocessor_YChen.ppt
AberaZeleke1
 
PPTX
Multiprocessor.pptx
Muhammad54342
 
PDF
Advanced processor Principles
Vinit Raut
 
PPTX
High performance computing
punjab engineering college, chandigarh
 
PPT
Par com
tttoracle
 
PPTX
Lecture 04 chapter 2 - Parallel Programming Platforms
National College of Business Administration & Economics ( NCBA&E)
 
PPT
Parallel computing
Vinay Gupta
 
PPTX
CA UNIT IV.pptx
ssuser9dbd7e
 
PPTX
unit 4.pptx
SUBHAMSHARANRA211100
 
PPTX
unit 4.pptx
SUBHAMSHARANRA211100
 
PPT
Ceg4131 models
anandme07
 
DOC
Introduction to multi core
mukul bhardwaj
 
PDF
Intro to parallel computing
Piyush Mittal
 
Parallel Processing (Part 2)
Ajeng Savitri
 
Week 13-14 Parrallel Processing-new.pptx
FaizanSaleem81
 
parellelisum edited_jsdnsfnjdnjfnjdn.pptx
aravym456
 
parallel-processing.ppt
MohammedAbdelgader2
 
Distributed Computing
Sudarsun Santhiappan
 
18 parallel processing
dilip kumar
 
Parallel processing Concepts
Army Public School and College -Faisal
 
Multiprocessor_YChen.ppt
AberaZeleke1
 
Multiprocessor.pptx
Muhammad54342
 
Advanced processor Principles
Vinit Raut
 
High performance computing
punjab engineering college, chandigarh
 
Par com
tttoracle
 
Lecture 04 chapter 2 - Parallel Programming Platforms
National College of Business Administration & Economics ( NCBA&E)
 
Parallel computing
Vinay Gupta
 
CA UNIT IV.pptx
ssuser9dbd7e
 
Ceg4131 models
anandme07
 
Introduction to multi core
mukul bhardwaj
 
Intro to parallel computing
Piyush Mittal
 
Ad

Recently uploaded (20)

PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PPTX
Farrell_Programming Logic and Design slides_10e_ch02_PowerPoint.pptx
bashnahara11
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PPTX
AI Code Generation Risks (Ramkumar Dilli, CIO, Myridius)
Priyanka Aash
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PDF
Build with AI and GDG Cloud Bydgoszcz- ADK .pdf
jaroslawgajewski1
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PDF
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification
Ivan Ruchkin
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PPTX
AVL ( audio, visuals or led ), technology.
Rajeshwri Panchal
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PPTX
Agentic AI in Healthcare Driving the Next Wave of Digital Transformation
danielle hunter
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
Farrell_Programming Logic and Design slides_10e_ch02_PowerPoint.pptx
bashnahara11
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
AI Code Generation Risks (Ramkumar Dilli, CIO, Myridius)
Priyanka Aash
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
Build with AI and GDG Cloud Bydgoszcz- ADK .pdf
jaroslawgajewski1
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification
Ivan Ruchkin
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
AVL ( audio, visuals or led ), technology.
Rajeshwri Panchal
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
Agentic AI in Healthcare Driving the Next Wave of Digital Transformation
danielle hunter
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 

Lecture4

  • 1. High Performance Computing Jawwad Shamsi Lecture #4 25th January 2010
  • 2. Recap • Pipelining • Super Scalar Execution – Dependencies • Branch • Data • Resource • Effect of Latency and Memory • Effect of Parallelism
  • 3. Flynn’s taxanomy It distinguishes multiprocessor computers according to data and instruction • the dimensions of Instruction and Data • SISD: Single Instruction Single data (Uniprocessor) • SIMD: Single Instruction Multiple data (Vector Processing) • MISD: Multiple Instruction Single date • MIMD: Multiple Instruction Multiple data (SMP, cluster, NUMA)
  • 4. MIMD • MIMD – Shared Memory (tightly coupled) • SMP (Symmetric Multiprocessing) • Non-Uniform Memory access – Distributed Memory (loosely coupled) • Clusters
  • 5. Taxonomy of Parallel Processor Architectures
  • 6. Shared Address Space • Shared Memory • Distributed Memory
  • 7. SMP • Two or more similar processors • Same main memory and I/O • Can perform similar operations • Share access to I/O devices
  • 8. Multiprogramming and Multiprocessing
  • 9. SMP Advantages • Performance • Availability • Incremental growth • Scaling
  • 10. Block Diagram of Tightly Coupled Multiprocessor
  • 11. Cache Coherence • Multiple copies of cache can maintain different data – Protocols?
  • 12. Processor Design: Modes of Parallelism • Two ways to increase parallelism – Superscaling • Instruction level parallelism – Threading • Thread level parallelism – Concept of Multithreaded processors » May or may not be different than OS level mult-threading • Temporal Multi-threading (also called implicit) – Instructions from only one thread • Simultaneous Multi-threading (explicit) – Instructions from more than one thread can be executed
  • 13. Scalar Processor Approaches • Single-threaded scalar – Simple pipeline – No multithreading • Interleaved multithreaded scalar – Easiest multithreading to implement – Switch threads at each clock cycle – Pipeline stages kept close to fully occupied – Hardware needs to switch thread context between cycles • Blocked multithreaded scalar – Thread executed until latency event occurs – Would stop pipeline – Processor switches to another thread
  • 14. Clusters • Alternative to SMP • High performance • High availability • Server applications • A group of interconnected whole computers • Working together as unified resource • Illusion of being one machine • Each computer called a node
  • 15. Cluster Benefits • Absolute scalability • Incremental scalability • High availability • Superior price/performance
  • 16. Cluster Configurations - Standby Server, No Shared Disk
  • 17. Cluster v. SMP • Both provide multiprocessor support to high demand applications. • Both available commercially – SMP for longer • SMP: – Easier to manage and control – Closer to single processor systems • Scheduling is main difference • Less physical space • Lower power consumption • Clustering: – Superior incremental & absolute scalability – Superior availability • Redundancy