0% found this document useful (0 votes)

11 views

LECTURE 1 - Intro to Parallel Computing

This document is an introduction to parallel computing, covering its motivation, scope, and terminology. It discusses the evolution from serial to parallel computing, highlighting the benefits of parallelism in solving complex problems and improving performance across various applications. Additionally, it outlines different classifications of parallel computers and key concepts such as granularity, scalability, and parallel overhead.

Uploaded by

2024793147

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

LECTURE 1 - Intro to Parallel Computing

Uploaded by

2024793147

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

CSC580

Parallel Processing
Lecture 1: Introduction to Parallel Computing

PREPARED BY: SALIZA RAMLY

Topic Description
This topic introduces the students:
◦ Motivating Parallelism
◦ Scope of Parallel Computing
◦ Parallel Computing Terminology

SALIZA RAMLY - CSC580

Motivating
Parallelism

SALIZA RAMLY - CSC580

For over 40 years, virtually all computers
have followed a common machine model
known as the von Neumann computer.
Fundamentals Named after the Hungarian mathematician
John von Neumann.
of Parallel
Computing –
Von Neumann A von Neumann computer uses the stored-
program concept. The CPU executes a
Architecture stored program that specifies a sequence of
read and write operations on the memory.

SALIZA RAMLY - CSC580

von Neumann Architecture
Comprised of four main components:
oMemory
oControl Unit
oArithmetic Logic Unit
oInput/Output

Parallel computers still follow this basic design, just multiplied in

units. The basic, fundamental architecture remains the same.

SALIZA RAMLY - CSC580

Basic Design
Memory is used to store both program and data
instructions
Program instructions are coded data which tell the
computer to do something
Data is simply information to be used by the program
A central processing unit (CPU) gets instructions
and/or data from memory, decodes the instructions
and then sequentially performs them.

SALIZA RAMLY - CSC580

What is IN ORDER TO
UNDERSTAND PARALLEL

Parallel COMPUTING, WE MUST

FIRST UNDERSTAND THE
MEANING OF SERIAL

Computing? COMPUTING

SALIZA RAMLY - CSC580

Serial Computing
Traditionally, software has been written for
serial computation:
oA problem is broken into a discrete series of
instructions
oInstructions are executed sequentially one after
another
oExecuted on a single processor
oOnly one instruction may execute at any moment
in time

SALIZA RAMLY - CSC580

Parallel Computing
In the simplest sense, parallel computing is the
simultaneous use of multiple compute resources to
solve a computational problem:
o A problem is broken into discrete parts that can be solved
concurrently
o Each part is further broken down to a series of instructions
o Instructions from each part execute simultaneously on
different processors
o An overall control/coordination mechanism is employed

SALIZA RAMLY - CSC580

THIS IS A LEGITIME
Why Parallel QUESTION!
PARALLEL

Computing? COMPUTING IS
COMPLEX ON ANY
ASPECT!

SALIZA RAMLY - CSC580

Limitations of Serial Computing
Limits to serial computing •both physical and practical reasons pose significant constraints to simply building ever faster serial
computers.

•the speed of a serial computer is directly dependent upon how fast data can move through hardware.
Transmission speeds Absolute limits are the speed of light (30 cm/nanosecond) and the transmission limit of copper wire (9
cm/nanosecond). Increasing speeds necessitate increasing proximity of processing elements.

Limits to miniaturization •processor technology is allowing an increasing number of transistors to be placed on a chip. However, even
with molecular or atomic-level components, a limit will be reached on how small components can be.

Economic limitations •it is increasingly expensive to make a single processor faster. Using a larger number of moderately fast
commodity processors to achieve the same (or better) performance is less expensive.

SALIZA RAMLY - CSC580

Why Use Parallel Computing?
Main Reasons:

Save • Save time - wall clock time

Solve • Solve larger / more complex problems

• Provide concurrency - do multiple things at

Provide the same time

SALIZA RAMLY - CSC580

Why Use Parallel Computing?
Other Reasons:
Taking advantage of • using available compute resources on a wide area network, or
even the Internet when local compute resources are scarce.
non-local resources

• using multiple "cheap" computing resources instead of paying

Cost savings for time on a supercomputer.

Overcoming • single computers have very finite memory resources. For

large problems, using the memories of multiple computers
memory constraints may overcome this obstacle.

SALIZA RAMLY - CSC580

Parallel Computing: what for?
Parallel computing is an evolution of serial computing that attempts to emulate what
has always been the state of affairs in the natural world: many complex, interrelated
events happening at the same time, yet within a sequence.
Some examples:
o Planetary and galactic orbits
o Weather and ocean patterns
o Tectonic plate drift
o Rush hour traffic in Paris
o Automobile assembly line
o Daily operations within a business
o Building a shopping mall
o Ordering a hamburger at the drive through.

SALIZA RAMLY - CSC580

Parallel Computing: what for?
Traditionally, parallel computing has been considered to be "the high end of
computing" and has been motivated by numerical simulations of complex systems
and "Grand Challenge Problems" such as:
oweather and climate
ochemical and nuclear reactions
obiological, human genome
ogeological, seismic activity
omechanical devices - from prosthetics to spacecraft
oelectronic circuits
omanufacturing processes

SALIZA RAMLY - CSC580

Parallel Computing: what for?
Today, commercial applications are providing an equal or greater driving force in the
development of faster computers. These applications require the processing of large
amounts of data in sophisticated ways. Example applications include:
oparallel databases, data mining
ooil exploration
oweb search engines, web based business services
ocomputer-aided diagnosis in medicine
omanagement of national and multi-national corporations
oadvanced graphics and virtual reality, particularly in the entertainment industry
onetworked video and multi-media technologies
ocollaborative work environments

SALIZA RAMLY - CSC580

Parallel ULTIMATELY, PARALLEL
COMPUTING IS AN
AT TEMPT TO MAXIMIZE

Computing: THE INFINITE BUT

SEEMINGLY SCARCE
COMMODITY CALLED

what for? TIME .

SALIZA RAMLY - CSC580

Scope of
Who is Using Parallel
Parallel Computing?
Computing
Applications
SALIZA RAMLY - CSC580
Parallel Computing Applications
Parallelism finds applications in very diverse application domains for different
motivating reasons.
These range from improved application performance to cost considerations.

Who is Using Parallel Computing?

o Applications in Engineering and Design
o Scientific Applications
o Commercial Applications
o Applications in Computer Systems

SALIZA RAMLY - CSC580

(1) Applications in Engineering and Design
o Design of airfoils (optimizing lift, drag, stability), internal combustion engines
(optimizing charge distribution, burn), high-speed circuits (layouts for delays
and capacitive and inductive effects), and structures (optimizing structural
integrity, design parameters, cost, etc.).
o Design and simulation of micro- and nano-scale systems.
o Process optimization, operations research.

SALIZA RAMLY - CSC580

(2) Scientific Applications
o Functional and structural characterization of genes and proteins.
o Advances in computational physics and chemistry have explored new
materials, understanding of chemical pathways, and more efficient processes.
o Applications in astrophysics have explored the evolution of galaxies,
thermonuclear processes, and the analysis of extremely large datasets from
telescopes.
o Weather modeling, mineral prospecting, flood prediction, etc., are other
important applications.
o Bioinformatics and astrophysics also present some of the most challenging
problems with respect to analyzing extremely large datasets.

SALIZA RAMLY - CSC580

(3) Commercial Applications
o Some of the largest parallel computers power the wall street!
o Data mining and analysis for optimizing business and marketing decisions.
o Large scale servers (mail and web servers) are often implemented using
parallel platforms.
o Applications such as information retrieval and search are typically powered by
large clusters.

SALIZA RAMLY - CSC580

(4) Applications in Computer Systems
o Network intrusion detection, cryptography, multiparty computations are some
of the core users of parallel computing techniques.
o Embedded systems increasingly rely on distributed control algorithms.
o A modern automobile consists of tens of processors communicating to
perform complex tasks for optimizing handling and performance.
o Conventional structured peer-to-peer networks impose overlay networks and
utilize algorithms directly from parallel computing.

SALIZA RAMLY - CSC580

Flynn's
Classical
Taxonomy
SALIZA RAMLY - CSC580
There are different ways to classify
parallel computers.
o One of the more widely used classifications, in use since 1966, is called Flynn's Taxonomy.
o Flynn's taxonomy distinguishes multi-processor computer architectures according to how they
can be classified along the two independent dimensions of Instruction Stream and Data
Stream. Each of these dimensions can have only one of two possible states: Single or Multiple.
o The following matrix defines the 4 possible classifications according to Flynn:

SALIZA RAMLY - CSC580

a) Single Instruction,
Single Data (SISD)
o A serial (non-parallel) computer
o Single Instruction: Only one instruction stream is being
acted on by the CPU during any one clock cycle
o Single Data: Only one data stream is being used as
input during any one clock cycle
o Deterministic execution
o This is the oldest type of computer
o Examples: older generation mainframes,
minicomputers, workstations and single processor/core
PCs.
SALIZA RAMLY - CSC580
b) Single Instruction,
Multiple Data (SIMD)
o A type of parallel computer
o Single Instruction: All processing units execute the
same instruction at any given clock cycle
o Multiple Data: Each processing unit can operate on a
different data element
o Best suited for specialized problems characterized by
a high degree of regularity, such as graphics/image
processing.
o Synchronous (lockstep) and deterministic execution
o Two varieties: Processor Arrays and Vector Pipelines
SALIZA RAMLY - CSC580
c) Multiple Instructions, Single Data
(MISD)
o A type of parallel computer
o Multiple Instructions: Each processing unit operates on
the data independently via separate instruction
streams.
o Single Data: A single data stream is fed into multiple
processing units.
o Few (if any) actual examples of this class of parallel
computer have ever existed.
o Some conceivable uses might be:
o multiple frequency filters operating on a single signal stream
o multiple cryptography algorithms attempting to crack a single
coded message.

SALIZA RAMLY - CSC580

d) Multiple Instructions, Multiple Data
(MIMD)
o A type of parallel computer
o Multiple Instruction: Every processor may be executing a different instruction
stream
o Multiple Data: Every processor may be working with a different data stream
o Execution can be synchronous or asynchronous, deterministic or non-
deterministic
o Currently, the most common type of parallel computer - most modern
supercomputers fall into this category.
o Examples: most current supercomputers, networked parallel computer
clusters and "grids", multi-processor SMP computers, multi-core PCs.
o Note: many MIMD architectures also include SIMD execution sub-components

SALIZA RAMLY - CSC580

Parallel
Computing
Terminology
SALIZA RAMLY - CSC580
Some General Parallel
Terminology
Like everything else, parallel computing has
its own "jargon".
Some of the more commonly used terms
associated with parallel computing are listed
below.
Most of these will be discussed in more detail
later.

SALIZA RAMLY - CSC580

Task Parallel Task Serial Execution
A logically discrete
section of
A task that can be
executed by multiple
Execution of a program
sequentially, one Parallel
computational work. A
task is typically a
program or program-
processors safely
(yields correct results)
statement at a time. In
the simplest sense, this
is what happens on a
Computing
like set of instructions
that is executed by a
processor.
one processor
machine. However,
virtually all parallel
Terminology
tasks will have sections
of a parallel program
that must be executed
serially.

SALIZA RAMLY - CSC580

Parallel Execution
• Execution of a program by more than one task, with each task being able
to execute the same or different statement at the same moment in time.

Shared Memory
• From a strictly hardware point of view, describes a computer
architecture where all processors have direct (usually bus based) access
Parallel
to common physical memory. In a programming sense, it describes a
model where parallel tasks all have the same "picture" of memory and
can directly address and access the same logical memory locations
Computing
regardless of where the physical memory actually exists.
Terminology
Distributed Memory
• In hardware, refers to network based memory access for physical
memory that is not common. As a programming model, tasks can only
logically "see" local machine memory and must use communications to
access memory on other machines where other tasks are executing.

SALIZA RAMLY - CSC580

• Parallel tasks typically need to exchange data. There are
several ways this can be accomplished, such as through a
Communications shared memory bus or over a network, however the actual
event of data exchange is commonly referred to as
communications regardless of the method employed.

Parallel
Computing
• The coordination of parallel tasks in real time, very often
associated with communications. Often implemented by
establishing a synchronization point within an application
Terminology
where a task may not proceed further until another task(s)
Synchronization reaches the same or logically equivalent point.
• Synchronization usually involves waiting by at least one task,
and can therefore cause a parallel application's wall clock
execution time to increase.

SALIZA RAMLY - CSC580

Granularity
• In parallel computing, granularity is a qualitative measure
of the ratio of computation to communication.
• Coarse: relatively large amounts of computational work
are done between communication events
• Fine: relatively small amounts of computational work are
Parallel
done between communication events
Computing
Observed Speedup
• Observed speedup of a code which has been parallelized,
Terminology
defined as:

• One of the simplest and most widely used indicators for a

parallel program's performance.

SALIZA RAMLY - CSC580

Parallel Overhead

• The amount of time required to coordinate parallel tasks, as

opposed to doing useful work. Parallel overhead can include
factors such as:
• Task start-up time
• Synchronizations
Parallel
• Data communications
• Software overhead imposed by parallel compilers, libraries,
Computing
tools, operating system, etc.
• Task termination time
Terminology
Massively Parallel

• Refers to the hardware that comprises a given parallel system

- having many processors. The meaning of many keeps
increasing, but currently BG/L pushes this number to 6 digits.

SALIZA RAMLY - CSC580

Scalability

• Refers to a parallel system's (hardware

and/or software) ability to demonstrate a
proportionate increase in parallel speedup
with the addition of more processors. Factors Parallel
that contribute to scalability include:
• Hardware - particularly memory-cpu
Computing
bandwidths and network communications Terminology
• Application algorithm
• Parallel overhead related
• Characteristics of your specific application
and coding

SALIZA RAMLY - CSC580

LECTURE2:

NEXT! PARALLEL
PLATFORMS
(PART 1)

SALIZA RAMLY - CSC580

Software Requirements Specification: Automated Timetable Generator
50% (2)
Software Requirements Specification: Automated Timetable Generator
8 pages
Víctor Olaya - ''A Gentle Introduction To SAGA GIS (Edition 1.1) ''
No ratings yet
Víctor Olaya - ''A Gentle Introduction To SAGA GIS (Edition 1.1) ''
217 pages
10 Parallel Computing
No ratings yet
10 Parallel Computing
15 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
30 pages
Introduction To Parallel Co...
No ratings yet
Introduction To Parallel Co...
44 pages
Module 2: Goals of Parallelism Week 2 Learning Outcomes:: General-Purpose Computing On Graphics Processing Units
No ratings yet
Module 2: Goals of Parallelism Week 2 Learning Outcomes:: General-Purpose Computing On Graphics Processing Units
11 pages
LLNL Introduction To Parallel Computing
No ratings yet
LLNL Introduction To Parallel Computing
39 pages
Chapter # 1
No ratings yet
Chapter # 1
117 pages
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
30 pages
Parallel Distributed Computing
No ratings yet
Parallel Distributed Computing
51 pages
1. Parallel and Distributed Computing-1
No ratings yet
1. Parallel and Distributed Computing-1
17 pages
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
No ratings yet
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
170 pages
Introduction To Parallel Computing LLNL
No ratings yet
Introduction To Parallel Computing LLNL
44 pages
517 454, 517 441: Parallel and Distributed Computing: Apisake Hongwitayakorn
No ratings yet
517 454, 517 441: Parallel and Distributed Computing: Apisake Hongwitayakorn
31 pages
Chapter1 - CLO1
No ratings yet
Chapter1 - CLO1
28 pages
The New Trends of Parallel Processing
No ratings yet
The New Trends of Parallel Processing
5 pages
Parallel Computing
No ratings yet
Parallel Computing
3 pages
Introduction To Parallel Computing
100% (1)
Introduction To Parallel Computing
34 pages
PDC 1
No ratings yet
PDC 1
41 pages
Basics of Parallel Programming: Unit-1
No ratings yet
Basics of Parallel Programming: Unit-1
79 pages
What Is Parallel Computing 1 PDF
No ratings yet
What Is Parallel Computing 1 PDF
21 pages
Parallel Computing Varun Patial
No ratings yet
Parallel Computing Varun Patial
41 pages
1-Introduction
No ratings yet
1-Introduction
48 pages
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
14 pages
Lec1-Introduction To Parallel - Distributed System
No ratings yet
Lec1-Introduction To Parallel - Distributed System
29 pages
CS ELEC 2 Introduce Parallel Computing
No ratings yet
CS ELEC 2 Introduce Parallel Computing
28 pages
Lec1 and 2
No ratings yet
Lec1 and 2
52 pages
Week 1
No ratings yet
Week 1
74 pages
Parallel Computing
100% (1)
Parallel Computing
18 pages
Introduction To Parallel Computing
0% (1)
Introduction To Parallel Computing
34 pages
Lecture 1 - Introduction To Parallel Computing
0% (1)
Lecture 1 - Introduction To Parallel Computing
32 pages
Lect 1 Overview
No ratings yet
Lect 1 Overview
17 pages
Introduction To: Parallel Distributed
No ratings yet
Introduction To: Parallel Distributed
32 pages
M3
No ratings yet
M3
70 pages
Introduction To Parallel Computing: Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar
No ratings yet
Introduction To Parallel Computing: Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar
15 pages
Lecture Week - 1 Introduction 1 - SP-24
No ratings yet
Lecture Week - 1 Introduction 1 - SP-24
51 pages
Map Reduce
No ratings yet
Map Reduce
11 pages
Parallel Computing Terminology
No ratings yet
Parallel Computing Terminology
11 pages
Parallel Computing
No ratings yet
Parallel Computing
3 pages
Lecture 2 Introduction to Parallel and Distributed Computing
No ratings yet
Lecture 2 Introduction to Parallel and Distributed Computing
29 pages
Computer Achitecture II - Parallel - Computing
No ratings yet
Computer Achitecture II - Parallel - Computing
46 pages
Parallel Computing Main
No ratings yet
Parallel Computing Main
47 pages
CS621_Handouts - Mids
No ratings yet
CS621_Handouts - Mids
61 pages
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
149 pages
HPC Unit 1
100% (1)
HPC Unit 1
12 pages
11 Introduction To Parallel Computing
No ratings yet
11 Introduction To Parallel Computing
14 pages
1.3. Underlying Principles of Parallel and Distributed Computing
No ratings yet
1.3. Underlying Principles of Parallel and Distributed Computing
118 pages
Lecture 1
No ratings yet
Lecture 1
18 pages
PDC Lecture 1
No ratings yet
PDC Lecture 1
34 pages
week 1
No ratings yet
week 1
14 pages
Chapter 1 - Parallel Architectures
No ratings yet
Chapter 1 - Parallel Architectures
60 pages
Introduction To Computing
No ratings yet
Introduction To Computing
6 pages
L01 Introduction
No ratings yet
L01 Introduction
51 pages
Topic 1 2024
No ratings yet
Topic 1 2024
41 pages
Unit4 Session1 Intro To Parallel Computing
No ratings yet
Unit4 Session1 Intro To Parallel Computing
24 pages
Lecture 1
No ratings yet
Lecture 1
23 pages
Lecture_2_Computer_Architecture_course_2024_1
No ratings yet
Lecture_2_Computer_Architecture_course_2024_1
57 pages
CSCE569 Parallel Computing: TTH 03:30AM-04:45PM Dr. Jianjun Hu
No ratings yet
CSCE569 Parallel Computing: TTH 03:30AM-04:45PM Dr. Jianjun Hu
37 pages
Lecture 1 Introduction PP
No ratings yet
Lecture 1 Introduction PP
14 pages
BCSE412L - Parallel Computing 01
No ratings yet
BCSE412L - Parallel Computing 01
27 pages
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
Next-Gen Mainframe: Mastering Modern Automation Techniques: Mainframes
From Everand
Next-Gen Mainframe: Mastering Modern Automation Techniques: Mainframes
Isaac Nangan
No ratings yet
13 Union-Find
No ratings yet
13 Union-Find
49 pages
cm19 Test 2sol
No ratings yet
cm19 Test 2sol
4 pages
AirWatch Components - VEffort
No ratings yet
AirWatch Components - VEffort
4 pages
Anomaly Detection RapidMiner
No ratings yet
Anomaly Detection RapidMiner
12 pages
The ABAP Under Verse - Slides
No ratings yet
The ABAP Under Verse - Slides
54 pages
Adc Series: Analog To Digital Converters
No ratings yet
Adc Series: Analog To Digital Converters
2 pages
DBMS Manual 2018 PDF
No ratings yet
DBMS Manual 2018 PDF
58 pages
3 Vladimir Kostadinov
No ratings yet
3 Vladimir Kostadinov
5 pages
The Mysterious Outlook Outbox - The Whole Story
No ratings yet
The Mysterious Outlook Outbox - The Whole Story
19 pages
Fit Sigma
No ratings yet
Fit Sigma
48 pages
Repro-5 Atrea Manual
No ratings yet
Repro-5 Atrea Manual
2 pages
NEET - 2019 - Confirmation Page For Application Number - 190410011219 PDF
No ratings yet
NEET - 2019 - Confirmation Page For Application Number - 190410011219 PDF
2 pages
Facultad de Ingeniería Escuela Profesional de Ingeniería Civil
No ratings yet
Facultad de Ingeniería Escuela Profesional de Ingeniería Civil
3 pages
Operations On Algebraic Expressions:: Distributive Property
No ratings yet
Operations On Algebraic Expressions:: Distributive Property
6 pages
HEALTH CARD Employee-User-manual PDF
No ratings yet
HEALTH CARD Employee-User-manual PDF
13 pages
Cyber Law Asigment Case
0% (1)
Cyber Law Asigment Case
34 pages
3 Java Highest Mark in Each Semester Description
No ratings yet
3 Java Highest Mark in Each Semester Description
2 pages
Mail Server
No ratings yet
Mail Server
6 pages
[Ebooks PDF] download (Ebook) Go Design Patterns by Contreras, Mario Castro ISBN 9781786466204, 1786466201 full chapters
100% (3)
[Ebooks PDF] download (Ebook) Go Design Patterns by Contreras, Mario Castro ISBN 9781786466204, 1786466201 full chapters
81 pages
B.C.A. (Artificial Intlligence and Data Science)-NEP-Syllabus (1)
No ratings yet
B.C.A. (Artificial Intlligence and Data Science)-NEP-Syllabus (1)
72 pages
RET and TMA Process
No ratings yet
RET and TMA Process
3 pages
BA Validation
No ratings yet
BA Validation
2 pages
6.6 Distribution Wheeling Service Agreement
No ratings yet
6.6 Distribution Wheeling Service Agreement
3 pages
BDC Xk02 Ooabap
No ratings yet
BDC Xk02 Ooabap
14 pages
WT Notes by Mohammed Ahmed PDF
No ratings yet
WT Notes by Mohammed Ahmed PDF
11 pages
Feasibility
100% (2)
Feasibility
3 pages
Detailed Profile
No ratings yet
Detailed Profile
14 pages
CS Revision Notes
No ratings yet
CS Revision Notes
12 pages