0% found this document useful (0 votes)

32 views

Parallel Computation Models: Slide 1

The document discusses parallel computation models and parallel algorithms. It introduces several parallel computation models including PRAM (parallel RAM), networks with fixed interconnections like meshes and hypercubes, Boolean and combinatorial circuits, BSP and LOGP models. It then discusses parallel versus distributed systems and metrics for analyzing parallel algorithms like speedup. Key parallel computation models covered in more detail include PRAM (parallel RAM), networks and the network model of computation.

Uploaded by

jyoti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views

Parallel Computation Models: Slide 1

Uploaded by

jyoti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 28

Parallel Computation Models

Lecture 3
Lecture 4

Slide 1

Parallel Computation Models

PRAM (parallel RAM)
Fixed Interconnection Network
bus, ring, mesh, hypercube, shuffle-exchange

Boolean Circuits
Combinatorial Circuits
BSP
LOGP
Slide 2

PARALLEL AND DISTRIBUTED

COMPUTATION
MANY INTERCONNECTED PROCESSORS WORKING CONCURRENTLY

INTERCONNECTION
NETWORK

P2
P1

....

CONNECTION MACHINE
INTERNET

Connects all the computers of the world

Slide 3

TYPES OF MULTIPROCESSING FRAMEWORKS

PARALLEL
DISTRIBUTED
TECHNICAL ASPECTS
PARALLEL COMPUTERS (USUALLY) WORK IN TIGHT SYNCRONY, SHARE MEMORY TO A LARGE
EXTENT AND HAVE A VERY FAST AND RELIABLE COMMUNICATION MECHANISM BETWEEN THEM.
DISTRIBUTED COMPUTERS ARE MORE INDEPENDENT, COMMUNICATION IS LESS
FREQUENT AND LESS SYNCRONOUS, AND THE COOPERATION IS LIMITED.
PURPOSES
PARALLEL COMPUTERS COOPERATE TO SOLVE MORE EFFICIENTLY (POSSIBLY)
DIFFICULT PROBLEMS
DISTRIBUTED COMPUTERS HAVE INDIVIDUAL GOALS AND PRIVATE ACTIVITIES.
SOMETIME COMMUNICATIONS WITH OTHER ONES ARE NEEDED. (E. G. DISTRIBUTED DATA
BASE OPERATIONS).

PARALLEL COMPUTERS: COOPERATION IN A POSITIVE SENSE

DISTRIBUTED COMPUTERS: COOPERATION IN A NEGATIVE SENSE,
ONLY WHEN IT IS NECESSARY

Slide 4

FOR PARALLEL SYSTEMS

WE ARE INTERESTED TO SOLVE ANY PROBLEM IN PARALLEL

FOR DISTRIBUTED SYSTEMS

WE ARE INTERESTED TO SOLVE IN PARALLEL
PARTICULAR PROBLEMS ONLY, TYPICAL EXAMPLES ARE:
COMMUNICATION SERVICES
ROUTING
BROADCASTING
MAINTENANCE OF CONTROL STUCTURE
SPANNING TREE CONSTRUCTION
TOPOLOGY UPDATE
LEADER ELECTION
RESOURCE CONTROL ACTIVITIES
LOAD BALANCING
MANAGING GLOBAL DIRECTORIES

Slide 5

PARALLEL ALGORITHMS
WHICH MODEL OF COMPUTATION IS THE BETTER TO USE?
HOW MUCH TIME WE EXPECT TO SAVE USING A PARALLEL ALGORITHM?
HOW TO CONSTRUCT EFFICIENT ALGORITHMS?
MANY CONCEPTS OF THE COMPLEXITY THEORY MUST BE REVISITED
IS THE PARALLELISM A SOLUTION FOR HARD PROBLEMS?
ARE THERE PROBLEMS NOT ADMITTING AN EFFICIENT PARALLEL SOLUTION,
THAT IS INHERENTLY SEQUENTIAL PROBLEMS?

Slide 6

We need a model of computation

NETWORK (VLSI) MODEL
The processors are connected by a network of bounded degree.
No shared memory is available.
Several interconnection topologies.
Synchronous way of operating.
MESH CONNECTED ARRAY

degree = 4 (N)

diameter = 2N

Slide 7

HYPERCUBE

0110

0111
1110

0100

degree = 4 (log2N)
0000

0101
0010

1111

1100

0011

1101
1010

diameter = 4

1011

0001
1000

1001

N = 24 PROCESSORS

Slide 8

Other important topologies

binary trees
mesh of trees
cube connected cycles
In the network model a PARALLEL MACHINE is a very complex
ensemble of small interconnected units, performing elementary
operations.
- Each processor has its own memory.
- Processors work synchronously.
LIMITS OF THE MODEL
different topologies require different algorithms to solve the same
problem
it is difficult to describe and analyse algorithms (the migration of
data have to be described)
A shared-memory model is more suitable by an algorithmic point of view
Slide 9

Model Equivalence
given two models M1 and M2, and a problem
of size n
if M1 and M2 are equivalent then solving
requires:
T(n) time and P(n) processors on M1
T(n)O(1) time and P(n)O(1) processors on M2
Slide 10

PRAM
Parallel Random Access Machine
Shared-memory multiprocessor
unlimited number of processors, each
has unlimited local memory
knows its ID
able to access the shared memory

unlimited shared memory

Slide 11

PRAM MODEL
1
2
3

P1
P2

.
Pi

Common Memory

.
.
.

Pn
m

PRAM n RAM processors connected to a common memory of m cells

ASSUMPTION: at each time unit each Pi can read a memory cell, make an internal
computation and write another memory cell.
CONSEQUENCE: any pair of processor Pi Pj can communicate in constant time!
Pi writes the message in cell x at time t
Pi reads the message in cell x at time t+1

Slide 12

PRAM
Inputs/Outputs are placed in the shared
memory (designated address)
Memory cell stores an arbitrarily large
integer
Each instruction takes unit time
Instructions are synchronized across the
processors
Slide 13

PRAM Instruction Set

accumulator architecture
memory cell R0 accumulates results

multiply/divide instructions take only

constant operands
prevents generating exponentially large
numbers in polynomial time
Slide 14

PRAM Complexity Measures

for each individual processor
time: number of instructions executed
space: number of memory cells accessed

PRAM machine
time: time taken by the longest running processor
hardware: maximum number of active processors

Slide 15

Two Technical Issues for PRAM

How processors are activated
How shared memory is accessed

Slide 16

Processor Activation
P0 places the number of processors (p) in the
designated shared-memory cell
each active Pi, where i < p, starts executing
O(1) time to activate
all processors halt when P0 halts

Active processors explicitly activate additional

processors via FORK instructions
tree-like activation
O(log p) time to activate
Slide 17

THE PRAM IS A THEORETICAL (UNFEASIBLE) MODEL

The interconnection network between processors and memory would require
a very large amount of area .
The message-routing on the interconnection network would require time
proportional to network size (i. e. the assumption of a constant access time
to the memory is not realistic).

WHY THE PRAM IS A REFERENCE MODEL?

Algorithms designers can forget the communication problems and focus their
attention on the parallel computation only.
There exist algorithms simulating any PRAM algorithm on bounded degree
networks.
E. G. A PRAM algorithm requiring time T(n), can be simulated in a mesh of tree
in time T(n)log2n/loglogn, that is each step can be simulated with a slow-down
of log2n/loglogn.
Instead of design ad hoc algorithms for bounded degree networks, design more
general algorithms for the PRAM model and simulate them on a feasible network.
Slide 18

For the PRAM model there exists a well developed body of techniques
and methods to handle different classes of computational problems.
The discussion on parallel model of computation is still HOT
The actual trend:
COARSE-GRAINED MODELS
The degree of parallelism allowed is independent from the number
of processors.
The computation is divided in supersteps, each one includes
local computation
communication phase
syncronization phase
the study is still at the beginning!
Slide 19

Metrics
A measure of relative performance between a multiprocessor
system and a single processor system is the speed-up S( p),
defined as follows:
S( p) =

Execution time using a single processor system

Execution time using a multiprocessor with p processors

S( p) =

T1
Tp

Efficiency =

Sp
p

Cost = p Tp
Slide 20

Metrics
Parallel algorithm is cost-optimal:
parallel cost = sequential time
C p = T1
Ep = 100%

Critical when down-scaling:

parallel implementation may
become slower than sequential
T1 = n 3
Tp = n2.5 when p = n2
Cp = n4.5
Slide 21

Amdahls Law
f = fraction of the problem thats
inherently sequential
(1 f) = fraction thats parallel

Parallel time Tp:

Tp f (1 f ) p

1
Speedup with p processors: S p
1 f
f
p
Slide 22

Amdahls Law
Upper bound on speedup (p = )
1
Sp
1 f
f
p

Converges to 0

1
S
f

Example:
f = 2%
S = 1 / 0.02 = 50
Slide 23

PRAM
Too many interconnections gives problems with synchronization
However it is the best conceptual model for designing efficient
parallel algorithms
due to simplicity and possibility of simulating efficiently PRAM
algorithms on more realistic parallel architectures

Slide 24

Shared-Memory Access
Concurrent (C) means, many processors can do the operation simultaneously in
the same memory
Exclusive (E) not concurent

EREW (Exclusive Read Exclusive Write)

CREW (Concurrent Read Exclusive Write)
Many processors can read simultaneously
the same location, but only one can attempt to write to a given location

ERCW
CRCW
Many processors can write/read at/from the same memory location

Slide 25

Example CRCW-PRAM
Initially
table A contains values 0 and 1
output contains value 0

The program computes the Boolean OR of

A[1], A[2], A[3], A[4], A[5]

Slide 26

Example CREW-PRAM
Assume initially table A contains [0,0,0,0,0,1] and we

have the parallel program

Slide 27

Pascal triangle

PRAM CREW

Slide 28

Parallel Algorithms Ws 20
No ratings yet
Parallel Algorithms Ws 20
353 pages
Parallel Algorithm Merged
No ratings yet
Parallel Algorithm Merged
76 pages
PRAM Algorithms
100% (1)
PRAM Algorithms
24 pages
Hacky Easter 2016 Solutions
No ratings yet
Hacky Easter 2016 Solutions
92 pages
Digital Marketing Agencies in Dubai
No ratings yet
Digital Marketing Agencies in Dubai
3 pages
Team Center Services
100% (1)
Team Center Services
70 pages
Rodnay Zaks-6502 Applications Book-Sybex Inc.,U.S. (1981) PDF
No ratings yet
Rodnay Zaks-6502 Applications Book-Sybex Inc.,U.S. (1981) PDF
281 pages
Market Research & Intelligence
100% (3)
Market Research & Intelligence
16 pages
L2 Parallel Computing Models
No ratings yet
L2 Parallel Computing Models
31 pages
PDA_3
No ratings yet
PDA_3
90 pages
Par Seq Algorithms
No ratings yet
Par Seq Algorithms
44 pages
Pram Algorithms: Parallel and Distributed Algorithms BY Debdeep Mukhopadhyay AND Abhishek Somani
No ratings yet
Pram Algorithms: Parallel and Distributed Algorithms BY Debdeep Mukhopadhyay AND Abhishek Somani
17 pages
Parallel Algorithm Main Single
No ratings yet
Parallel Algorithm Main Single
289 pages
Parallel Random Access Machine
No ratings yet
Parallel Random Access Machine
22 pages
Ram, Pram, and Logp Models
No ratings yet
Ram, Pram, and Logp Models
72 pages
1 Overview, Models of Computation, Brent's Theorem
No ratings yet
1 Overview, Models of Computation, Brent's Theorem
8 pages
unit1 2 and 3
No ratings yet
unit1 2 and 3
76 pages
Lecture 9 - Parallel Algorithms
No ratings yet
Lecture 9 - Parallel Algorithms
28 pages
05 - Lecture #5 - 6
No ratings yet
05 - Lecture #5 - 6
42 pages
What Is Parallel Computing 1 PDF
No ratings yet
What Is Parallel Computing 1 PDF
21 pages
Parallel and Distributed Algorithms
No ratings yet
Parallel and Distributed Algorithms
65 pages
hpc_parallel
No ratings yet
hpc_parallel
122 pages
PRAM and Distributed Computing Report
No ratings yet
PRAM and Distributed Computing Report
5 pages
Chapter 3
No ratings yet
Chapter 3
21 pages
Lecture 8 Miscellaneous Topics
No ratings yet
Lecture 8 Miscellaneous Topics
52 pages
Paralle Processing in Brief
No ratings yet
Paralle Processing in Brief
31 pages
PDA_1
No ratings yet
PDA_1
72 pages
Parallel Programming: Sathish S. Vadhiyar Course Web Page
No ratings yet
Parallel Programming: Sathish S. Vadhiyar Course Web Page
36 pages
PRAM Model
No ratings yet
PRAM Model
72 pages
Parallel Computing
No ratings yet
Parallel Computing
28 pages
Parallel Algorithms Presentation (1)
No ratings yet
Parallel Algorithms Presentation (1)
32 pages
Chapter 02
No ratings yet
Chapter 02
47 pages
PRAM COMP 633: Parallel Computing Algorithms: The PRAM Model of Computation
No ratings yet
PRAM COMP 633: Parallel Computing Algorithms: The PRAM Model of Computation
49 pages
Lect 1 Overview
No ratings yet
Lect 1 Overview
17 pages
Notes 03
No ratings yet
Notes 03
3 pages
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
No ratings yet
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
63 pages
Introduction To Parallel Programming: Linda Woodard CAC 19 May 2010
100% (1)
Introduction To Parallel Programming: Linda Woodard CAC 19 May 2010
38 pages
Chapter 14: Parallel Algorithms
No ratings yet
Chapter 14: Parallel Algorithms
23 pages
parallel and distributed algorithms
No ratings yet
parallel and distributed algorithms
21 pages
Models For Parallel And Distributed Computation Theory Algorithmic Techniques And Applications 1st Edition Michel Cosnard Auth pdf download
100% (1)
Models For Parallel And Distributed Computation Theory Algorithmic Techniques And Applications 1st Edition Michel Cosnard Auth pdf download
79 pages
Parallel Algorithms and Architectures 1
No ratings yet
Parallel Algorithms and Architectures 1
22 pages
Fundamental Algorithms: Chapter 3: Parallel Algorithms - The PRAM Model
No ratings yet
Fundamental Algorithms: Chapter 3: Parallel Algorithms - The PRAM Model
26 pages
PDC Complete Course File
No ratings yet
PDC Complete Course File
422 pages
PRAMs
No ratings yet
PRAMs
67 pages
HPC Lectures 1 5
No ratings yet
HPC Lectures 1 5
18 pages
The PRAM Model and Algorithms: Advanced Topics Spring 2008
No ratings yet
The PRAM Model and Algorithms: Advanced Topics Spring 2008
24 pages
Chapter 4
No ratings yet
Chapter 4
46 pages
HPC Note
No ratings yet
HPC Note
39 pages
Parallel Algorithms: Peter Harrison and William Knottenbelt
No ratings yet
Parallel Algorithms: Peter Harrison and William Knottenbelt
65 pages
Khaitan PSERC Webinar HPC Mar 2013 Slides
No ratings yet
Khaitan PSERC Webinar HPC Mar 2013 Slides
52 pages
Assignment of Algorithm
No ratings yet
Assignment of Algorithm
9 pages
Parallel Random Access Machine
No ratings yet
Parallel Random Access Machine
8 pages
Parallel Algorithms: Theory and Practice
No ratings yet
Parallel Algorithms: Theory and Practice
44 pages
CS621-CHEATSHEET.docx
No ratings yet
CS621-CHEATSHEET.docx
11 pages
Notes 02
No ratings yet
Notes 02
9 pages
Project - ParallelComputing BSR v2
No ratings yet
Project - ParallelComputing BSR v2
40 pages
Parallel Computing
No ratings yet
Parallel Computing
19 pages
PA midsem
No ratings yet
PA midsem
20 pages
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
No ratings yet
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
170 pages
Chapter 02 - Asynchronous and Parallel Programming in .NET
No ratings yet
Chapter 02 - Asynchronous and Parallel Programming in .NET
55 pages
Week1 - Parallel and Distributed Computing
100% (1)
Week1 - Parallel and Distributed Computing
46 pages
001__DDS-IIIT-Jan-10th
No ratings yet
001__DDS-IIIT-Jan-10th
34 pages
PC 1
No ratings yet
PC 1
53 pages
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
The Complete Future Trait Guide
From Everand
The Complete Future Trait Guide
Hamze Ghalebi
No ratings yet
Practical Workbook-1
No ratings yet
Practical Workbook-1
11 pages
Cisco ASA Access Lists Concepts and Configuration
No ratings yet
Cisco ASA Access Lists Concepts and Configuration
5 pages
Beginners Python Cheat Sheet PCC Files Exceptions PDF
No ratings yet
Beginners Python Cheat Sheet PCC Files Exceptions PDF
2 pages
Embedded System Interfacing: Introduction To Switch & Keypad
No ratings yet
Embedded System Interfacing: Introduction To Switch & Keypad
2 pages
Python Quetion and Answers
No ratings yet
Python Quetion and Answers
5 pages
Microsoft Iis 7.0 and Later: Enabling Fastcgi Support in Iis
No ratings yet
Microsoft Iis 7.0 and Later: Enabling Fastcgi Support in Iis
7 pages
Constructing OLAP Cubes Based On Queries: Tapio Niemi Jyrki Nummenmaa Peter Than&h
No ratings yet
Constructing OLAP Cubes Based On Queries: Tapio Niemi Jyrki Nummenmaa Peter Than&h
7 pages
Day Con Tang Dai Nhat: Tam Tat Ca To Hop Con
No ratings yet
Day Con Tang Dai Nhat: Tam Tat Ca To Hop Con
5 pages
Crypto 101: Laurens Van Houtven (LVH)
No ratings yet
Crypto 101: Laurens Van Houtven (LVH)
242 pages
How To Build Report On OBIEE
No ratings yet
How To Build Report On OBIEE
19 pages
Introduction To Arithmetic
No ratings yet
Introduction To Arithmetic
11 pages
Chapter 2 Quiz - CET501f13 Applied Networking I
100% (1)
Chapter 2 Quiz - CET501f13 Applied Networking I
7 pages
Group Assignment 1
No ratings yet
Group Assignment 1
2 pages
Tekla - Steel Detailing - Basic Training Drawing
100% (1)
Tekla - Steel Detailing - Basic Training Drawing
160 pages
Online Pre - Assessment Job Application System For PHIL-TEX Staffing Services International Inc
0% (1)
Online Pre - Assessment Job Application System For PHIL-TEX Staffing Services International Inc
8 pages
App Builder Messages PDF
No ratings yet
App Builder Messages PDF
209 pages
Agent-Based Modelling and Geographical Information Systems
No ratings yet
Agent-Based Modelling and Geographical Information Systems
23 pages
CDP 1802 Data Sheet 2
No ratings yet
CDP 1802 Data Sheet 2
7 pages
Lecture 2.3.1 Graph
No ratings yet
Lecture 2.3.1 Graph
23 pages
Permanent Magnet Linear Transverse Flux Motors
No ratings yet
Permanent Magnet Linear Transverse Flux Motors
6 pages
' Software Engineering, Super Important Questions - Review Team CSE/ISE
No ratings yet
' Software Engineering, Super Important Questions - Review Team CSE/ISE
3 pages
695-Article Text-3629-2-10-20180821
No ratings yet
695-Article Text-3629-2-10-20180821
23 pages
Handing Over & Close Out: Darshan Shukla Sajal Chatterjee
No ratings yet
Handing Over & Close Out: Darshan Shukla Sajal Chatterjee
11 pages
TE May 2022 Result Analysis
No ratings yet
TE May 2022 Result Analysis
9 pages
IMM5821E Canada
No ratings yet
IMM5821E Canada
10 pages

Parallel Computation Models: Slide 1

Uploaded by

Parallel Computation Models: Slide 1

Uploaded by

Parallel Computation Models

Parallel Computation Models

PARALLEL AND DISTRIBUTED

Connects all the computers of the world

TYPES OF MULTIPROCESSING FRAMEWORKS

PARALLEL COMPUTERS: COOPERATION IN A POSITIVE SENSE

FOR PARALLEL SYSTEMS

FOR DISTRIBUTED SYSTEMS

We need a model of computation

Other important topologies

unlimited shared memory

PRAM n RAM processors connected to a common memory of m cells

PRAM Instruction Set

multiply/divide instructions take only

PRAM Complexity Measures

Two Technical Issues for PRAM

Active processors explicitly activate additional

THE PRAM IS A THEORETICAL (UNFEASIBLE) MODEL

WHY THE PRAM IS A REFERENCE MODEL?

Execution time using a single processor system

Critical when down-scaling:

Parallel time Tp:

EREW (Exclusive Read Exclusive Write)

The program computes the Boolean OR of

have the parallel program

You might also like