IT105 Midterm Lecture Part1

The document discusses parallel programming models which can be used to write parallel programs. There are two main classifications - process interaction and problem decomposition. Process interaction includes shared memory, message passing, and implicit models for communication between parallel processes. Problem decomposition includes task-parallel and data-parallel models which focus on how parallel processes/threads are formulated to work on problems. Hybrid models combine different programming models.

Uploaded by

lov3m3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views5 pages

IT105 Midterm Lecture Part1

Uploaded by

lov3m3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

IT105 – Parallel Processing Midterm Lecture Part 1

Parallel Programming Model

In Computer Science, a Parallel Programming Model is a model for writing parallel programs which can be compiled and
executed //en.wikipedia.org//
Parallel programming model is a set of software technologies to express parallel algorithms and match applications with
underlying parallel systems.
Classifications of parallel programming models can be divided broadly into two areas: process interaction and problem
decomposition.

A. Process interaction relates to the mechanisms by which parallel processes are able to communicate with each
other. The most common forms of interaction are shared memory and message passing, but it can also be
implicit.

1. Shared memory is an efficient means of passing data between programs. Depending on context, programs
may run on a single processor or on multiple separate processors.
 In this model, parallel tasks share a global address space which they read and write to
asynchronously.
 This requires protection mechanisms such as locks, semaphores and monitors to control concurrent
access.
 An advantage of this model from the programmer's point of view is that the notion of data
"ownership" is lacking, so there is no need to specify explicitly the communication of data between
tasks. Program development can often be simplified.
 An important disadvantage in terms of performance is that it becomes more difficult to understand
and manage data locality:
o Keeping data local to the processor that works on it conserves memory accesses, cache
refreshes and bus traffic that occurs when multiple processors use the same data.
o Unfortunately, controlling data locality is hard to understand and may be beyond the
control of the average user.

 Thread Model is a type of Shared memory Model programming

o In the threads model of parallel programming, a single "heavy weight" process can have
multiple "light weight", concurrent execution paths.
o For example: The main program a.out is scheduled to run by the native operating
system. a.out loads and acquires all of the
necessary system and user resources to run.
This is the "heavy weight" process.
o a.out performs some serial work, and then
creates a number of tasks (threads) that can
be scheduled and run by the operating system
concurrently.
o Each thread has local data, but also, shares
the entire resources of a.out. This saves the
overhead associated with replicating a
program's resources for each thread ("light
weight"). Each thread also benefits from a
global memory view because it shares the memory space of a.out.
o A thread's work may best be described as a subroutine within the main program. Any
thread can execute any subroutine at the same time as other threads.

IT105 – Parallel Processing 1

o Threads communicate with each other through global memory (updating address locations).
This requires synchronization constructs to ensure that more than one thread is not
updating the same global address at any time.
o Threads can come and go, but a.out remains present to provide the necessary shared
resources until the application has completed.

2. Message passing is a concept from computer science that is used extensively in the design and
implementation of modern software applications; it is key to some models of concurrency and object-
oriented programming.

 In a message passing model, parallel tasks exchange data through passing messages to one another.
These communications can be asynchronous or synchronous.
 This model demonstrates the following characteristics:
o A set of tasks that use their own local memory
during computation. Multiple tasks can reside on the
same physical machine and/or across an arbitrary
number of machines.
o Tasks exchange data through communications by
sending and receiving messages.
o Data transfer usually requires cooperative
operations to be performed by each process. For
example, a send operation must have a matching
receive operation.

3. Implicit Model - In an implicit model, no process interaction is visible to the programmer, instead the
compiler and/or runtime is responsible for performing it. This is most common with domain-specific
languages where the concurrency within a problem can be more prescribed.
 Advantages
o A programmer that writes implicitly parallel code does not need to worry about task
division or process communication, focusing instead in the problem that his or her program
is intended to solve.
o Implicit parallelism generally facilitates the design of parallel programs and therefore results
in a substantial improvement of programmer productivity.
o Many of the constructs necessary to support this also add simplicity or clarity even in the
absence of actual parallelism. The example above, of List comprehension in the sin()
function, is a useful feature in of itself.
o By using implicit parallelism, languages effectively have to provide such useful constructs to
users simply to support required functionality (a language without a decent for() loop, for
example, is one few programmers will use).
 Disadvantages
o Languages with implicit parallelism reduce the control that the programmer has over the
parallel execution of the program, resulting sometimes in less-than-optimal parallel
efficiency.
o A larger issue is that every program has some parallel and some serial logic. Binary I/O, for
example, requires support for such serial operations as Write() and Seek(). If implicit
parallelism is desired, this creates a new requirement for constructs and keywords to
support code that cannot be threaded or distributed.

IT105 – Parallel Processing 2

B. Problem Decomposition relates to the way in which these processes are formulated. This classification may also
be referred to as algorithmic skeletons or parallel programming paradigms.
1. Task-Parallel Model focuses on processes, or threads of execution. These processes will often be
behaviorally distinct, which emphasizes the need for communication. Task parallelism is a natural way to
express message-passing communication. It is usually classified as MIMD/MPMD or MISD.

2. Data-Parallel Model focuses on performing operations on a data set which is usually regularly
structured in an array. A set of tasks will operate on this data, but independently on separate partitions.
In a shared memory system, the data will be accessible to all, but in a distributed-memory system it will
divided between memories and worked on locally.

 Data parallelism is usually classified as SIMD/SPMD.

 May also be referred to as the Partitioned Global
Address Space (PGAS) model.
 The data parallel model demonstrates the following
characteristics:
o Address space is treated globally
o Most of the parallel work focuses on
performing operations on a data set. The data
set is typically organized into a common
structure, such as an array or cube.
o A set of tasks work collectively on the same
data structure, however, each task works on a
different partition of the same data structure.
o Tasks perform the same operation on their
partition of work, for example, "add 4 to every array element".
 On shared memory architectures, all tasks may have access to the data structure through global
memory
 On distributed memory architectures the data structure is split up and resides as "chunks" in the
local memory of each task.

Other Programming Models

Hybrid Model

 A hybrid model combines more than one of the previously

described programming models.
 Currently, a common example of a hybrid model is the
combination of the message passing model (MPI) with the
threads model (OpenMP).
o Threads perform computationally intensive kernels
using local, on-node data
o Communications between processes on different
nodes occurs over the network using MPI
 This hybrid model lends itself well to the increasingly common hardware environment of clustered multi/many-
core machines.
 Another similar and increasingly popular example of a hybrid model is using MPI with GPU (Graphics Processing
Unit) programming.
o GPUs perform computationally intensive kernels using local, on-node data
o Communications between processes on different nodes occurs over the network using MPI

IT105 – Parallel Processing 3

Tasks and Channels

A simple parallel programming model. The figure shows both the instantaneous state of a computation and a detailed
picture of a single task. A computation consists of a set of tasks (represented by circles) connected by channels (arrows).
A task encapsulates a program and local memory and defines a set of ports that define its interface to its environment. A
channel is a message queue into which a sender can place messages and from which a receiver can remove messages,
``blocking'' if messages are not available.

We consider next the question of which abstractions are appropriate and useful in a parallel programming model. Clearly,
mechanisms are needed that allow explicit discussion about concurrency and locality and that facilitate development of
scalable and modular programs. Also needed are abstractions that are simple to work with and that match the
architectural model, the multicomputer. While numerous possible abstractions could be considered for this purpose, two
fit these requirements particularly well: the task and channel. These are illustrated below and can be summarized as
follows:

The four basic task actions. In addition to reading and writing local memory, a task can send a message, receive a
message, create new tasks (suspending until they terminate), and terminate.

1. A parallel computation consists of one or more tasks. Tasks execute concurrently. The number of tasks can vary
during program execution.
2. A task encapsulates a sequential program and local memory. (In effect, it is a virtual von Neumann machine.) In
addition, a set of inports and outports define its interface to its environment.
3. A task can perform four basic actions in addition to reading and writing its local memory : send messages on its
outports, receive messages on its inports, create new tasks, and terminate.
4. A send operation is asynchronous: it completes immediately. A receive operation is synchronous: it causes
execution of the task to block until a message is available.
5. Outport/inport pairs can be connected by message queues called channels. Channels can be created and
deleted, and references to channels (ports) can be included in messages, so connectivity can vary dynamically.

IT105 – Parallel Processing 4

6. Tasks can be mapped to physical processors in various ways; the mapping employed does not affect the
semantics of a program. In particular, multiple tasks can be mapped to a single processor.

Example: Bridge Construction:

Consider the following real-world problem. A bridge is to be assembled from girders being constructed at a foundry.
These two activities are organized by providing trucks to transport girders from the foundry to the bridge site. This
situation is illustrated below (a) with the foundry and bridge represented as tasks and the stream of trucks as a channel.
Notice that this approach allows assembly of the bridge and construction of girders to proceed in parallel without any
explicit coordination: the foundry crew puts girders on trucks as they are produced, and the assembly crew adds girders
to the bridge as and when they arrive.

Two solutions to the bridge construction problem. Both represent the foundry and the bridge assembly site as separate
tasks, foundry and bridge. The first uses a single channel on which girders generated by foundry are transported as fast as
they are generated. If foundry generates girders faster than they are consumed by bridge, then girders accumulate at the
construction site. The second solution uses a second channel to pass flow control messages from bridge to foundry so as
to avoid overflow.

A disadvantage of this scheme is that the foundry may produce girders much faster than the assembly crew can use them.
To prevent the bridge site from overflowing with girders, the assembly crew instead can explicitly request more girders
when stocks run low.

IT105 – Parallel Processing 5

X 64 DBG
No ratings yet
X 64 DBG
239 pages
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
A Survey of Parallel Programming Models and Tools in The Multi and Many-Core Era
No ratings yet
A Survey of Parallel Programming Models and Tools in The Multi and Many-Core Era
18 pages
ASTM - F959M - 01a PDF
No ratings yet
ASTM - F959M - 01a PDF
6 pages
Tutorial 6. Using The Discrete Ordinates Radiation Model
No ratings yet
Tutorial 6. Using The Discrete Ordinates Radiation Model
42 pages
Lecture-4 Parallel Programming Model
No ratings yet
Lecture-4 Parallel Programming Model
14 pages
Programming Models
No ratings yet
Programming Models
21 pages
Parallel and Distributed Computing Lecture#12
No ratings yet
Parallel and Distributed Computing Lecture#12
19 pages
HPC Module 4
No ratings yet
HPC Module 4
18 pages
Chapter 2 - Parallel Algorithm Design
No ratings yet
Chapter 2 - Parallel Algorithm Design
84 pages
3-ParallelProgrammingModels
No ratings yet
3-ParallelProgrammingModels
20 pages
Lecture 4
No ratings yet
Lecture 4
20 pages
3.3-Recent Trends in Parallel Computing
No ratings yet
3.3-Recent Trends in Parallel Computing
12 pages
6th Sem Section D PC
No ratings yet
6th Sem Section D PC
6 pages
PA midsem
No ratings yet
PA midsem
20 pages
Part 1 - Lecture 3 - Parallel Software-1
No ratings yet
Part 1 - Lecture 3 - Parallel Software-1
45 pages
Parallel Programming Models
No ratings yet
Parallel Programming Models
25 pages
Parallel Processing
No ratings yet
Parallel Processing
31 pages
mod5_aca-1-52
No ratings yet
mod5_aca-1-52
52 pages
Unit 2.2 Parallel Programming Models
No ratings yet
Unit 2.2 Parallel Programming Models
23 pages
Lecture 13 - Programming Models
No ratings yet
Lecture 13 - Programming Models
15 pages
Parallel Computing
No ratings yet
Parallel Computing
24 pages
Parallel Programming
No ratings yet
Parallel Programming
42 pages
ACA Unit 8 - 1
No ratings yet
ACA Unit 8 - 1
23 pages
Parallel Computing
No ratings yet
Parallel Computing
28 pages
L12-Principles of Message Passing1
No ratings yet
L12-Principles of Message Passing1
10 pages
Unit III
No ratings yet
Unit III
31 pages
unit-3
No ratings yet
unit-3
49 pages
Module 5
No ratings yet
Module 5
40 pages
Unit 5 - Advanced Computer Architecture - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Advanced Computer Architecture - WWW - Rgpvnotes.in
9 pages
Parallel Paradigms
No ratings yet
Parallel Paradigms
16 pages
Message Passing Fundamentals: Reference: Http://foxtrot - Ncsa.uiuc - edu:8900/public/MPI
No ratings yet
Message Passing Fundamentals: Reference: Http://foxtrot - Ncsa.uiuc - edu:8900/public/MPI
22 pages
Mpi Course
No ratings yet
Mpi Course
202 pages
15cs72aca Module-5 Aca
No ratings yet
15cs72aca Module-5 Aca
53 pages
Parallel Programming: Aaron Bloomfield CS 415 Fall 2005
No ratings yet
Parallel Programming: Aaron Bloomfield CS 415 Fall 2005
24 pages
L04 Parallel Programming Models I
No ratings yet
L04 Parallel Programming Models I
72 pages
High Performance Computing (HPC) - Lec2
No ratings yet
High Performance Computing (HPC) - Lec2
53 pages
L1.3a HPC Concepts
No ratings yet
L1.3a HPC Concepts
43 pages
CICS 504 Computer Organization
No ratings yet
CICS 504 Computer Organization
35 pages
Intro_HPC_IITK
No ratings yet
Intro_HPC_IITK
44 pages
2 Parallel Computer Memory Architectures
No ratings yet
2 Parallel Computer Memory Architectures
26 pages
Unit 3 Complete APP
No ratings yet
Unit 3 Complete APP
49 pages
Parallel_computing
No ratings yet
Parallel_computing
32 pages
CS526 3 Design of Parallel Programs
No ratings yet
CS526 3 Design of Parallel Programs
83 pages
Computação Paralela
No ratings yet
Computação Paralela
18 pages
Unit 2.3 Parallel Programming Architecture
No ratings yet
Unit 2.3 Parallel Programming Architecture
9 pages
Chapter-10 Parallel Programming Models, Languages and Compilers
No ratings yet
Chapter-10 Parallel Programming Models, Languages and Compilers
29 pages
03 (Parallel Software)
No ratings yet
03 (Parallel Software)
38 pages
Chap 4-7 - Parallel - Abstractions - and - MPI
No ratings yet
Chap 4-7 - Parallel - Abstractions - and - MPI
34 pages
Parallel Programming Models: Sathish Vadhiyar
No ratings yet
Parallel Programming Models: Sathish Vadhiyar
32 pages
Parallel Programming: Homework Number 5 Objective
No ratings yet
Parallel Programming: Homework Number 5 Objective
6 pages
DS1822 -Parallel Computing - Unit 1
No ratings yet
DS1822 -Parallel Computing - Unit 1
23 pages
Concurrent Programming With Threads: Rajkumar Buyya
No ratings yet
Concurrent Programming With Threads: Rajkumar Buyya
168 pages
Module 5
No ratings yet
Module 5
138 pages
PDC Lecture 05
No ratings yet
PDC Lecture 05
48 pages
Parallel Algorithm - Introduction
No ratings yet
Parallel Algorithm - Introduction
36 pages
Learn C++
From Everand
Learn C++
Aishik Dutta
No ratings yet
Mastering Computer Programming: A Comprehensive Guide
From Everand
Mastering Computer Programming: A Comprehensive Guide
Kondwani Hara
No ratings yet
Core Java Programming
From Everand
Core Java Programming
Jitendra Patel
4/5 (11)
JavaScript File Handling from Scratch: A Practical Guide with Examples
From Everand
JavaScript File Handling from Scratch: A Practical Guide with Examples
William E. Clark
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
Mastering the Craft of C Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Craft of C Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
Thinking About Star
From Everand
Thinking About Star
Francis McCabe
No ratings yet
Acctg 100G 08 1
No ratings yet
Acctg 100G 08 1
4 pages
Acctg 100C 17
No ratings yet
Acctg 100C 17
2 pages
Estate Ni Jona
50% (2)
Estate Ni Jona
2 pages
Acctg 100C 07
No ratings yet
Acctg 100C 07
1 page
Afar Reviewer (Ok)
No ratings yet
Afar Reviewer (Ok)
4 pages
Business Combination
No ratings yet
Business Combination
3 pages
Acctg 100G 02
100% (1)
Acctg 100G 02
4 pages
Agreement For Printing
No ratings yet
Agreement For Printing
2 pages
5BSA-1 Scoresheet AFAR
No ratings yet
5BSA-1 Scoresheet AFAR
4 pages
Gov't & Agri
No ratings yet
Gov't & Agri
4 pages
Take A Vow Lyrics
No ratings yet
Take A Vow Lyrics
1 page
Types of Task
No ratings yet
Types of Task
2 pages
Pamantasan NG Cabuyao: Application For Graduation
No ratings yet
Pamantasan NG Cabuyao: Application For Graduation
1 page
Final Exam TH
No ratings yet
Final Exam TH
1 page
Adufina Algire Duremdes Manimtim Mojar
No ratings yet
Adufina Algire Duremdes Manimtim Mojar
3 pages
Year 1
No ratings yet
Year 1
20 pages
Chapter 7
No ratings yet
Chapter 7
13 pages
Disassembly and Assembly
100% (2)
Disassembly and Assembly
46 pages
How Power Grids Work
No ratings yet
How Power Grids Work
4 pages
Faizan Yaseen: Frontend Developer
No ratings yet
Faizan Yaseen: Frontend Developer
1 page
FMCG Sales Distribution
No ratings yet
FMCG Sales Distribution
8 pages
The Kenya National Examinations Council: 1.0 General Instructions
No ratings yet
The Kenya National Examinations Council: 1.0 General Instructions
29 pages
XP Tricks
No ratings yet
XP Tricks
124 pages
Deutz F3M 1011F, BF3M, F4M, BF4M Service Manual - Compressed
100% (2)
Deutz F3M 1011F, BF3M, F4M, BF4M Service Manual - Compressed
78 pages
Installing Scaffold
No ratings yet
Installing Scaffold
4 pages
Applications of Solar Energy
No ratings yet
Applications of Solar Energy
18 pages
Zener Diode Table Reference
No ratings yet
Zener Diode Table Reference
2 pages
Analysis of Impulse Voltage Generator-543 PDF
No ratings yet
Analysis of Impulse Voltage Generator-543 PDF
7 pages
IECEx BAS 13.0142X 005
No ratings yet
IECEx BAS 13.0142X 005
5 pages
Sony VGN-NR498E Specifications
No ratings yet
Sony VGN-NR498E Specifications
2 pages
Standard Height of Bathroom Fittings
No ratings yet
Standard Height of Bathroom Fittings
2 pages
Aiche 174855 Feedstock Contaminants: A Case Study: Patricio Herrera
No ratings yet
Aiche 174855 Feedstock Contaminants: A Case Study: Patricio Herrera
13 pages
The Tinkerers: The Amateurs, DIYers and Inventors Who Make America Great
No ratings yet
The Tinkerers: The Amateurs, DIYers and Inventors Who Make America Great
4 pages
Sheet 09 Solution
No ratings yet
Sheet 09 Solution
7 pages
Automation robotics I mid question paper
No ratings yet
Automation robotics I mid question paper
3 pages
Drill Pipe Coatings
100% (3)
Drill Pipe Coatings
64 pages
Semi
50% (2)
Semi
6 pages
MR Fault Code
No ratings yet
MR Fault Code
17 pages
Capacity Vs CCA
No ratings yet
Capacity Vs CCA
3 pages
CEE 8104-Homework 4 November 11, 20003 Brian Devine: Microgravity
No ratings yet
CEE 8104-Homework 4 November 11, 20003 Brian Devine: Microgravity
5 pages
Bridge Deck Behaviour, Second Edition
No ratings yet
Bridge Deck Behaviour, Second Edition
17 pages
CO 251 Software Engineering Dec 2017 - May 2018: Instructor: Prof. K.Chandrasekaran
No ratings yet
CO 251 Software Engineering Dec 2017 - May 2018: Instructor: Prof. K.Chandrasekaran
26 pages
02 Number Systems
No ratings yet
02 Number Systems
52 pages
Details of RC1530DH PDF
No ratings yet
Details of RC1530DH PDF
8 pages