0% found this document useful (0 votes)

2 views22 pages

Pdc - Co1-Basic Op & Cost Analysis

The document outlines basic communication operations and cost analysis in parallel and distributed computing, emphasizing the importance of efficient communication patterns among processes. Key operations discussed include one-to-all broadcast, all-to-one reduction, scatter, gather, and all-to-all communication, along with their implementations on various architectures. Additionally, it covers cost analysis methods for estimating the performance of parallel execution, focusing on optimizing resource allocation and calculating parallel costs.

Uploaded by

Syed Tahaseen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views22 pages

Pdc - Co1-Basic Op & Cost Analysis

Uploaded by

Syed Tahaseen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

CO - 1

COURSE NAME : PARALLEL & DISTRIBUTED COMPUTING

COURSE CODE : 22CS4106

TOPICS : BASIC COMMUNICATION OPERATIONS AND COST

ANALYSIS.
BASIC COMMUNICATION
OPERATIONS
• Many interactions in practical parallel programs occur in
well-defined patterns involving more than two processes.
• Often either all processes participate together in a single
global interaction operation, or subset of processes
participate in interactions local to each subset.
• These common basic pattern inter-process interaction of
communication are frequently used as building blocks in
a variety of parallel programs.

2
BASIC COMMUNICATION OPERATIONS
• Proper implementation of these basic communication operations on various
parallel architectures is a key to the efficient execution of the parallel algorithms
that use them.
• The following basic communication operations are commonly used on various
parallel architectures:
• One to all broadcast and All to one reduction
• All to all broadcast and reduction
• All reduce operations
• Prefix sum operations
• Scatter and gather
• All to all personalized communication

3
ONE-TO-ALL BROADCAST AND ALL-TO-
ONE REDUCTION

• One processor has a piece of data (of size m) it needs to send to everyone.
• The dual of one-to-all broadcast is all-to-one reduction.
• In all-to-one reduction, each processor has m units of data. These data items
must be combined piece-wise (using some associative operator, such as
addition or min), and the result made available at a target processor.

4
ONE-TO-ALL BROADCAST AND ALL-TO-
ONE REDUCTION ON RINGS

• Simplest way is to send p-1 messages from the source to the other p-1
processors - this is not very efficient.
• Use recursive doubling: source sends a message to a selected processor. We now
have two independent problems derined over halves of machines.
• Reduction can be performed in an identical fashion by inverting the process.

5
ONE-TO-ALL BROADCAST

One-to-all broadcast on an eight-node ring.

Node 0 is the source of the broadcast. Each
message transfer step is shown by a
numbered, dotted arrow from the source of
the message to its destination. The number
on an arrow indicates the time step during
which the message is transferred.

6
ALL-TO-ONE REDUCTION

Reduction on an eight-node
ring with node 0 as the
destination of the reduction.

7
ALL-TO-ALL BROADCAST ON A MESH

All-to-all broadcast on a 3 x 3
mesh. The groups of nodes
communicating with each other in
each phase are enclosed by dotted
boundaries. By the end of the
second phase, all nodes get
(0,1,2,3,4,5,6,7) (that is, a
message from each node).

8
ALL-TO-ALL REDUCTION

• Similar communication pattern to all-to-all broadcast,

except in the reverse order.
• On receiving a message, a node must combine it with
the local copy of the message that has the same
destination as the received message before forwarding
the combined message to the next neighbor.

9
ALL-TO-ALL BROADCAST AND
REDUCTION ON A RING

• Simplest approach: perform p one-to-all

broadcasts. This is not the most efficient way,
though.
• Each node first sends to one of its neighbors the
data it needs to broadcast.
• In subsequent steps, it forwards the data
received from one of its neighbors to its other
neighbor.
• The algorithm terminates in p-1 steps

All-to-all broadcast on an eight-node ring.

10
BROADCAST AND REDUCTION ON A
MESH
• We can view each row and column of a
square mesh of p nodes as a linear array of
√p nodes.
• Broadcast and reduction operations can be
performed in two steps - the first step does
the operation along a row and the second
step along each column concurrently.
• This process generalizes to higher
dimensions as well.

11
BROADCAST AND REDUCTION ON A
BALANCED BINARY TREE

• Consider a binary tree in

which processors are
(logically) at the leaves and
internal nodes are routing
nodes.
• Assume that source
processor is the root of this
tree. In the first step, the
source sends the data to the
right child (assuming the
source is also the left child).
One-to-all broadcast on an eight-node tree. The problem has now been
decomposed into two
12 problems with half the
POINT-TO-POINT COMMUNICATION

• Send/Receive:
• A process sends a message to another process and the receiving
process acknowledges it.
• Used for direct communication between two processes.
• Examples: MPI (Message Passing Interface) MPI_Send and
MPI_Recv.

13
BROADCAST

• A single process sends the same message to all other

processes in the system.
• Common in tasks where a root process distributes data to
others.
• Example: MPI MPI_Bcast.

14
SCATTER & GATHER
• Scatter
• A process divides data into chunks and sends each chunk to
different processes.
• Used when a large dataset needs to be distributed across
processes.
• Gather
• A process collects data from all other processes into a single
process.
• Example: MPI MPI_Gather.
15
ALL-TO-ALL COMMUNICATION

• All processes send data to every other process.

• Used in dense communication patterns.
• Example: MPI MPI_Alltoall.

16
REDUCE

• Data from multiple processes is combined using a specific

operation (e.g., sum, max).
• Results are sent to a single process.
• Example: MPI MPI_Reduce.

17
COST ANALYSIS IN COMMUNICATION

• Cost analysis in parallel and distributed computing is the process of

estimating the cost of parallel execution in distributed systems.
• It's different from serial cost analysis because it takes into account
the cost of synchronized tasks that are running in parallel.
• Here are some aspects of cost analysis in parallel and distributed
computing:

18
COST ANALYSIS IN COMMUNICATION
• Parallel cost analysis

• This static cost analysis method uses three phases to estimate the cost
of parallel execution:
• Block-level analysis: Estimates the serial costs of blocks between
synchronization points
• Distributed flow graph (DFG) construction: Captures the
parallelism, waiting, and idle times in the distributed system
• Parallel cost calculation: The parallel cost is the path with the highest
cost in the DFG

19
COST ANALYSIS IN COMMUNICATION

• Optimizing cost

• Optimizing cost can improve system performance and user experience by

allocating resources efficiently. This involves analyzing performance metrics like
latency and throughput to identify areas where resources can be minimized.
• Cost calculation

• The cost of parallel computing is the product of the number of processing

elements used and the parallel runtime. This reflects the total time spent by
each processing element solving the problem.

20
WORKING PROCESS OF COST ANALYSIS
• Parallel cost analysis works in three phases:
• (1) it performs a block-level analysis to estimate the serial costs of the blocks
between synchronization points in the program;
• (2) it then constructs a distributed flow graph (DFG) to capture the parallelism,
the waiting, and idle times at the locations of the distributed system;
• (3) the parallel cost can finally be obtained as the path of maximal cost in the
DFG. We prove the correctness of the proposed parallel cost analysis, and
provide a prototype implementation to perform an experimental evaluation of
the accuracy and feasibility of the proposed analysis.

21
THANK YOU

Team – Parallel & Distributed Computing

Abdul-Azeez Adeyinka UoPeople Computer Science Assignment
No ratings yet
Abdul-Azeez Adeyinka UoPeople Computer Science Assignment
5 pages
DC Unit 3
No ratings yet
DC Unit 3
44 pages
hpc_scaling
No ratings yet
hpc_scaling
56 pages
Unit 3 HPC
No ratings yet
Unit 3 HPC
73 pages
Flat Course File 24-25
No ratings yet
Flat Course File 24-25
100 pages
module 3ppt
No ratings yet
module 3ppt
50 pages
chap4_selected_slides
No ratings yet
chap4_selected_slides
54 pages
Module_203_20-_20MPI_20for_20Cluster_20Computing_20Lec
No ratings yet
Module_203_20-_20MPI_20for_20Cluster_20Computing_20Lec
30 pages
Lecture 11
No ratings yet
Lecture 11
52 pages
Unit 3
No ratings yet
Unit 3
62 pages
Lecture 14 Basic Communication Operations.pptx
No ratings yet
Lecture 14 Basic Communication Operations.pptx
40 pages
Communication Paradigms
No ratings yet
Communication Paradigms
13 pages
Communication
No ratings yet
Communication
24 pages
PDC Summers Finals Revision Notes
No ratings yet
PDC Summers Finals Revision Notes
50 pages
Lecture-17-PDC-BCS-6EF-SMI-Spring-2025
No ratings yet
Lecture-17-PDC-BCS-6EF-SMI-Spring-2025
17 pages
Unit 3 - Parallel Communication
No ratings yet
Unit 3 - Parallel Communication
41 pages
12.revision Parallelization
No ratings yet
12.revision Parallelization
30 pages
Lecture-19-PDC-BCS-6EF-SMI-Spring-2025
No ratings yet
Lecture-19-PDC-BCS-6EF-SMI-Spring-2025
14 pages
Lecture-15-PDC-BCS-6EF-SMI-Spring-2025
No ratings yet
Lecture-15-PDC-BCS-6EF-SMI-Spring-2025
27 pages
Dos Notes
No ratings yet
Dos Notes
18 pages
dscc QB solution copy
No ratings yet
dscc QB solution copy
15 pages
3 Module 3 Message Passing Studemt Version 2
No ratings yet
3 Module 3 Message Passing Studemt Version 2
18 pages
Parallel Computing - Unit II - NLAL
No ratings yet
Parallel Computing - Unit II - NLAL
84 pages
Harvard Mathematics Review
No ratings yet
Harvard Mathematics Review
104 pages
Distributed Systems: Dr. Martin Kleppmann mk428@cst - Cam.ac - Uk
No ratings yet
Distributed Systems: Dr. Martin Kleppmann mk428@cst - Cam.ac - Uk
91 pages
PDC Presntation
No ratings yet
PDC Presntation
9 pages
HPC Endsem 2024 FlyHigh Services
No ratings yet
HPC Endsem 2024 FlyHigh Services
16 pages
HPC 3rd Unit
No ratings yet
HPC 3rd Unit
16 pages
LEC6 parallelAlg-Broadcasting
No ratings yet
LEC6 parallelAlg-Broadcasting
15 pages
1_062814
No ratings yet
1_062814
7 pages
Enterprise Java Unit 1
No ratings yet
Enterprise Java Unit 1
20 pages
Vector Addition and Dot Product
No ratings yet
Vector Addition and Dot Product
6 pages
HPC_Bankai
No ratings yet
HPC_Bankai
7 pages
Introduction
No ratings yet
Introduction
34 pages
IntroDistribuetComputing
No ratings yet
IntroDistribuetComputing
41 pages
Chapter 2 Slides
No ratings yet
Chapter 2 Slides
33 pages
F2 PDF
No ratings yet
F2 PDF
51 pages
Slides Chapter 2 - Parallel Programming Platforms
No ratings yet
Slides Chapter 2 - Parallel Programming Platforms
33 pages
FINAL TERM ASSIGNMENT - Distributed Computing
No ratings yet
FINAL TERM ASSIGNMENT - Distributed Computing
4 pages
Basic Communications
No ratings yet
Basic Communications
13 pages
Unit 1 Part 2
No ratings yet
Unit 1 Part 2
37 pages
Lec8 MPIalgorithmDesign
No ratings yet
Lec8 MPIalgorithmDesign
12 pages
801DCexp3
No ratings yet
801DCexp3
18 pages
Reli Tivity
No ratings yet
Reli Tivity
16 pages
Parallel Programming Platforms (Part 2) : CSE3057Y Parallel and Distributed Systems
No ratings yet
Parallel Programming Platforms (Part 2) : CSE3057Y Parallel and Distributed Systems
20 pages
Some Numericals in CO-3
No ratings yet
Some Numericals in CO-3
11 pages
Mpi
No ratings yet
Mpi
46 pages
Computer Science Roadmap - Curriculum For The Self Taught Developer
No ratings yet
Computer Science Roadmap - Curriculum For The Self Taught Developer
1 page
HPC UNIT 3 To UNIT 6 Technical-Merged
No ratings yet
HPC UNIT 3 To UNIT 6 Technical-Merged
143 pages
Assignment 3 ALM 3
No ratings yet
Assignment 3 ALM 3
2 pages
Consistency and Replication Slm.docx
No ratings yet
Consistency and Replication Slm.docx
25 pages
Intro To Communication: - Advantages
No ratings yet
Intro To Communication: - Advantages
13 pages
Mod 2
No ratings yet
Mod 2
66 pages
Chapter 2 - Parallel Programming Platforms
No ratings yet
Chapter 2 - Parallel Programming Platforms
33 pages
Gtu - Time Table - Summer 2024 - Sem 3 (Remedial)
No ratings yet
Gtu - Time Table - Summer 2024 - Sem 3 (Remedial)
1 page
Parallel Programming
No ratings yet
Parallel Programming
18 pages
Distributed System Message Passing
No ratings yet
Distributed System Message Passing
30 pages
Decode HPC
No ratings yet
Decode HPC
68 pages
CS DataStructure-Lecture 4-List (Array and Linked)
No ratings yet
CS DataStructure-Lecture 4-List (Array and Linked)
28 pages
1 Module 1 Parallelism Fundamentals Motivation Key Concepts and Challenges Parallel Computing
No ratings yet
1 Module 1 Parallelism Fundamentals Motivation Key Concepts and Challenges Parallel Computing
81 pages
HARISHCHANDRAGAD TREK (Responses)
No ratings yet
HARISHCHANDRAGAD TREK (Responses)
3 pages
3-1 Extrema On An Interval
No ratings yet
3-1 Extrema On An Interval
9 pages
ANSYS Examples by APDL - Hani Aziz Ameen
50% (6)
ANSYS Examples by APDL - Hani Aziz Ameen
56 pages
A) What Is RPC? Explain Different Types of RPC?
No ratings yet
A) What Is RPC? Explain Different Types of RPC?
6 pages
CT Upto 17
No ratings yet
CT Upto 17
112 pages
OMX 05 02 00 48 UserGuide
No ratings yet
OMX 05 02 00 48 UserGuide
90 pages
CP Unit 5
No ratings yet
CP Unit 5
18 pages
BB, PE, CE-111 (2)
No ratings yet
BB, PE, CE-111 (2)
80 pages
OOP Assignment
No ratings yet
OOP Assignment
2 pages
CIS 111 HW 3abcd
No ratings yet
CIS 111 HW 3abcd
3 pages
pdcco1
No ratings yet
pdcco1
8 pages
Parallel Computing Communication Operations Slides
No ratings yet
Parallel Computing Communication Operations Slides
71 pages
Unit-1 DAA
No ratings yet
Unit-1 DAA
194 pages
Unit - 5 - Dictionary Technique
No ratings yet
Unit - 5 - Dictionary Technique
19 pages
What Is A Message Passing System? Discuss The Desirable Feature of A Message Passing System. Ans
No ratings yet
What Is A Message Passing System? Discuss The Desirable Feature of A Message Passing System. Ans
8 pages
icse class ix
No ratings yet
icse class ix
3 pages
Code Tantra - 4
No ratings yet
Code Tantra - 4
30 pages
Bugreport 2058E SP1A.210812.003 2023 02 16 07 44 45 Dumpstate - Log 23080
No ratings yet
Bugreport 2058E SP1A.210812.003 2023 02 16 07 44 45 Dumpstate - Log 23080
28 pages
Sanjana
No ratings yet
Sanjana
7 pages
LinuxModule1 GettingStarted
No ratings yet
LinuxModule1 GettingStarted
32 pages
3 Solution Assignment - 02 Coal
No ratings yet
3 Solution Assignment - 02 Coal
6 pages
CHAPTER 8 - Cryptography
No ratings yet
CHAPTER 8 - Cryptography
4 pages
Lecture 3 - 3 Evaluating Static Interconnection Networks
No ratings yet
Lecture 3 - 3 Evaluating Static Interconnection Networks
41 pages
Format For Project Report
No ratings yet
Format For Project Report
15 pages
SEN Chapter 3 Notes
No ratings yet
SEN Chapter 3 Notes
85 pages
Elastic Run 2018
No ratings yet
Elastic Run 2018
7 pages
First Term Class 9 2079
No ratings yet
First Term Class 9 2079
1 page
BCS401-Module-4
No ratings yet
BCS401-Module-4
42 pages
Danish Ali (Thesis Title Pages)
No ratings yet
Danish Ali (Thesis Title Pages)
12 pages
LAB 3 Binary-Hexadecimal Notes
No ratings yet
LAB 3 Binary-Hexadecimal Notes
5 pages
Communication Operations
No ratings yet
Communication Operations
70 pages
vb6 Activex DLL Tutorial
No ratings yet
vb6 Activex DLL Tutorial
3 pages
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Simulation of Digital Communication Systems Using Matlab
From Everand
Simulation of Digital Communication Systems Using Matlab
Mathuranathan Viswanathan
3.5/5 (22)
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
CCNA Interview Questions You'll Most Likely Be Asked
From Everand
CCNA Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Pdc - Co1-Basic Op & Cost Analysis

Uploaded by

Pdc - Co1-Basic Op & Cost Analysis

Uploaded by

CO - 1

COURSE NAME : PARALLEL & DISTRIBUTED COMPUTING

TOPICS : BASIC COMMUNICATION OPERATIONS AND COST

One-to-all broadcast on an eight-node ring.

• Similar communication pattern to all-to-all broadcast,

• Simplest approach: perform p one-to-all

All-to-all broadcast on an eight-node ring.

• Consider a binary tree in

• A single process sends the same message to all other

• All processes send data to every other process.

• Data from multiple processes is combined using a specific

• Cost analysis in parallel and distributed computing is the process of

• Optimizing cost can improve system performance and user experience by

• The cost of parallel computing is the product of the number of processing

Team – Parallel & Distributed Computing

You might also like