0% found this document useful (0 votes)

184 views

Mpi Lecture

This document provides information about an MPI (Message Passing Interface) course, including: - MPI is a standardized library specification for parallel programming on clusters and networks. It allows processes to communicate through message passing. - The course covers MPI concepts over three days, including point-to-point communication, collective communication, and a case study parallelizing a hydrodynamic code. - MPI uses a message passing paradigm where each process has its own data and runs the same program. Processes communicate by exchanging messages through send and receive calls.

Uploaded by

ishwar12173

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

184 views

Mpi Lecture

Uploaded by

ishwar12173

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 129

Introduction to the

Message Passing
Interface (MPI)
Irish Centre for High-End Computing
(ICHEC)
www.ichec.ie
MPI Course

Acknowledgments
This course is based on the MPI
course developed by Rolf
Rabenseifner at the HighPerformance Computing-Center
Stuttgart (HLRS), University of
Stuttgart in collaboration with the
EPCC Training and Education Centre,
Edinburgh Parallel Computing
Centre, University of Edinburgh.
MPI Course

Schedule and Outline

DAY 1
1. 09:00 MPI Overview
2. 09:30 Process model and language bindings
10:00 Hello world practical (login, compilation, executing)

MPI_Init()
MPI_Comm_rank()

10:30 Coffee/Tea break

3. 11:00 Messages and point-to-point communication

11:30 Ping-Pong Example
12:30 Lunch

MPI Course

DAY 2
4.

09:00 Nonblonking communication

Passing around a ring practical

Coffee/Tea break

11:00 Collective Communication

Global reduction example

12: 30 Lunch

MPI Course

DAY 3
6.

09:00 Virtual topologie

09:30 Derived Datatype

Coffee/Tea break

10:30 Case study, parallelisation of an hydrodynamic code

12:30 Lunch/ close

MPI Course

MPI (Message Passing

Interface)?

Standardized message passing library specification

(IEEE)

for parallel computers, clusters and heterogeneous

networks
not a specific product, compiler specification etc.
many implementations, MPICH, LAM, OpenMPI

Portable, with Fortran and C/C++ interfaces.

Many functions
Real parallel programming
Notoriously difficult to debug

MPI Course

Information about MPI

MPI: A Message-Passing Interface Standard (1.1, June 12, 1995)

MPI-2: Extensions to the Message-Passing Interface (July 18,1997)
MPI: The Complete Reference, Marc Snir and William Gropp et al, The
MIT Press, 1998 (2-volume set)
Using MPI: Portable Parallel Programming With the MessagePassing Interface and Using MPI-2: Advanced Features of the
Message-Passing Interface. William Gropp, Ewing Lusk and Rajeev
Thakur, MIT Press, 1999 also available in a single volume ISBN
026257134X.
Parallel Programming with MPI, Peter S. Pacheco, Morgen Kaufmann
Publishers, 1997 - very good introduction.
Parallel Programming with MPI, Neil MacDonald, Elspeth Minty, Joel
Malard, Tim Harding, Simon Brown, Mario Antonioletti. Training handbook
from EPCC which can be used together with these slides -

http://www.epcc.ed.ac.uk/computing/training/document_archive/mpicourse/mpi-course.pdf

MPI Course

Compilation and Parallel

Start
Compilation in C:
mpicc -o prog prog.c
Compilation in C++:

mpiCC -o prpg prog.c (Bull)

mpicxx -o prog prog.cpp
(IBM cluster)

Compilation in Fortran: mpif77 -o prog prog.f

mpif90 -o prog prog.f90
Executing program with num processes:
mprun n num ./pra
(Bull)
mpiexec -n num ./prg
Examples

(Standard MPI-2)

~course00/MPI-I/examples

Note: The examples of a chapter are only readable after the end of the practical of that
chapter.

MPI Course

The Message-Passing Programming

Paradigm
Sequential Programming Paradigm

data

memory

program

Processor/Process

A processor may
run many
processes

Message-Passing Programming Paradigm

data

program

distributed
memory
parallel
processors

communication network

MPI Course

A process is a program performing a task on a processor

Each processor/process in a message passing program runs a
instance/copy of a program:
written in a conventional sequential language, e.g., C or Fortran,
typically a single program operating of multiple dataset
the variables of each sub-program have
the same name
but different locations (distributed memory) and different data!
i.e., all variables are local to a process

communicate via special send & receive routines (message passing)

data
program
communication network

MPI Course

Data and Work Distribution

To communicate together mpi-processes need
identifiers: rank = identifying number
all distribution decisions are based on the rank
i.e., which process works on which data
myrank=0 myrank=1 myrank=2
data

data

program

communication network

MPI Course

myrank=
(size-1)
data
program

What is SPMD
Single Program, Multiple Data
Same (sub-)program runs on each
processor
MPI allows also MPMD, i.e., Multiple
Program, ...
but some vendors may be restricted to SPMD
MPMD can be emulated with SPMD
MPI Course

Emulation of MPMD
main(int argc, char **argv){
if (myrank < .... /* process should run the ocean model */){
ocean( /* arguments */ );
}else{
weather( /* arguments */ );
}
}
PROGRAM
IF (myrank < ... ) THEN !! process should run the ocean model
CALL ocean ( some arguments )
ELSE
CALL weather ( some arguments )
ENDIF
END

MPI Course

Message passing
Messages are packets of data moving between subprograms
Necessary information for the message passing
system:

sending process
source location
source data type
source data size

receiving process
i.e., the ranks
destination location
destination data type
destination buffer size

data
program

communication network

MPI Course

Access
A sub-program needs to be connected to a
message passing system
A message passing system is similar to:

phone line
mail box
fax machine
etc.

MPI:
program must be linked with an MPI library
program must be started with the MPI startup tool

MPI Course

Point-to-Point
Communication
Simplest form of message passing.
One process sends a message to
another.
Different types of point-to-point
communication:
synchronous send
buffered = asynchronous send
MPI Course

Synchronous Sends
The sender gets an information that the
message is received.
Analogue to the beep or okay-sheet of a fax.

beep
ok

MPI Course

Buffered = Asynchronous
Sends
Only know when the message has left.

MPI Course

Blocking Operations
Some sends/receives may block
until another process acts:
synchronous send operation blocks
until receive is issued;
receive operation blocks until
message is sent.

Blocking subroutine returns only

when the operation has completed.
MPI Course

Non-Blocking
Operations
Non-blocking operations return

immediately and allow the subprogram to perform other work.

beep
ok

MPI Course

Collective
Communications
Collective communication routines are
higher level routines.
Several processes are involved at a
time.
May allow optimized internal
implementations, e.g., tree based
algorithms
MPI Course

Broadcast
A one-to-many
communication.

MPI Course

Reduction Operations
Combine data from several processes
to produce a single result.
sum=
?
200

15
10
300

MPI Course

Barriers
Synchronize processes.

all here?

MPI Course

MPI Forum
MPI-1 Forum

First message-passing interface standard.

Sixty people from forty different organizations.
Users and vendors represented, from US and Europe.
Two-year process of proposals, meetings and review.
Message-Passing Interface document produced.
MPI 1.0 June,1994.
MPI 1.1 June 12, 1995.

MPI-2 Forum July 18, 1997

MPI Course

Goals and Scope of MPI

MPIs prime goals
To provide a message-passing interface.
To provide source-code portability.
To allow efficient implementations.

It also offers:
A great deal of functionality.
Support for heterogeneous parallel architectures.

With MPI-2:
Important additional functionality.
No changes to MPI-1.

MPI Course

Chap.2 Process Model and

Language Bindings
1.

MPI Overview

MPI_Init()
2. Process model and language bindings
MPI_Comm_rank()

starting several MPI processes

Messages and point-to-point communication

Non-blocking communication

Collective communication

Virtual topologies

Derived datatypes

Case study

MPI Course

Header files
C
#include <mpi.h>
Fortran
include mpif.h

MPI Course

MPI Function Format

C:
error = MPI_Xxxxxx(parameter, ...);
MPI_Xxxxxx( parameter, ... );

Fortran:
CALL MPI_XXXXXX( parameter, ...,
IERROR )
MPI Course

Never
forget!
29

MPI Function Format Details

Have a look into the MPI standard, e.g., MPI 1.1, page 20.
Each MPI routine is defined:

language independent,
in several programming languages (C, Fortran, C++ [in MPI-2]).
Output arguments in C:
definition in the standard
usage in your code:

MPI_Comm_rank( ...., int *rank)

MPI_Recv(..., MPI_Status *status)
main...
{ int myrank; MPI_Status rcv_status;
MPI_Comm_rank(..., &myrank);
MPI_Recv(..., &rcv_status);

MPI_...... namespace is reserved for MPI constants and routines,

i.e. application routines and variable names must not begin with
MPI_ .

MPI Course

Initializing MPI
#include <mpi.h>
int main(int argc, char **argv)
C: int MPI_Init( int *argc, char ***argv)
{
MPI_Init(&argc, &argv);
....
program xxxxx
Fortran: MPI_INIT( IERROR ) implicit none
INTEGER IERROR include mpif.h
integer ierror
call MPI_Init(ierror)
....
Must be first MPI routine that is called.

MPI Course

Starting the MPI Program

Start mechanism is implementation dependent
Most implementations provide mpirun:
mpirun np number_of_processes ./executable
Bull NovaScale
mprun n number_of_processes ./executable
MPI-2 standard defines mpiexec:
mpiexec n number_of_processes ./executable

mpiexec generally used on clusters using PBS as

it overcomes short comings of mpirun mpirun
disabled on IBM cluster
The parallel MPI processes exist at least after
MPI_Init was called.
MPI Course

Communicator MPI_COMM_WORLD
All processes of an MPI program are members of
the default communicator MPI_COMM_WORLD.
MPI_COMM_WORLD is a predefined handle in
mpi.h and mpif.h.
Each process has its own rank in a communicator:
starting with 0
ending with (size-1)
MPI_COMM_WORLD

1
4
MPI Course

2
3

Handles
Handles identify MPI objects.
For the programmer, handles are
predefined constants in mpi.h or mpif.h

example: MPI_COMM_WORLD
predefined values exist only after MPI_Init
was called
values returned by some MPI routines,
to be stored in variables, that are defined as

in Fortran: INTEGER
in C: special MPI typedefs
Handles refer to internal MPI data structures

MPI Course

Rank
The rank identifies different processes within a
communicator
The rank is the basis for any work and data distribution.
C: int MPI_Comm_rank( MPI_Comm comm, int *rank)
Fortran:

MPI_COMM_RANK( comm, rank, ierror)

INTEGER comm, rank, ierror

myrank=0

myrank=1

myrank=2

myrank=
(size-1)

CALL MPI_COMM_RANK( MPI_COMM_WORLD, myrank, ierror)

MPI Course

Size
How many processes are contained
within a communicator?
C: int MPI_Comm_size( MPI_Comm
comm, int *size)
Fortran:
MPI_COMM_SIZE( comm,
size, ierror)
INTEGER comm, size, ierror
MPI Course

Exiting MPI
C: int MPI_Finalize()
Fortran: MPI_FINALIZE( ierror )
INTEGER ierror
Must be called last by all processes.
After MPI_Finalize:

Further MPI-calls are forbidden

Especially re-initialization with MPI_Init is
forbidden
MPI Course

Exercise: Hello World

Write a minimal MPI program which prints hello world by
each MPI process.
Compile and run it on a single processor.
Run it on several processors in parallel.
Modify your program so that
every process writes its rank and the size of
MPI_COMM_WORLD,
only process ranked 0 in MPI_COMM_WORLD prints hello
world.
I am 2 of 4
Why is the sequence of the output non-deterministic?
Hello world
I am 0 of 4
I am 3 of 4
I am 1 of 4

MPI Course

Advanced Exercises: Hello

World with deterministic
output

Discuss with your neighbor, what

must be done, that the output of all
MPI processes on the terminal
window is in the sequence of the
ranks.

or is there no chance to guarantee

this.
MPI Course

Chap.3 Messages and Pointto-Point Communication

1. MPI Overview

2. Process model and language bindings

MPI_Init()
MPI_Comm_rank()

Messages and point-to-point communication

the MPI processes can communicate

4. Non-blocking communication
5. Collective communication
6. Virtual topologies
7. Derived datatypes
8. Case study

MPI Course

Messages
A message contains a number of elements of
some particular datatype.
MPI datatypes:
Basic datatype.
Derived datatypes

C types are different from Fortran types.

Datatype handles are used to describe the type
of the data in the memory.
Example: message with 5 integers
2345

654

96574

-12

7676

MPI Course

MPI Datatype

C datatype

MPI_CHAR

signed char

MPI_SHORT

signed short int

MPI_INT

signed int

MPI_LONG

signed long int

MPI_UNSIGNED_CHAR

unsigned char

MPI_UNSIGNED_SHORT

unsigned short int

MPI_UNSIGNED

unsigned int

MPI_UNSIGNED_LONG

unsigned long int

MPI_FLOAT

float

MPI_DOUBLE

double

MPI_LONG_DOUBLE

long double

MPI_BYTE
MPI_PACKED

MPI Course

MPI Datatype

Fortran datatype

MPI_INTEGER

INTEGER

MPI_REAL

REAL
DOUBLE PRECISION

MPI_DOUBLE_PRECISION
MPI_COMPLEX

COMPLEX

MPI_ LOGICAL

LOGICAL

MPI_CHARACTER

CHARACTER(1)

MPI_BYTE
MPI_PACKED
2345

654

96574

count=5
datatype=MPI_INTEGER

-12

7676

INTEGER arr(5)

MPI Course

Point-to-Point Communication
Communication between two processes.
Source process sends message to destination process.
Communication takes place within a communicator,
e.g., MPI_COMM_WORLD.
Processes are identified by their ranks in the
communicator.
communicator

1
4

message

destination

source

6
MPI Course

Sending a Message
C: int MPI_Send(void *buf, int count, MPI_Datatype datatype,
int dest, int tag, MPI_Comm comm)
Fortran:
MPI_SEND(BUF, COUNT, DATATYPE, DEST, TAG,
COMM,
IERROR)
<type> BUF(*)
INTEGER COUNT, DATATYPE, DEST, TAG, COMM, IERROR
buf is the starting point of the message with count elements,
each described with datatype.
dest is the rank of the destination process within the
communicator comm.
tag is an additional nonnegative integer piggyback information,
additionally transferred with the message.
The tag can be used by the program to distinguish different types
of messages.

MPI Course

Receiving a Message
C: int MPI_Recv(void *buf, int count, MPI_Datatype datatype,
int source, int tag, MPI_Comm comm,
MPI_Status *status)
Fortran: MPI_RECV(BUF, COUNT, DATATYPE, SOURCE, TAG,
COMM, STATUS, IERROR)
<type> BUF(*)
INTEGER COUNT, DATATYPE, SOURCE, TAG, COMM
INTEGER STATUS(MPI_STATUS_SIZE), IERROR
buf/count/datatype describe the receive buffer.
Receiving the message sent by process with rank source in
comm.
Envelope information is returned in status.
Output arguments are printed blue-cursive.
Only messages with matching tag are received.

MPI Course

Requirements for Point-toPoint Communications

For a communication to succeed:

Sender must specify a valid destination rank.

Receiver must specify a valid source rank.
The communicator must be the same.
Tags must match.
Message datatypes must match.
Receivers buffer must be large enough.
MPI Course

Wildcards
Receiver can wildcard.
To receive from any source
source = MPI_ANY_SOURCE

To receive from any tag tag =

MPI_ANY_TAG
Actual source and tag are returned
in the receivers status parameter.
MPI Course

Communication
Envelope

Envelope information is
returned from MPI_RECV in
status.
C:

From: source rank

tag

status.MPI_SOURCE
status.MPI_TAG
count via MPI_Get_count()

Fortran: status(MPI_SOURCE)
status(MPI_TAG)
count via
MPI_GET_COUNT()

MPI Course

To:
destination rank

item-1
item-2
item-3
item-4
...
item-n

count
elements

Receive Message Count

C: int MPI_Get_count(MPI_Status
*status, MPI_Datatype datatype,
int *count)
Fortran: MPI_GET_COUNT(STATUS,
DATATYPE, COUNT, IERROR)
INTEGER STATUS(MPI_STATUS_SIZE)
INTEGER DATATYPE, COUNT, IERROR
MPI Course

Communication Modes
Send communication modes:
synchronous send MPI_SSEND
buffered [asynchronous] send
MPI_BSEND
standard send
MPI_SEND
Ready send
MPI_RSEND

Receiving all modes MPI_RECV

MPI Course

Communication Modes
Sender modes
Definition Definitions
Notes
Synchronous send
MPI_SSEND

Only completes when the receive has

started

Buffered send
MPI_BSEND

Always completes
(unless an error occurs), irrespective of
receiver

needs applicationdefined buffer to be

declared with
MPI_BUFFER_ATTACH

Synchronous
MPI_SEND

Standard send. Either uses an internal

buffer or buffered

Ready send
MPI_RSEND

May be started only if the matching

receive is already posted!

Receive
MPI_RECV

Completes when a the message (data)

has arrived

MPI Course

highly dangerous!

Rules for the communication modes

Standard send (MPI_SEND)

minimal transfer time

may block due to synchronous mode
> risks with synchronous send

Synchronous send (MPI_SSEND)

risk of deadlock
risk of serialization
risk of waiting > idle time
high latency / best bandwidth

Buffered send (MPI_BSEND)

low latency / bad bandwidth

Ready send (MPI_RSEND)

use never, except you have a 200% guarantee that Recv is

already called in the current version and all future versions
of your code

MPI Course

Message Order
Preservation
Rule for messages on the same connection,

i.e., same communicator, source, and destination rank:

Messages do not overtake each other.
This is true even for non-synchronous sends.

1
4

3
6

If both receives match both messages, then the order is

preserved.

MPI Course

Exercise Ping pong

ping

Write a program according to the time-line diagram:

process 0 sends a message to process 1 (ping)
after receiving this message,
process 1 sends a message back to process 0 (pong)

pong

Repeat this ping-pong with a loop of length 50

Add timing calls before and after the loop:
C:
double MPI_Wtime(void);
Fortran:
DOUBLE PRECISION FUNCTION
MPI_WTIME()
MPI_WTIME returns a wall-clock time in seconds.
At process 0, print out the transfer time of one
message
in seconds
in s.

MPI Course

time

Exercise Ping pong

rank=0
Send

rank=1

(dest=1)
(tag=17)

Recv (source=0)
Send (dest=0)
Recv

(tag=23)
(source=1)

if (my_rank==0)
/* i.e., emulated multiple program */
MPI_Send( ... dest=1 ...)
MPI_Recv( ... source=1 ...)
else
MPI_Recv( ... source=0 ...)
MPI_Send( ... dest=0 ...)
fi

MPI Course

Advanced Exercise - Measure

latency and bandwidth
latency = transfer time for zero length messages
bandwidth = message size (in bytes) / transfer
time
Print out message transfer time and bandwidth
for following send modes:

for standard send (MPI_Send)

for synchronous send (MPI_Ssend)

for following message sizes:

8 bytes (e.g., one double or double precision value)

512 B(= 8*64 bytes)
32 kB (= 8*64**2 bytes)
2 MB (= 8*64**3 bytes)

MPI Course

Chap.4 Non-Blocking Communication

1. MPI Overview

MPI_Init()
MPI_Comm_rank()

2. Process model and language bindings

3. Messages and point-to-point communication

Non-blocking communication

to avoid idle time and deadlocks

5. Collective communication
6. Virtual topologies
7. Derived datatypes
8. Case study

MPI Course

Deadlock
Code in each MPI process:
MPI_Ssend(, right_rank, )
MPI_Recv( , left_rank, )

Will block and never return,

because MPI_Recv cannot
be called in the right-hand
MPI process

2
6

Same problem with standard send mode (MPI_Send),

if MPI implementation chooses synchronous protocol

MPI Course

Non-Blocking
Communications

Separate communication into three

phases:
Initiate non-blocking communication
returns Immediately
routine name starting with MPI_I

Do some work
latency hiding

Wait for non-blocking communication to

complete
MPI Course

Non-Blocking Examples

Non-blocking send

MPI_Isend(...)
doing some other work
MPI_Wait(...)

Non-blocking receive

MPI_Irecv(...)
doing some other work
MPI_Wait(...)

= waiting until operation locally completed

MPI Course

Non-Blocking Send
Initiate non-blocking send
in the ring example: Initiate non-blocking send to the right
neighbor

Do some work:
in the ring example: Receiving the message from left
neighbor

Now, the message transfer can be completed

Wait for non-blocking send to complete

2
6

4
MPI Course

Non-Blocking Receive

Initiate non-blocking receive

in the ring example: Initiate non-blocking receive from left

neighbor

Do some work:

in the ring example: Sending the message to the right neighbor

Now, the message transfer can be completed

Wait for non-blocking receive to complete

2
6

4
MPI Course

Handles, already known

Predefined handles

defined in mpi.h / mpif.h

communicator, e.g., MPI_COMM_WORLD
datatype, e.g., MPI_INT, MPI_INTEGER,

Handles can also be stored in local

variables
memory for datatype handles
in C: MPI_Datatype
in Fortran: INTEGER

memory for communicator handles

in C: MPI_Comm
in Fortran: INTEGER

MPI Course

Request Handles
Request handles
are used for non-blocking communication
must be stored in local variables
C: MPI_Request
Fortran: INTEGER

the value

is generated by a non-blocking
communication routine
is used (and freed) in the MPI_WAIT routine

MPI Course

Non-blocking Synchronous
Send
C:
MPI_Issend(buf, count, datatype, dest, tag, comm,
OUT &request_handle);
MPI_Wait(INOUT &request_handle, &status);

Fortran:
CALL MPI_ISSEND(buf, count, datatype, dest, tag, comm,
OUT request_handle, ierror)
CALL MPI_WAIT(INOUT request_handle, status, ierror)

buf must not be used between Issend and Wait (in all progr.
languages)
MPI 1.1, page 40, lines 44-45

Issend + Wait directly after Issend is equivalent to blocking

call (Ssend)
status is not used in Issend, but in Wait (with send: nothing
returned)
Fortran problems, see MPI-2, Chap. 10.2.2, pp 284-290

MPI Course

Non-blocking Receive
C:
MPI_Irecv(buf, count, datatype, source, tag, comm,
OUT &request_handle);
MPI_Wait(INOUT &request_handle, &status);

Fortran:
CALL MPI_IRECV (buf, count, datatype, source, tag,
comm,
OUT request_handle, ierror)
CALL MPI_WAIT( INOUT request_handle, status, ierror)

buf must not be used between Irecv and Wait

(in all progr. languages)
MPI Course

Blocking and NonBlocking

Send and receive can be blocking or

non-blocking.
A blocking send can be used with a
non-blocking receive,
and vice-versa.
Non-blocking sends can use any mode

standard MPI_ISEND
synchronous MPI_ISSEND
buffered MPI_IBSEND
ready
MPI_IRSEND
MPI Course

Completion

MPI_Wait( &request_handle, &status);

MPI_Test( &request_handle, &flag, &status);

Fortran:
CALL MPI_WAIT( request_handle, status, ierror)
CALL MPI_TEST( request_handle, flag, status, ierror)

one must
WAIT or
loop with TEST until request is completed, i.e., flag ==
1 or .TRUE.

MPI Course

Multiple Non-Blocking
Communications
You have several request handles:
Wait or test for completion of one message
MPI_Waitany / MPI_Testany

Wait or test for completion of all messages

MPI_Waitall / MPI_Testall

Wait or test for completion of as many

messages as possible
MPI_Waitsome / MPI_Testsome

MPI Course

Exercise Rotating information around a

ring
A set of processes are arranged in a ring.
Each process stores its rank
in MPI_COMM_WORLD into
an integer variable snd_buf.
Each process passes this on
to its neighbor on the right.
Each processor calculates
the sum of all values.
Keep passing it around the ring until
the value is back where it started, i.e.
each process calculates sum of all ranks.
Use non-blocking MPI_Issend

my_rank
0

Init

snd_buf
0
2

sum
0
2

my_rank

snd_buf

to avoid deadlocks
sum
to verify the correctness, because
0
1
blocking synchronous send will cause a deadlock

MPI Course

sum
0

Initialization: 1
Each iteration:

my_rank

2 3 4 5

snd_buf

1
3

Fortran:
dest
source
C:
dest
source

= mod(my_rank+1,size)
= mod(my_rank1+size,size)
= (my_rank+1) % size;
= (my_rank1+size) % size;
Single
Program !!!

rcv_buf

5
sum
my_rank

my_rank

snd_buf

4
rcv_buf

1
snd_buf

4
rcv_buf

5
sum

sum

MPI Course

Advanced Exercises
Irecv instead of Issend
Substitute the IssendRecvWait
method by the IrecvSsendWait
method in your ring program.
Or
Substitute the IssendRecvWait
method by the IrecvIssendWaitall
method in your ring program.
MPI Course

Chap.5 Collective
Communication
1. MPI Overview
MPI_Init()
2. Process model and language bindings
MPI_Comm_rank()

3. Messages and point-to-point communication

4. Non-blocking communication

5. Collective communication
e.g.,broadcast
6. Virtual topologies
7. Derived datatypes
8. Case study

MPI Course

Collective
Communication

Communications involving a group of

processes.
Must be called by all processes in a
communicator.
Examples:
Barrier synchronization.
Broadcast, scatter, gather.
Global sum, global maximum, etc.
MPI Course

Characteristics of Collective
Communication
Optimised Communication routines
involving a group of processes
Collective action over a communicator,
i.e. all processes must call the collective
routine.
Synchronization may or may not occur.
All collective operations are blocking.
No tags.
Receive buffers must have exactly the
same size as send buffers.
MPI Course

Barrier Synchronization
C: int MPI_Barrier(MPI_Comm comm)
Fortran:
MPI_BARRIER(COMM, IERROR)
INTEGER COMM, IERROR
MPI_Barrier is normally never needed:

all synchronization is done automatically by the

data communication:

a process cannot continue before it has the data that it

needs.

if used for debugging:

please guarantee, that it is removed in production.

MPI Course

Broadcast
int MPI_Bcast(void *buf, int count, MPI_Datatype datatype,
int root, MPI_Comm comm)

Fortran:
MPI_Bcast(BUF, COUNT, DATATYPE, ROOT, COMM,
IERROR)
<type> BUF(*)
INTEGER COUNT, DATATYPE, ROOT
INTEGER COMM, IERROR

before
bcast

after
bcast

r ed

e.g., root=1
rank of the sending process (i.e., root process)
must be given identically by all processes

MPI Course

Scatter
ABCDE

before
scatter
e.g., root=1
ABCDE

after
A
B
C
D
E
scatter
C: int MPI_Scatter(void *sendbuf, int sendcount, MPI_Datatype
sendtype,
void *recvbuf, int recvcount, MPI_Datatype recvtype,
int root, MPI_Comm comm)
Fortran:
MPI_SCATTER(SENDBUF, SENDCOUNT, SENDTYPE,
RECVBUF,
RECVCOUNT, RECVTYPE, ROOT, COMM, IERROR)
<type>SENDBUF(*), RECVBUF(*)
INTEGER
SENDCOUNT, SENDTYPE, RECVCOUNT, RECVTYPE
INTEGER
ROOT, COMM, IERROR

MPI Course

Gather
e.g., root=1

before
gather

after
gather

AB C D E

C:
int MPI_Gather(void *sendbuf, int sendcount, MPI_Datatype
sendtype,
void *recvbuf, int recvcount, MPI_Datatype recvtype,
int root, MPI_Comm comm)

Fortran: MPI_GATHER(SENDBUF, SENDCOUNT, SENDTYPE, RECVBUF,

RECVCOUNT, RECVTYPE, ROOT, COMM, IERROR)
<type> SENDBUF(*), RECVBUF(*)
INTEGERSENDCOUNT, SENDTYPE, RECVCOUNT, RECVTYPE
INTEGERROOT, COMM, IERROR

MPI Course

Global Reduction
Operations
To perform a global reduce operation across all members of a
group.
d0 o d1 o d2 o d3 o o ds-2 o ds-1
di = data in process rank i
single variable, or
vector

o
= associative operation
Example:
global sum or product
global maximum or minimum
global user-defined operation

floating point rounding may depend on usage of associative law:

[(d0 o d1) o (d2 o d3)] o [ o (ds-2 o ds-1)]
((((((d0 o d1) o d2) o d3) o ) o ds-2) o ds-1)

MPI Course

Example of Global
Reduction

Global integer sum.

Sum of all inbuf values should be returned in

resultbuf.
C: root=0;
MPI_Reduce(&inbuf, &resultbuf, 1, MPI_INT,
MPI_SUM,root, MPI_COMM_WORLD);
Fortran: root=0
MPI_REDUCE(inbuf, resultbuf, 1, MPI_INTEGER,
MPI_SUM, root, MPI_COMM_WORLD, IERROR)
The result is only placed in resultbuf at the root
process.

MPI Course

Predefined Reduction Operation

Handles
Predefined operation handle
Function
MPI_MAX

Maximum

MPI_MIN

Minimum

MPI_SUM

Sum

MPI_PROD

Product

MPI_LAND

Logical AND

MPI_BAND

Bitwise AND

MPI_LOR

Logical OR

MPI_BOR

Bitwise OR

MPI_LXOR

Logical exclusive OR

MPI_BXOR

Bitwise exclusive OR

MPI_MAXLOC

Maximum and location of the

maximum

MPI_MINLOC

Minimum
and location of the 83
MPI
Course

MPI_REDUCE
before MPI_REDUCE
inbuf
result

after

ABC

DE F

GH I

JKL

MN O

root=1
AoDoGoJoM

MPI Course

User-Defined Reduction
Operations
Operator handles

predefined see table above

user-defined

User-defined operation :

associative
user-defined function must perform the operation vector_A
vector_B
syntax of the user-defined function MPI-1 standard

Registering a user-defined reduction function:

MPI_Op_create(MPI_User_function *func, int commute,

MPI_Op *op)
Fortran: MPI_OP_CREATE(FUNC, COMMUTE, OP, IERROR)

COMMUTE tells the MPI library whether FUNC is

commutative.

MPI Course

Example
void myPro Complex
(
*in,
d
Complex
*inout, int *len,
MPI_Datatype *dptr )

typedef struct {
double real, imag;
}Complex

Complex a[100], answer[100];

int i;

Complex

MPI_Op myOp

MPI_Datatype ctype;

MPI_Type_contiguous( 2, MPI_DOUBLE,
&ctype);

MPI_Type_commit( &ctype );

myPro
MPI_Op_create( d
);

myOp

c;
for (i=0; i< *len; ++i) {
c.real = inout->real*in->real inout->imag*in->imag;
c.imag = inout->real*in->imag +
inout->imag*in->real;
*inout = c;
in++; inout++;
}

, True, &

myOp

MPI_Reduce( a, answer, 100, ctype,

, root, comm );

MPI Course

Variants of Reduction
Operations
MPI_ALLREDUCE
no root,
returns the result in all processes

MPI_REDUCE_SCATTER
result vector of the reduction operation
is scattered to the processes into the real result
buffers

MPI_SCAN
prefix reduction
result at process with rank i :=
reduction of inbuf-values from rank 0 to rank i

MPI Course

MPI_ALLREDUCE
before MPI_ALLREDUCE
inbuf
result

after

ABC

DE F

GH I

JKL

MN O

AoDoGoJoM

MPI Course

MPI_SCAN
before MPI_SCAN
inbuf
result

after

ABC

DE F

AoD

GH I

AoDoG

JKL

AoDoGoJ

MN O

AoDoGoJoM

done in parallel

MPI Course

Exercise Global
reduction
Rewrite the pass-around-the-ring program to

use the MPI global reduction to perform the

global sum of all ranks of the processes in the
ring.

Use the results from Chap. 4:

~course00/MPI-I/examples/fortran/ring.f
~course00/MPI-I/examples/c/ring.c

I.e., the pass-around-the-ring communication

loop must be totally substituted by one call to
the MPI collective reduction routine.

MPI Course

Advanced Exercises Global

scan and sub-groups
Global scan:
Rewrite the last program so that each
process computes a partial sum.
Rewrite in a way that each process prints
out its partial result in the correct order:
rank=0
rank=1
rank=2
rank=3
rank=4

sum=0
sum=1
sum=3
sum=6
sum=10

MPI Course

Chap.6 Communicators, Groups and

Virtual topologies
1. MPI Overview

MPI_Init()
MPI_Comm_rank()

2. Process model and language bindings

3. Messages and point-to-point communication
4. Non-blocking communication
5. Collective communication

6. Communicators, Groups and Virtual topologies

a multi-dimensional process naming scheme

Derived datatypes

8. Case study

MPI Course

Motivations

Need to create sets of processes

For programming convenience
Make use of collectives routines

Need to map the abstract topology

onto the natural topology of the
problem domain
For programming convenience
For performance
MPI Course

Groups & communicators

A group is an ordered set of
process identifiers
Each process in a group is
associated with an rank
Usually one associates to groups
communicators
MPI Course

Working with groups

Odd_grou
p

1
2

3
4

5
Even_grou
p

Select processes ranks to create groups

Associate to these groups new communicators
Use these new communicators as usual
MPI_Comm_group(comm, group) returns in group the
group associated to the communicator comm

MPI Course

For the previous example

Odd_ranks={1, 3, 5}, Even_ranks={0, 2, 4}

1.
2.
3.

MPI_comm_group(MPI_COMM_WORLD, Old_group)
MPI_Group_incl(Old_group, 3, Odd_ranks, &Odd_group)
MPI_Group_incl(Old_group, 3, Even_ranks, &Even_group)

int MPI_Comm_create(MPI_COMM_WORLD, Odd_group,

Odd_Comm )
int MPI_Comm_create(MPI_COMM_WORLD, Even_group,
Even_Comm)

Alternatively
color = modulo(myrank, 2)
MPI_Comm_split(MPI_COMM_WORLD, color, key, &newcomm)

MPI Course

Group Management
Group Accessors
MPI_Group_size()
MPI_Group_rank()

Group Constructors
MPI_COMM_GROUP()
MPI_GROUP_INCL()
MPI_GROUP_EXCL()

Group Destructors
MPI_GROUP_FREE(group)

MPI Course

Communicator Management
Communicator Accessors
MPI_COMM_SIZE()
MPI_COMM_RANK()

Communicator Constructors
MPI_COMM_CREATE()
MPI_COMM_SPLIT()

Communicator Destructors
MPI_COMM_FREE(comm)

MPI Course

Virtual topology
For more complex mapping, mpi routines are availble
Global array
A(1:3000,
1:4000,
1:500) =
6109
words
on
3
x
4
x
5
=
60
processors
process coordinates
0..2,
0..3,
0..4
example:
on process
ic0=2,
ic1=0,
decomposition, e.g.,
301:400)
= 0.1109 words
process coordinates:
Cartesian topologies
Array decomposition:
application program directly

ic2=3
(rank=43)
A(2001:3000, 1:1000,
handled with virtual
handled by the

MPI Course

Graphical representation

1000
3000

2000
1000
2000

3000

1
2

4000

3
0

0
10
0
0
5

0
20

0
40

Distribution of
processes over the
grid
Distribution of the
Global Array
Coordinate (2, 0, 3)
represents process
number 43
It is being assigned
the cube
A(2001:3000, 1:1000,
301:400)

MPI Course

100

Virtual Topologies
Convenient process naming.
Simplifies writing of code.
Can allow MPI to optimize
communications.

MPI Course

101

How to use a Virtual

Topology
Creating a topology produces a
new communicator.
MPI provides mapping functions:
to compute process ranks, based on
the topology naming scheme,
and vice versa.
MPI Course

102

Topology Types
Cartesian Topologies
each process is connected to its neighbor in a virtual
grid,
boundaries can be cyclic, or not,
processes are identified by Cartesian coordinates,
of course,
communication between any two processes is still
allowed.

Graph Topologies
general graphs,
not covered here.

MPI Course

103

Creating a Cartesian Virtual

Topology
C:

int MPI_Cart_create(MPI_Comm comm_old, int ndims,

int *dims, int *periods, int reorder,
MPI_Comm *comm_cart)

Fortran:
MPI_CART_CREATE(
COMM_OLD,
NDIMS, DIMS, PERIODS,
REORDER, COMM_CART, IERROR)
INTEGER
COMM_OLD, NDIMS, DIMS(*)
LOGICAL
PERIODS(*), REORDER
INTEGER
COMM_CART, IERROR
comm_old
ndims
dims
periods
reorder

= MPI_COMM_WORLD
= 2
= ( 4,
3
)
= ( 1/.true., 0/.false. )
= see next slide

0
(0,0)

3
(1,0)

6
(2,0)

9
(3,0)

1
(0,1)

4
(1,1)

7
(2,1)

10
(3,1)

2
(0,2)

5
(1,2)

8
(2,2)

11
(3,2)

MPI Course

104

Example A 2-dimensional
Cylinder
Ranks and Cartesian process
coordinates in comm_cart

Ranks in comm and comm_cart may differ, if reorder = 1 or

.TRUE.
This reordering can allow MPI to optimize communications
0
(0,0)

1
(0,1)

2
(0,2)

3
(1,0)

4
(1,1)

5
(1,2)

6
(2,0)

7
(2,1)

8
(2,2)

MPI Course

9
(3,0)

10
(3,1)

11
(3,2)

105

7
(2,1)

Cartesian Mapping
MappingFunctions
ranks to
process grid coordinates

C: int MPI_Cart_coords(

MPI_Comm comm_cart,

int rank, int maxdims, int *coords)

Fortran: MPI_CART_COORDS(COMM_CART, RANK,

MAXDIMS, COORDS,
IERROR)
INTEGER COMM_CART, RANK
INTEGER MAXDIMS, COORDS(*), IERROR
MPI Course

106

Cartesian Mapping
Functions
7
Mapping process grid coordinates to ranks
(2,1)

C: int MPI_Cart_rank(MPI_Comm comm_cart,

int *coords, int *rank)

Fortran: MPI_CART_RANK(COMM_CART,
COORDS, RANK, IERROR)
INTEGER
COMM_CART, COORDS(*)
INTEGER
RANK, IERROR
MPI Course

107

Own coordinates
0
(0,0)

3
(1,0)

6
(2,0)

9
(3,0)

MPI_Cart_rank
1
(0,1)

4
(1,1)

MPI_Cart_coords
2
(0,2)

5
(1,2)

7
(2,1)

10
(3,1)

8
(2,2)

11
(3,2)

Each process gets its own coordinates with

MPI_Comm_rank(comm_cart, my_rank, ierror)
MPI_Cart_coords(comm_cart, my_rank, maxdims,
my_coords, ierror)
MPI Course
108

Cartesian Mapping
Functions???

Computing ranks of neighboring processes

C:
disp,

int MPI_Cart_shift( MPI_Comm comm_cart, int direction, int

Fortran:
DIRECTION, DISP,

int rank_prev, int rank_next)

MPI_CART_SHIFT(

COMM_CART,

RANK_PREV, RANK_NEXT, IERROR)

INTEGER COMM_CART, DIRECTION
INTEGER DISP, RANK_SOURCE
INTEGER RANK_DEST, IERROR

Returns MPI_PROC_NULL if there is no neighbor.

MPI_PROC_NULL can be used as source or destination rank in
each communication Then, this communication will be a noop!

MPI Course

109

MPI_Cart_shift Example
0
(0,0)

3
(1,0)

6
(2,0)

9
(3,0)

1
(0,1)

4
(1,1)

7
(2,1)

10
(3,1)

2
(0,2)

5
(1,2)

8
(2,2)

11
(3,2)

invisible input argument: my_rank in cart

MPI_Cart_shift( cart, direction, displace, rank_prev, rank_next,
ierror)
example on
0 or
+1
4
10
process rank=7
1
+1
6
8

MPI Course

110

Cartesian Partitioning
Cut a grid up into slices.
A new communicator is produced for each slice.
Each slice can then perform its own collective
communications.
C: int MPI_Cart_sub(MPI_Comm comm_cart, int
*remain_dims,
MPI_Comm *comm_slice)
Fortran:
0
3
6
(0,0)
(1,0)
(2,0)
REMAIN_DIMS,

9
(3,0)

1
(0,1)

4
(1,1)

7
(2,1)

10
(3,1)

2
(0,2)

5
(1,2)

8
(2,2)

11
(3,2)

MPI_CART_SUB( COMM_CART,
COMM_SLICE, IERROR)
INTEGER COMM_CART
LOGICAL REMAIN_DIMS(*)
INTEGER COMM_SLICE, IERROR

MPI Course

111

MPI_Cart_sub Example

Ranks and Cartesian process coordinates in comm_sub

0
(0,0)

1
(0,1)

2
(0,2)

0
(0)

3
(1,0)

4
(1,1)

5
(1,2)

1
(1)

6
(2,0)

7
(2,1)

8
(2,2)

2
(2)

9
(3,0)

10
(3,1)

11
(3,2)

3
(3)

MPI_Cart_sub( comm_cart, remain_dims, comm_sub, ierror)

(true, false)

MPI Course

112

Chap.7 Derived datatypes

MPI Overview

MPI_Init()
Process model and language bindings
MPI_Comm_rank()

Messages and point-to-point

communication

Non-blocking communication

Collective communication

Virtual topologies

7. Derived datatypes

transfer of any combination of typed data

Case study

MPI Course

113

MPI Datatypes
Description of the memory layout
of the buffer
for sending
for receiving

Basic types
Derived types
Vectors, structs, others
Built from existing datatypes
MPI Course

114

Data Layout and the Describing Datatype

Handle
array_of_types[0]=MPI_INT;

struct
buff_layout
{
int
i_val[3];
double d_val[5];
} buffer;

array_of_blocklengths[0]=3;
array_of_displacements[0]=0;
array_of_types[1]=MPI_DOUBLE;
array_of_blocklengths[1]=5;
array_of_displacements[1]=;

MPI_Type_struct(2, array_of_blocklengths,
array_of_displacements, array_of_types,

&buff_datatype);

Compiler

MPI_Type_commit(&buff_datatype);

MPI_Send(&buffer, 1, buff_datatype, )
&buffer = the start
address of the data
int

the datatype handle

describes the data layout

double

MPI Course

115

Derived Datatypes
Maps

Type

A derived datatype is logically a pointer

to a list of entries:
basic datatype at displacement
basic datatype 0

displacement of datatype
0

basic datatype 1

displacement of datatype
1

...

basic datatype n1

displacement of datatype
n-1

MPI Course

116

Example:

6.36324d+107

derived datatype handle

basic datatype

MPI_CHAR
MPI_INT
MPI_INT
MPI_DOUBLE

displacement

0
4
8
16

A derived datatype describes the

memory layout of, e.g.,
structures,
common blocks,
subarrays,
some variables in the memory

MPI Course

117

Contiguous Data
The simplest derived datatype
Consists of a number of contiguous items of the same datatype
oldtype
newtype

int MPI_Type_contiguous(int count, MPI_Datatype oldtype,

MPI_Datatype *newtype)

Fortran: MPI_TYPE_CONTIGUOUS( COUNT, OLDTYPE,

NEWTYPE, IERROR)
INTEGER COUNT, OLDTYPE
INTEGER NEWTYPE, IERROR

MPI Course

118

Vector Datatype
oldtype

holes, that should not be transferred

newtype
blocklength = 3 elements per block
stride = 5 (element stride between blocks)
count = 2 blocks

int MPI_Type_vector(int count, int blocklength, int stride,

MPI_Datatype oldtype, MPI_Datatype *newtype)

Fortran:
MPI_TYPE_VECTOR(COUNT, BLOCKLENGTH,
STRIDE,
OLDTYPE, NEWTYPE, IERROR)
INTEGER COUNT, BLOCKLENGTH, STRIDE
INTEGER OLDTYPE, NEWTYPE, IERROR

MPI Course

119

MPI_TYPE_VECTOR: AN
EXAMPLE
Sending the first row of a N*M Matrix
C
Fortran

Sending the first column of an N*M

Matrix
C
Fortran
MPI Course

120

Sending a row using MPI_TYPE_vector

C
MPI_Type_vector(1, 5, 1, MPI_INT,
MPI_ROW)

Fortran
MPI_Type_vector(5, 1, 4, MPI_INT,
MPI_ROW)

MPI_Type_Commit(MPI_ROW)
MPI_Send(&buf , MPI_ROW)
MPI_Recv(&buf , MPI_ROW)
MPI Course

121

Sending a column using MPI_TYPE_vector

C
MPI_Type_vector(4, 1, 5,
MPI_INT, MPI_COL)

Fortran
MPI_Type_vector(1, 4, 1,
MPI_INT, MPI_COL)

MPI_Type_Commit(MPI_COL)
MPI_Send(buf , MPI_COL)
MPI_Recv(buf , MPI_COL)
MPI Course

122

Sending a sub-matrix using

MPI_TYPE_vector
C
MPI_Type_vector(2, 3, 5, MPI_INT,
MPI_SUBMAT)

Fortran
MPI_Type_vector(3, 2, 4, MPI_INT,
MPI_SUBMAT)

MPI_Type_Commit(MPI_SUBMAT)
MPI_Send(&buf ,
MPI_SUBMAT)
MPI_Recv(&buf , MPI_SUBMAT)

MPI Course

123

Struct Datatype
oldtypes

MPI_INT
addr_0

MPI_DOUBLE
addr_1

newtype
block 0

holes, if double needs an

8 byte alignment

block 1

int MPI_Type_struct(int count, int *array_of_blocklengths,

MPI_Aint *array_of_displacements,
MPI_Datatype *array_of_types, MPI_Datatype *newtype)

Fortran: MPI_TYPE_STRUCT(COUNT, ARRAY_OF_BLOCKLENGTHS,

ARRAY_OF_DISPLACEMENTS, ARRAY_OF_TYPES,
NEWTYPE, IERROR)
count
array_of_blocklengths
array_of_displacements
array_of_types

= 2
= ( 3,
5
)
= ( 0,
addr_1 addr_0 )
= ( MPI_INT, MPI_DOUBLE )

MPI Course

124

How to compute the

displacement

array_of_displacements[i] :=
address(block_i) address(block_0)
MPI-1
C:
int MPI_Address(void* location,
MPI_Aint *address)
Fortran: MPI_ADDRESS(LOCATION,
ADDRESS, IERROR)
<type>
LOCATION(*)
INTEGER ADDRESS, IERROR
MPI Course

125

Committing a Datatype
Before a dataytype handle is used in message
passing communication,
it needs to be committed with
MPI_TYPE_COMMIT.
This must be done only once.
C: int MPI_Type_commit(MPI_Datatype *datatype);
Fortran: MPI_TYPE_COMMIT(DATATYPE, IERROR)
INTEGER DATATYPE, IERROR
IN-OUT argument

MPI Course

126

Size and Extent of a Datatype,

I.
Size := number of bytes that have to be transferred.
Extent := spans from first to last byte.
Basic datatypes: Size = Extent = number of bytes used by the
compiler.
Derived datatypes, an example:
oldtype
newtype

size

:= 6 * size(oldtype)

extent := 8 * extent(oldtype)
better visualization of newtype:

MPI Course

127

Size and Extent of a Datatype,

II.
MPI-1:

C:
int MPI_Type_size(MPI_Datatype datatype, int
*size)
int MPI_Type_extent(MPI_Datatype datatype,
MPI_Aint *extent)
Fortran:

MPI_TYPE_SIZE(DATATYPE, SIZE, IERROR)

INTEGER DATATYPE, SIZE, IERROR

MPI_TYPE_EXTENT(DATATYPE, EXTENT, IERROR)

INTEGER DATATYPE, EXTENT, IERROR
MPI Course

128

Case Study: The advetion

equation

MPI Course

129

ClassNK - Part H - Electrical Installation
No ratings yet
ClassNK - Part H - Electrical Installation
117 pages
Message Passing Interface (MPI)
No ratings yet
Message Passing Interface (MPI)
14 pages
Introduction To MPI Ranger Lonestar
No ratings yet
Introduction To MPI Ranger Lonestar
67 pages
Cs-3006 6 Mpi Basics 2
No ratings yet
Cs-3006 6 Mpi Basics 2
52 pages
An Introduction To MPI: Parallel Programming With The Message Passing Interface
No ratings yet
An Introduction To MPI: Parallel Programming With The Message Passing Interface
48 pages
Week 10
No ratings yet
Week 10
52 pages
BIg data anslysi
No ratings yet
BIg data anslysi
57 pages
Message Passing Interface (MPI) : EC3500: Introduction To Parallel Computing
100% (1)
Message Passing Interface (MPI) : EC3500: Introduction To Parallel Computing
40 pages
Message Passing Interface: Parallel Processing Course University of Tehran
No ratings yet
Message Passing Interface: Parallel Processing Course University of Tehran
49 pages
Chapter 4 - Message-Passing Programming, MPI
No ratings yet
Chapter 4 - Message-Passing Programming, MPI
79 pages
Lec 9 DR Marwa Abbas
No ratings yet
Lec 9 DR Marwa Abbas
64 pages
Clase 4 - Tutorial de MPI
No ratings yet
Clase 4 - Tutorial de MPI
35 pages
5 MPIprogramming
No ratings yet
5 MPIprogramming
43 pages
MiniTool Partition Wizard Crack 12 Key Download Free 2025
No ratings yet
MiniTool Partition Wizard Crack 12 Key Download Free 2025
29 pages
02 Mpi 0
No ratings yet
02 Mpi 0
19 pages
Parallel Programming Using Basic MPI Presented by Timothy H. Kaiser, Ph.D. San Diego Supercomputer Center
No ratings yet
Parallel Programming Using Basic MPI Presented by Timothy H. Kaiser, Ph.D. San Diego Supercomputer Center
19 pages
Computing LLNL Gov
No ratings yet
Computing LLNL Gov
42 pages
Lecture 11 Distributed Memory Programming
No ratings yet
Lecture 11 Distributed Memory Programming
28 pages
02 Message Passing Interface Tutorial
No ratings yet
02 Message Passing Interface Tutorial
34 pages
in3200-chap09
No ratings yet
in3200-chap09
56 pages
Mpi Unit 5 Part 2 1
No ratings yet
Mpi Unit 5 Part 2 1
65 pages
Mpi
No ratings yet
Mpi
30 pages
Message Passing Interface (MPI) : Author: Blaise Barney, Lawrence Livermore National Laboratory
No ratings yet
Message Passing Interface (MPI) : Author: Blaise Barney, Lawrence Livermore National Laboratory
41 pages
NGK Mpi
No ratings yet
NGK Mpi
74 pages
Message Passing Interface (MPI) Programming
No ratings yet
Message Passing Interface (MPI) Programming
11 pages
The Message Passing Interface (MPI)
No ratings yet
The Message Passing Interface (MPI)
18 pages
Message Passing Interface (MPI) : Steve Lantz Center For Advanced Computing Cornell University
No ratings yet
Message Passing Interface (MPI) : Steve Lantz Center For Advanced Computing Cornell University
53 pages
Ms. V. Uma Maheswari, Assistant Lecturer, Department of Information Technology, National Institute of Technology, Surathkal
No ratings yet
Ms. V. Uma Maheswari, Assistant Lecturer, Department of Information Technology, National Institute of Technology, Surathkal
91 pages
Message Passing Interface (MPI)
No ratings yet
Message Passing Interface (MPI)
22 pages
Introduction to C MPI PM
No ratings yet
Introduction to C MPI PM
50 pages
Class03 - MPI, Part 1, Intermediate PDF
No ratings yet
Class03 - MPI, Part 1, Intermediate PDF
83 pages
Week09 L2
No ratings yet
Week09 L2
13 pages
Introduction to MPI Basics
No ratings yet
Introduction to MPI Basics
8 pages
Lec5 MPI
No ratings yet
Lec5 MPI
28 pages
HPC Lecture40
No ratings yet
HPC Lecture40
25 pages
MPI (2)
No ratings yet
MPI (2)
25 pages
Distributed Memory Programming With MPI: Peter Pacheco
No ratings yet
Distributed Memory Programming With MPI: Peter Pacheco
121 pages
Intro_MPI
No ratings yet
Intro_MPI
60 pages
‎⁨تقرير⁩
No ratings yet
‎⁨تقرير⁩
16 pages
Message Passing Interface (MPI) Programming
No ratings yet
Message Passing Interface (MPI) Programming
11 pages
Mpi Basic Operations
No ratings yet
Mpi Basic Operations
6 pages
Writing Message Passing Parallel Programs With MPI: Course Notes
No ratings yet
Writing Message Passing Parallel Programs With MPI: Course Notes
80 pages
Message Passing and MPI: John Mellor-Crummey
No ratings yet
Message Passing and MPI: John Mellor-Crummey
78 pages
Lab Mpi
No ratings yet
Lab Mpi
32 pages
[Scientific and Engineering Computation] William Gropp, Ewing L. Lusk, Anthony Skjellum, Rajeev Thakur - Using MPI and Using MPI-2 (1999, The MIT Press)
No ratings yet
[Scientific and Engineering Computation] William Gropp, Ewing L. Lusk, Anthony Skjellum, Rajeev Thakur - Using MPI and Using MPI-2 (1999, The MIT Press)
385 pages
PA
No ratings yet
PA
87 pages
SERC IntroMPI 2019-09-14 v0
No ratings yet
SERC IntroMPI 2019-09-14 v0
43 pages
6.3 Mpi: The Message Passing Interface: (Team Lib)
No ratings yet
6.3 Mpi: The Message Passing Interface: (Team Lib)
5 pages
Ch3_L1_PDC_CS4172_Fall_2024
No ratings yet
Ch3_L1_PDC_CS4172_Fall_2024
128 pages
Introduction MPI - Chap2 - Slide 3
No ratings yet
Introduction MPI - Chap2 - Slide 3
16 pages
Lab Mpi
No ratings yet
Lab Mpi
29 pages
07_2_Introduction_MPI
No ratings yet
07_2_Introduction_MPI
27 pages
Mpi
No ratings yet
Mpi
67 pages
Lecture 15 MPI Summarization
No ratings yet
Lecture 15 MPI Summarization
26 pages
Mpi 1
No ratings yet
Mpi 1
38 pages
MPI_tutorial_Fall_Break_2022
No ratings yet
MPI_tutorial_Fall_Break_2022
60 pages
Pic® Micro Principles Teachers Pack V11
From Everand
Pic® Micro Principles Teachers Pack V11
Clive W. Humphris
No ratings yet
Pic® Micro Principles on Your Mobile
From Everand
Pic® Micro Principles on Your Mobile
Clive W. Humphris
No ratings yet
Pic® Micro Principles V11
From Everand
Pic® Micro Principles V11
Clive W. Humphris
No ratings yet
Learn the Pic® Micro on Your Smartphone
From Everand
Learn the Pic® Micro on Your Smartphone
Clive W. Humphris
No ratings yet
Learn to Code with C: Program with the world's most popular language on your Raspberry Pi
From Everand
Learn to Code with C: Program with the world's most popular language on your Raspberry Pi
Simon Long
5/5 (1)
NumPy, SciPy, Pandas, Quandl Cheat Sheet
100% (3)
NumPy, SciPy, Pandas, Quandl Cheat Sheet
4 pages
Categorical Data Analysis
No ratings yet
Categorical Data Analysis
11 pages
AppleTV Product Environmental Report 20110323
No ratings yet
AppleTV Product Environmental Report 20110323
3 pages
Digital TV: Overview Digital TV: Overview Digital TV: Overview Digital TV: Overview
No ratings yet
Digital TV: Overview Digital TV: Overview Digital TV: Overview Digital TV: Overview
42 pages
REMOS GX Pilot Operating Handbook r02 2009
86% (7)
REMOS GX Pilot Operating Handbook r02 2009
111 pages
Statement of Account: Date Narration Chq./Ref - No. Value DT Withdrawal Amt. Deposit Amt. Closing Balance
No ratings yet
Statement of Account: Date Narration Chq./Ref - No. Value DT Withdrawal Amt. Deposit Amt. Closing Balance
3 pages
Drone Pilot Training
No ratings yet
Drone Pilot Training
18 pages
(Chapter 12) Metallic Cable Transmission Media (78-81)
No ratings yet
(Chapter 12) Metallic Cable Transmission Media (78-81)
4 pages
PICAXE-18M2 Information: Revolution
No ratings yet
PICAXE-18M2 Information: Revolution
6 pages
Sample
100% (1)
Sample
3 pages
MEP Design Course_896
No ratings yet
MEP Design Course_896
15 pages
Api RP 2x-Acceptance Criteria
No ratings yet
Api RP 2x-Acceptance Criteria
6 pages
Unesco Global Citizenship Education Guide
No ratings yet
Unesco Global Citizenship Education Guide
45 pages
Abrasive Jet Machining
No ratings yet
Abrasive Jet Machining
20 pages
Spec. of Escalator Amdt. 5
No ratings yet
Spec. of Escalator Amdt. 5
38 pages
Popusti 8 Mart 2022 1
No ratings yet
Popusti 8 Mart 2022 1
1 page
Tank
100% (1)
Tank
72 pages
BGA Rework Station Touchbga Gm390: User Manual
No ratings yet
BGA Rework Station Touchbga Gm390: User Manual
19 pages
Problem 12-4 Gas Cap Expansion
No ratings yet
Problem 12-4 Gas Cap Expansion
2 pages
Undergraduate Thesis Agreement: Specific Responsibilities of The Faculty Advisor
No ratings yet
Undergraduate Thesis Agreement: Specific Responsibilities of The Faculty Advisor
2 pages
Syllabus Plan s3
No ratings yet
Syllabus Plan s3
2 pages
Toward A Ludic Architecture: The Space of Play and Games
No ratings yet
Toward A Ludic Architecture: The Space of Play and Games
382 pages
IOS Hacking Guide
100% (1)
IOS Hacking Guide
139 pages
Standard Power Cables: Afumex 90 Armoured Power Cable
No ratings yet
Standard Power Cables: Afumex 90 Armoured Power Cable
12 pages
Backup and Recovery Scenarios
No ratings yet
Backup and Recovery Scenarios
8 pages
Nptel Odd
No ratings yet
Nptel Odd
8 pages
12 Gerakan Lalu Lintas
No ratings yet
12 Gerakan Lalu Lintas
6 pages
SyncroFlo Pump Brochure 9 2017
No ratings yet
SyncroFlo Pump Brochure 9 2017
8 pages
TYM-T233 T273 Engine List (S3L2-S3L)
100% (1)
TYM-T233 T273 Engine List (S3L2-S3L)
46 pages
Single and Three Phase Transformer
No ratings yet
Single and Three Phase Transformer
7 pages
Stolle Can Body Light Testers
No ratings yet
Stolle Can Body Light Testers
2 pages
PhiHis Syllabus
No ratings yet
PhiHis Syllabus
4 pages
Exam 1 Study Guide Bio 6C Ecology and Evolution
0% (1)
Exam 1 Study Guide Bio 6C Ecology and Evolution
2 pages