0% found this document useful (0 votes)

6 views

OpenMP Examples

This document provides an overview of OpenMP and includes examples of its usage for parallel programming in C/C++. It discusses OpenMP directives like #pragma omp parallel that create parallel regions and clauses like private and shared that specify variable scoping. It also includes examples of parallel constructs like parallel loops and sections that distribute work across threads. The examples demonstrate compiling and running simple OpenMP programs on Linux clusters to print output from each thread.

Uploaded by

Fernando Montoya Cubas

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

OpenMP Examples

Uploaded by

Fernando Montoya Cubas

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

OpenMP Tutorial

https://computing.llnl.gov/tutorials/openMP/

C / C++ - General Code Structure

#include <omp.h>

main () {

int var1, var2, var3;

Serial code
.
.
.

Beginning of parallel section. Fork a team of threads.

Specify variable scoping

#pragma omp parallel private(var1, var2) shared(var3)

{

Parallel section executed by all threads

.
Other OpenMP directives
.
Run-time Library calls
.
All threads join master thread and disband

Resume serial code

.
.
.

GNU C/C++ 4.4.7 OpenMP

3.0

OpenMP 4.0 Support: according to vendor documentation,

beginning with the following compiler versions, OpenMP 4.0
is supported:
 GNU: 4.9 for C/C++

Compiler / Platform Compiler Flag

GNU gcc -fopenmp

C / C++ Directives Format

Format:
#pragma
directive-name [clause, ...] newline
omp
Required for A valid OpenMP Optional. Clauses Required.
all OpenMP directive. Must can be in any order, Precedes the
C/C++ appear after the and repeated as structured block
directives. pragma and before necessary unless which is enclosed
any clauses. otherwise restricted. by this directive.

Example:

#pragma omp parallel default(shared) private(beta,pi)

#pragma omp parallel [clause ...] newline

if (scalar_expression)
private (list)
shared (list)
default (shared | none)
firstprivate (list)
C/C++ reduction (operator: list)
copyin (list)
num_threads (integer-expression)

structured_block

Notes:
 When a thread reaches a PARALLEL directive, it creates a team of
threads and becomes the master of the team. The master is a member of
that team and has thread number 0 within that team.
 Starting from the beginning of this parallel region, the code is
duplicated and all threads will execute that code.
 There is an implied barrier at the end of a parallel section. Only the
master thread continues execution past this point.
 If any thread terminates within a parallel region, all threads in the team
will terminate, and the work done up until that point is undefined.

How Many Threads?

 The number of threads in a parallel region is determined by the

following factors, in order of precedence:
1. Evaluation of the IF clause
2. Setting of the NUM_THREADS clause
3. Use of the omp_set_num_threads() library function
4. Setting of the OMP_NUM_THREADS environment variable
5. Implementation default - usually the number of CPUs on a node,
though it could be dynamic (see next bullet).
 Threads are numbered from 0 (master thread) to N-1

C / C++ - Parallel Region Example

#include <omp.h>

main () {

int nthreads, tid;

/* Fork a team of threads with each thread having a private tid

variable */
#pragma omp parallel private(tid)
{

/* Obtain and print thread id */

tid = omp_get_thread_num();
printf("Hello World from thread = %d\n", tid);

/* Only master thread does this */

if (tid == 0)
{
nthreads = omp_get_num_threads();
printf("Number of threads = %d\n", nthreads);
}

} /* All threads join master thread and terminate */

}
OpenMP Exercise 1

Getting Started
Overview:

 Login to the workshop cluster using your workshop username

and OTP token
 Copy the exercise files to your home directory
 Familiarize yourself with LC's OpenMP environment
 Write a simple "Hello World" OpenMP program
 Successfully compile your program
 Successfully run your program
 Modify the number of threads used to run your program

GO TO THE EXERCISE HERE

https://computing.llnl.gov/tutorials/openMP/exercise.html

OpenMP Exercise

https://computing.llnl.gov/tutorials/openMP/exercise.html

Exercise 1
1. Login to the workshop machine

Workshops differ in how this is done. The instructor will go over this
beforehand.

2. Copy the example files

1. In your home directory, create a subdirectory for the example
codes and then cd to it.
2. mkdir openMP
cd openMP

3. Then, copy the C version of the parallel OpenMP exercise files

to your openMP subdirectory:

C: cp /usr/global/docs/training/blaise/openMP/C/*
~/openMP

https://computing.llnl.gov/tutorials/openMP/exercise.html

EXAMPLE 1 - hello world

/
**********************************************************************
FILE: omp_hello.c Hello world
* DESCRIPTION:
* OpenMP Example - Hello World - C/C++ Version
* In this simple example, the master thread forks a parallel region.
* All threads in the team obtain their unique thread number and
print it.
* The master thread only prints the total number of threads. Two
OpenMP
* library routines are used to obtain the number of threads and each
* thread's number.
* AUTHOR: Blaise Barney 5/99
* LAST REVISED: 04/06/05
**********************************************************************
********/
#include <omp.h>
#include <stdio.h>
#include <stdlib.h>

int main (int argc, char *argv[])

{
int nthreads, tid;

/* Fork a team of threads giving them their own copies of variables */

#pragma omp parallel private(nthreads, tid)
{

/* Obtain thread number */

tid = omp_get_thread_num();
printf("Hello World from thread = %d\n", tid);

/* Only master thread does this */

if (tid == 0)
{
nthreads = omp_get_num_threads();
printf("Number of threads = %d\n", nthreads);
}

} /* All threads join master thread and disband */

Using your choice of compiler (see above section 4), compile your hello world
OpenMP program. This may take several attempts if there are any code
errors. For example:
gcc -fopenmp omp_hello.c -o hello

1. When you get a clean compile, proceed.

2. Run your hello executable and notice its output.
o Is it what you expected? As a comparison, you can compile and
run the provided omp_hello.c example program.
3. How many threads were created? By default, the GNU compilers will
create 1 thread for each core.
4. Notes:
o For the remainder of this exercise, you can use the compiler
command of your choice unless indicated otherwise.
o Compilers will differ in which warnings they issue, but all can be
ignored for this exercise. Errors are different, of course.

EXAMPLE 2 – workShare1

/*********************************************************************
* FILE: omp_workshare1.c Loop work-sharing
* DESCRIPTION:
* OpenMP Example - Loop Work-sharing - C/C++ Version
* In this example, the iterations of a loop are scheduled
dynamically
* across the team of threads. A thread will perform CHUNK
iterations
* at a time before being scheduled for the next CHUNK of work.
* AUTHOR: Blaise Barney 5/99
* LAST REVISED: 04/06/05
**********************************************************************
********/
#include <omp.h>
#include <stdio.h>
#include <stdlib.h>
#define CHUNKSIZE 10
#define N 100

int main (int argc, char *argv[])

{
int nthreads, tid, i, chunk;
float a[N], b[N], c[N];

/* Some initializations */
for (i=0; i < N; i++)
a[i] = b[i] = i * 1.0;
chunk = CHUNKSIZE;

#pragma omp parallel shared(a,b,c,nthreads,chunk) private(i,tid)

{
tid = omp_get_thread_num();
if (tid == 0)
{
nthreads = omp_get_num_threads();
printf("Number of threads = %d\n", nthreads);
}
printf("Thread %d starting...\n",tid);

#pragma omp for schedule(dynamic,chunk)

for (i=0; i<N; i++)
{
c[i] = a[i] + b[i];
printf("Thread %d: c[%d]= %f\n",tid,i,c[i]);
}

} /* end of parallel section */

EXAMPLE 3 - workShare2

/*********************************************************************
* FILE: omp_workshare2.c
* DESCRIPTION:
* OpenMP Example - Sections Work-sharing - C Version
* In this example, the OpenMP SECTION directive is used to assign
* different array operations to each thread that executes a SECTION.
* AUTHOR: Blaise Barney 5/99
* LAST REVISED: 07/16/07
**********************************************************************
/
#include <omp.h>
#include <stdio.h>
#include <stdlib.h>
#define N 50

int main (int argc, char *argv[])

{
int i, nthreads, tid;
float a[N], b[N], c[N], d[N];

/* Some initializations */
for (i=0; i<N; i++) {
a[i] = i * 1.5;
b[i] = i + 22.35;
c[i] = d[i] = 0.0;
}

#pragma omp parallel shared(a,b,c,d,nthreads) private(i,tid)

{
tid = omp_get_thread_num();
if (tid == 0)
{
nthreads = omp_get_num_threads();
printf("Number of threads = %d\n", nthreads);
}
printf("Thread %d starting...\n",tid);

#pragma omp sections nowait

{
#pragma omp section
{
printf("Thread %d doing section 1\n",tid);
for (i=0; i<N; i++)
{
c[i] = a[i] + b[i];
printf("Thread %d: c[%d]= %f\n",tid,i,c[i]);
}
}

#pragma omp section

{
printf("Thread %d doing section 2\n",tid);
for (i=0; i<N; i++)
{
d[i] = a[i] * b[i];
printf("Thread %d: d[%d]= %f\n",tid,i,d[i]);
}
}

} /* end of sections */

printf("Thread %d done.\n",tid);

} /* end of parallel section */

EXAMPLE
/*********************************************************************
* FILE: omp_mm.c Matrix multiply
* DESCRIPTION:
* OpenMp Example - Matrix Multiply - C Version
* Demonstrates a matrix multiply using OpenMP. Threads share row
iterations
* according to a predefined chunk size.
* AUTHOR: Blaise Barney
* LAST REVISED: 06/28/05
**********************************************************************
/
#include <omp.h>
#include <stdio.h>
#include <stdlib.h>

#define NRA 62 /* number of rows in matrix A */

#define NCA 15 /* number of columns in matrix A */
#define NCB 7 /* number of columns in matrix B */

int main (int argc, char *argv[])

{
int tid, nthreads, i, j, k, chunk;
double a[NRA][NCA], /* matrix A to be multiplied */
b[NCA][NCB], /* matrix B to be multiplied */
c[NRA][NCB]; /* result matrix C */

chunk = 10; /* set loop iteration chunk size */

/* Spawn a parallel region explicitly scoping all variables */

#pragma omp parallel shared(a,b,c,nthreads,chunk) private(tid,i,j,k)
{
tid = omp_get_thread_num();
if (tid == 0)
{
nthreads = omp_get_num_threads();
printf("Starting matrix multiple example with %d threads\
n",nthreads);
printf("Initializing matrices...\n");
}
/*** Initialize matrices ***/
#pragma omp for schedule (static, chunk)
for (i=0; i<NRA; i++)
for (j=0; j<NCA; j++)
a[i][j]= i+j;
#pragma omp for schedule (static, chunk)
for (i=0; i<NCA; i++)
for (j=0; j<NCB; j++)
b[i][j]= i*j;
#pragma omp for schedule (static, chunk)
for (i=0; i<NRA; i++)
for (j=0; j<NCB; j++)
c[i][j]= 0;

/* Do matrix multiply sharing iterations on outer loop */

/*** Display who does which iterations for demonstration purposes
***/
printf("Thread %d starting matrix multiply...\n",tid);
#pragma omp for schedule (static, chunk)
for (i=0; i<NRA; i++)
{
printf("Thread=%d did row=%d\n",tid,i);
for(j=0; j<NCB; j++)
for (k=0; k<NCA; k++)
c[i][j] += a[i][k] * b[k][j];
}
} /*** End of parallel region ***/

/* Print results */

printf("******************************************************\n");
printf("Result Matrix:\n");
for (i=0; i<NRA; i++)
{
for (j=0; j<NCB; j++)
printf("%6.2f ", c[i][j]);
printf("\n");
}
printf("******************************************************\n");
printf ("Done.\n");

https://computing.llnl.gov/tutorials/parallel_comp/

Parallel Computing:

 In the simplest sense, parallel computing is the simultaneous

use of multiple compute resources to solve a computational
problem:
o A problem is broken into discrete parts that can be solved
concurrently
o Each part is further broken down to a series of instructions
o Instructions from each part execute simultaneously on
different processors
o An overall control/coordination mechanism is employed
 The computational problem should be able to:
o Be broken apart into discrete pieces of work that can be
solved simultaneously;
o Execute multiple program instructions at any moment in
time;
o Be solved in less time with multiple compute resources
than with a single compute resource.
 The compute resources are typically:
o A single computer with multiple processors/cores
o An arbitrary number of such computers connected by a
network.

OPENMP
C Examples of Parallel Programming with OpenMP

https://people.sc.fsu.edu/~jburkardt/c_src/openmp/openmp.html
OpenMP Exercise

https://computing.llnl.gov/tutorials/openMP/exercise.html

Aprendendo a usar a estrutura OpenMP com GCC

http://www.ibm.com/developerworks/br/aix/library/au-aix-openmp-framework/#list2

Exam SC 900 Microsoft Security Compliance and Identity Fundamentals Skills Measured
No ratings yet
Exam SC 900 Microsoft Security Compliance and Identity Fundamentals Skills Measured
9 pages
The Fundamental Guide To SQL Query Optimization Ebook 27621
No ratings yet
The Fundamental Guide To SQL Query Optimization Ebook 27621
33 pages
OpenMPSlides Tamu SC PDF
No ratings yet
OpenMPSlides Tamu SC PDF
74 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
OMP Common Core-Voss
No ratings yet
OMP Common Core-Voss
217 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
Beginning OpenMP
No ratings yet
Beginning OpenMP
20 pages
Openmp: Parallel Processing
No ratings yet
Openmp: Parallel Processing
40 pages
Presentation2 HS OpenMP
No ratings yet
Presentation2 HS OpenMP
29 pages
Openmp Programming: Aiichiro Nakano
No ratings yet
Openmp Programming: Aiichiro Nakano
10 pages
Omp Hands On SC08 PDF
No ratings yet
Omp Hands On SC08 PDF
153 pages
Omp Hands On SC08
No ratings yet
Omp Hands On SC08
153 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
Lecture Open MP
No ratings yet
Lecture Open MP
35 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
PDC-Lab 21BCE10419
No ratings yet
PDC-Lab 21BCE10419
20 pages
DS1822-Parallel Computing - Unit2
No ratings yet
DS1822-Parallel Computing - Unit2
25 pages
10 OpenMP-2
No ratings yet
10 OpenMP-2
25 pages
OpenMPSlides Tamu SC
No ratings yet
OpenMPSlides Tamu SC
80 pages
OpenMP Presentation
No ratings yet
OpenMP Presentation
51 pages
UNIT 3
No ratings yet
UNIT 3
13 pages
Unit 4 Shared-Memory Parallel Programming With Openmp
No ratings yet
Unit 4 Shared-Memory Parallel Programming With Openmp
37 pages
NGK Openmp
No ratings yet
NGK Openmp
13 pages
Openmp Overview
No ratings yet
Openmp Overview
74 pages
Open MP
No ratings yet
Open MP
30 pages
Program Excecution ExpFinal
No ratings yet
Program Excecution ExpFinal
10 pages
OpenMP 2
No ratings yet
OpenMP 2
3 pages
07 OpenMP
No ratings yet
07 OpenMP
28 pages
OpenMP Tutorial
100% (1)
OpenMP Tutorial
82 pages
Parallel Programming Module 2
No ratings yet
Parallel Programming Module 2
112 pages
Openmp: Openmp Adds Constructs For Shared-Memory
No ratings yet
Openmp: Openmp Adds Constructs For Shared-Memory
15 pages
CS-3006 5 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 5 UsingOpenMP SharedMemoryProgramming
76 pages
Exercise 1 (Openmp-I)
No ratings yet
Exercise 1 (Openmp-I)
10 pages
Lec 12 OpenMP
No ratings yet
Lec 12 OpenMP
152 pages
OpenMP P1
No ratings yet
OpenMP P1
32 pages
Chapter 3 - Shared-Memory Programming, OpenMP
No ratings yet
Chapter 3 - Shared-Memory Programming, OpenMP
65 pages
Introduction To Open MP
No ratings yet
Introduction To Open MP
42 pages
Chap4 OpenMP
No ratings yet
Chap4 OpenMP
35 pages
Openmp 2pp
No ratings yet
Openmp 2pp
15 pages
OpenMP Workshop Day 1
No ratings yet
OpenMP Workshop Day 1
49 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
Openmp 1
No ratings yet
Openmp 1
38 pages
Open MP
No ratings yet
Open MP
35 pages
Introduction To OpenMP
No ratings yet
Introduction To OpenMP
46 pages
Updated_CS8083 MCP UNIT III notes
No ratings yet
Updated_CS8083 MCP UNIT III notes
26 pages
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
No ratings yet
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
46 pages
Lab # 2 by Akram
No ratings yet
Lab # 2 by Akram
14 pages
Lecture 10 Shared Memory Programming with OpenMP.pptx
No ratings yet
Lecture 10 Shared Memory Programming with OpenMP.pptx
30 pages
4 Openmp
No ratings yet
4 Openmp
32 pages
OpenMP Reference
No ratings yet
OpenMP Reference
2 pages
A Tutorial On Parallel Computing On Shared Memory Systems
No ratings yet
A Tutorial On Parallel Computing On Shared Memory Systems
23 pages
Parallel Programming Using Openmp: Mike Bailey
No ratings yet
Parallel Programming Using Openmp: Mike Bailey
27 pages
OPENMP1
No ratings yet
OPENMP1
67 pages
Lab # 1
No ratings yet
Lab # 1
11 pages
final
No ratings yet
final
30 pages
CP4253 Map Unit Iii
No ratings yet
CP4253 Map Unit Iii
26 pages
ACA 2024W 04 Shared-memory programming with OpenMP 1-15
No ratings yet
ACA 2024W 04 Shared-memory programming with OpenMP 1-15
8 pages
Cse 4001-Parallel and Distributed Computing Lab Digital Assessment-1 Name: Avulapati Anusha REG - NO: 17BCE0435
No ratings yet
Cse 4001-Parallel and Distributed Computing Lab Digital Assessment-1 Name: Avulapati Anusha REG - NO: 17BCE0435
5 pages
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
The Mac Terminal Reference and Scripting Primer
From Everand
The Mac Terminal Reference and Scripting Primer
Jay Docherty
4.5/5 (3)
LPIC-1 Primer
From Everand
LPIC-1 Primer
John Greene
4.5/5 (3)
Perl One-Liners: 130 Programs That Get Things Done
From Everand
Perl One-Liners: 130 Programs That Get Things Done
Peteris Krumins
4/5 (3)
Testing Fundamentals V1.0
No ratings yet
Testing Fundamentals V1.0
47 pages
DP-300 StudyGuide ENU FY23Q2 v2
No ratings yet
DP-300 StudyGuide ENU FY23Q2 v2
12 pages
Introduction To API Security
100% (1)
Introduction To API Security
33 pages
SHC Data Collection Form 2009
No ratings yet
SHC Data Collection Form 2009
1 page
Design Data Management On Aws
No ratings yet
Design Data Management On Aws
1 page
Mysql - DBeaver Error Resolving Maven Dependencies - Stack Overflow
No ratings yet
Mysql - DBeaver Error Resolving Maven Dependencies - Stack Overflow
1 page
Exporter Website30
No ratings yet
Exporter Website30
3 pages
RTOS For Embedded System Design: Mr. Anand H. D
No ratings yet
RTOS For Embedded System Design: Mr. Anand H. D
128 pages
Chapter 6-State Management
No ratings yet
Chapter 6-State Management
79 pages
IT Project Manager Resume - Sample 1: Sharad Saxena
No ratings yet
IT Project Manager Resume - Sample 1: Sharad Saxena
20 pages
Business Analyst
No ratings yet
Business Analyst
7 pages
SDUML Lab Manual 23 24
No ratings yet
SDUML Lab Manual 23 24
60 pages
Presentation On Mern Stack (1) 1
No ratings yet
Presentation On Mern Stack (1) 1
9 pages
Computer Viruses and Malwares
No ratings yet
Computer Viruses and Malwares
76 pages
Unit 1 Agile Process
No ratings yet
Unit 1 Agile Process
3 pages
Computer Memory
71% (7)
Computer Memory
19 pages
Runescape Spreading Guide
No ratings yet
Runescape Spreading Guide
7 pages
What Is Webcasting UNIT - 2
No ratings yet
What Is Webcasting UNIT - 2
30 pages
Netwrix Sbpam: Leave No Chance For Compromise or Misuse of Privileged Accounts
No ratings yet
Netwrix Sbpam: Leave No Chance For Compromise or Misuse of Privileged Accounts
2 pages
Enhancements in Sap Abap
No ratings yet
Enhancements in Sap Abap
5 pages
Coding Book Literature Review
100% (2)
Coding Book Literature Review
8 pages
Chapter 5 DB Security & Authorization
No ratings yet
Chapter 5 DB Security & Authorization
41 pages
Cap Round 1
No ratings yet
Cap Round 1
5 pages
Srs Software Requirement Specification On Paypal
No ratings yet
Srs Software Requirement Specification On Paypal
37 pages
Customer Relationship Management: Concepts and Technologies
No ratings yet
Customer Relationship Management: Concepts and Technologies
43 pages
Final Thesis Revised
No ratings yet
Final Thesis Revised
55 pages
Week 13 Lab A
No ratings yet
Week 13 Lab A
2 pages
Se Mini Project
No ratings yet
Se Mini Project
12 pages