0% found this document useful (0 votes)

6 views

PDC LAB Experiment 2

The document outlines an experiment focused on parallel computation using OpenMP, specifically for summation techniques and matrix-vector multiplication. It details the objectives, prerequisites, and provides source code examples for implementing parallel summation and prefix sum operations. The document emphasizes the importance of load balancing, data parallelism, and key OpenMP directives in optimizing performance.

Uploaded by

mohammadtasneem98

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

PDC LAB Experiment 2

Uploaded by

mohammadtasneem98

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Experiment 2

Title: Parallel Computation with OpenMP: Summation Techniques and Matrix-Vector

Multiplication
Aim/Objective:
Illustrate parallel computation techniques using OpenMP for two fundamental algorithms—
summation and matrix-vector multiplication—aiming to enhance performance through
concurrent processing.
Description:
Implement OpenMP-based programs for parallel summation and matrix-vector multiplication,
exploring various OpenMP directives and strategies to optimize computation speed and
scalability.

Pre-Requisites:
Proficiency in a language supporting OpenMP (e.g., C/C++), understanding of OpenMP
directives, familiarity with array and matrix manipulation, and a foundational grasp of parallel
programming concepts including thread management, synchronization, and parallelization
strategies.
Pre-Lab:
1. What is the primary objective of utilizing OpenMP in parallel summation techniques?
improve computational efficiency and performance by distributing the workload of
summation across multiple threads.
1.Dividing Workload Across Threads
2 .Leveraging Multicore Processors
3. Simplifying Parallel Programming
4. Reducing Computational Bottlenecks: A 'computational bottleneck' refers to a point in
an algorithm where the computational demand is significantly high, causing a slowdown in
the overall process.
5. Maintaining Scalability:

2. Briefly discuss one advantage of using OpenMP for parallel summation techniques
compared to sequential approaches.
reduced execution time.
3 In the context of matrix-vector multiplication, define the term "data parallelism" and explain
how OpenMP leverages this concept for parallel computation.
Data parallelism refers to the simultaneous execution of the same operation on different pieces of data.
In the context of matrix-vector multiplication, this means distributing the computation of individual
rows of the matrix (each contributing to a single element of the result vector) across multiple threads or
processing units.
How OpenMP Leverages Data Parallelism:
Dividing the Rows of the Matrix Among Threads
Parallelizing the Loop: Using the #pragma omp parallel for
Managing Workload Distribution
Reducing Overhead

4. Explain the role of load balancing in the context of parallel matrix-vector multiplication
using OpenMP, and why it is crucial for optimizing performance.
Load balancing refers to the even distribution of computational tasks among the available threads or
processing units in a parallel system.
Why Load Balancing is Crucial for Optimizing Performance
Minimizing Idle Time
Reducing Execution Time
Improving Scalability
Mitigating(reducing) Overhead

5. Name one key OpenMP directive used in the parallelization of computations. Provide a
brief description of its purpose?
One key OpenMP directive is #pragma omp parallel.

In-Lab:
1. Implement Parallel Summation using OMP - The Array Element Sum Problem, Tree
structure global sum - Parallel-tree-sum.c
o Program:
Aim: Implement Parallel Summation using OMP - The Array Element Sum Problem .c

Source code:
#include <stdio.h>
#include <omp.h>

int main() {
int n = 10; // Number of elements in the array
int arr[] = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}; // Array elements
int sum = 0; // Shared variable to store the result

// Parallel region with reduction to compute the sum

#pragma omp parallel for reduction(+:sum)
for (int i = 0; i < n; i++) {
sum += arr[i]; // Add each element to the shared sum
}

printf("The sum of the array elements is: %d\n", sum);

return 0;
}

Output:

Program Overview
1. Array Initialization:
2. The array arr has 10 elements: {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}.
3. Parallelization: The #pragma omp parallel for directive is used to parallelize the for
loop, and the reduction(+:sum) clause ensures that the sum is correctly computed in parallel
without race conditions.
4. Goal: Compute the sum of the elements in the array, i.e., 1 + 2 + 3 + ... + 10 = 55.
Tracing:
------------------------------------------------------------------------------------
Aim: Tree structure global sum
Source code:
#include <stdio.h>
#include <omp.h>

int main() {
int n = 16; // Number of elements in the array
int arr[] = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16};
int sum = 0; // Variable to store the global sum

omp_set_num_threads(4); // Set the number of threads to 4

int num_threads;
int local_sums[4] = {0}; // Array to hold partial sums for each thread

// Parallel region with thread-local partial sums

#pragma omp parallel
{
int tid = omp_get_thread_num(); // Thread ID
int chunk_size = n / omp_get_num_threads();
int start = tid * chunk_size; // Start index for this thread
int end = start + chunk_size; // End index for this thread

// Compute the partial sum for this thread

for (int i = start; i < end; i++) {
local_sums[tid] += arr[i];
}

// Synchronize threads before the final reduction

#pragma omp barrier
// Combine partial sums in a tree-like fashion
#pragma omp single
{
num_threads = omp_get_num_threads();
for (int step = 1; step < num_threads; step *= 2) {
for (int i = 0; i + step < num_threads; i += 2 * step) {
local_sums[i] += local_sums[i + step];
}
}
}
}

// The final sum is stored in local_sums[0]

sum = local_sums[0];
printf("The sum of the array elements is: %d\n", sum);

return 0;
}

Output:

Explanation
1. Initialization:
a. The array has 16 elements.
b. We set the number of threads to 4, so each thread handles 4 elements.
2. Thread Work:
a. Each thread computes the partial sum of its assigned chunk of the array:
i.Thread 0: arr[0..3]
ii.Thread 1: arr[4..7]
iii.Thread 2: arr[8..11]
iv.Thread 3: arr[12..15]
3. Tree-Structured Reduction:
a. After all threads compute their local sums, the results are combined in a tree-like
fashion:
i.Step 1: Combine sums from adjacent threads (0+1, 2+3).
ii.Step 2: Combine the results from the previous step (0+2).
b. This reduction process minimizes contention and is efficient for large arrays.

Tracing
-------------------------------------------------------------------------------------
Aim:Parallel-Tree-Sum.c
Source code:
#include <omp.h>
#include <stdio.h>
#include <stdlib.h>

#define SIZE 16 // Size of the array (power of 2 for simplicity)

int main() {
int array[SIZE];
int i;

// Initialize the array with values 1 to SIZE

for (i = 0; i < SIZE; i++) {
array[i] = i + 1;
}

printf("Array elements: ");

for (i = 0; i < SIZE; i++) {
printf("%d ", array[i]);
}
printf("\n");

int sum = 0; // Final global sum

int step = 1; // Step size for tree reduction

// Perform the tree-based parallel sum

#pragma omp parallel
{
while (step < SIZE) {
#pragma omp for
for (i = 0; i < SIZE; i += 2 * step) {
array[i] += array[i + step];
}
#pragma omp barrier
step *= 2; // Move to the next level of the tree
}
}

// The total sum is stored in the first element

sum = array[0];

printf("Total Sum: %d\n", sum);

return 0;
}

Output
For SIZE = 16, the array elements are [1, 2, 3, ..., 16]. The output would look like this:

Explanation of the Code

use a binary tree structure to progressively reduce pairs of elements, eventually summing the entire
array.
Tracing
2. Implement a program to demonstrate Parallel prefix sum using OpenMP - OMP Parallel
prefix sumfinal.c
The prefix sum (or scan) is a common operation in parallel computing, where each element in an array
is replaced by the sum of all the previous elements in the array, including the element itself.
• Program:
#include <stdio.h>
#include <omp.h>

#define SIZE 16 // Size of the array (power of 2 for simplicity)

int main() {
int arr[SIZE];
int prefix_sum[SIZE]; // Array to hold the prefix sum
int i;

// Initialize the array with values 1 to SIZE

for (i = 0; i < SIZE; i++) {
arr[i] = i + 1;
}

// Print the original array

printf("Original array: ");
for (i = 0; i < SIZE; i++) {
printf("%d ", arr[i]);
}
printf("\n");

// Prefix sum computation using parallel approach

#pragma omp parallel
{
int tid = omp_get_thread_num(); // Thread ID
int step;

// Copy the input array to the prefix sum array

#pragma omp for
for (i = 0; i < SIZE; i++) {
prefix_sum[i] = arr[i];
}

// Perform the parallel prefix sum in a tree-like reduction fashion

for (step = 1; step < SIZE; step *= 2) {
// Parallel step where each thread computes its part of the prefix sum
#pragma omp for
for (i = step; i < SIZE; i++) {
prefix_sum[i] += prefix_sum[i - step]; // Add the value from step distance
}
}
}

// Print the prefix sum array

printf("Prefix sum: ");
for (i = 0; i < SIZE; i++) {
printf("%d ", prefix_sum[i]);
}
printf("\n");

return 0;
}
Final Output

Explanation of Output

SAP ERP Financials Configure and Design PDF
No ratings yet
SAP ERP Financials Configure and Design PDF
35 pages
The Startup Guide - Customer Acquisition & Marketing
93% (14)
The Startup Guide - Customer Acquisition & Marketing
28 pages
Lab Manual
No ratings yet
Lab Manual
31 pages
Micro
No ratings yet
Micro
30 pages
Multicore Architecture and Programming Lab Manual
No ratings yet
Multicore Architecture and Programming Lab Manual
29 pages
20BCE260
No ratings yet
20BCE260
13 pages
PDC-Assignment#02
No ratings yet
PDC-Assignment#02
5 pages
CP4252 Multicore Architecture and Programming Lab Manual
No ratings yet
CP4252 Multicore Architecture and Programming Lab Manual
26 pages
Openmp 3
No ratings yet
Openmp 3
32 pages
Worksharing and Parallel Loops
No ratings yet
Worksharing and Parallel Loops
23 pages
Cc103 Miterm Reviewer PDF
No ratings yet
Cc103 Miterm Reviewer PDF
27 pages
Openmp 4
No ratings yet
Openmp 4
31 pages
HPC 3
No ratings yet
HPC 3
7 pages
Parallel Computing
No ratings yet
Parallel Computing
16 pages
Unit 4 Shared-Memory Parallel Programming With Openmp
No ratings yet
Unit 4 Shared-Memory Parallel Programming With Openmp
37 pages
Parallel & Distributed Computing (L31+32) : Write A Simple Openmp Program To Demonstrate The Parallel Loop Construct
No ratings yet
Parallel & Distributed Computing (L31+32) : Write A Simple Openmp Program To Demonstrate The Parallel Loop Construct
3 pages
Presentation2 HS OpenMP
No ratings yet
Presentation2 HS OpenMP
29 pages
Parallel Computing: # Registering Cores For Parallel Process
No ratings yet
Parallel Computing: # Registering Cores For Parallel Process
4 pages
Parallel and Distributed Computing Lab Digital Assignment - 3
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 3
10 pages
Ekt120 Lecture07 Arrays1
No ratings yet
Ekt120 Lecture07 Arrays1
31 pages
array c++
No ratings yet
array c++
13 pages
E 3 (Openmp - Iii) : Matrix Multiplication
No ratings yet
E 3 (Openmp - Iii) : Matrix Multiplication
10 pages
PC File
No ratings yet
PC File
57 pages
MAP laB mannual
No ratings yet
MAP laB mannual
24 pages
Ch-2 Python Libraries For ML
No ratings yet
Ch-2 Python Libraries For ML
70 pages
NumPy_Array_Operations_and_Functions
No ratings yet
NumPy_Array_Operations_and_Functions
14 pages
Exercise 1 (Openmp-I)
No ratings yet
Exercise 1 (Openmp-I)
10 pages
Unit3_ Arrays and Strings
No ratings yet
Unit3_ Arrays and Strings
20 pages
Openmp: Parallel Processing
No ratings yet
Openmp: Parallel Processing
40 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
As 3
No ratings yet
As 3
2 pages
Lecture Open MP
No ratings yet
Lecture Open MP
35 pages
PF_WEEK TEN
No ratings yet
PF_WEEK TEN
36 pages
Pendahuluan Python
No ratings yet
Pendahuluan Python
29 pages
4-Arrays 2
No ratings yet
4-Arrays 2
13 pages
45B AIML Practical1.1
No ratings yet
45B AIML Practical1.1
57 pages
07 OpenMP
No ratings yet
07 OpenMP
28 pages
Chapter 8
No ratings yet
Chapter 8
20 pages
Lab # 1
No ratings yet
Lab # 1
11 pages
PDC Experiments
No ratings yet
PDC Experiments
11 pages
Lecture 4 Notes: Arrays and Strings
No ratings yet
Lecture 4 Notes: Arrays and Strings
6 pages
SWE2017 - Lab Assignment 1pages-7
No ratings yet
SWE2017 - Lab Assignment 1pages-7
5 pages
Lesson Proper For Week 7: C++ Arrays
No ratings yet
Lesson Proper For Week 7: C++ Arrays
8 pages
python-notes-BCC-302 (Unit - 05)
No ratings yet
python-notes-BCC-302 (Unit - 05)
25 pages
1) Stage (Journey)
No ratings yet
1) Stage (Journey)
7 pages
Programming Fundamentals Lab 08 (1D Array)
No ratings yet
Programming Fundamentals Lab 08 (1D Array)
5 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
UNIT 3 QB ANSWER
No ratings yet
UNIT 3 QB ANSWER
27 pages
Lecture 13 Pdc Bcs 6ef Smi Spring 2025
No ratings yet
Lecture 13 Pdc Bcs 6ef Smi Spring 2025
17 pages
TI Technical Interview PDF
No ratings yet
TI Technical Interview PDF
21 pages
W67Chap5WithExercises DrHanDuyPhan Simplified
No ratings yet
W67Chap5WithExercises DrHanDuyPhan Simplified
23 pages
Iml Practical Assignment
No ratings yet
Iml Practical Assignment
22 pages
Numpy Cheatbook
No ratings yet
Numpy Cheatbook
4 pages
Numpy
No ratings yet
Numpy
4 pages
MPI Plamen Krastev
No ratings yet
MPI Plamen Krastev
49 pages
(WWW - Entrance-Exam - Net) - TEXAS Instruments Placement Sample Paper 1
No ratings yet
(WWW - Entrance-Exam - Net) - TEXAS Instruments Placement Sample Paper 1
52 pages
Drill 5
No ratings yet
Drill 5
5 pages
python lab manual
No ratings yet
python lab manual
20 pages
TP1
No ratings yet
TP1
5 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Tasneem current
No ratings yet
Tasneem current
1 page
Skill Experiment 3 - Solution
No ratings yet
Skill Experiment 3 - Solution
10 pages
RSA PROGRAM
No ratings yet
RSA PROGRAM
10 pages
Skill Experiment 4 - Solution
No ratings yet
Skill Experiment 4 - Solution
17 pages
Teletest 1
No ratings yet
Teletest 1
17 pages
House of Dreams Sponsorship Proposal
No ratings yet
House of Dreams Sponsorship Proposal
8 pages
Set 1 With Answers
No ratings yet
Set 1 With Answers
5 pages
Matatag Template - Selg Report
No ratings yet
Matatag Template - Selg Report
7 pages
10 Basics For Starting Your Online Business PDF
100% (1)
10 Basics For Starting Your Online Business PDF
16 pages
NEC3 notice templates
No ratings yet
NEC3 notice templates
11 pages
Modelingandsimualtionof Bullet Resistant Composite Body Armor
No ratings yet
Modelingandsimualtionof Bullet Resistant Composite Body Armor
10 pages
How IKEA Maintains Workplace Culture For 170,000 Distributed Employees
No ratings yet
How IKEA Maintains Workplace Culture For 170,000 Distributed Employees
9 pages
Facing Today's New Normal
No ratings yet
Facing Today's New Normal
2 pages
Motilal Oswal PMS Brochure
No ratings yet
Motilal Oswal PMS Brochure
12 pages
s1 Merged
No ratings yet
s1 Merged
20 pages
Im Property Surveys 3
No ratings yet
Im Property Surveys 3
19 pages
Capex Opex..
No ratings yet
Capex Opex..
6 pages
Minne 2014
No ratings yet
Minne 2014
10 pages
Q4 Pfizer Earnings Infographic FINAL PDF
No ratings yet
Q4 Pfizer Earnings Infographic FINAL PDF
1 page
Draft-2 Sem-8 Project
No ratings yet
Draft-2 Sem-8 Project
44 pages
Appendix B. Verbatim Transcript
No ratings yet
Appendix B. Verbatim Transcript
9 pages
Baterías CELLYTE 2TLA
100% (1)
Baterías CELLYTE 2TLA
8 pages
Ethics Review
No ratings yet
Ethics Review
2 pages
Fossil Book
No ratings yet
Fossil Book
115 pages
Install Traffic Server
No ratings yet
Install Traffic Server
4 pages
CV Adela-2
No ratings yet
CV Adela-2
2 pages
Author Book List
No ratings yet
Author Book List
3 pages
Cell Phones 1 26 15 1488594799 Leaked
No ratings yet
Cell Phones 1 26 15 1488594799 Leaked
2 pages
Opex Ratio
No ratings yet
Opex Ratio
4 pages
Exe Estuary Trail Map With Share Space 180515
No ratings yet
Exe Estuary Trail Map With Share Space 180515
2 pages
Maintenance Instruction Manual Operation of The Single Measuring System
No ratings yet
Maintenance Instruction Manual Operation of The Single Measuring System
4 pages
CF Week11 12 ST
No ratings yet
CF Week11 12 ST
57 pages