Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

OpenMP Programming: Dot Product of Two Vectors using Parallel Communication, Slides of Parallel Computing and Programming

Ankit Institute of Technology and Science Parallel Computing and Programming

Information on writing an openmp-based program to compute the dot product of two vectors with 1200 elements each on a 6-core smp system. It covers openmp directives and library functions, group communication operations, cost analysis, and examples of broadcast and reduction in matrix-vector multiplication. The document also discusses communication patterns in different topologies.

Typology: Slides

2011/2012

Uploaded on 07/23/2012

paramita 🇮🇳

4.6

(16)

120 documents

1 / 13

This page cannot be seen from the preview

Don't miss anything!

Quiz#05

Marks10Time10minutes

WriteanOpenMPbasedprogramtocomputethe

DotproductoftwovectorsA&B(with1200

elementseach)usinga6‐CoreSMPsystem.

Referdirective/libfunctionlist

docsity.com

Discover Slides of Parallel Computing and Programming Ankit Institute of Technology and Science

Partial preview of the text

Download OpenMP Programming: Dot Product of Two Vectors using Parallel Communication and more Slides Parallel Computing and Programming in PDF only on Docsity!

Quiz

Marks

Time

minutes

Write

an

OpenMP

based

program

to

compute

the

Dot

product

of

two

vectors

A

B

(with

elements

each)

using

a

‐Core

SMP

system.

Refer

directive/

lib

function

list

Some openMp directives & library Functions

void omp_set_num_threads

(int num_threads); int omp_get_num_threads ();int omp_get_max_threads ();int omp_get_thread_num ();int omp_get_num_procs ();int omp_in_parallel();

#pragma omp parallel [clause list]

reduction (operator: variable list). #pragma omp for [clause list

#pragma omp sections

#pragma omp section

void omp_set_lock

*(omp_lock_t lock); void omp_unset_lock

*(omp_lock_t lock); int omp_test_lock

*(omp_lock_t lock);

Summary: Group Communication

Operations

•^

Group communication operations are builtusing point-to-point messaging primitives.

Communicating a message of size

over an

uncongested network takes time (

ts

+ t

m )

•^

Where necessary, we take congestion intoaccount explicitly by scaling the

term.

•^

We assume that the network is bidirectionaland that communication is single-ported.

Cost Analysis: one-to-all broadcast

& reduction

Using recursive doubling approach thebroadcast or reduction procedure on all thetopologies (array, mesh, tree, hypercube)involves

log p

point-to-point simple message

transfers, each at a time cost of

t^ s

+ t

m

The total time (assuming no congestion) istherefore given by:

Broadcast and Reduction: Matrix-Vector Multiplication One-to-all broadcast and all-to-one reduction in the multiplication of a

4 x 4

matrix with a

4 x 1

vector.

Communication Patterns in Different

Topologies

One-to-All Broadcast and All-to-One

Reduction

All-to-All Broadcast and Reduction

& All-Reduce Operations

Scatter and Gather (one to all & all to

one personalized comm)

All-to-All Broadcast & Reduction

on a Ring

•^

Simplest approach: perform

one-to-all broadcasts

This is not the most efficient way, though.

Each node first sends to one of its neighbors thedata it needs to broadcast.

In subsequent steps, it forwards the data receivedfrom one of its neighbors to its other neighbor.

The algorithm terminates in

steps.

All-to-All Broadcast and Reduction on a Ring

On a ring, the time is given by:

(t

+ t

m)(p-1)

. docsity.com

All-to-all broadcast on a

3 x 3

mesh. The groups of nodes communicating with each other in each phase

are enclosed by dotted boundaries. By the end of the second phase, all nodes get (0,1,2,3,4,5,6,7)