Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Performance Metrics for Parallel Programs-Parallel Computing-Lecture Slides, Slides of Parallel Computing and Programming

Pakistan Institute of Engineering and Applied Sciences, Islamabad (PIEAS)Parallel Computing and Programming

This lecture was delivered by Dr. Hanif Durad at Pakistan Institute of Engineering and Applied Sciences, Islamabad (PIEAS) for Parallel Computing course. it includes: Performance, Metrics, Parallel, Programs, Timing, Wall, User, CPU, Runtime, MPI, Platform, Independent

Typology: Slides

2011/2012

Uploaded on 07/19/2012

adnaan 🇵🇰

(1)

13 documents

1 / 34

This page cannot be seen from the preview

Don't miss anything!

Dr. Hanif Durad 2

Lecture Outline-Part1

Timing

wall time

user time

system time

Measuring time

Using gprof program

PC-2.pdf

Discover Slides of Parallel Computing and Programming Pakistan Institute of Engineering and Applied Sciences, Islamabad (PIEAS)

Partial preview of the text

Download Performance Metrics for Parallel Programs-Parallel Computing-Lecture Slides and more Slides Parallel Computing and Programming in PDF only on Docsity!

Dr. Hanif Durad

Lecture Outline-Part

 Timing  wall time  user time  system time  Measuring time  Using gprof program PC-2.pdf

Timing

 In order to parallelize a program/algorithm, weneed to know which parts of a program need themost computation time.  Three different time spans to be considered:  wall time  user time  system time Dr. Hanif Durad

User Time

 The actual runtime used by the program.  User time << the wall time  the program has to wait a lot, for example forcomputation time allocation or data from the RAMor from the hard-disk.  These are indications for necessary optimizations.  When using more than one CPU, the user timeshould be higher than the wall time, indicating thatthe CPUs work in parallel.

System Time

 Time used not by the program itself, but by theoperating system, e.g. for allocating memory orhard disk access.  System time should stay low. Dr. Hanif Durad

Measuring time (2/3)

 For the performance analysis, we want to know the runtime required by individual parts of a program.  There are several programming language and operatingsystem dependent methods for measuring time inside aprogram.  MPI & OpenMP have their own, platform independentfunctions for time measurement.  MPI_Wtime() & omp_get_wtime() return the wall time in secs, the difference between the results of two such function callsyields the runtime elapsed between the two function calls.

Measuring time (3/3)

 advanced method of performance analysis: profiling  the program has to be built with information for theprofiler.  Example:  done with the switch -p for Intel Fortran  at run, the program creates the file gmon.out required by the profiler gprof  gprof program > prof.txt creates a text file with the profilinginformation.  flat profile lists all function/subroutine calls, time used for them,percentage of the total time, no. of calls etc  call tree, a listing of all routines call by the subroutines of the program

Analytical modelingof parallel programs

Dr. Hanif Durad

Lecture Outline- -Part 2Modeling of Parallel Programs

 Parallel Execution Time  Parallel Cost  Overheads, Sources Of Overhead  Speedup, Efficiency  Amdahl’s Law, Scalability  Granularity, Coupling Analysis.ppt

Overhead (T

 Overhead: T o

=C-T

s  Where does it come from?  idling^  not enough parallelism  load imbalance  communication  additional and/or repeated calculations Dr. Hanif Durad Gramma, P-

Other Measures

 Speedup: 

S=T

/T

p , where T s is the best sequential time  Efficiency:  E = S/p = T s /pT p

= T

/C = T

/ (T

+T

Dr. Hanif Durad

Scalability of Parallel Systems(2/2)

 Consequence of Amdahl’s law:  for a given instance, adding additional processors gives diminishing returns  only relatively few processors can be efficiently used  Way around:  increase the problem size  sequential part tends to grow slower then the parallel part  A system is scalable if efficiency can be maintained byincreasing problem size Dr. Hanif Durad

Granularity

Dr. Hanif Durad The size of the computation segments between communication. fine grained coarse grained ILP loop parallelism task parallelism

Fine Grain Parallelism

 Typified by long computations consisting of large numbers ofinstructions between communication synchronization points  High computation to communication ratio  Lower communication overhead  Harder to load balance efficiently P0 P computation P 2 commmunication P 3 P

Granularity

 The most efficient granularity is dependent on thealgorithm and the hardware environment inwhich it runs  In most cases overhead associated withcommunications and synchronization is highrelative to execution speed so it is advantageousto have coarse granularity. Dr. Hanif Durad

Performance Metrics for Parallel Programs-Parallel Computing-Lecture Slides, Slides of Parallel Computing and Programming

Related documents

Partial preview of the text

Download Performance Metrics for Parallel Programs-Parallel Computing-Lecture Slides and more Slides Parallel Computing and Programming in PDF only on Docsity!

Lecture Outline-Part

Timing

User Time

System Time

Measuring time (2/3)

Measuring time (3/3)

Analytical modelingof parallel programs

Lecture Outline- -Part 2Modeling of Parallel Programs

Overhead (T

=C-T

Other Measures

S=T

/T

= T

/C = T

/ (T

+T

Scalability of Parallel Systems(2/2)

Granularity

Fine Grain Parallelism

Granularity