Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Parallel Execution & Compilers: Dependence Analysis & Enhanced Parallelization, Slides of Assembly Language Programming

Massachusetts Institute of Technology (MIT)Assembly Language Programming

This document, presented by prof. Saman amarasinghe in the iap 2007 course at mit, outlines the concepts of parallel execution, parallelizing compilers, dependence analysis, and increasing parallelization opportunities. It covers types of parallelism, iteration space, dependence definition, data dependence analysis, and techniques for increasing parallelization opportunities such as scalar privatization, reduction recognition, induction variable identification, array privatization, interprocedural parallelization, loop transformations, and granularity of parallelism.

Typology: Slides

2010/2011

Uploaded on 10/11/2011

lovefool 🇬🇧

4.5

(21)

292 documents

1 / 68

This page cannot be seen from the preview

Don't miss anything!

Prof. Saman Amarasinghe, MIT. 1 6.189 IAP 2007 MIT

6.189 IAP 2007

Lecture 11

Parallelizing Compilers

Discover Slides of Assembly Language Programming Massachusetts Institute of Technology (MIT)

Partial preview of the text

Download Parallel Execution & Compilers: Dependence Analysis & Enhanced Parallelization and more Slides Assembly Language Programming in PDF only on Docsity!

Prof. Saman Amarasinghe, MIT.

6.189 IAP 2007Lecture 11Parallelizing Compilers

Outline ●^ Parallel Execution ●^ Parallelizing Compilers ●^ Dependence Analysis ●^ Increasing Parallelization Opportunities ●^ Generation of Parallel Loops ●^ Communication Code GenerationProf. Saman Amarasinghe, MIT.

Why Loops? ●^ 90% of the execution time in 10% of the code^ ^ Mostly in loops ●^ If parallel, can get good performance^ ^ Load balancing ●^ Relatively easy to analyzeProf. Saman Amarasinghe, MIT.

Programmer Defined Parallel Loop ●^ FORALL^ ^ No “loop carrieddependences”^ ^ Fully parallel Prof. Saman Amarasinghe, MIT.

●^ FORACROSS^ ^ Some “loop carrieddependences”

Parallel Execution ●^ Example^ FORPAR I = 0 to NA[I] = A[I] + 1 ●^ Block Distribution: Program gets mapped into^ Iters = ceiling(N/NUMPROC);FOR P = 0 to NUMPROC-1FOR I = PIters to MIN((P+1)Iters, N)A[I] = A[I] + 1 ●^ Code that fork a function^ Iters = ceiling(N/NUMPROC);ParallelExecute(func1);^ …^ void func1(integer myPid){ FOR I = myPidIters to MIN((myPid+1)Iters, N)A[I] = A[I] + 1} Prof. Saman Amarasinghe, MIT.

Outline ●^ Parallel Execution ●^ Parallelizing Compilers ●^ Dependence Analysis ●^ Increasing Parallelization Opportunities ●^ Generation of Parallel Loops ●^ Communication Code GenerationProf. Saman Amarasinghe, MIT.

Iteration Space ●^ N deep loops^ Æ^ Prof. Saman Amarasinghe, MIT.

n-dimensional discretecartesian space Normalized loops: assume step size = 1 FOR^ I^ =^0 to^

(^6) FOR J = I to^7 ●^ Iterations are represented ascoordinates in iteration space^ ^ i̅^ = [i, i, i^1

,…, i] 3 n^

0 1 2^3

5 6 7 ^ J

(^012) I Æ 3 4 5 6

Iteration Space ●^ N deep loops^ Æ^ Prof. Saman Amarasinghe, MIT.

n-dimensional discretecartesian space Normalized loops: assume step size = 1 FOR^ I^ =^0 to^

(^6) FOR J = I to^7 ●^ Iterations are represented ascoordinates in iteration space ●^ Sequential execution order of iterations^ Î^ Lexicographic order[0,0], [0,1], [0,2], …, [0,6], [0,7],[1,1], [1,2], …, [1,6], [1,7],[2,2], …, [2,6], [2,7],

………[6,6], [6,7],

0 1 2^3

5 6 7 ^ J

(^012) I Æ 3 4 5 6

Iteration Space ●^ N deep loops^ Æ^ Prof. Saman Amarasinghe, MIT.

n-dimensional discretecartesian space Normalized loops: assume step size = 1 FOR^ I^ =^0 to^

(^6) FOR J = I to^7 ●^ An affine loop nest^ ^ Loop bounds are integer linear functions ofconstants, loop constant variables andouter loop indexes^ ^ Array accesses are integer linear functionsof constants, loop constant variables andloop indexes

0 1 2^3

5 6 7 ^ J

(^012) I Æ 3 4 5 6

Iteration Space ●^ N deep loops^ Æ^ Prof. Saman Amarasinghe, MIT.

n-dimensional discretecartesian space Normalized loops: assume step size = 1 FOR^ I^ =^0 to^

(^6) FOR J = I to^7 ●^ Affine loop nest

Æ^ Iteration space as aset of liner inequalities 0 ≤ I I ≤ (^6) I ≤ J J ≤ 7

0 1 2^3

5 6 7 ^ J

(^012) I Æ 3 4 5 6

Dependences ●^ True dependence^ a^ = =^ a ●^ Anti dependence^ =^ a a^ = ●^ Output dependence^ a^ = a^ = ●^ Definition:Data dependence exists for a dynamic instance i and j iff^ ^ either i or j is a write operation^ ^ i and j refer to the same variable^ ^ i executes before j ●^ How about array accesses within loops?Prof. Saman Amarasinghe, MIT.

Outline ●^ Parallel Execution ●^ Parallelizing Compilers ●^ Dependence Analysis ●^ Increasing Parallelization Opportunities ●^ Generation of Parallel Loops ●^ Communication Code GenerationProf. Saman Amarasinghe, MIT.

Array Accesses in a loop Prof. Saman Amarasinghe, MIT.

FOR I = 0 to 5A[I] = A[I] + 1 0 1 2^3 4

0 1 2^3

Iteration Space^5

Data Space

= A[I]A[I]= A[I]A[I]= A[I]A[I]= A[I]A[I]= A[I]A[I]= A[I]A[I]

Array Accesses in a loop Prof. Saman Amarasinghe, MIT.

FOR I = 0 to 5A[I+1] = A[I] + 1 0 1 2^3 4

0 1 2^3

Iteration Space^5

Data Space

= A[I]A[I+1]= A[I]A[I+1]= A[I]A[I+1]= A[I]A[I+1]= A[I]A[I+1]= A[I]A[I+1]

Parallel Execution & Compilers: Dependence Analysis & Enhanced Parallelization, Slides of Assembly Language Programming

Related documents

Partial preview of the text

Download Parallel Execution & Compilers: Dependence Analysis & Enhanced Parallelization and more Slides Assembly Language Programming in PDF only on Docsity!

6.189 IAP 2007Lecture 11Parallelizing Compilers

Outline ●^ Parallel Execution ●^ Parallelizing Compilers ●^ Dependence Analysis ●^ Increasing Parallelization Opportunities ●^ Generation of Parallel Loops ●^ Communication Code GenerationProf. Saman Amarasinghe, MIT.

Why Loops? ●^ 90% of the execution time in 10% of the code^ ^ Mostly in loops ●^ If parallel, can get good performance^ ^ Load balancing ●^ Relatively easy to analyzeProf. Saman Amarasinghe, MIT.

Outline ●^ Parallel Execution ●^ Parallelizing Compilers ●^ Dependence Analysis ●^ Increasing Parallelization Opportunities ●^ Generation of Parallel Loops ●^ Communication Code GenerationProf. Saman Amarasinghe, MIT.

Iteration Space ●^ N deep loops^ Æ^ Prof. Saman Amarasinghe, MIT.

0 1 2^3

5 6 7 ^ J

Iteration Space ●^ N deep loops^ Æ^ Prof. Saman Amarasinghe, MIT.

0 1 2^3

5 6 7 ^ J

Iteration Space ●^ N deep loops^ Æ^ Prof. Saman Amarasinghe, MIT.

0 1 2^3

5 6 7 ^ J

Iteration Space ●^ N deep loops^ Æ^ Prof. Saman Amarasinghe, MIT.

0 1 2^3

5 6 7 ^ J

Outline ●^ Parallel Execution ●^ Parallelizing Compilers ●^ Dependence Analysis ●^ Increasing Parallelization Opportunities ●^ Generation of Parallel Loops ●^ Communication Code GenerationProf. Saman Amarasinghe, MIT.

Array Accesses in a loop Prof. Saman Amarasinghe, MIT.

0 1 2^3

Array Accesses in a loop Prof. Saman Amarasinghe, MIT.

0 1 2^3

Why Loops? ●^ 90% of the execution time in 10% of the code^ ^ Mostly in loops ●^ If parallel, can get good performance^ ^ Load balancing ●^ Relatively easy to analyzeProf. Saman Amarasinghe, MIT.