Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Advanced Computer Architecture: Parallel Systems and Parallel Computers, Schemes and Mind Maps of Advanced Computer Architecture

Almuthanna University (AMU)Advanced Computer Architecture

An overview of parallel computers, their role in the future of computing, and the different types of parallelism. It covers the concepts of bit-level, instruction-level, process/thread-level, and job-level parallelism, as well as applications in scientific computing and commercial industries. The document also discusses programming models, communication abstractions, and taxonomy of parallel architecture. It includes examples of communication architectures and performance keys.

Typology: Schemes and Mind Maps

2021/2022

Uploaded on 09/07/2022

adnan_95 🇮🇶

4.3

(39)

918 documents

1 / 38

This page cannot be seen from the preview

Don't miss anything!

CMSC 611: Advanced

Computer Architecture

Parallel Systems

Discover Schemes and Mind Maps of Advanced Computer Architecture Almuthanna University (AMU)

Partial preview of the text

Download Advanced Computer Architecture: Parallel Systems and Parallel Computers and more Schemes and Mind Maps Advanced Computer Architecture in PDF only on Docsity!

CMSC 611: Advanced

Computer Architecture

Parallel Systems

Parallel Computers

Definition: “A parallel computer is a collection of

processing elements that cooperate and communicate to

solve large problems fast.”

Almasi and Gottlieb, Highly Parallel Computing ,

Parallel machines are expected to have a bigger role in

the future since:

Microprocessors are likely to remain dominant in the uniprocessor

arena and the logical way to extend the performance is by

connecting multiple microprocessors

It is not expected that the microprocessor technology will keep the

pace of performance improvement given the increased level of

complexity

There has been steady progress in software development for

parallel architectures in recent years

Slide is a courtesy of Dave Patterson

Level of Parallelism

Bit-level parallelism

• ALU parallelism: 1-bit, 4-bits, 8-bit, ...

Instruction-level parallelism (ILP)

• Pipelining, Superscalar, VLIW, Out-of-Order

execution

Process/Thread-level parallelism

• Divide job into parallel tasks

Job-level parallelism

• Independent jobs on one computer system

Applications

Scientific Computing

Nearly Unlimited Demand (Grand Challenge):
Successes in some real industries:
- Petroleum: reservoir modeling
- Automotive: crash simulation, drag analysis, engine
- Aeronautics: airflow analysis, engine, structural mechanics
- Pharmaceuticals: molecular modeling
  - Slide is a courtesy of Dave Patterson

App Perf (GFLOPS) Memory (GB)

48 hour weather 0.1 0.

72 hour weather 3 1

Pharmaceutical design 100 10

Global Change, Genome 1000 1000

Framework

Extend traditional computer architecture with a

communication architecture

abstractions (HW/SW interface)
organizational structure to realize abstraction efficiently

Programming Model:

Multiprogramming: lots of jobs, no communication
Shared address space: communicate via memory
Message passing: send and receive messages
Data Parallel: several agents operate on several data sets

simultaneously and then exchange information globally and

simultaneously (shared or message passing)

Communication Abstraction:

Shared address space: e.g., load, store, atomic swap
Message passing: e.g., send, receive library calls
Debate over this topic (ease of programming, scaling)

→ many hardware designs 1:1 programming model

Taxonomy of Parallel

Architecture

Flynn Categories

• SISD (Single Instruction Single Data)

• MISD (Multiple Instruction Single Data)

• SIMD (Single Instruction Multiple Data)

• MIMD (Multiple Instruction Multiple Data)

Slide is a courtesy of Dave Patterson

MISD

No commercial examples

Apply same operations to a set of data

• Find primes

• Crack passwords

SIMD

Vector/Array computers

Data Parallel Model

Operations performed in parallel on each element of a

large regular data structure, such as an array

One Control Processor broadcast to many processing elements

(PE) with condition flag per PE so that can skip

For distributed memory architecture data is distributed

among memories

Data parallel model requires fast global synchronization
Data parallel programming languages lay out data to processor
Vector processors have similar ISAs, but no data placement

restriction

Slide is a courtesy of Dave Patterson

SIMD Utilization

Conditional Execution

PE Enable
- if (f<.5) {...}
Global PE enable check
- while (t > 0) {...} Memory Program Data Controller

Single swizzle operation collects one word from each PE in block
- Designed for antialiasing
NO inter-block connections
NO global routing Memory Program Data Controller

Advanced Computer Architecture: Parallel Systems and Parallel Computers, Schemes and Mind Maps of Advanced Computer Architecture

Related documents

Partial preview of the text

Download Advanced Computer Architecture: Parallel Systems and Parallel Computers and more Schemes and Mind Maps Advanced Computer Architecture in PDF only on Docsity!

CMSC 611: Advanced

Computer Architecture

Parallel Systems

Parallel Computers

Definition: “A parallel computer is a collection of

processing elements that cooperate and communicate to

solve large problems fast.”

Parallel machines are expected to have a bigger role in

the future since:

arena and the logical way to extend the performance is by

connecting multiple microprocessors

pace of performance improvement given the increased level of

complexity

parallel architectures in recent years

Level of Parallelism

Bit-level parallelism

• ALU parallelism: 1-bit, 4-bits, 8-bit, ...

Instruction-level parallelism (ILP)

• Pipelining, Superscalar, VLIW, Out-of-Order

execution

Process/Thread-level parallelism

• Divide job into parallel tasks

Job-level parallelism

• Independent jobs on one computer system

Applications

Scientific Computing

App Perf (GFLOPS) Memory (GB)

48 hour weather 0.1 0.

72 hour weather 3 1

Pharmaceutical design 100 10

Global Change, Genome 1000 1000

Framework

Extend traditional computer architecture with a

communication architecture

Programming Model:

simultaneously and then exchange information globally and

simultaneously (shared or message passing)

Communication Abstraction:

→ many hardware designs 1:1 programming model

Taxonomy of Parallel

Architecture

Flynn Categories

• SISD (Single Instruction Single Data)

• MISD (Multiple Instruction Single Data)

• SIMD (Single Instruction Multiple Data)

• MIMD (Multiple Instruction Multiple Data)

MISD

No commercial examples

Apply same operations to a set of data

• Find primes

• Crack passwords

SIMD

Vector/Array computers

Data Parallel Model

Operations performed in parallel on each element of a

large regular data structure, such as an array

(PE) with condition flag per PE so that can skip

For distributed memory architecture data is distributed

among memories

restriction

SIMD Utilization

Conditional Execution

PE

PE

PE

PE

PE

PE

PE

PE

PE

PE

PE

PE

PE

PE