Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

CSE 311 Computer Organization: Enhancing Performance with Pipelining, Slides of Computer Architecture and Organization

Alexandria University Computer Architecture and Organization

Computer Organization MIPS pipelining

Typology: Slides

2020/2021

Uploaded on 04/14/2021

unknown user 🇪🇬

5 documents

1 / 32

This page cannot be seen from the preview

Don't miss anything!

CSE 311

Computer Organization

Mostafa I. Soliman

Professor of Computer Engineering

CSE Department

[email protected]

Lecture 10

Enhancing Performance

with Pipelining

Lecture 10

Enhancing Performance

with Pipelining

Discover Slides of Computer Architecture and Organization Alexandria University

Partial preview of the text

Download CSE 311 Computer Organization: Enhancing Performance with Pipelining and more Slides Computer Architecture and Organization in PDF only on Docsity!

CSE 311

Computer Organization

Mostafa I. Soliman

Professor of Computer Engineering

CSE Department

[email protected]

Lecture 10

Enhancing Performance

with Pipelining

Lecture 10

Enhancing Performance

with Pipelining

Enhancing Performance with

Pipelining

• Introduction to Pipelining

• Pipelined vs. Single-Cycle Instruction Execution

• Pipelining MIPS

• What Makes Pipelining Hard?

- Structural Hazards

- Control Hazards

- Data Hazards

• Put All Together:

Pipelined Datapath

You can often find in

rivers what you cannot

find in oceans.

Indian proverb

Pipelined vs. Single-Cycle

Instruction Execution: the Plan

Instruction

fetch

Reg ALU

Data

access

Reg

8 ns

Instruction

fetch

Reg ALU

Data

access

Reg

8 ns

Instruction

fetch

8 ns

Time

lw $ 1 , 100 ($ 0 )

lw $ 2 , 200 ($ 0 )

lw $ 3 , 300 ($ 0 )

Program

execution

order

(in instructions)

Instruction

fetch

Reg ALU

Data

access

Reg

Time

lw $ 1 , 100 ($ 0 )

lw $ 2 , 200 ($ 0 )

lw $ 3 , 300 ($ 0 )

2 ns

Instruction

fetch

Reg ALU

Data

access

Reg

2 ns

Instruction

fetch

Reg ALU

Data

access

Reg

2 ns 2 ns 2 ns 2 ns 2 ns

Program

execution

order

(in instructions)

Single-cycle

Pipelined

Assume 2 ns for memory access, ALU operation; 1 ns for register access:

therefore, single cycle clock 8 ns; pipelined clock cycle 2 ns.

Pipelining: Keep in Mind

Pipelining does not reduce latency of a single task, it

increases throughput of entire workload

Pipeline rate limited by longest stage

potential speedup = number pipe stages

unbalanced lengths of pipe stages reduces speedup
Time to fill pipeline and time to drain it – when there is slack

in the pipeline – reduces speedup

Pipelining MIPS

What makes it hard?

 structural hazards: different instructions, at

different stages, in the pipeline want to use the same

hardware resource

control hazards: succeeding instruction, to put

into pipeline, depends on the outcome of a previous

branch instruction, already in pipeline

data hazards: an instruction in the pipeline requires

data to be computed by a previous instruction still in

the pipeline

Before actually building the pipelined datapath and

control we first briefly examine these potential hazards

individually…

Structural Hazards

Structural hazard : inadequate hardware to simultaneously

support all instructions in the pipeline in the same clock cycle

E.g., suppose single – not separate – instruction and data memory

in pipeline below with one read port

then a structural hazard between first and fourth lw instructions

MIPS was designed to be pipelined : structural hazards are easy to

avoid!

Instruction

fetch

Reg ALU

Data

access

Reg

Time

lw $ 1 , 100 ($ 0 )

lw $ 2 , 200 ($ 0 )

lw $ 3 , 300 ($ 0 )

2 ns

Instruction

fetch

Reg ALU

Data

access

Reg

2 ns

Instruction

fetch

Reg ALU

Data

access

Reg

2 ns 2 ns 2 ns 2 ns 2 ns

Program

execution

order

(in instructions)

Pipelined

Instruction

fetch

Reg ALU

Data

access

Reg

2 ns

lw $ 4 , 400 ($ 0 )

Hazard if single memory

Control Hazards

Solution 2 Predict branch outcome - e.g., predict branch-not-taken :

Instruction

fetch

Reg ALU

Data

access

Reg

Time

beq $ 1 , $ 2 , 40

add $ 4 , $ 5 , $ 6

lw $ 3 , 300 ($ 0 )

Instruction

fetch

Reg ALU

Data

access

Reg

2 ns

Instruction

fetch

Reg ALU

Data

access

Reg

2 ns

Program

execution

order

(in instructions)

Instruction

fetch

Reg ALU

Data

access

Reg

Time

beq $ 1 , $ 2 , 40

add $ 4 , $ 5 ,$ 6

or $ 7 , $ 8 , $ 9

Instruction

fetch

Reg ALU

Data

access

Reg

2 4 6 8 10 12 14

Instruction

fetch

Reg ALU

Data

access

Reg

2 ns

4 ns

bubble bubble

bubble bubble bubble

Program

execution

order

(in instructions)

Prediction success

Prediction failure: undo )=flush( lw

Control Hazards

Solution 3 Delayed branch: always execute the sequentially

next statement with the branch executing after one

instruction delay – compiler’s job to find a statement that

can be put in the slot that is independent of branch

outcome

MIPS does this – but it is an option in SPIM )Simulator ->

Settings(

Instruction

fetch

Reg ALU

Data

access

Reg

Tim e

be q $ 1 , $ 2 , 4 0

a dd $ 4 , $ 5 , $ 6

lw $ 3 , 3 0 0 ($ 0 )

Instruction

fetch

Reg ALU

Data

access

Reg

2 ns

Instruction

fetch

Reg ALU

Data

access

Reg

2 ns

2 n s

(d ela ye d bra nch slot)

Pro gra m

e xe cution

orde r

(in instructio ns)

Delayed branch beq is followed by add that is

independent of branch outcome

Data Hazards

Forwarding may not be enough - e.g., if an R-type instruction following a load uses the result of the load - called load-use data hazard

Time

lw $s 0 , 20 ($t 1 )

sub $t 2 , $s 0 , $t 3

Program

execution

order

(in instructions)

IF ID

MEM WB

EX

IF ID WB

EX MEM

Time

lw $s 0 , 20 ($t 1 )

sub $t 2 , $s 0 , $t 3

Program

execution

order

(in instructions)

IF ID

EX MEM WB

IF ID WB

EX MEM

bubble bubble bubble bubble bubble

With a one-stage stall,

forwarding

can get the data to the sub

instruction in time

Without a stall it is

impossible

to provide input to the sub

instruction in time

Reordering Code to Avoid

Pipeline Stall )Software Solution(

Example:

lw $t0, 0($t1)

lw $t2, 4($t1)

sw $t2, 0($t1)

sw $t0, 4($t1)

Reordered code:

lw $t0, 0($t1)

lw $t2, 4($t1)

sw $t0, 4($t1)

sw $t2, 0($t1)

Data hazard

Interchanged

Review - Single-Cycle Datapath “Steps”

5 5

RD

RN1 RN2 WN

WD

Register File ALU

E

X

T

N

D

16 32

RD

WD

Data

Memory

ADDR

Instruction I

RD

Instruction

Memory

ADDR

PC

ADD

Instruction Fetch

Instruction Decode

. Execute/ Address Calc

MEM

Memory Access

Write Back

Zero

Pipelined Datapath – Key Idea

What happens if we break the execution into multiple cycles, but keep the

extra hardware?

Answer: We may be able to start executing a new instruction at each clock

cycle - pipelining

…but we shall need extra registers to hold data between cycles
- pipeline registers

CSE 311 Computer Organization: Enhancing Performance with Pipelining, Slides of Computer Architecture and Organization

Related documents

Partial preview of the text

Download CSE 311 Computer Organization: Enhancing Performance with Pipelining and more Slides Computer Architecture and Organization in PDF only on Docsity!

CSE 311

Computer Organization

Mostafa I. Soliman

Professor of Computer Engineering

CSE Department

[email protected]

[email protected]

Lecture 10

Enhancing Performance

with Pipelining

Lecture 10

Enhancing Performance

with Pipelining

Enhancing Performance with

Pipelining

• Introduction to Pipelining

• Pipelined vs. Single-Cycle Instruction Execution

• Pipelining MIPS

• What Makes Pipelining Hard?

- Structural Hazards

- Control Hazards

- Data Hazards

• Put All Together:

Pipelined Datapath

You can often find in

rivers what you cannot

find in oceans.

Indian proverb

8 ns

8 ns

8 ns

Time

lw $ 1 , 100 ($ 0 )

lw $ 2 , 200 ($ 0 )

lw $ 3 , 300 ($ 0 )

Program

execution

order

(in instructions)

Time

lw $ 1 , 100 ($ 0 )

lw $ 2 , 200 ($ 0 )

lw $ 3 , 300 ($ 0 )

2 ns

2 ns

2 ns 2 ns 2 ns 2 ns 2 ns

Program

execution

order

(in instructions)

What makes it hard?

Time

lw $ 1 , 100 ($ 0 )

lw $ 2 , 200 ($ 0 )

lw $ 3 , 300 ($ 0 )

2 ns

2 ns

2 ns 2 ns 2 ns 2 ns 2 ns

Program

execution

order

(in instructions)

2 ns

lw $ 4 , 400 ($ 0 )

Hazard if single memory

bubble bubble

bubble bubble bubble

Tim e

be q $ 1 , $ 2 , 4 0

a dd $ 4 , $ 5 , $ 6

lw $ 3 , 3 0 0 ($ 0 )

2 ns

2 ns

2 n s

(d ela ye d bra nch slot)

Pro gra m