


Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The questions and answers for exam iii of the advanced computer architecture course (ece 4100/6100) held in summer 2004. The exam focuses on various aspects of cache systems and memory access, including cache sizes, cache miss penalties, and cache conflict resolution. Students are required to apply their knowledge of cache systems and memory access to solve problems and answer questions related to cache hit rates, average memory access times, and loop unrolling.
Typology: Exams
1 / 4
This page cannot be seen from the preview
Don't miss anything!



1_. (10 points)_ What limits the size and complexity of the L1 cache and why is an L2 (and even an L3) in common use today? The L1 cache is in the critical delay path for clock cycle time on most processors. If it gets too large (larger memory is slower) everything else would slow down. Adding larger L2 or L3 cache levels keeps L1 hits as fast as possible and reduces the miss penalty on an L1 miss.
Instruction producing result Instruction using result Latency in clock cycles FP. ALU Op FP. ALU Op 3 FP. ALU Op Store Double 2 Int. ALU Op Any 1 Load Double FP. ALU Op; 2 Load Double Store Double 0 LOOP: L.D F2, 0(R1) SUB.D F6, F4, F ADD.D F4, F6, F S.D F4, 0(R1) DADDIU R1, R1, # BNE R1, R3 LOOP Before loop unrolling a single execution requires ___15_____clocks (original code - do not reschedule)
After unrolling, a (single) execution of the original loop’s operations now requires ___4.5____clocks