Microprocessors-Computer Fundamentals and Programming-Assignment Solution, Exercises of Computer Engineering and Programming

Kapish Gupta assigned this task at Assam Don Bosco University. Submission of the task was not acceptable after due date. Assignment is related to Computer Fundamentals and Programming. It includes: Simd, Extensions, Instruction, Fpu, Produced, Cpu, Speeds, Fabrication, Size, Microprocessor, Progression, Microns

Typology: Exercises

2011/2012

Uploaded on 07/28/2012

dewansh
dewansh 🇮🇳

4.4

(10)

89 documents

1 / 3

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
SSE2 SIMD extensions One of the best ways to counter a low IPC is
to make each instruction do more work. To that end, Intel has added a set
of 144 new instructions, dubbed SSE2, to the mix. Like the original
Streaming SIMD Extensions, SSE2 involves executing a single instruction
on multiple data targets (hence SIMD) at once. Most importantly, SSE2
handles 128-bit, double-precision floating-point math. The ability to
handle more precise floating-point numbers makes SSE2 just the ticket
for accelerating a host of multimedia, 3D, engineering, and workstation-
type tasksonce software is properly optimized to take advantage of it.
A relatively simple FPU The Pentium 4's floating-point unit isn't as
capable as the FPU in the Pentium III, and it's quite a bit less capable
than the Athlon's hoss FPU. To put it simply, the P4's FPU can't do as
much work at once, and it has higher latencies in some cases. Programs
optimized for SSE2 will be able to bypass the P4's FPU in many cases, but
without special optimizations, the P4 will have a hard time keeping up.
Produced: Late 2000
Manufacturer: Intel
CPU Speeds: 1.5 GHz to 3 GHz
FSB: 200 Mhz to 500 Mhz
Fabrication Size: 0.25 µm to 0.13 µm
Architecture: x86
docsity.com
pf3

Partial preview of the text

Download Microprocessors-Computer Fundamentals and Programming-Assignment Solution and more Exercises Computer Engineering and Programming in PDF only on Docsity!

SSE2 SIMD extensions — One of the best ways to counter a low IPC is to make each instruction do more work. To that end, Intel has added a set of 144 new instructions, dubbed SSE2, to the mix. Like the original Streaming SIMD Extensions, SSE2 involves executing a single instruction on multiple data targets (hence SIMD) at once. Most importantly, SSE handles 128-bit, double-precision floating-point math. The ability to handle more precise floating-point numbers makes SSE2 just the ticket for accelerating a host of multimedia, 3D, engineering, and workstation- type tasks—once software is properly optimized to take advantage of it.  A relatively simple FPU — The Pentium 4's floating-point unit isn't as capable as the FPU in the Pentium III, and it's quite a bit less capable than the Athlon's hoss FPU. To put it simply, the P4's FPU can't do as much work at once, and it has higher latencies in some cases. Programs optimized for SSE2 will be able to bypass the P4's FPU in many cases, but without special optimizations, the P4 will have a hard time keeping up.  Produced: Late 2000  Manufacturer: Intel  CPU Speeds: 1.5 GHz to 3 GHz  FSB: 200 Mhz to 500 Mhz  Fabrication Size: 0.25 μm to 0.13 μm  Architecture: x

Microprocessor Progression: Intel

The following table helps to understand the differences between the different processors that Intel has introduced over the years.

Name Date Transistors Microns Clock Speed Data Width MIPS Pentium III 1999 9,500,000 0.25 450 MHz 32 bits 64-bit bus

Pentium IV 2000 42,000,000 0.18 1.5 GHz 32 bits 64-bit bus

 The date is the year that the processor was first introduced. Many processors are re-introduced at higher clock speeds for many years after the original release date.  Transistors are the number of transistors on the chip. You can see that the number of transistors on a single chip has risen steadily over the years.  Microns is the width, in microns, of the smallest wire on the chip. For comparison, a human hair is 100 microns thick. As the feature size on the chip goes down, the number of transistors rises.  Clock speed is the maximum rate that the chip can be clocked at. Clock speed will make more sense in the next section.  Data Width is the width of the ALU. An 8-bit ALU can add/subtract/multiply/etc. two 8-bit numbers, while a 32-bit ALU can manipulate 32-bit numbers. An 8-bit ALU would have to execute four instructions to add two 32-bit numbers, while a 32-bit ALU can do it in one instruction. In many cases, the external data bus is the same width as the ALU, but not always. The 8088 had a 16-bit ALU and an 8-bit bus, while the modern Pentiums fetch data 64 bits at a time for their 32-bit ALUs.  MIPS stands for "millions of instructions per second" and is a rough measure of the performance of a CPU. Modern CPUs can do so many different things that MIPS ratings lose a lot of their meaning, but you