



















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Amdahl's Law, Character Performance Peak, MIPS, MPLOPS, Benchmarking, CPU Performance Measure
Typology: Slides
1 / 27
This page cannot be seen from the preview
Don't miss anything!




















Amdahl’s Law
Benchmarking (lying with numbers)
class23.ppt
class23.ppt
time NY->Paris
total trip time
speedup over 747
8.5 hours
16.5 hours
3.75 hours
11.75 hours
CS 213 F’
class23.ppt
enhanced
2
(fraction of original time that can be improved)
enhanced
2
2
(speedup of enhanced part)
1
’ + T
2
’ = T
1
2
enhanced
2
’
enhanced
2
enhanced
[by def of S
enhanced
enhanced
enhanced
enhanced
[by def of F
enhanced
enhanced
enhanced
enhanced
overall
enhanced
enhanced
enhanced
class23.ppt
time NY->Paris
total trip time
speedup over 747
8.5 hours
16.5 hours
3.75 hours
11.75 hours
rocket
0.25 hours
8.25 hours
rip
0.0 hours
8 hours
class23.ppt
lack of operating system and language standards
difficult to develop portable and realistic applications
era of meaningless rates (e.g, MIPS)
age of meaningless benchmarks (e.g., Whetstone)
dawn of semi-realistic benchmarks (e.g., SPEC CPU95)
class23.ppt
T secs = I inst x (C cycles/I inst) x 1/(MHz x 10^6) cycles/sec
MHz
System
SPECfp95 time (secs)
Pentium Pro
Alder
CS 213 F’
class23.ppt
floating point operations do not predict running time
even if the did, the ideal instruction stream is usually unrealistic
1d fft
sasum
saxpy
sdot
sgemm sgemv
spvma
%peak
class23.ppt
(1) Choose representative benchmark programs.
(2) Choose an individual performance measure (for each benchmark)
(3) Choose an aggregate performance measure (for all benchmarks)
class23.ppt
10-100 line
e.g.,: sieve, puzzle, quicksort
attempt to match average frequencies of real workloads
e.g., Whetstone, Dhrystone
Time critical excerpts of REAL programs
element models.compression, sparse matrix vector product from unstructured finitee.g., 8x8 Discrete Cosine Transform (DCT) from JPEG and MPEG
class23.ppt
www.specbench.org/osg/
“Egads! That is an 8 MIPS machine, but they claim 10 MIPS!”
Sun, MIPS, HP, Apollo, DEC
some real programs, includes OS calls, some I/O
SGI, SunCompaq, Cray, DEC, HP, Hitachi, IBM, Intel, Motorola, Netscape,
class23.ppt
benchmark
description
go
plays a game of go
m88ksim
Motorola 88k chip simulator
gcc
Gnu C compiler
compress
in-memory LZW file compression
li
Lisp interpreter
jpeg
spectral based image compression/decompression
perl
Perl program that manipulates strings and primes
vortex
database program
class23.ppt
benchmark
description
tomcatv
mesh generation program
swim
513x513 shallow water finite difference model
su2cor
Monte Carlo simulation
hydro2d
2D Navier-Stokes solver
mgrid
3D multigrid solver
applu
parabolic/elliptic PDE solver
turb3d
turbulence model
apu
air pollution model
fppp
quantum chemistry model
wave
electromagnetic particle model
CS 213 F’
class23.ppt
frames
sys A
sys B
sys C
prog 1
20 secs
10 secs
40 secs
prog 2
40 secs
80 secs
20 secs
total
60 secs
90 secs
60 secs
Total running time is the ultimate performance measure.
CS 213 F’
class23.ppt
seconds
frames
sys A
sys B
sys C
prog 1
prog 2
total
normalized to A
normalized to B
normalized to C
Normalized total running time is OK too. It tracks with total running time.