Professional Documents
Culture Documents
Module 1 Chapter3
Module 1 Chapter3
Performance measures
Speedup laws
Scalability principles
Scaling up vs. scaling down
Performance metrics and measures
2
Parallelism profiles
Asymptotic speedup factor
System efficiency, utilization and quality
Standard performance measures
Degree of parallelism
3
Algorithm structure
Program optimization
Resource utilization
Run-time conditions
Realistically limited by # of available processors,
memory, and other nonprocessor resources
Average parallelism variables
5
n – homogeneous processors
m – maximum parallelism in a profile
- computing capacity of a single processor
(execution rate only, no overhead)
DOP=i – # processors busy during an observation
period
Average parallelism
6
t2
W DOP (t )dt
t1
m
W i ti
i 1
Average parallelism
7
1 t2
A
t 2 t1 t 1
DOP (t )dt
m
m
A i ti / ti
i 1 i 1
Example: parallelism profile and average
parallelism
8
Asymptotic speedup
9
m
m
T (1) ti (1)
m
Wi
T (1) W i
i 1 i 1 S i 1
T ( ) m
m
T ( ) ti ( )
m
Wi W / i
i 1
i
m
Ra Ri / m Arithmetic mean execution rate
(assumes equal weighting)
i 1
m
R ( f i Ri )
* Weighted arithmetic mean
a execution rate
i 1
m
Rg R 1/ m
i
Geometric mean execution rate
i 1
m
R Ri
*
g
fi Weighted geometric mean
execution rate
i 1
-does not summarize the real performance since it does
not have the inverse relation with the total time
Harmonic mean performance
13
1 m 1 m 1
Ta Ti Arithmetic mean execution time
m i 1 m i 1 Ri per instruction
Harmonic mean performance
14
m
Rh 1 / Ta m Harmonic mean execution rate
(1 / R )
i 1
i
1
R
*
h m
Weighted harmonic mean execution rate
( f
i 1
i / Ri )
1
S T1 / T
*
n
i 1
f i / Ri
Harmonic Mean Speedup Performance
16
Amdahl’s Law
17
Assume Ri = i, w = (, 0, 0, …, 1- )
System is either sequential, with probability , or
fully parallel with prob. 1-
n
Sn
1 (n 1)
Implies S 1/ as n
Speedup Performance
18
System Efficiency
19
S ( N ) T (1) / T (n)
S (n) T (1)
E ( n)
n nT (n)
Redundancy and Utilization
20
O ( n)
U ( n) R ( n ) E ( n)
nT (n)
Quality of Parallelism
21
3
S (n) E (n) T (1)
Q(n) 2
R(n) nT (n)O(n)
Example of Performance
22
Dhrystone results
Measure of integer performance
Whestone results
Measure of floating-point performance
Drug design
High-speed civil transport
Ocean modeling
Ozone depletion research
Air pollution
Digital anatomy
Application Models for Parallel Computers
25
Fixed-load model
Constant workload
Fixed-time model
Demands constant program execution time
Fixed-memory model
Limited by the memory bound
26
Algorithm Characteristics
27
E
w( s ) h( s, n)
1 E
C = E/(1-E) is constant for fixed E
f E (n) C h(s, n)
Speedup Performance Laws
30
Amdahl’s law
for fixed workload or fixed problem size
Gustafson’s law
for scaled problems (problem size increases with increased
machine size)
Speedup model
for scaled problems bounded by memory capacity
Amdahl’s Law
31
Wi i
ti (i ) Wi
i n ti ( n) ti ( )
i
m
Wi i
T ( n)
i 1 i n m
T (1) W i
Sn i i
m
Wi i
T ( n)
n
i 1 i
Amdahl’s Law
33
Gustafson’s Law
34
i
m m '
Wi
i 1
Wi
i 1 i
n Q ( n )
m'
W i
'
W1 nWn
S ' i 1
W1 Wn
n m
W
i 1
i
Gustafson’s Scaled Speedup
37
Memory Bounded Speedup Model
38
m*
W i
*
W1 G (n)Wn
S
* i 1
W1 G (n)Wn / n
n m*
Wi* i
i 1 i
n Q ( n )
Fixed-Memory Speedup
40
T ( s,1)
S ( s , n)
T ( s , n ) h( s , n )
S ( s, n)
E ( s, n)
n
Scalable Systems
45
S ( s, n) TI ( s, n)
( s, n)
S I ( s, n) T ( s, n)