Professional Documents
Culture Documents
SPH With MPI - Matthew Anderson - sph2
SPH With MPI - Matthew Anderson - sph2
SPH With MPI - Matthew Anderson - sph2
? 29 November 2001 – p.1/15
Parallelization Issues
? 29 November 2001 – p.2/15
Review of Parallelization Issues
0.6
f(h)
0.4
0.2
= Fluid
0.0
= Fluid mass element ( particle ) 0.0 0.5 1.0 1.5 2.0
h
? 29 November 2001 – p.3/15
Smoothing Length
? 29 November 2001 – p.4/15
Parallel Nearest Neighbor Finder
01
23
45
67
89
:;
<=
>?
@A
BC
DE
FG
HI
JK
LM
NO
PQ
RS
TU
VW
XY
Z[
\]
^_
+ + + + + + + +
+
!
"#
$%
&'
()
*+
,-
./
+ + + + + + + + +
xy
lm
`a
¢£
²³
+ + + + + + + + + +
z{
no
bc
¡
°±
+ + + + + + + + + +
|}
pq
de
®¯
+ + + + + + + + + +
~
rs
fg
¬
+ + + + + + + + + +
tu
hi
ª«
+ + + + + + + + + +
vw
jk
¨©
+ + + + + + + + + +
¦§
+ + + + + + + + +
¤¥
+ × × × × × × × ×
× × × × × × × ×
+× +× +× +× +× +× +× +×
× × × × × × × ×
× × × × × × × ×
× × × × × × × ×
? 29 November 2001 – p.5/15
Parallel Nearest Neighbor Finder
hb
? 29 November 2001 – p.6/15
Parallel Nearest Neighbor Finder
Nearest neighbors
Neighbors of a nearest neighbor
? 29 November 2001 – p.7/15
Trade-Offs: Search by Index
ghostzones = particles to exchange for an
simulation )
The index search works well when the initial data is decomposed
so that neighboring particles have neighboring indices
The index search performs well with random velocity distributions
in the initial data
The index search will not work for highly dynamic cases
? 29 November 2001 – p.8/15
Trade-Offs: Variable H
particles to exchange for an
simulation )
Increases chance for a numerical instability
Will not work for highly dynamic cases
? 29 November 2001 – p.9/15
Trade-Offs: Search the Neighbors
? 29 November 2001 – p.10/15
Performance Results
Particle index search: test run
Scalability with Fixed Problem Size
300.0
250.0
Ideal
Execution time
150.0
100.0
50.0
0.0
1 3 5 7
Number of Processors
? 29 November 2001 – p.11/15
Performance Results
Computation time
Performance with 4 Processors
500.0
400.0
200.0
100.0
0.0
20 40 60 80 100 120
Problem Size ( N x N x N )
Regression indicates computation time of
? 29 November 2001 – p.12/15
Performance Results
Efficiency
Efficiency for 51 x 51 x 51
1.00
0.95
Efficiency
0.90
0.85
0.80
1 3 5 7
Number of Processors
? 29 November 2001 – p.13/15
Performance Comments
? 29 November 2001 – p.14/15
Work In Progress
? 29 November 2001 – p.15/15