Professional Documents
Culture Documents
1100-1130 Nathan Rutman MapReduce Lug 2011
1100-1130 Nathan Rutman MapReduce Lug 2011
2
Map Reduce overview
Using Lustre with Apache Hadoop, Sun Microsystems
3
Apache Hadoop disk usage
Using Lustre with Aparche Hadoop, Sun Microsystems
4
Other Studies: Hadoop with PVFS
Crossing the Chasm: Sneaking a Parallel File System Into Hadoop , Carnegie Mellon
250
200
150
100
50
0
Series1
6
A Critical Oversight
7 LUG 2011
Cluster Setup: HDFS vs Lustre
IB Switch
OSS OSS
80MB/s 1GB/s
Lustre Setup
IB Switch
OSS OSS
80MB/s 1GB/s
HDFS Setup
IB Switch
80MB/s 1GB/s
Theoretical Comparison: HDFS vs Lustre
13 LUG 2011
Striping on MR jobs
14 LUG 2011
Replication
15 LUG 2011
Data Locality
16 Xyratex Confidential
MR I/O Benchmark
17 LUG 2011
MR Sort Benchmark
18 LUG 2011
MR tuning
Data from Hadoop Performance Tuning: A case study Berkeley 6/09
19
Lustre Tuning: TestDFSIO
20 LUG 2011
Data Staging: Not a Fair Comparison
21 LUG 2011
Hypothetical Cost Comparison
22 LUG 2011
Cost Considerations
23 LUG 2011
Conclusions
24 LUG 2011
Fini
Thanks!
25 LUG 2011