What Is A Distributed File System?: Why Are There So Many?

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 26

What is a Distributed File

System?:

Why are there so many?


After this video you will be able to..
• Describe what a file system is
• Explain the reasons to have distributed file
systems and how it helps big data analysis
• Visualize the organization of data in
physical compute and storage nodes for
big data architectures
Long-term information storage
Long-term information storage

Access result of a process later


Long-term information storage

Access result of a process later


Store large amounts of information
Long-term information storage

Access result of a process later


Store large amounts of information
Enable access of multiple processes
File
Accessing files
5TB

1TB
256GB
64GB
5TB

1TB
256GB
64GB

? What if you have more data?


? Buy a bigger disk?

OR

? Copy data to an external hard drive?


WORK PERSONAL
Rack
Distributed File System (DFS)
Rack
Data
1 2 3 4 5

2 5
Rack

1
3 4
Data
1 2 3 4 5

Analyze part 5 here!


2 5
Rack

1
3 4
Data
1 2 3 4 5

4 3 2 1 5
Rack

5 1 4 2 3

3 2 5 4 1

1 4 3 5 2

2 5 1 3 4
Data
1 2 3 4 5

4 3 2 1 5
Rack

5 1 4 2 3

3 2 5 4 1

1 4 3 5 2

2 5 1 3 4
Data High Concurrency
1 2 3 4 5
vs.
Low Consistency

5: Reader 1
4 3 2 1 5
Rack

5 1 4 2 3

3 2 5 4 1

1 4 3 5 2
5: Reader 2
2 5 1 3 4
5: Reader 3
Rack
Data partitioning
Distributed File System (DFS)
Data replication
Rack

Data scalability
Fault tolerance
High concurrency

You might also like