Professional Documents
Culture Documents
SQL Dbms
SQL Dbms
Image Journal
Name Node
Job Tracker Checkpoint
HDFS
Client
Job
Tracker Journal
Inode Image
Checkpoint
Inode - Files and directories are represented on the
NameNode, which record attributes like permissions,
modification and access times, namespace and disk
space quotas.
Name
Data Node Node
DATA NODE
Total
Fraction #Data Transfers
Storage
Storage In Progress
Capacity
Commands
HDFS CLIENT
IMAGE & JOURNAL
Multiple Reader
DATA WRITE OPERATION
client DN1 DN2 DN3
setup
Client Name Node
packet1
DN1 packet2
packet3
DN2 packet4
packet5
DN3
close
DN4
DATA WRITE/READ OPERATION
Client
Limit and Hard Limit)
Name Node
Pipelining, Buffering and
Hflush
DN1
Checksum for data
integrity
Choosing nodes for read
operation
BLOCK PLACEMENT
Name Node
Add(data)
Client Inode Image
Data Nodes for Replica
checkpoint
Journal
RACK
RACK1
3
DN1 DN2 DN3 DN4 DN5 D11 D12 D13 D14 D15
RACK2
Inode Image
/
Journal checkpoint
RACK1 RACK3
DN1 DN2 DN3 DN4 DN5 D11 D12 D13 D14 D15
Under Replicated
DN6 DN7 DN8 DN9 D10
BALANCER
Balancing the disk space utilization on individual
data nodes.
Based on utilization threshold.