Welcome to Scribd!

Lecture 3-1

Uploaded by

0% found this document useful (0 votes)

10 views1 page

Apache Hadoop is an open source framework that efficiently processes large volumes of data across clusters of commodity hardware. It divides large files into smaller pieces that are stored and processed in parallel across multiple machines. Hadoop consists of three core layers - HDFS for storage, MapReduce for processing, and Yarn for resource management. HDFS follows a master/slave architecture with one Namenode acting as the master and multiple Datanodes responsible for block storage and replication under Namenode instructions.

Original Description:

Original Title

lecture 3-1

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

10 views1 page

Lecture 3-1

Uploaded by

Ismail Mourid

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 1

Search inside document

Nom et Prénom : EL AMINE MEHDI

Apache Hadoop is an open source, Scalable, and Fault tolerant framework written

in Java. It efficiently processes large volumes of data on a cluster of commodity hardware.
Hadoop is not only a storage system but it is a platform for large data storage as well as
processing.
In this lecture, we get a look on how Apache Hadoop works uder the hood. So when
Apache Hadoop is getting fed a huge file, the framework divides that chunk of big data
into smaller pieces and stores them across multiple machines to be processed in parallel,
so that’s why Hadoop interconnects an army of widely-available and relatively
inexpensive machines that form a Hadoop cluster, and no matter what the size of the file
that the user feeds to Hadoop, each one of its clusters accommodates three functional
layers, Hadoop distributed file systems for data storage, Hadoop MapReduce for
processing, and Hadoop Yarn for resource management.
Then we get a brief introduction to HDFS, a distributed file systems that follows
master/slave architecture. It consists of a single namenode and many datanodes. In the
HDFS architecture, a file is divided into one or more blocks of 128 Mb (the size can be
changed in the configurations) and stored in separate datanodes. Datanodes are
responsible for operations such as block creation, deletion and replication according to
namenode instructions. Apart from that, they are responsible to perform read-write
operations on file systems.
Namenode acts as the master server and the central controller for HDFS. It holds the file
system metadata and maintains the file system namespace. Namenode oversees the
condition of the datanode and coordinates access to data.

Lecture 3-1
Document1 page
Lecture 3-1
Ismail Mourid
No ratings yet
Introduction To Hadoop
Document5 pages
Introduction To Hadoop
Hanumanthu Gouthami
No ratings yet
Hadoop Interview1
Document27 pages
Hadoop Interview1
paramreddy2000
No ratings yet
h13999 Hadoop Ecs Data Services WP
Document9 pages
h13999 Hadoop Ecs Data Services WP
Vijay Reddy
No ratings yet
Hadoop
Document11 pages
Hadoop
Inu Kag
No ratings yet
2 Hadoop
Document20 pages
2 Hadoop
YASH PRAJAPATI
No ratings yet
BigDataProcessingTools HaddopHDFSHiveSpark
Document2 pages
BigDataProcessingTools HaddopHDFSHiveSpark
Henrique Santos
No ratings yet
Unit II Big Data
Document27 pages
Unit II Big Data
rohitmarale77
No ratings yet
Big Data Analytics Unit-3
Document15 pages
Big Data Analytics Unit-3
4241 DAYANA SRI VARSHA
No ratings yet
Hadoop Big Data: Follow This Link To Know About Features of Hadoop
Document85 pages
Hadoop Big Data: Follow This Link To Know About Features of Hadoop
mvdurgadevi
No ratings yet
Subtitle
Document2 pages
Subtitle
Tai Nguyen
No ratings yet
Unit 3
Document15 pages
Unit 3
xcgfxgvx
No ratings yet
What Is The Hadoop Ecosystem
Document5 pages
What Is The Hadoop Ecosystem
Zahra Mea
No ratings yet
CC-KML051-Unit V
Document17 pages
CC-KML051-Unit V
Fdjs
No ratings yet
Hadoop Ecosystem PDF
Document6 pages
Hadoop Ecosystem PDF
Kittu
No ratings yet
Hadoop Is A Framework That Is Widely Used For Storing and Managing Big Data
Document2 pages
Hadoop Is A Framework That Is Widely Used For Storing and Managing Big Data
Seid Hussen
No ratings yet
BDA Notes
Document25 pages
BDA Notes
mrudula.sb
No ratings yet
Getting Started With HDP Sandbox
Document107 pages
Getting Started With HDP Sandbox
risdianto sigma
No ratings yet
Cloud Computing
Document19 pages
Cloud Computing
Afia Faryad
No ratings yet
Hadoop Ecosystem
Document55 pages
Hadoop Ecosystem
nehal
No ratings yet
Unit-Iv CC&BD CS71
Document148 pages
Unit-Iv CC&BD CS71
Hael
No ratings yet
Mapreduce
Document15 pages
Mapreduce
manasa
No ratings yet
Os Bittu
Document10 pages
Os Bittu
Vishwa Moorthy
No ratings yet
Hadoop Overview
Document16 pages
Hadoop Overview
Sunil D Patil
100% (1)
Unit 2-1
Document43 pages
Unit 2-1
sahuakshat286
No ratings yet
BDA Unit 2 Q&A
Document14 pages
BDA Unit 2 Q&A
viswakranthipalagiri
No ratings yet
High Performance Fault-Tolerant Hadoop Distributed File System
Document9 pages
High Performance Fault-Tolerant Hadoop Distributed File System
Editor IJRITCC
No ratings yet
Apache Hadoop Technology
Document1 page
Apache Hadoop Technology
Seethal Kumars
No ratings yet
CC Unit - 5
Document27 pages
CC Unit - 5
harshitamakhija100
No ratings yet
Unit 3
Document61 pages
Unit 3
Ramstage Testing
No ratings yet
BD - Unit - II - Hadoop Frameworks and HDFS
Document37 pages
BD - Unit - II - Hadoop Frameworks and HDFS
Prem Kumar
No ratings yet
Hadoop Introduction PDF
Document3 pages
Hadoop Introduction PDF
Tahseef Reza
No ratings yet
Apache Hadoop
Document11 pages
Apache Hadoop
Imaad Ukaye
No ratings yet
Bda Summer 2022 Solution
Document30 pages
Bda Summer 2022 Solution
Vivek
No ratings yet
Unit-2 Hadoop HDFS Hadoopecosystem
Document25 pages
Unit-2 Hadoop HDFS Hadoopecosystem
sisodiyaa853
No ratings yet
Haddob Lab Report
Document12 pages
Haddob Lab Report
Magneto Eric Apollyon Thorn
No ratings yet
Hadoop
Document7 pages
Hadoop
Mayank Rai
No ratings yet
Unit-2 Hadoop
Document16 pages
Unit-2 Hadoop
abhaypratapverma6969
No ratings yet
Apache Hadoop: Abstract
Document1 page
Apache Hadoop: Abstract
Sainath Reddy
No ratings yet
Hadoop
Document6 pages
Hadoop
Vikas Sinha
No ratings yet
Bda Lab Manual
Document40 pages
Bda Lab Manual
vishalatdwork573
0% (1)
Experiment No - 01
Document14 pages
Experiment No - 01
AYAAN Satkut
No ratings yet
Big Data Analytics
Document26 pages
Big Data Analytics
iasccoe354
No ratings yet
Ibm Hadoop
Document4 pages
Ibm Hadoop
4022 MALISHWARAN M
No ratings yet
Bda Unit 4 Material
Document37 pages
Bda Unit 4 Material
Siva Saikumar Reddy K
No ratings yet
Hadoop Ecosystem
Document56 pages
Hadoop Ecosystem
RUGAL NEEMA MBA 2021-23 (Delhi)
No ratings yet
BDA - Chapter-1-Components of Hadoop Ecosystem - Lecture 3
Document38 pages
BDA - Chapter-1-Components of Hadoop Ecosystem - Lecture 3
dnyanbavkar
No ratings yet
Unit IV
Document65 pages
Unit IV
Raghavendra Vithal Goud
No ratings yet
Hadoop Distributed File System: Presented by Mohammad Sufiyan Nagaraju Kola Prudhvi Krishna Kamireddy
Document17 pages
Hadoop Distributed File System: Presented by Mohammad Sufiyan Nagaraju Kola Prudhvi Krishna Kamireddy
Sufiyan Mohammad
No ratings yet
BDA Lab Assignment 1 PDF
Document20 pages
BDA Lab Assignment 1 PDF
parth shah
No ratings yet
Hadoop Questions
Document14 pages
Hadoop Questions
Shreya Kasturia
No ratings yet
Big Data Technology Stack
Document12 pages
Big Data Technology Stack
Khalid Imran
No ratings yet
Unit II Big Data Analytics
Document11 pages
Unit II Big Data Analytics
beelogger4321
No ratings yet
BDA Lab Assignment 2
Document18 pages
BDA Lab Assignment 2
parth shah
No ratings yet
Hadoop Unit-4
Document44 pages
Hadoop Unit-4
Kishore Parimi
No ratings yet
Bda - 10
Document7 pages
Bda - 10
deshpande.pxresh
No ratings yet
HADOOP
Document40 pages
HADOOP
saadiaiftikhar123
No ratings yet
Hadoop Tutorial
Document3 pages
Hadoop Tutorial
Sundaram yadav
No ratings yet
Apache Hadoop: Jump To Navigation Jump To Search
Document2 pages
Apache Hadoop: Jump To Navigation Jump To Search
Varun Malik
No ratings yet
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet