Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 8

Hadoop Tutorial

Certified Big Data & Hadoop Training – DataFlair


Agenda

 Introduction to Hadoop

 Hadoop nodes & daemons

 Hadoop Architecture
 Characteristics
 Hadoop Features

Certified Big Data & Hadoop Training – DataFlair


What is Hadoop?

An Open Source framework that


allows distributed processing of
large data-sets across the cluster of
commodity hardware

Certified Big Data & Hadoop Training – DataFlair


What is Hadoop?

An Open Source framework that Open Source


allows distributed processing of
large data-sets across the cluster of  Source code is freely available
commodity hardware  It may be redistributed and
modified

Certified Big Data & Hadoop Training – DataFlair


What is Hadoop?

An open source framework that Distributed Processing


allows Distributed Processing of
large data-sets across the cluster of  Data is processed distributedly
commodity hardware on multiple nodes / servers
 Multiple machines processes
the data independently

Certified Big Data & Hadoop Training – DataFlair


What is Hadoop?

An open source framework that Cluster


allows distributed processing of
large data-sets across the Cluster  Multiple machines connected
of commodity hardware together
 Nodes are connected via LAN

Certified Big Data & Hadoop Training – DataFlair


What is Hadoop?

An open source framework that Commodity Hardware


allows distributed processing of
large data-sets across the cluster of  Economic / affordable
Commodity Hardware machines
 Typically low performance
hardware

Certified Big Data & Hadoop Training – DataFlair


What is Hadoop?

• Open source framework written in Java


• Inspired by Google's Map-Reduce programming model as well as its file
system (GFS)

Certified Big Data & Hadoop Training – DataFlair

You might also like