Professional Documents
Culture Documents
Presented By-: Nisha Choudhary - Priya Kamti - Chandra Kanta Singha
Presented By-: Nisha Choudhary - Priya Kamti - Chandra Kanta Singha
-NISHA CHOUDHARY
-PRIYA KAMTI
-CHANDRA KANTA SINGHA
CONTENTS
Big Data
Hadoop history
Hadoop
HDFS
MapReduce
YARN
Why Hadoop
Hadoop ecosystem
Hive
Pig
Features of Hadoop
Youtube
Digitalmedia
Healthcare/lifescience
Finance services
Law enforcement
Retail(marketing)
HADOOP HISTORY
Hadoop was primarily driven by Doug Cutting and
Tom White in 2006.
Doug Cutting’s kid named Hadoop to one of his toy
that was a yellow elephant.
HADOOP
It is an open source distributed processing
framework.
It manages data processing and storage for big
data application.
It works on clustered system
1) HDFS
2) Map Reduce
3) Yarn
HDFS(HADOOP DISTRIBUTED FILE
SYSTEM)
Primary Data storage unit in hadoop.
Used in distributed data processing environment.
2) Data consistency
No delta iteration
Latency
Not easy to use
Security
No Abstraction
Vulnerable by nature
No caching
Uncertainty
THANK YOU…