Professional Documents
Culture Documents
BigData v.3 PDF
BigData v.3 PDF
● Introduction to Big data and Hadoop Ecosystem (Illustrate the problem due to
big-data)
● Need of big data ie 3 V’s
● Hadoop framework & concept
● HDFS
○ Architecture of HDFS
○ Default Configuration
○ Fault Tolerance
○ Rack Awareness
○ Read/Write in HDFS
○ HDFS Federat
○ ion
● Component of Hadoop Ecosystem
● Hadoop 1.0 VS Hadoop 2.0 (Architecture of both version)
● Map Reduce
○ Deamons of Map reduce
○ Architecture of Map Reduce
○ Java Example (practice)
○ Optimization technique (Combiner)
○ Job submission in cluster
● Quiz
Apache Kafka
● What is Apache Kafka
● Architecture- design
▪ Describe how Apache Kafka fits in the Hadoop ecosystem
● Concepts- Working of Kafka
▪ Advantages of Kafka
▪ Where/When to use Kafka
● Producer, Broker & Consumer – Explaining its components
Apache FLUME