Professional Documents
Culture Documents
Format Synopsis
Format Synopsis
Format Synopsis
On
Submitted in Partial Fulfilment of the requirement for the award of the degree of
BACHELOR OF TECHNOLOGY
(CSE)
Submitted by
2. Related work
Data analysis is related to database. Hence, the first job was to learn MySql and the various
database queries. Then the next part was to learn the technology: Hadoop and then covered
various tools related to Hadoop such as: Flume (For Data Extraction), Hive (For Database
queries), Pig (For Analysis), Sqoop (For Transferring Data), etc.
3. Methodology/Proposed Methodology
4. Plan of Work
Week Description
Week 1 Introduction to Big Data Hadoop
Week 2 Worked on MapReduce and Introduction to HDFS
Week 3 Worked on Flume and Hbase
Week 4 Worked on Hive and Pig
Week 5 Worked on Sqoop and Introduction to NoSQL
Week 6 Worked on Kafka
5. References
1. https://hadoop.apache.org/docs/stable/
2. https://pig.apache.org/
3. https://flume.apache.org/