Welcome to Scribd!

Ccs334 - Big Data Analytics

Uploaded by

0% found this document useful (0 votes)

2K views2 pages

This document outlines the course objectives, units, outcomes, experiments, and requirements for CCS334 Big Data Analytics. The course aims to help students understand the usage of Hadoop related tools for big data analytics. The 5 units cover understanding big data, NoSQL data management, basics of Hadoop, MapReduce applications, and Hadoop related tools like HBase, Pig and Hive. The course outcomes are for students to describe big data use cases, explain NoSQL management, install and use Hadoop/HDFS, perform MapReduce analytics, and use tools like HBase, Cassandra, Pig and Hive for analytics. A list of 8 experiments and the software requirements of Cassandra, Hadoop, Java, Pig, Hive

Original Description:

Original Title

Ccs334 - Big Data Analytics (1)

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

2K views2 pages

Ccs334 - Big Data Analytics

Uploaded by

silambarasan

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

CCS334 BIG DATA ANALYTICS LTPC

2023
COURSE OBJECTIVES:

understand the usage of Hadoop related tools for Big Data Analytics

UNIT I UNDERSTANDING BIG DATA 5

Introduction to big data – convergence of key trends – unstructured data – industry examples of
big data – web analytics – big data applications– big data technologies – introduction to Hadoop
– open source technologies – cloud and big data – mobile business intelligence – Crowd
sourcing analytics – inter and trans firewall analytics.

UNIT II NOSQL DATA MANAGEMENT 7

Introduction to NoSQL – aggregate data models – key-value and document data models –
relationships – graph databases – schemaless databases – materialized views – distribution
models – master-slave replication – consistency - Cassandra – Cassandra data model –
Cassandra examples – Cassandra clients

UNIT IV MAP REDUCE APPLICATIONS 6

MapReduce workflows – unit tests with MRUnit – test data and local tests – anatomy of
MapReduce job run – classic Map-reduce – YARN – failures in classic Map-reduce and YARN –
job scheduling – shuffle and sort – task execution – MapReduce types – input formats – output
formats.

UNIT III BASICS OF HADOOP 6

Data format – analyzing data with Hadoop – scaling out – Hadoop streaming – Hadoop pipes –
design of Hadoop distributed file system (HDFS) – HDFS concepts – Java interface – data flow
– Hadoop I/O – data integrity – compression – serialization – Avro – file-based data structures -
Cassandra – Hadoop integration.

UNIT V HADOOP RELATED TOOLS 6

Hbase – data model and implementations – Hbase clients – Hbase examples – praxis.
Pig – Grunt – pig data model – Pig Latin – developing and testing Pig Latin scripts.
Hive – data types and file formats – HiveQL data definition – HiveQL data manipulation –
HiveQL queries.

30 PERIODS
COURSE OUTCOMES:

After the completion of this course, students will be able to:

CO1:Describe big data and use cases from selected business domains.
CO2:Explain NoSQL big data management.
CO3:Install, configure, and run Hadoop and HDFS.
CO4:Perform map-reduce analytics using Hadoop.
CO5:Use Hadoop-related tools such as HBase, Cassandra, Pig, and Hive for big data analytics.

LIST OF EXPERIMENTS: 30 PERIODS

1. Downloading and installing Hadoop; Understanding different Hadoop modes. Startup scripts,
Configuration files.
2. Hadoop Implementation of file management tasks, such as Adding files and directories,
retrieving files and Deleting files
3. Implement of Matrix Multiplication with Hadoop Map Reduce
4. Run a basic Word Count Map Reduce program to understand Map Reduce Paradigm.
5. Installation of Hive along with practice examples.
7. Installation of HBase, Installing thrift along with Practice examples
8. Practice importing and exporting data from various databases.

Software Requirements:
Cassandra, Hadoop, Java, Pig, Hive and HBase.

TOTAL:60 PERIODS
TEXT BOOKS:

1. Michael Minelli, Michelle Chambers, and AmbigaDhiraj, "Big Data, Big Analytics: Emerging
Business Intelligence and Analytic Trends for Today's Businesses", Wiley, 2013.
2. Eric Sammer, "Hadoop Operations", O'Reilley, 2012.
3. Sadalage, Pramod J. “NoSQL distilled”, 2013

REFERENCES:

1. E. Capriolo, D. Wampler, and J. Rutherglen, "Programming Hive", O'Reilley, 2012.

2. Lars George, "HBase: The Definitive Guide", O'Reilley, 2011.
3. Eben Hewitt, "Cassandra: The Definitive Guide", O'Reilley, 2010. 87
4. Alan Gates, "Programming Pig", O'Reilley, 2011.

Ccs345-Eai Unit 2
Document18 pages
Ccs345-Eai Unit 2
sibiya varghese
No ratings yet
cp4252 Machine Learning
Document49 pages
cp4252 Machine Learning
Suganya C
100% (1)
CCS336 Cloud Services Management Lecture Notes 2
Document118 pages
CCS336 Cloud Services Management Lecture Notes 2
Gokul M
No ratings yet
Ccs341 Data Warehousing
Document2 pages
Ccs341 Data Warehousing
arul mamce
60% (5)
CSBS - AD3491 - FDSA - IA 1 - Answer Key
Document14 pages
CSBS - AD3491 - FDSA - IA 1 - Answer Key
R.Mohan Kumar
100% (9)
Unit3 BD
Document104 pages
Unit3 BD
Hirdesh Sharma
100% (1)
cp5293 Big Data Analytics Question Bank
Document13 pages
cp5293 Big Data Analytics Question Bank
Sanguine Shereen
0% (1)
MC5502 Bigdata Unit 2 Notes
Document20 pages
MC5502 Bigdata Unit 2 Notes
Sreehul
100% (2)
UNIT-3 Hadoop and MapReduce Programming
Document84 pages
UNIT-3 Hadoop and MapReduce Programming
Naru Naveen
100% (1)
Matrix Multiplication Using Hadoop Map-Reduce
Document10 pages
Matrix Multiplication Using Hadoop Map-Reduce
Niri
No ratings yet
Cp5293 Big Data Analytics Question Bank
Document13 pages
Cp5293 Big Data Analytics Question Bank
Sanguine Shereen
0% (1)
CP7019-Managing Big Data-Anna University - Question Paper
Document4 pages
CP7019-Managing Big Data-Anna University - Question Paper
bhuvangates
100% (3)
cp5293 Big Data Analytics Unit 5 PDF
Document28 pages
cp5293 Big Data Analytics Unit 5 PDF
Gnanendra Kotikam
No ratings yet
A Convergence of Key Trends: Kept Large Amounts of Information Information On Tape
Document14 pages
A Convergence of Key Trends: Kept Large Amounts of Information Information On Tape
Pratiksha Deshmukh
No ratings yet
Ad3301 - Data Exploration and Visualization
Document2 pages
Ad3301 - Data Exploration and Visualization
silambarasan
100% (2)
Unit5 BD
Document91 pages
Unit5 BD
Hirdesh Sharma
100% (2)
Ad3301 Data Exploration and Visualization
Document30 pages
Ad3301 Data Exploration and Visualization
Shamilie M
No ratings yet
Map Reduce Applications
Document94 pages
Map Reduce Applications
Hirdesh Sharma
No ratings yet
ME P4252-II Semester - MACHINE LEARNING
Document48 pages
ME P4252-II Semester - MACHINE LEARNING
Bibsy Adlin Kumari R
No ratings yet
Question Paper Code:: (10×2 20 Marks)
Document2 pages
Question Paper Code:: (10×2 20 Marks)
Ponraj Park
No ratings yet
r18 - Big Data Analytics - Cse (DS)
Document1 page
r18 - Big Data Analytics - Cse (DS)
aarthi dev
No ratings yet
4 UNIT-4 Introduction To Hadoop
Document154 pages
4 UNIT-4 Introduction To Hadoop
PrakashRameshGadekar
No ratings yet
Unit-Iii 3.1 Regression Modelling
Document7 pages
Unit-Iii 3.1 Regression Modelling
Sankar Jaikissan
100% (1)
Image and Video Analytics
Document3 pages
Image and Video Analytics
Jetlin C P
No ratings yet
CS8091 Important Questions BDA
Document1 page
CS8091 Important Questions BDA
vanitha
No ratings yet
Big Data & Analytics Lab Manual
Document51 pages
Big Data & Analytics Lab Manual
Sathish
No ratings yet
IR Question Bank
Document29 pages
IR Question Bank
Amaya Ema
100% (2)
AD3491 - FDSA - Unit I - Introduction - Part I
Document23 pages
AD3491 - FDSA - Unit I - Introduction - Part I
R.Mohan Kumar
100% (1)
BDA Final Lab Manual
Document56 pages
BDA Final Lab Manual
Public Tola
100% (1)
CS8791-Cloud Computing - Question Bank
Document10 pages
CS8791-Cloud Computing - Question Bank
Anandakumar Hadorai
No ratings yet
Unit 5 Notes
Document66 pages
Unit 5 Notes
Malathy S
100% (3)
Super Important Questions For BDA-18CS72: Module-1
Document2 pages
Super Important Questions For BDA-18CS72: Module-1
Samarth
No ratings yet
Irt 2 Marks With Answer
Document15 pages
Irt 2 Marks With Answer
Amaya Ema
No ratings yet
CS3461 Oslab
Document2 pages
CS3461 Oslab
Anurekha Prasad
No ratings yet
AI - Unit I QB
Document1 page
AI - Unit I QB
Narendran Muthusamy
No ratings yet
CS8581-Networks Lab - Manual PDF
Document68 pages
CS8581-Networks Lab - Manual PDF
Seekay Alais Karuppaiah C
0% (1)
BD - Unit - III - MapReduce
Document31 pages
BD - Unit - III - MapReduce
Prem Kumar
No ratings yet
CS8091-BIG DATA ANALYTICS UNIT V Notes
Document31 pages
CS8091-BIG DATA ANALYTICS UNIT V Notes
anu xerox
100% (4)
Data Warehouse 21reg
Document2 pages
Data Warehouse 21reg
Ponni S
100% (1)
Unit I - Part I Notes
Document33 pages
Unit I - Part I Notes
Manju Ancy John Immanuel
100% (6)
Unit-Vi Hive Hadoop & Big Data
Document24 pages
Unit-Vi Hive Hadoop & Big Data
Abhay Dabhade
100% (1)
Unit 4
Document33 pages
Unit 4
Sahana Shetty
100% (1)
Ccs354 Network Security Lab
Document63 pages
Ccs354 Network Security Lab
Rameshkumar M
No ratings yet
Big Data Analysis Lab Manual
Document39 pages
Big Data Analysis Lab Manual
ragulnagarajan896
No ratings yet
Big Data Analytics Unit 2 MINING DATA STREAMS
Document22 pages
Big Data Analytics Unit 2 MINING DATA STREAMS
Rathi Priya
100% (2)
Multi - Core Architectures and Programming - Lecture Notes, Study Material and Important Questions, Answers
Document49 pages
Multi - Core Architectures and Programming - Lecture Notes, Study Material and Important Questions, Answers
M.V. TV
0% (1)
Indexing: 1. Static and Dynamic Inverted Index
Document55 pages
Indexing: 1. Static and Dynamic Inverted Index
Vaibhav Garg
50% (2)
Lab Cs3591 Computer Networks Lab
Document38 pages
Lab Cs3591 Computer Networks Lab
Shamilie M
100% (2)
Iot Question Bank
Document8 pages
Iot Question Bank
BALACHANDRAN A
100% (1)
Data Mining and Visualization Question Bank
Document11 pages
Data Mining and Visualization Question Bank
ghost
100% (1)
Software Project Management: Nehru Institute of Engineering and Technology
Document39 pages
Software Project Management: Nehru Institute of Engineering and Technology
sakthisubi
No ratings yet
Question Bank-Big Data
Document1 page
Question Bank-Big Data
Hìtésh Rélwàñí
0% (3)
CCS336 CSM PART A AND B Question and Answers
Document83 pages
CCS336 CSM PART A AND B Question and Answers
NISHANTH M
100% (1)
Advanced Software Engg Notes Unit1-5
Document155 pages
Advanced Software Engg Notes Unit1-5
Bibsy Adlin Kumari R
75% (4)
Question Bank For Int - Data Science
Document5 pages
Question Bank For Int - Data Science
Priyansh Polra
No ratings yet
Big Data Analytics With Lab
Document3 pages
Big Data Analytics With Lab
Keerthana K
No ratings yet
DATA ANALYTICS Lab
Document3 pages
DATA ANALYTICS Lab
Boopathi kumar
No ratings yet
Big Data Analytics Syllabus
Document2 pages
Big Data Analytics Syllabus
Saiyed Faiayaz Waris
No ratings yet
3rd Sem Syllabus
Document13 pages
3rd Sem Syllabus
iShamirali
No ratings yet
Big Data Syllabus
Document1 page
Big Data Syllabus
Deepak Kumar
No ratings yet
DP 900T00A ENU TrainerHandbook
Document288 pages
DP 900T00A ENU TrainerHandbook
André baungatner
No ratings yet
m11 IF2132 2 IntroAnalisisData
Document45 pages
m11 IF2132 2 IntroAnalisisData
anisa tyaas
No ratings yet
Advanced Oracle PL SQL Course
Document8 pages
Advanced Oracle PL SQL Course
Happy Deal
No ratings yet
Database Assignment 1
Document16 pages
Database Assignment 1
Asif Syed
No ratings yet
2021 Part B National Summary Data File Code Range: 01) Anesthesia (00000 - 09999)
Document31 pages
2021 Part B National Summary Data File Code Range: 01) Anesthesia (00000 - 09999)
Destiny Omenka
No ratings yet
Logcat Home Fota Update Log
Document898 pages
Logcat Home Fota Update Log
Isamar Urvina
No ratings yet
ABAP On HANA
Document114 pages
ABAP On HANA
gvrahul
No ratings yet
Data Lakehouse As A Service For Vmware Sovereign Clouds Solution Brief
Document2 pages
Data Lakehouse As A Service For Vmware Sovereign Clouds Solution Brief
dicruz81
No ratings yet
6 Access Layer PDF
Document84 pages
6 Access Layer PDF
Allen Chandler
50% (2)
Active Directory Structure and Storage Technologies - Active Directory
Document5 pages
Active Directory Structure and Storage Technologies - Active Directory
Amal P M
No ratings yet
GDPR and Oracle Cloud Apps 4124561
Document1 page
GDPR and Oracle Cloud Apps 4124561
neaman_ahmed
No ratings yet
SP2016 SP2019 Enterprise Search Architecture Model PDF
Document1 page
SP2016 SP2019 Enterprise Search Architecture Model PDF
johnanticona
No ratings yet
Sec3 Program
Document41 pages
Sec3 Program
paku
No ratings yet
Migrate Access To SQL Server 2005
Document5 pages
Migrate Access To SQL Server 2005
Brenda Cox
No ratings yet
Big Data 4
Document2 pages
Big Data 4
Otra Vez
No ratings yet
Tibco - ADB
Document11 pages
Tibco - ADB
arunbharadwaj
No ratings yet
Practice 11
Document3 pages
Practice 11
Mark Manabat
No ratings yet
Mysql Helper
Document1 page
Mysql Helper
Guilherme Nogueira
No ratings yet
SF Dataloading Commands
Document4 pages
SF Dataloading Commands
Avinash Reddy
No ratings yet
Ang Data Structure, Ay Isang Pinadali Na Paraan para Mapabilis Ang Paghahanap or Pag Retrieve NG Ating Mga Files.
Document3 pages
Ang Data Structure, Ay Isang Pinadali Na Paraan para Mapabilis Ang Paghahanap or Pag Retrieve NG Ating Mga Files.
aragonkaycy
No ratings yet
Manage LOB
Document1,124 pages
Manage LOB
Vega Lin
No ratings yet
Install A Local Copy of Aumentum DB-623537
Document6 pages
Install A Local Copy of Aumentum DB-623537
Carlos Gongora
No ratings yet
ExtJS Fly Chart
Document3 pages
ExtJS Fly Chart
Steffe Arbini
No ratings yet
Data Warehousing Concepts
Document87 pages
Data Warehousing Concepts
pranjal vispute
No ratings yet
SQL Mid Term Part 1
Document17 pages
SQL Mid Term Part 1
Ioana Toader
0% (1)
Login Issues in Apps
Document5 pages
Login Issues in Apps
Naresh Babu
No ratings yet
Stack ADT Using Interface
Document4 pages
Stack ADT Using Interface
1035 PUNEETH RAM P
No ratings yet
MSTR Vs Looker
Document45 pages
MSTR Vs Looker
Victor Flores
No ratings yet
DFC20203 - LAB ACTIVITY 3 - Part 2
Document2 pages
DFC20203 - LAB ACTIVITY 3 - Part 2
nur syafiqah
No ratings yet
Lotus Notes Domino Administration Rakesh
Document35 pages
Lotus Notes Domino Administration Rakesh
Ravi Yalala
100% (1)