Download as pdf or txt
Download as pdf or txt
You are on page 1of 1

Workshop on Big Data Analysis with Hadoop for

Business Decisions
Transforming Data to Goal-Driven Insight
Big Data needs no introduction in todays world! In fact, PIG Programming - Input and output
it is becoming a crucial way for leading companies to PIG Programming Relational Operations
outperform their competitors.. All the companies are o For each
taking strategic initiatives to leverage the big data o Filter
revolution to innovate, compete, and capture value. o Group
Those who possess the skills in these domains making o Orde by
inroads into the future. o Distinct
o Join
Velveer is happy to invite you to this 4-day workshop o Limit
on Hadoop to prepare you for the future. This course PIG Programming Demo
has been designed to provide the required expertise to PIG Programming Lab Exercise
use new Big Data tools and learn the methods of storing
the data - structured data and unstructured data, for Hive:
efficient processing and analysis for making the right Hive Introduction
business decisions. Data Types
Data Definition Language (DDL)
Course Coverage
Data Manipulation Language (DDL)
Overview on: Queries
Business Introduction Views
Database and SQL Demo
ETL Lab Exercise
Introduction to Json
Introduction to Big Data Hadoop Loading data in Json format into Hive tables
Querying Json data loaded Hive tables
Hadoop Charecteristics
Assumptions and Goals Case Study: Analysing twitter data
HDFS Architecture Introduction to Flume
NameNode and DataNodes Introduction to Oozie
Data Replication Describing twitter data which is in Json format
Robustness Loading twitter data into hive tables
The Persistence of File System Metadata Extract required information from data using
Read Operation in HDFS HiveQL
Write Operation in HDFS Demo Loading data into Hive and using
HDFS Federation HiveQL to retrieve useful information
HDFS High Availability
Overview:
MapReduce Sqoop Introduction
HBase Introduction
MapReduce Analogy HBase & Hive Integration
MapReduce - Process Integration of Hadoop with DW & BI.
MapReduce: High Level
Fault Tolerance Delivery Model
Distributed Cache The Workshop consists of class lectures by experts in Big
Data analytics, individual & group lab exercises and case
Yarn: studies
Yarn Introduction
Architecture
For Further Details
Pig Programming
Please call +91 99626 55322 or
+99401 78478 or
PIG - Introduction
email : successpoint@velveer.com
How Pig differs from MapReduce
PIG - Data Types and schemas

Velveer Corporate Solutions Pvt. Ltd


Chennai-India
www.velveer.com | bigdata@velveer.com
Ph No: 00 91 99626 55322 / 00 91 99401 78478

You might also like