Professional Documents
Culture Documents
Day 1 Big Data Concepts For Executives and Senior Management Objective
Day 1 Big Data Concepts For Executives and Senior Management Objective
Day 1 Big Data Concepts For Executives and Senior Management Objective
ta concepts Developing the business case for a big data solution Maintaining a technology ecosystem Examining how big data is influencing society and businesses The Emerging Role of a Data Scientist Social Media, the Quest for Real-Time and the Future Understand big data and how it can be applied to store, manage, process and analyze massive amounts of unstructured and poly structured data Explore the technologies underpinning big data including Hadoop and NoSQL Determine how big data systems can complement traditional data warehousing and business intelligence solutions and processes Utilize big data to differentiate your business and provide better service to your customers Examine case studies of how big data is influencing society and businesses
Hadoop Concepts for Executives, Business Leaders, IT Managers, Technical Staff, Developers & Administrators Objective Topics Why Hadoop? History & background Real-world use cases and case studies The Hadoop Platform Introduction to MapReduce and Hadoop File System (HDFS) Data warehousing with Hive Parallel processing with Pig Data mining with Mahout Data storage with HBase Common utilities - Sqoop, Flume, Hue, Scribe, Zookeeper, HCatalog Hadoop distributions - Apache Foundation, Cloudera, Hortonworks, MapR Understanding of the Hadoop technology stack, including MapReduce, HDFS, Hive, Pig, HBase, and provides an initial introduction to Mahout and other common utilities. What is Hadoop? The essential components of a Hadoop-based data management solution Pros and cons of implementing Hadoop How does Hadoop fit into our existing environment and architecture? The differences between various Hadoop distributions Examine case studies of how big data is influencing society and businesses
Day 2 Objective
Write a MapReduce program using Hadoop API Utilize HDFS for effective loading and processing of data with CLI and API.
Pulling contents from Social Web Sites Tweeter Streaming API's Facebook API's