BISP Developing Solutions Using Hadoop

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Developing Solutions Using Apache Hadoop

Course Description: Training course is designed for developers who want to better understand how to create Apache Hadoop solutions. This 30 Hours provides Java programmers the necessary training for creating enterprise solutions using Apache Hadoop. It consists of an prudent combination of interactive lecture and extensive hand-on lab exercises.

Course Highlights

Write a MapReduce program using Hadoop API. Utilize HDFS for effective loading and processing of data with CLI and API. Understand best practices for building, debugging and optimizing Hadoop solutions. Use Pig, Hive, HBase and HCatalog effectively.

Course Duration: 35 hours. Class Delivery: On-Line (Interactive Web Based) Big Data The problem space and example applications Why dont traditional approaches scale? Requirements Hadoop Background Hadoop History The ecosystem and stack: HDFS, MapReduce, Hive, Pig Cluster architecture overview Development Environment Hadoop distribution and basic commands Eclipse development HDFS Introduction The HDFS command line and web interfaces The HDFS Java API (lab) MapReduce Introduction Key philosophy: move computation, not data Core concepts: Mappers, reducers, drivers The MapReduce Java API (lab)

www.bispsolutions.com

www.bisptrainings.com

Page 1

Real-World MapReduce Optimizing with Combiners and Partitioners (lab) More common algorithms: sorting, indexing and searching (lab) Relational manipulation: map-side and reduce-side joins (lab) Chaining Jobs Testing with MRUnit Higher-level Tools Patterns to abstract thinking in MapReduce The Cascading library (lab) The Hive database (lab)

www.bispsolutions.com

www.bisptrainings.com

Page 2

You might also like