Hadoop-Bootcamp Full Premium Course Package

Duration : 5 weeks ( 5 full day contact classes, 15 days remote access to cluster )

Fee : $1150

Week - 1 : Hadoop Introduction and HDFS

  • Big Data and Hadoop

  • HDFS Architecture

  • Name Node, Secondary node and Data Node

  • HDFS Commands

  • Java HDFS API

  • Handling of large files

  • Lab Work

  • More details

Week - 2 : MapReduce

  • Distributed Programming Framework

  • Job Tracker, Task Tacker, Map Tasks and Reduce Tasks

  • Input, Output Formatter classes.

  • Multiple Output formatter classes and Counters

  • Handling of Failed Tasks

  • Distributed Cache

  • Combiners and Partitioners

  • Map side and Reduce side joins

  • More details

  • Lab Work

Week - 3 : Hive and Pig

  • Hive Introduction

  • Hive Scripts

  • Pig Introduction

  • Pig constructs

  • Pig data flow scripts

  • Comparison of Java API, Hive and Pig

  • More Details

  • Lab Work

Week - 4 and 5 : Case Study

  • Analyze Requirements

  • Plan Hadoop cluster

  • Design system components using Hadoop

  • Evaluate Design Options

  • Each student will be give oppurtunity to design the system from end-to-end flow

  • This session will put all Hadoop skills to design and implement a Hadoop solution

  • More details