Hadoop-Bootcamp Full Package

Duration : 3 weeks ( 3 full day contact classes, 9 days remote access to cluster )

Fee : $750

Week - 1 : Hadoop Introduction and HDFS

  • Big Data and Hadoop

  • HDFS Architecture

  • Name Node, Secondary node and Data Node

  • HDFS Commands

  • Java HDFS API

  • Handling of large files

  • Lab Work

  • More Details

Week - 2 : MapReduce

  • Distributed Programming Framework

  • Job Tracker, Task Tacker, Map Tasks and Reduce Tasks

  • Input, Output Formatter classes.

  • Multiple Output formatter classes and Counters

  • Handling of Failed Tasks

  • Distributed Cache

  • Combiners and Partitioners

  • Map side and Reduce side joins

  • MapReduce Patterns

  • Lab Work

  • More details

Week - 3 : Hive and Pig

  • Hive Introduction

  • Hive Scripts

  • Pig Introduction

  • Pig constructs

  • Pig data flow scripts

  • Comparison of Java API, Hive and Pig

  • Lab Work

  • More Details