Big Data Hadoop Online Training
Keen IT Technologies Pvt. Ltd.
Course Summary
Hadoop is open source software for storing and processed the large data sets on clusters of commodity hardware. Hadoop is one of the Apache top level project used by global community of users.
-
+
Course Description
Apache Hadoop is a software solution for distributed computing of larger datasets. Hadoop offers a distributed filesystem (HDFS) and a Map Reduce implementation. A special computer acts as the "name node". This computer saves the information about the available clients and the files. The Hadoop clients (computers) are called nodes. The "name node" is currently a single point of failure.The Projects of Hadoop are working on these solutions
-
+
Course Syllabus
Course Outline
Introduction to Hadoop
High Availability Fedration, Yarn and Security
- Distributed computing
- Parallel computing
- Concurrency
- Cloud Computing
- Data Past, Present and Future
- Computing Past, Present and Future
- Hadoop
- NoSQL
- Hadoop Streaming
- Distributing Debug Scripts
- Getting Started With Eclipse Hadoop Stack
- MapReduceNoSQL
- CAP Theorem
- Databases: Key Value, Document, Graph
- Hive and Pig HDFS Lab 1: Hadoop Hands-on
- Installing Hadoop Single Node cluster(CDH4)
- Understanding Hadoop configuration files MapReduce Introduction
- Functional Concept of Map
- Functional Concept of Reduce
- Functional Ordering, Concurrency, No Lock, Concurrency
- Functional Shuffling
- Functional Reducing, Key, Concurrency
- MapReduce Execution framework
- MapReduce Partitioners and Combiners
- MapReduce and role of distributed filesystem
- Role of Key and Pairs
- Hadoop Data Types Lab 2: MapReduce Exercises
- Understanding Sample MapReduce code
- Executing MapReduce code HDFS Introduction
- Architecture
- File System
- Data replication
- Name Node
- Data Node Hive
- Architecture
- Data Model
- Physical Layout
- DDL DML SQL Operations Lab 3: Hive Hands ON
- Installation
- Setup
- Exercises Pig
- Rationale
- Pig Latin
- Input, Output and Relational Operators
- User Defined Functions
- Analyzing and designing using Pig Latin Lab 4: Pig Hands on
- Installation
- Setup
- Executing Pig Latin scripts on File system
- Executing Pig Latin scripts on HDFS
- Writing custom User Defined Functions Flume
- What is Flume?
- How it works ?
- An example
- What is Sqoop?
- How it works ?
- An example
- What is Oozie?
- How it works?
- An example Sqoop Oozie Introduction to Zoo Keeper
- Cluster Planning and Cloud Manager Set-up Hadoop Multi node Cluster Setup
- Installation and Configuration
- Running MapReduce Jobs on Multi Node cluster Working with Large data sets
- Steps involved in analyzing large data
- Lab walk through