Big Data Hadoop Online Training

Keen IT Technologies Pvt. Ltd.

Course Summary

Hadoop is open source software for storing and processed the large data sets on clusters of commodity hardware. Hadoop is one of the Apache top level project used by global community of users.

+
Course Description

Apache Hadoop is a software solution for distributed computing of larger datasets. Hadoop offers a distributed filesystem (HDFS) and a Map Reduce implementation. A special computer acts as the "name node". This computer saves the information about the available clients and the files. The Hadoop clients (computers) are called nodes. The "name node" is currently a single point of failure.The Projects of Hadoop are working on these solutions

Course Description

Apache Hadoop is a software solution for distributed computing of larger datasets. Hadoop offers a distributed filesystem (HDFS) and a Map Reduce implementation. A special computer acts as the "name node". This computer saves the information about the available clients and the files. The Hadoop clients (computers) are called nodes. The "name node" is currently a single point of failure.The Projects of Hadoop are working on these solutions

+
Course Syllabus
Course Outline

Introduction to Hadoop

High Availability Fedration, Yarn and Security
- Distributed computing
- Parallel computing
- Concurrency
- Cloud Computing
- Data Past, Present and Future
- Computing Past, Present and Future
- Hadoop
- NoSQL
- Hadoop Streaming
- Distributing Debug Scripts
- Getting Started With Eclipse
- MapReduceNoSQL
- CAP Theorem
- Databases: Key Value, Document, Graph
- Hive and Pig HDFS
- Installing Hadoop Single Node cluster(CDH4)
- Understanding Hadoop configuration files
- Functional Concept of Map
- Functional Concept of Reduce
- Functional Ordering, Concurrency, No Lock, Concurrency
- Functional Shuffling
- Functional Reducing, Key, Concurrency
- MapReduce Execution framework
- MapReduce Partitioners and Combiners
- MapReduce and role of distributed filesystem
- Role of Key and Pairs
- Hadoop Data Types
- Understanding Sample MapReduce code
- Executing MapReduce code
- Architecture
- File System
- Data replication
- Name Node
- Data Node
- Architecture
- Data Model
- Physical Layout
- DDL DML SQL Operations
- Installation
- Setup
- Exercises
- Rationale
- Pig Latin
- Input, Output and Relational Operators
- User Defined Functions
- Analyzing and designing using Pig Latin
- Installation
- Setup
- Executing Pig Latin scripts on File system
- Executing Pig Latin scripts on HDFS
- Writing custom User Defined Functions
- What is Flume?
- How it works ?
- An example
- What is Sqoop?
- How it works ?
- An example
- What is Oozie?
- How it works?
- An example
- Cluster Planning and Cloud Manager Set-up
- Installation and Configuration
- Running MapReduce Jobs on Multi Node cluster
- Steps involved in analyzing large data
- Lab walk through

Course Fee:

Free

Course Type:	Instructor-Led
Course Status:	Active
Course Start Date:	2 Jul 14
Course End Date:	31 Dec 14

This course is listed under Open Source , Development & Implementations , Industry Specific Applications , Data & Information Management , Networks & IT Infrastructure and Server & Storage Management Community

Attended this course? Write a Review

Course Fee:

Free

Course Type:	Instructor-Led
Course Status:	Active
Course Start Date:	2 Jul 14
Course End Date:	31 Dec 14

IT Career Development Platform

Big Data Hadoop Online Training

Keen IT Technologies Pvt. Ltd.

Course Summary

Course Description

Course Description

Course Syllabus

Course Type:

Course Status:

Course Start Date:

Course End Date:

Hadoop

MapReduce

Pig

Hive

Big Data

Apache

SQL (Structured Query Language)

Open source

High Availability

Eclipse

Parallel Computing

Attended this course? Write a Review

Course Type:

Course Status:

Course Start Date:

Course End Date: