Have Queries? Ask us +91 72592 22234

Course Overview


Xpertised Offers Advanced and Personalized Instructor Led Online Classroom training on Hadoop which gives you the opportunity to interact with a Hadoop instructor and help you enhance yourself to meet the demands of the industry.

Learn from our instructors from the convenience of your home or office. Interact and learn live with trainers and other participants. In-depth knowledge of YARN and HDFS and MapReduce framework Learn how to work with resource management and Hadoop storage Perform ETL operations & data analytics using Pig and Hive Use MapReduce for implementing a complex business solution Implementing Partitioning and Indexing in Hive Implement best practices for Hadoop development Understand how to integrate HBase with Hive Understand Apache Spark and its Ecosystem

Course Content


Introduction to Big Data and Hadoop

  • Understanding Big Data and Big Data Challenges
  • Limitations and Solutions of Big Data Architecture
  • Core Components of Hadoop 2.x
  • Hadoop and its Features
  • Hadoop Ecosystem
  • Hadoop Storage
  • Hadoop Distributed File System (HDFS)
  • Hadoop Processing
  • MapReduce Framework
  • Different Hadoop Distributions

Hadoop Architecture and HDFS

  • Hadoop Cluster Architecture
  • Basic Hadoop Administration
  • Hadoop Cluster Modes
  • Federation and High Availability Architecture
  • Hadoop Configuration Files
  • Typical Production Hadoop Cluster
  • Common Hadoop Shell Commands
  • How to setup Single Node and Multi-Node Hadoop Cluster?

Hadoop MapReduce Framework

  • Difference between Traditional way and MapReduce way
  • Why MapReduce?
  • YARN Architecture
  • YARN Workflow
  • YARN Components
  • Anatomy of MapReduce Program
  • Advanced MapReduce concepts
  • Input Splits
  • Combiner and Partitioner
  • Relation between Input Splits and HDFS Blocks

Advanced Hadoop MapReduce

  • MRunit
  • Counters
  • Reduce Join
  • Distributed Cache
  • XML Parsing
  • Custom Input Format
  • Sequence Input Format

Apache Pig

  • Overview of Apache Pig
  • Pig Vs MapReduce
  • Pig Components
  • Pig Data Types
  • Pig Execution
  • Data Models in Pig
  • Pig Latin scripting
  • Shell and Utility Commands
  • Pig Streaming
  • Testing Pig Scripts
  • Pig UDF

Apache Hive

  • Overview of Apache Hive
  • Hive Architecture
  • Hive Components
  • Hive Vs Pig
  • Hive Metastore
  • Limitations of Hive
  • Hive Data Types
  • Hive Data Models
  • Hive Partition
  • Hive Tables
  • Comparison with Traditional Database
  • Hive Bucketing
  • Hive Tables
  • Managed Tables
  • External Tables
  • Hive Script
  • Hive UDF

Advanced Apache Hive and HBase

  • Hive QL
  • Hive Indexes and views
  • Hive UDF
  • Hive Query Optimizers
  • Dynamic Partitioning
  • Apache Base
  • HBase Architecture
  • HBase Components
  • HBase Configuration
  • HBase Cluster Deployment

Advanced Apache HBase

  • HBase Data Model
  • HBase Bulk Loading
  • HBase Shell
  • HBase Filters
  • Overview of Apache Zookeeper
  • Zookeeper Data Model
  • Zookeeper Service

Customer Reviews


Thanks to Xpertised and the tutor who walked me through all the topics with Practical exposure which is helping me in my current project.
-Waseem

Course was quite helpful in terms of understanding of concepts and practicality. Its really a very friendly environment to learn. The timing were mutually chosen, as we both are working professional. I am quite satisfied with the course.
-Tanmoy

...more
Share:

For Batch Details
Call us at: +91 7259222234

Not sure? Consult Our Experts

Looking for a Training for

Myself

My Team/Organization

I agree to be contacted over mail or phone

or
Call us at: +91 7259222234