Course Content
-
Introduction to Big Data & Hadoop
-
Data Explosion
-
What is Big Data?
-
Types of Data
-
Need for Big Data
-
Characteristics of Big Data
-
Big Data – Capabilities
-
Big Data—Use Cases
-
Traditional Data Warehouse – Definition
-
Traditional Data Warehouse – Limitations
-
Introduction to Hadoop
-
Hadoop Key Characteristics
-
History and Milestones of Hadoop
-
Hadoop Ecosystem
-
-
Hadoop Architecture
-
Hadoop Key Terms
-
Hadoop Cluster in commodity hardware
-
Hadoop Configuration
-
Hadoop Core Components & Core Services
-
Hadoop Server Roles
-
-
Hadoop cluster
-
Planning Hadoop cluster
-
Installation & configuration: Oracle VirtualBox — Introduction
-
Installing Oracle VirtualBox
-
Setting up the Virtual Environment
-
Open a VM
-
Hadoop Installation
-
Single Node Configuration
-
Multi-node Cluster setup
-
-
HDFS
-
HDFS Features
-
Difference – Regular File System & HDFS
-
HDFS Architecture
-
HDFS Operation Principle
-
Namenode Operation
-
Data Blocks & Replication Architecture
-
Datanode Failure & Recovery
-
Writing File to HDFS
-
Reading File from HDFS
-
-
Mapreduce
-
Introduction to MapReduce & Components
-
JobTracker TaskTracker
-
MapReduce Framework
-
Mapper & Reducer
-
Combiner & Partitioner
-
Shuffle & Sort
-
-
Overview of Mapreduce & Yarn
-
Setting up your MapReduce Environment
-
Building a MapReduce Application
-
Counters & Joins
-
Hadoop Data Types
-
Serialization & Writable Interface
-
Input Formats in MapReduce
-
Output Formats in MapReduce
-
YARN
-
-
PIG
-
Introduction to PIG
-
Pig Installation
-
Data Loading
-
Data Transformation
-
PIG – Syntax & Hands On
-
-
Hive
-
What is HIVE
-
Characteristics of Hive
-
System Architecture and Components of Hive
-
Hive Data Models
-
Hive Query Language
-
Hive Installing, running, and programming
-
Hive – Syntax & Hands On
-
-
Hbase
-
HBase introduction
-
Characteristics of HBase
-
HBase Architecture
-
HBase Storage Model
-
HBase Data Model
-
Installation of HBase HBase – Syntax & Hands On
-
-
Sqoop & Flume
-
Introduction to Sqoop & Flume
-
Importing & Exporting Data
-
Sqoop &– Syntax & Hands On
-
-
Apache Spark
-
Introduction to Apache Spark
-
Spark Architecture & Internals concepts
-
Using Spark for Analytics
-
Prev
Hive Query Language