Spark Specialization Training

 

COURSE ID

GES-SP
DURATION 39 hrs
DELIVERY METHOD Classroom Instructor-led training (CILT)
Online Instructor-led training ( OILT)

Intro to Hadoop
RDBMS vs Hadoop
Ecosystem tour (9 products)
Vendor comparison (Cloudera, Hortonworks, MapR, Amazon EMR)
Hardware Recommendations

HDFS: File System details
Heartbeats
Rack awareness

Hands-on on Hadoop machine?
Introduction to Hadoop FS and Processing Environment?s UIs
How to read and write files
Basic Unix commands for Hadoop
Hadoop ?FS shell
Hadoop releases practical
Hadoop daemons practical?

Hive Introduction
Meta storage and meta store
Hive Data types
HQL
DDL, DML and sub languages of Hive
Internal , external and Temp tables in Hive
Differentiation between SQL based Datawarehouse and Hive?
Hive releases
Why Hive is not best solution for OLTP
OLAP in Hive
Partitioning
Bucketing
Hive Architecture
Hue Interface for Hive
Complex Use cases in Hive
Hive Advanced Assignment
Real time scenarios of Hive
POC on Pig and Hive , With real time data sets and problem statements?

End to End execution flow of Map Reduce job
Different tasks in Map Reduce job
Introduction to Combiner
Introduction to Partitioner
POC based on Pig, Hive, HDFS, MR?

Introduction to NOSQL
Why NOSQL if SQL is in market since several years
Databases in market based on NOSQL
OLTP Solutions with different capabilities
Which Nosql based solution is capable to handle specific requirements
Examples of companies like Google, Facebook, Amazon, and other clients who are using NOSQL based databases

HBase Architecture of column families?
Introduction to HBase
Introduction to other NOSQL based data models
Drawbacks of Hadoop
Why Hadoop can?t work for real time processing
HBase table and column family structure
HBase versioning concept
HBase flexible schema
HBase Advanced?

Introduction to Zookeeper
How Zookeeper helps in Hadoop Ecosystem
How to load data from Relational storage in Hadoop
Sqoop basics
Sqoop practical implementation
Sqoop alternative
Sqoop connector
Quick revision of previous classes to fill the gap in your understanding and correct understandings
Nifi

YARN
YARN vs v1 of hadoop
Introduction to YARN
Significance of YARN?
Introduction to Hue
Real time Hadoop usage
Real time cluster introduction
Hadoop real time project
Real time problems and frequently faced errors with solution?

Introduction to Spark
Why Spark demand is increasing in market
How can we use Spark with Hadoop Eco System
Datasets for practice purpose?
Spark use cases with ?real time scenarios
Apache Spark
Why Spark demand is increasing in market
Introduction to Spark
Introduction to scala
Spark Shell
Spark Context
RDD
Spark Architecture
Transformations
Actions
Caching
Spark SQL
Real time project use cases examples based on Spark and Scala

popular courses
Spark Specialization Training Montreal
QA view

Spark Specialization Training Montreal
.net view

Spark Specialization Training Montreal
Java view
Oracle view
Spark Specialization Training Montreal
Sap view

Recent comments

Soumya

"I would like to thank Global erp solutions Trainers for enhancing my technical knowledge which help to boosted up my careers and confidence for guiding me throughout my training. The training was superb that helped me upgrade my knowledge & technical skills .
I assured sincerely refer to My friends.."

Amina

"Thanks for giving excellent training for big data training. "

Harry

"Well qualified trainers with Global ERP Solutions, they have done outstanding job. Great job!"