- Overview of Big Data
- Introduction to Apache Hadoop
- Hadoop Distributed File System
- Hadoop Map Reduce
- Introduction to Apache Spark
- Characteristics of Apache Spark
- Users and Use Cases of Apache Spark
- Job Execution Flow and Spark Execution
- Spark Unified Stack
- Complete Picture of Apache Spark
- Why Spark with Scala
- Apache spark Architecture
Quick Facts
particular | details | |||
---|---|---|---|---|
Medium of instructions
English
|
Mode of learning
Self study
|
Mode of Delivery
Video and Text Based
|
Course and certificate fees
Fees information
₹ 499 ₹1,499
certificate availability
Yes
certificate providing authority
Udemy
The syllabus
Module 1 - Introduction to Big Data, Spark and Hadoop
Introduction to Scala Programming Language
- Introduction to Scala
- Scala Basic Syntax
- Scala Class and Objects
- If else Statements in Scala
- Loops in Scala
Module 3 - Advanced Scala Programming
- Functions and Procedures in Scala
- 3.2 Access Modifiers
- 3.3 Strings and Arrays
- 3.4 Scala Collections
- 3.5 Scala Traits
- 3.6 Pattern Matching
- 3.7 Scala Extractors
- 3.8 Scala Exception Handling
- 3.9 Scala Files IO
Module 4 - Apache Spark RDDs
- Programming with RDDs
- Starting with Spark
- Creating RDDs
- RDD Operations
- Lifecycle of Spark
Module 5 - Apache Spark RDDs-II
- Spark Caching
- Common Transformations and Actions
- Spark Functions
- Some more Spark functions
Module 6 - Working with Key Value Pairs
- Key Value Pairs
- Aggregate Functions
- Working with Aggregate Functions
- Joins in Spark
- Practical on Word count example
Module 7 - Advanced Spark Programming
- Spark Shared Variables
- Spark and Fault Tolerance
- Broadcast variables
- Numeric RDD Operations
- Per-Partition Operations
Module 8 - Running Spark Jobs on Cluster
- Spark Runtime Architecture
- Spark Driver
- Executors
- Cluster Managers
- Cluster Managers II
Module 9 - Spark SQL
- Introduction to Spark SQL
- Starting Point-SQL Context
- Hive with Spark SQL
- Spark SQL Caching
Module 10 - Spark Streaming
- People.json, Employee.json
Module 11 - Machine Learning in Spark
- Machine learning with mllib
- MLib Data Types
- labeled point data types
- Local Matrices in mllib
- MLib Algorithms
- Classification and Regression
- Clustering
Module 12 - GraphX in Spark
- GraphX Introduction
- Creating Graphs
- Graph Operators
- Subgraph Transformation
- Computation with map reduce triplets
Articles
Popular Articles
prev
next
Latest Articles
prev
next