Apache Spark and Scala

BY
Udemy

Mode

Online

Fees

₹ 499 1499

Quick Facts

particular details
Medium of instructions English
Mode of learning Self study
Mode of Delivery Video and Text Based

Course and certificate fees

Fees information
₹ 499  ₹1,499
certificate availability

Yes

certificate providing authority

Udemy

The syllabus

Module 1 - Introduction to Big Data, Spark and Hadoop

  • Overview of Big Data
  • Introduction to Apache Hadoop
  • Hadoop Distributed File System
  • Hadoop Map Reduce
  • Introduction to Apache Spark
  • Characteristics of Apache Spark
  • Users and Use Cases of Apache Spark
  • Job Execution Flow and Spark Execution
  • Spark Unified Stack
  • Complete Picture of Apache Spark
  • Why Spark with Scala
  • Apache spark Architecture

Introduction to Scala Programming Language

  • Introduction to Scala
  • Scala Basic Syntax
  • Scala Class and Objects
  • If else Statements in Scala
  • Loops in Scala

Module 3 - Advanced Scala Programming

  • Functions and Procedures in Scala
  • 3.2 Access Modifiers
  • 3.3 Strings and Arrays
  • 3.4 Scala Collections
  • 3.5 Scala Traits
  • 3.6 Pattern Matching
  • 3.7 Scala Extractors
  • 3.8 Scala Exception Handling
  • 3.9 Scala Files IO

Module 4 - Apache Spark RDDs

  • Programming with RDDs
  • Starting with Spark
  • Creating RDDs
  • RDD Operations
  • Lifecycle of Spark

Module 5 - Apache Spark RDDs-II

  • Spark Caching
  • Common Transformations and Actions
  • Spark Functions
  • Some more Spark functions

Module 6 - Working with Key Value Pairs

  • Key Value Pairs
  • Aggregate Functions
  • Working with Aggregate Functions
  • Joins in Spark
  • Practical on Word count example

Module 7 - Advanced Spark Programming

  • Spark Shared Variables
  • Spark and Fault Tolerance
  • Broadcast variables
  • Numeric RDD Operations
  • Per-Partition Operations

Module 8 - Running Spark Jobs on Cluster

  • Spark Runtime Architecture
  • Spark Driver
  • Executors
  • Cluster Managers
  • Cluster Managers II

Module 9 - Spark SQL

  • Introduction to Spark SQL
  • Starting Point-SQL Context
  • Hive with Spark SQL
  • Spark SQL Caching

Module 10 - Spark Streaming

  • People.json, Employee.json

Module 11 - Machine Learning in Spark

  • Machine learning with mllib
  • MLib Data Types
  • labeled point data types
  • Local Matrices in mllib
  • MLib Algorithms
  • Classification and Regression
  • Clustering

Module 12 - GraphX in Spark

  • GraphX Introduction
  • Creating Graphs
  • Graph Operators
  • Subgraph Transformation
  • Computation with map reduce triplets

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses