Building Big Data Pipelines with PySpark + MongoDB + Bokeh

Udemy

Acquire a thorough understanding of the strategies involved in building big data pipelines with PySpark, MongoDB, and Bokeh.

Online

₹ 499 2299

Quick Facts

particular	details
Medium of instructions English	Mode of learning Self study	Mode of Delivery Video and Text Based

Course overview

Big data pipelines are data pipelines that are built to facilitate one or more of the three characteristics of big data. The speed of big data makes it attractive to create big data streaming data pipelines. Data can be gathered and handled in real time, allowing for action to be taken. EBISYS R&D - Big Data Engineering and Consulting created the Building Big Data Pipelines with PySpark + MongoDB + Bokeh certification course, which is available on Udemy.

Building Big Data Pipelines with PySpark + MongoDB + Bokeh online course is a self-paced program that is aimed at students who want to master the skills and strategies useful for creating data pipelines using the core functionalities of tools like PySpark, Bokeh, and MongoDB. Building Big Data Pipelines with PySpark + MongoDB + Bokeh online classes cover topics like data preprocessing, data loading, data extraction, data manipulation, data transformation, and data visualization as well as explain the techniques to create machine learning scripts, PySpark ETL scripts, and dashboard server.

The highlights

Certificate of completion
Self-paced course
5 hours of pre-recorded video content
1 article
1 downloadable resource

Program offerings

Online course
Learning resources
30-day money-back guarantee
Unlimited access
Accessible on mobile devices and tv

Course and certificate fees

Fees information

₹ 499 ₹2,299

certificate availability

Yes

certificate providing authority

Udemy

Who it is for

It engineer Software engineer Software developer Data scientist Data engineer Data architect Data administrator Data analyst Big data developer Big data engineer Web developer Python programmer Machine learning engineer

What you will learn

Knowledge of big data Machine learning Knowledge of data visualization Knowledge of mongodb

After completing the Building Big Data Pipelines with PySpark + MongoDB + Bokeh online certification, students will develop an understanding of big data and machine learning to develop big data pipelines using PySpark, MongoDB, Bokeh, and MLlib. Students will explore the methodologies associated with data processing, data analysis, data loading, data transformation, data extraction, data visualization, and data manipulation. Students will also learn about the strategies and concepts involved with geospatial machine learning and geo-mapping.

The syllabus

Introduction

Introduction

Setup and Installations

Python Installation
Installing Third Party Libraries
Installing Apache Spark
Installing Java (Optional)
Testing Apache Spark Installation
Installing MongoDB
Installing NoSQL Booster for MongoDB

Data Processing with PySpark and MongoDB

Integrating PySpark with Jupyter Notebook
Data Extraction
Data Transformation
Loading Data into MongoDB

Machine Learning with PySpark and MLlib

Data Pre-processing
Building the Predictive Model
Creating the Prediction Dataset

Data Visualization

Loading the Data Sources from MongoDB
Creating a Map Plot
Creating a Bar Chart
Creating a Magnitude Plot
Creating a Grid Plot

Creating the Data Pipeline Scripts

Installing Visual Studio Code
Creating the PySpark ETL Script
Creating the Machine Learning Script
Creating the Dashboard Server

Source Code and Notebook

Source Code and Notebook

Articles

Latest Articles

Top 50 Hadoop Interview Questions for Freshers and Experienced Professionals Updated On 17 Apr, 2024

Understanding What Is Hadoop? Updated On 26 Mar, 2024

10 Best Hadoop Tutorials To Pursue Online Today Updated On 09 Nov, 2021

Trending Courses

Popular Courses

General Management Courses Public Health Courses Teaching and Education Courses Financial Management Courses Web Development Courses Mathematics Courses Programming Courses Data Science Courses Cyber Security Courses Law Courses Digital Marketing Courses Mechanical Engineering Courses Explore all courses

Popular Platforms

upGrad Courses Udemy Courses Edx Courses Swayam Courses Coursera Courses NPTEL Courses Futurelearn Courses Mindmajix Technologies Courses Vskills Courses IIT Kharagpur Courses IIT Kanpur Courses Emeritus Courses Explore all platforms

Learn more about the Courses

10 Reasons to Enrol Yourself in a Digital Marketing Course 8 Must-Have Skills for AWS Cloud Architects Planning to Upskill Yourself? Enrol for a Program in Data Science 25+ Tips for Improving Your Graphic Design Skills Top Universities in India Offering Cyber Security Courses 15+ Courses for Learning Data Mining How to Make a Career in the Field of Artificial Intelligence Top 10 Benefits Of Holding A Certification In Business Intelligence Which are the best certification courses for Photography in India A Beginner's Guide to Pursue Python Programming Want to Pursue a Career in Blockchain Technology? Here is all that you need to Know How Entrepreneurs Can Use Machine Learning to Make their Business Successful? The Scope of Artificial Intelligence in India Top 10 Online Courses for Travel Lovers 10 Best Certification Courses After Hospital and Healthcare Management

Open in App

Get the Careers360 App today!

And never miss an important update

Download Careers360 App

All this at the convenience of your phone

Regular Exam Updates
Best College Recommendations
College & Rank predictors
Detailed Books and Sample Papers
Question and Answers

Popular Searches

Building Big Data Pipelines with PySpark + MongoDB + Bokeh

Online

₹ 499 2299

Quick Facts

Course overview

The highlights

Program offerings

Course and certificate fees

Fees information

certificate availability

certificate providing authority

Who it is for

What you will learn

The syllabus

Introduction

Setup and Installations

Data Processing with PySpark and MongoDB

Machine Learning with PySpark and MLlib

Data Visualization

Creating the Data Pipeline Scripts

Source Code and Notebook

Articles

Popular Articles

Latest Articles

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses

Download Careers360 App

All this at the convenience of your phone

Popular Searches

Building Big Data Pipelines with PySpark + MongoDB + Bokeh

Online

₹ 499 2299

Quick Facts

Course overview

The highlights

Program offerings

Course and certificate fees

Fees information

certificate availability

certificate providing authority

Who it is for

What you will learn

The syllabus

Introduction

Setup and Installations

Data Processing with PySpark and MongoDB

Machine Learning with PySpark and MLlib

Data Visualization

Creating the Data Pipeline Scripts

Source Code and Notebook

Articles

Popular Articles

Latest Articles

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses

Thank You!

Download Careers360 App

All this at the convenience of your phone