Reinforcement Learning beginner to master AI in Python

BY
Udemy

Lavel

Beginner

Mode

Online

Fees

₹ 3099

Quick Facts

particular details
Medium of instructions English
Mode of learning Self study
Mode of Delivery Video and Text Based

Course and certificate fees

Fees information
₹ 3,099
certificate availability

Yes

certificate providing authority

Udemy

The syllabus

Welcome module

  • [IMPORTANT] English captions available for sections 1-4
  • Welcome
  • Course Structure
  • Environment setup [Important]
  • Setup - Mac

The Markov decision process (MDP)

  • The Markov decision process (MDP)
  • Types of Markov decision process
  • Trajectory vs episode
  • Reward vs Return
  • Discount factor
  • Policy
  • State values v(s) and action values q(s,a)
  • Bellman equations
  • Solving a Markov decision process
  • Setup - MDP in code
  • MDP in code - Part 1
  • MDP in code - Part 2

Dynamic Programming

  • Introduction to Dynamic Programming
  • Value iteration
  • Setup - Value iteration
  • Coding - Value iteration 1
  • Coding - Value iteration 2
  • Coding - Value iteration 3
  • Coding - Value iteration 4
  • Coding - Value iteration 5
  • Policy iteration
  • Setup - Policy iteration
  • Coding - Policy iteration 1
  • Policy evaluation
  • Coding - Policy iteration 2
  • Policy Improvement
  • Coding - Policy iteration 3
  • Coding - Policy iteration 4
  • Policy iteration in practice
  • Generalized Policy Iteration (GPI)

Monte Carlo methods

  • Monte Carlo methods
  • Solving control tasks with Monte Carlo methods
  • On-policy Monte Carlo control
  • Setup - On-policy Monte Carlo control
  • Coding - On-policy Monte Carlo control 1
  • Coding - On-policy Monte Carlo control 2
  • Coding - On-policy Monte Carlo control 3
  • Setup - Constant alpha Monte Carlo
  • Coding - Constant alpha Monte Carlo
  • Off-policy Monte Carlo control
  • Setup - Off-policy Monte Carlo control
  • Coding - Off-policy Monte Carlo 1
  • Coding - Off-policy Monte Carlo 2
  • Coding - Off-policy Monte Carlo 3

Temporal difference methods

  • Temporal difference methods
  • Solving control tasks with temporal difference methods
  • Monte Carlo vs temporal difference methods
  • SARSA
  • Setup - SARSA
  • Coding - SARSA 1
  • Coding - SARSA 2
  • Q-Learning
  • Setup - Q-Learning
  • Coding - Q-Learning 1
  • Coding - Q-Learning 2
  • Advantages of temporal difference methods

N-step bootstrapping

  • N-step temporal difference methods
  • Where do n-step methods fit?
  • Effect of changing n
  • N-step SARSA
  • N-step SARSA in action
  • Setup - n-step SARSA
  • Coding - n-step SARSA

Continuous state spaces

  • Setup - Classic control tasks
  • Coding - Classic control tasks
  • Working with continuous state spaces
  • State aggregation
  • Setup - Continuous state spaces
  • Coding - State aggregation 1
  • Coding - State aggregation 2
  • Coding - State aggregation 3
  • Tile coding
  • Coding - Tile coding 1
  • Coding - Tile coding 2
  • Coding - Tile coding 3

Brief introduction to neural networks

  • Function approximators
  • Artificial Neural Networks
  • Artificial Neurons
  • How to represent a Neural Network
  • Stochastic Gradient Descent
  • Neural Network optimization

Deep SARSA

  • Deep SARSA
  • Neural Network optimization (Deep Q-Network)
  • Experience Replay
  • Target Network
  • Coding - Deep SARSA 1
  • Coding - Deep SARSA 2
  • Coding - Deep SARSA 3
  • Coding - Deep SARSA 4
  • Coding - Deep SARSA 5
  • Coding - Deep SARSA 6
  • Coding - Deep SARSA 7
  • Coding - Deep SARSA 8
  • Coding - Deep SARSA 9
  • Coding -Deep SARSA 10

Deep Q-Learning

  • Deep Q-Learning
  • Setup - Deep Q-Learning
  • Coding - Deep Q-Learning 1
  • Coding - Deep Q-Learning 2
  • Coding - Deep Q-Learning 3

REINFORCE

  • Policy gradient methods
  • Representing policies using neural networks
  • Policy performance
  • The policy gradient theorem
  • REINFORCE
  • Parallel learning
  • Entropy regularization
  • REINFORCE 2
  • Coding - REINFORCE 1
  • Coding - REINFORCE 2
  • Coding - REINFORCE 3
  • Coding - REINFORCE 4
  • Coding - REINFORCE 5

Advantage Actor - Critic (A2C)

  • A2C
  • Setup - A2C
  • Coding - A2C 1
  • Coding - A2C 2
  • Coding - A2C 3
  • Coding - A2C 4

Outro

  • Looking back
  • Next steps

Articles

Popular Articles

Latest Articles

Similar Courses

Getting Started with Generative AI APIs

Codio via Coursera

3 Weeks Online
Beginner

Artificial Intelligence Projects

Great Learning

Online
Beginner
Free

Artificial Intelligence Chatbots Without Programmi...

IBM via Edx

2 Weeks Online
Beginner
Free

Google Artificial Intelligence for JavaScript Deve...

Google via Edx

7 Weeks Online
Beginner
Free

Contact Center Artificial Intelligence Conversatio...

Google via Coursera

2 Weeks Online
Beginner

Introduction to Intel Distribution of OpenVino Too...

Intel via Coursera

1 Week Online
Beginner
Free

Basic Certificate Course in Artificial Intelligenc...

CDAC Noida via FutureSkills

120 Hours Online
Beginner
₹ 3,390

Intelligence Tools for the Digital Age

IE Business School, Madrid via Coursera

3 Weeks Online
Beginner
Free

AI and the Illusion of Intelligence

Copenhagen Business School, Frederiksberg via Coursera

3 Weeks Online
Beginner
Free

Artificial Intelligence Empathy and Ethics

UC Santa Cruz via Coursera

3 Weeks Online
Beginner

Courses of your Interest

Certificate in Database Management using SQL and M...

Certificate in Database Management using SQL and M...

Amity Online

24 Hours Online
Beginner
₹27,000 ₹33,000
Certificate in Dashboarding and Storytelling using...

Certificate in Dashboarding and Storytelling using...

Amity Online

24 Hours Online
Beginner
₹27,000 ₹33,000
Certificate in Spreadsheet Modelling using Excel

Certificate in Spreadsheet Modelling using Excel

Amity Online

24 Hours Online
Beginner
₹27,000 ₹33,000
Certificate in Big Data Analytics

Certificate in Big Data Analytics

Amity Online

40 Hours Online
Beginner
₹42,000 ₹52,000
Certificate in Artificial Intelligence and Deep le...

Certificate in Artificial Intelligence and Deep le...

Amity Online

40 Hours Online
Beginner
₹42,000 ₹52,000
Certificate in Text Mining and NLP

Certificate in Text Mining and NLP

Amity Online

32 Hours Online
Beginner
₹32,000 ₹40,000
Certificate in Descriptive Analytics and Data Pre-...

Certificate in Descriptive Analytics and Data Pre-...

Amity Online

16 Hours Online
Beginner
₹17,000 ₹21,000
Certificate in Applied Data Engineering

Certificate in Applied Data Engineering

Amity Online

60 Hours Online
Beginner
₹75,000 ₹100,000
Certificate in Programming for Data Analytics Usin...

Certificate in Programming for Data Analytics Usin...

Amity Online

24 Hours Online
Beginner
₹27,000 ₹33,000
Certificate in Predictive Analytics Using Python

Certificate in Predictive Analytics Using Python

Amity Online

32 Hours Online
Beginner
₹32,000 ₹40,000

More Courses by Udemy

Microsoft Excel 2013 Course Beginners Intermediate...

Udemy

Online
Beginner
₹399 ₹2,699

Python for Beginners to Advance

Udemy

Online
Beginner
₹ 2,499

Learn Python Turtle Using Block Coding

Udemy

Online
Beginner
₹399 ₹799

Master Python Basics For Developer

Udemy

Online
Beginner
₹475 ₹3,499

Programming in Python for Beginners

Udemy

Online
Beginner
₹ 799

Learn Python 3 Programming from Scratch

Udemy

Online
Beginner
₹475 ₹1,299

Automate Your Life With Python

Udemy

Online
Beginner
₹ 2,899

Learn Python Python for Beginners

Udemy

Online
Beginner
₹ 1,799

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses