Course Outline

Introduction

  • Learning through positive reinforcement

Elements of Reinforcement Learning

Important Terms (Actions, States, Rewards, Policy, Value, Q-Value, etc.)

Overview of Tabular Solutions Methods

Creating a Software Agent

Understanding Value-based, Policy-based, and Model-based Approaches

Working with the Markov Decision Process (MDP)

How Policies Define an Agent's Way of Behaving

Using Monte Carlo Methods

Temporal-Difference Learning

n-step Bootstrapping

Approximate Solution Methods

On-policy Prediction with Approximation

On-policy Control with Approximation

Off-policy Methods with Approximation

Understanding Eligibility Traces

Using Policy Gradient Methods

Summary and Conclusion

Requirements

  • Experience with machine learning
  • Programming experience

Audience

  • Data scientists
 21 Hours

Number of participants



Price per participant

Related Courses

Deep Reinforcement Learning with Python

21 Hours

Reinforcement Learning with Java

21 Hours

AI-Augmented Software Engineering (AIASE)

14 Hours

AI Coding Assistants: Enhancing Developer Productivity

7 Hours

Introduction to Data Science and AI using Python

35 Hours

AI in Digital Marketing

7 Hours

Artificial Intelligence (AI) for Managers

7 Hours

Artificial Intelligence (AI) for Robotics

21 Hours

Introduction to Artificial Intelligence (AI)

35 Hours

AI and Robotics for Nuclear - Extended

120 Hours

AI and Robotics for Nuclear

80 Hours

AI in business and Society & The future of AI - AI/Robotics

7 Hours

Introduction to AI Trust, Risk, and Security Management (AI TRiSM)

21 Hours

Introduction to Bing AI: Enhancing Search with Artificial Intelligence

14 Hours

IBM Cloud Pak for Data

14 Hours

Related Categories

1