Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
Course Outline
Introduction
- Learning through positive reinforcement
Elements of Reinforcement Learning
Important Terms (Actions, States, Rewards, Policy, Value, Q-Value, etc.)
Overview of Tabular Solutions Methods
Creating a Software Agent
Understanding Value-based, Policy-based, and Model-based Approaches
Working with the Markov Decision Process (MDP)
How Policies Define an Agent's Way of Behaving
Using Monte Carlo Methods
Temporal-Difference Learning
n-step Bootstrapping
Approximate Solution Methods
On-policy Prediction with Approximation
On-policy Control with Approximation
Off-policy Methods with Approximation
Understanding Eligibility Traces
Using Policy Gradient Methods
Summary and Conclusion
Requirements
- Experience with machine learning
- Programming experience
Audience
- Data scientists
21 Hours
Related Courses
Reinforcement Learning with Java
21 Hours
AI in Digital Marketing
7 Hours
AI and Robotics for Nuclear - Extended
120 Hours
AI and Robotics for Nuclear
80 Hours
IBM Cloud Pak for Data
14 Hours