AI-Accelerated Product Development
Model-Free Policy Evaluation
Total time needed:
Basic understanding of model-free approaches for policy evaluation
Potential Use Cases
Strategic Games, Robotics
Who is This For ?
Anyone who is interested in learning the concepts and real world applications of Reinforcement Learning
Click on each of the following
to see details.
1. What are expected values, variance, and covariance?
What are expected values and how do these relate to the concept of covariance?
2. Monte-Carlo (MC) Policy Evaluation
What is Monte-Carlo policy evaluation technique?
3. Temporal Difference (TD) Policy Evaluation
What is Temporal Difference (TD) policy evaluation?
4. MDP, MC and TD sections from Reinforcement Learning book
What is Markov Decision Process (MDP)?
What is Monte-Carlo (MC) Learning?
What is Temporal Difference (TD) Learning?
5. Markov Decision Processes - Part 1
Definition of Markov Decision Processes?
What is Markov about MDPs?
What is V-value and Q-value?
6. Markov Decision Processes - Part 2
What are Bellman equations?
What is Value iteration?
What is Policy iteration?