#temporaldifference #reinforcementlearning
Here we introduce the idea of temporal difference learning for the prediction problem, which combines both MC and DP methods to give us a more powerful RL algorithm with advantages from both these methods.
Негізгі бет Reinforcement Learning - Lecture 15 (Temporal Difference Learning - Prediction)
Пікірлер: 3