
0:00
1:52:22
Patrick and Jason introduce reinforcement learning and place it alongside supervised and unsupervised learning. They cover Q-learning, SARSA, policy gradients, actor-critic methods, PPO, imitation learning, and why training and evaluating RL systems is so challenging.
More episodes from "Programming Throwdown"



Don't miss an episode of “Programming Throwdown” and subscribe to it in the GetPodcast app.








