
0:00
1:52:22
Patrick and Jason introduce reinforcement learning and place it alongside supervised and unsupervised learning. They cover Q-learning, SARSA, policy gradients, actor-critic methods, PPO, imitation learning, and why training and evaluating RL systems is so challenging.
Weitere Episoden von „Programming Throwdown“



Verpasse keine Episode von “Programming Throwdown” und abonniere ihn in der kostenlosen GetPodcast App.








