
0:00
1:52:22
Patrick and Jason introduce reinforcement learning and place it alongside supervised and unsupervised learning. They cover Q-learning, SARSA, policy gradients, actor-critic methods, PPO, imitation learning, and why training and evaluating RL systems is so challenging.
Fler avsnitt från "Programming Throwdown"



Missa inte ett avsnitt av “Programming Throwdown” och prenumerera på det i GetPodcast-appen.








