
0:00
1:52:22
Patrick and Jason introduce reinforcement learning and place it alongside supervised and unsupervised learning. They cover Q-learning, SARSA, policy gradients, actor-critic methods, PPO, imitation learning, and why training and evaluating RL systems is so challenging.
Otros episodios de "Programming Throwdown"



No te pierdas ningún episodio de “Programming Throwdown”. Síguelo en la aplicación gratuita de GetPodcast.








