
0:00
1:52:22
Patrick and Jason introduce reinforcement learning and place it alongside supervised and unsupervised learning. They cover Q-learning, SARSA, policy gradients, actor-critic methods, PPO, imitation learning, and why training and evaluating RL systems is so challenging.
Altri episodi di "Programming Throwdown"



Non perdere nemmeno un episodio di “Programming Throwdown”. Iscriviti all'app gratuita GetPodcast.








