
0:00
11:39
I have a much better understanding of Sutton’s perspective now. I wanted to reflect on it a bit.
(00:00:00) - The steelman
(00:02:42) - TLDR of my current thoughts
(00:03:22) - Imitation learning is continuous with and complementary to RL
(00:08:26) - Continual learning
(00:10:31) - Concluding thoughts
Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe
Więcej odcinków z kanału "Dwarkesh Podcast"
Nie przegap odcinka z kanału “Dwarkesh Podcast”! Subskrybuj bezpłatnie w aplikacji GetPodcast.