Dwarkesh Podcast podkast

Some thoughts on the Sutton interview

0:00
11:39
Do tyłu o 15 sekund
Do przodu o 15 sekund

I have a much better understanding of Sutton’s perspective now. I wanted to reflect on it a bit.

(00:00:00) - The steelman

(00:02:42) - TLDR of my current thoughts

(00:03:22) - Imitation learning is continuous with and complementary to RL

(00:08:26) - Continual learning

(00:10:31) - Concluding thoughts



Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe

Więcej odcinków z kanału "Dwarkesh Podcast"