TalkRL: The Reinforcement Learning Podcast podkast

Abhishek Naik on Continuing RL & Average Reward

10.02.2025

TalkRL: The Reinforcement Learning Podcast

0:00

1:21:40

Abhishek Naik was a student at University of Alberta and Alberta Machine Intelligence Institute, and he just finished his PhD in reinforcement learning, working with Rich Sutton. Now he is a postdoc fellow at the National Research Council of Canada, where he does AI research on Space applications.

Featured References

Reinforcement Learning for Continuing Problems Using Average Reward
Abhishek Naik Ph.D. dissertation 2024

Reward Centering
Abhishek Naik, Yi Wan, Manan Tomar, Richard S. Sutton 2024

Learning and Planning in Average-Reward Markov Decision Processes
Yi Wan, Abhishek Naik, Richard S. Sutton 2020

Discounted Reinforcement Learning Is Not an Optimization Problem
Abhishek Naik, Roshan Shariff, Niko Yasui, Hengshuai Yao, Richard S. Sutton 2019

Additional References

Explaining dopamine through prediction errors and beyond, Gershman et al 2024 (proposes Differential-TD-like learning mechanism in the brain around Box 4)

Więcej odcinków z kanału "TalkRL: The Reinforcement Learning Podcast"

Więcej odcinków

Odkrywaj najlepsze podcasty dzięki bezpłatnej aplikacji GetPodcast.

Subskrybuj ulubione podcasty, słuchaj odcinków offline i sprawdzaj najlepsze polecane podcasty.

Abhishek Naik on Continuing RL & Average Reward

TalkRL: The Reinforcement Learning Podcast

Więcej odcinków z kanału "TalkRL: The Reinforcement Learning Podcast"

Satinder Singh: The Origin Story of RLDM @ RLDM 2025

NeurIPS 2024 - Posters and Hallways 3

NeurIPS 2024 - Posters and Hallways 2

NeurIPS 2024 - Posters and Hallways 1

Abhishek Naik on Continuing RL & Average Reward

Neurips 2024 RL meetup Hot takes: What sucks about RL?

RLC 2024 - Posters and Hallways 5

RLC 2024 - Posters and Hallways 4

RLC 2024 - Posters and Hallways 3

RLC 2024 - Posters and Hallways 2