Programming Throwdown podcast

180: Reinforcement Learning

17/03/2025

Programming Throwdown

0:00

1:52:22

Patrick and Jason introduce reinforcement learning and place it alongside supervised and unsupervised learning. They cover Q-learning, SARSA, policy gradients, actor-critic methods, PPO, imitation learning, and why training and evaluating RL systems is so challenging.

Mais episódios de "Programming Throwdown"

188: World Models
09/07/2026
1:36:01
187: Agentic Coding
02/05/2026
1:38:00
In this episode, Patrick and Jason cover Agentic Coding!
Não percas um episódio de “Programming Throwdown” e subscrevê-lo na aplicação GetPodcast.
186: Becoming a Manager
03/02/2026
1:27:30
Patrick and Jason discuss what it means to become a manager and how the role differs from individual engineering work. They cover hiring, coaching, performance management, team goals, and when moving into management is or is not the right choice.
185: Workflow Orchestrators
04/11/2025
1:32:02
Patrick and Jason break down workflow orchestrators and why they matter for batch jobs, long-running tasks, and resumable distributed systems. They compare tools such as Airflow, Dagster, Temporal, Ray, and Kubeflow while explaining the infrastructure patterns behind them.
184: Asynchronous Programming
23/09/2025
1:30:32
Patrick and Jason explain asynchronous programming and how it differs from traditional multithreading and multiprocessing. They cover coroutines, blocking versus non-blocking operations, promises, callbacks, async/await, and the tradeoffs behind each approach.
183: Landing a Software Job in 2025
31/07/2025
1:46:53
Patrick and Jason are joined by Mark Cunningham to discuss how software engineers can find strong job opportunities and perform well throughout the interview process. They cover sourcing strategies, reverse interviews, negotiation, hiring-manager expectations, and common mistakes candidates should avoid.
182: AI Assisted Coding
30/06/2025
1:37:36
Patrick and Jason discuss how AI-assisted coding tools can speed up development, answer questions about a codebase, and reduce boilerplate work. They compare common workflows and tools such as Copilot, Cursor, and command-line assistants while talking through where these systems help most.
181: Memory Management
12/05/2025
1:46:21
Patrick and Jason cover memory management from both the operating-system and language-runtime perspectives. They discuss heap management, virtual memory, garbage collection, ownership models, and practical techniques for diagnosing and reducing excessive memory use.
180: Reinforcement Learning
17/03/2025
1:52:22
Patrick and Jason introduce reinforcement learning and place it alongside supervised and unsupervised learning. They cover Q-learning, SARSA, policy gradients, actor-critic methods, PPO, imitation learning, and why training and evaluating RL systems is so challenging.
179: Project Planning
03/02/2025
1:43:00
Patrick and Jason discuss project planning and management for software teams. They cover why planning matters, how frameworks like SMART goals, Gantt charts, Scrum, Agile, and Kanban fit together, and how to deal with uncertainty, dependencies, and scheduling risk.

Mais episódios

Descobre o mundo dos podcasts com a app gratuita GetPodcast.

Subscreve os teus podcasts preferidos, ouve episódios offline e obtém recomendações fantásticas.

© radio.de GmbH 2026

MADSACK