Programming Throwdown podcast

180: Reinforcement Learning

3/17/2025

Programming Throwdown

0:00

1:52:22

Patrick and Jason introduce reinforcement learning and place it alongside supervised and unsupervised learning. They cover Q-learning, SARSA, policy gradients, actor-critic methods, PPO, imitation learning, and why training and evaluating RL systems is so challenging.

More episodes from "Programming Throwdown"

187: Agentic Coding
5/2/2026
1:38:00
In this episode, Patrick and Jason cover Agentic Coding!
186: Becoming a Manager
2/3/2026
1:27:30
Patrick and Jason discuss what it means to become a manager and how the role differs from individual engineering work. They cover hiring, coaching, performance management, team goals, and when moving into management is or is not the right choice.
Don't miss an episode of “Programming Throwdown” and subscribe to it in the GetPodcast app.
185: Workflow Orchestrators
11/4/2025
1:32:02
Patrick and Jason break down workflow orchestrators and why they matter for batch jobs, long-running tasks, and resumable distributed systems. They compare tools such as Airflow, Dagster, Temporal, Ray, and Kubeflow while explaining the infrastructure patterns behind them.
184: Asynchronous Programming
9/23/2025
1:30:32
Patrick and Jason explain asynchronous programming and how it differs from traditional multithreading and multiprocessing. They cover coroutines, blocking versus non-blocking operations, promises, callbacks, async/await, and the tradeoffs behind each approach.
183: Landing a Software Job in 2025
7/31/2025
1:46:53
Patrick and Jason are joined by Mark Cunningham to discuss how software engineers can find strong job opportunities and perform well throughout the interview process. They cover sourcing strategies, reverse interviews, negotiation, hiring-manager expectations, and common mistakes candidates should avoid.
182: AI Assisted Coding
6/30/2025
1:37:36
Patrick and Jason discuss how AI-assisted coding tools can speed up development, answer questions about a codebase, and reduce boilerplate work. They compare common workflows and tools such as Copilot, Cursor, and command-line assistants while talking through where these systems help most.
181: Memory Management
5/12/2025
1:46:21
Patrick and Jason cover memory management from both the operating-system and language-runtime perspectives. They discuss heap management, virtual memory, garbage collection, ownership models, and practical techniques for diagnosing and reducing excessive memory use.
180: Reinforcement Learning
3/17/2025
1:52:22
Patrick and Jason introduce reinforcement learning and place it alongside supervised and unsupervised learning. They cover Q-learning, SARSA, policy gradients, actor-critic methods, PPO, imitation learning, and why training and evaluating RL systems is so challenging.
179: Project Planning
2/3/2025
1:43:00
Patrick and Jason discuss project planning and management for software teams. They cover why planning matters, how frameworks like SMART goals, Gantt charts, Scrum, Agile, and Kanban fit together, and how to deal with uncertainty, dependencies, and scheduling risk.
178: Working from Home
12/3/2024
1:45:15
Patrick and Jason revisit working from home and the realities of remote engineering work. They cover communication, scheduling, home-office setup, motivation, distractions, and why remote work is not equally effective for every team or every person.

Get the whole world of podcasts with the free GetPodcast app.

Subscribe to your favorite podcasts, listen to episodes offline and get thrilling recommendations.

© radio.de GmbH 2026

MADSACK