Programming Throwdown podcast

180: Reinforcement Learning

17.3.2025

Programming Throwdown

0:00

1:52:22

Patrick and Jason introduce reinforcement learning and place it alongside supervised and unsupervised learning. They cover Q-learning, SARSA, policy gradients, actor-critic methods, PPO, imitation learning, and why training and evaluating RL systems is so challenging.

Flere episoder fra "Programming Throwdown"

187: Agentic Coding
2.5.2026
1:38:00
In this episode, Patrick and Jason cover Agentic Coding!
186: Becoming a Manager
3.2.2026
1:27:30
Patrick and Jason discuss what it means to become a manager and how the role differs from individual engineering work. They cover hiring, coaching, performance management, team goals, and when moving into management is or is not the right choice.
Gå ikke glip af nogen episoder af “Programming Throwdown” - abonnér på podcasten med gratisapp GetPodcast.
185: Workflow Orchestrators
4.11.2025
1:32:02
Patrick and Jason break down workflow orchestrators and why they matter for batch jobs, long-running tasks, and resumable distributed systems. They compare tools such as Airflow, Dagster, Temporal, Ray, and Kubeflow while explaining the infrastructure patterns behind them.
184: Asynchronous Programming
23.9.2025
1:30:32
Patrick and Jason explain asynchronous programming and how it differs from traditional multithreading and multiprocessing. They cover coroutines, blocking versus non-blocking operations, promises, callbacks, async/await, and the tradeoffs behind each approach.
183: Landing a Software Job in 2025
31.7.2025
1:46:53
Patrick and Jason are joined by Mark Cunningham to discuss how software engineers can find strong job opportunities and perform well throughout the interview process. They cover sourcing strategies, reverse interviews, negotiation, hiring-manager expectations, and common mistakes candidates should avoid.
182: AI Assisted Coding
30.6.2025
1:37:36
Patrick and Jason discuss how AI-assisted coding tools can speed up development, answer questions about a codebase, and reduce boilerplate work. They compare common workflows and tools such as Copilot, Cursor, and command-line assistants while talking through where these systems help most.
181: Memory Management
12.5.2025
1:46:21
Patrick and Jason cover memory management from both the operating-system and language-runtime perspectives. They discuss heap management, virtual memory, garbage collection, ownership models, and practical techniques for diagnosing and reducing excessive memory use.
180: Reinforcement Learning
17.3.2025
1:52:22
Patrick and Jason introduce reinforcement learning and place it alongside supervised and unsupervised learning. They cover Q-learning, SARSA, policy gradients, actor-critic methods, PPO, imitation learning, and why training and evaluating RL systems is so challenging.
179: Project Planning
3.2.2025
1:43:00
Patrick and Jason discuss project planning and management for software teams. They cover why planning matters, how frameworks like SMART goals, Gantt charts, Scrum, Agile, and Kanban fit together, and how to deal with uncertainty, dependencies, and scheduling risk.
178: Working from Home
3.12.2024
1:45:15
Patrick and Jason revisit working from home and the realities of remote engineering work. They cover communication, scheduling, home-office setup, motivation, distractions, and why remote work is not equally effective for every team or every person.

Få adgang til hele det store podcastunivers med gratisappen GetPodcast.

Abonnér på dine favoritpodcasts, lyt til episoder offline, og få spændende anbefalinger.

© radio.de GmbH 2026

En virksomhed fra

MADSACK