From Single-Player to Multi-Player: Operating AI Agents at Scale

9.06.2026

MLOps.community

0:00

55:54

James Everingham is the CEO and Co-founder of Guild.ai — the AI agent control plane for production teams. With roots at Netscape, Instagram (Head of Engineering), and Meta (Head of Dev Infra, leading a 1,000-person org), James brings rare, hard-won expertise to the challenge of operating AI agents at scale.

From Single-Player to Multi-Player: Operating AI Agents at Scale // MLOps Podcast #383 with James Everingham, CEO and Co-founder of Guild.ai

In this episode, James unpacks what actually breaks when you move from a single AI agent to a fleet of them — and what engineering leaders need to build before it's too late.

🎯 Single-Agent vs. Multi-Agent Systems — Why "single-player" AI workflows don't survive contact with production reality, and what the shift to multi-agent coordination actually demands from your infrastructure.

🔍 The Agent Control Plane — What it is, why every engineering org needs one in 2026, and how Guild.ai is building the neutral layer to deploy, govern, and share agents across any framework or model.

⚠️ Non-Determinism at Scale — Why AI agents behave like employees, not software, and why you need workforce-style governance — not just observability tooling — to manage them.

💸 Token Spend & Cost Visibility — How teams running agents in production are flying blind on cost, and what Guild shows you that your current stack doesn't.

🏗️ Lessons from Meta's DevMate — How Meta's AI coding agent went from experiment to submitting 50% of all diffs, and what that journey teaches every engineering leader about scaling agents safely.

🚦 Agent Identity & Governance — Why every agent needs an identity, what happens when they don't have one, and how agent sprawl becomes a governance crisis fast.

🔄 Sharing Agents as Infrastructure — Why Guild treats agents as shared production infrastructure rather than one-off scripts, and how that changes the economics of AI investment.

🛠️ Framework Agnosticism — Why betting on a single agent framework is a losing strategy, and how to build for a multi-model, multi-framework world from day one.

Essential viewing for engineering leaders, AI platform teams, and founders building production-grade agentic systems.

🔗 Guild.ai: https://guild.ai

🔗 James on X/Twitter: https://x.com/jevering

🔗 James on LinkedIn: https://www.linkedin.com/in/jameseveringham

🔗 Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/

⏱️ Timestamps

[00:00] Context Transfer Challenges

[00:51] Control Plane for Agents

[02:17] Effective Agent Policies

[09:23] Agent Governance Policies

[15:34] Developer Tool Adoption

[22:02] Knowledge Sharing and Open Source