
James Everingham is the CEO and Co-founder of Guild.ai — the AI agent control plane for production teams. With roots at Netscape, Instagram (Head of Engineering), and Meta (Head of Dev Infra, leading a 1,000-person org), James brings rare, hard-won expertise to the challenge of operating AI agents at scale.
From Single-Player to Multi-Player: Operating AI Agents at Scale // MLOps Podcast #383 with James Everingham, CEO and Co-founder of Guild.ai
In this episode, James unpacks what actually breaks when you move from a single AI agent to a fleet of them — and what engineering leaders need to build before it's too late.
🎯 Single-Agent vs. Multi-Agent Systems — Why "single-player" AI workflows don't survive contact with production reality, and what the shift to multi-agent coordination actually demands from your infrastructure.
🔍 The Agent Control Plane — What it is, why every engineering org needs one in 2026, and how Guild.ai is building the neutral layer to deploy, govern, and share agents across any framework or model.
⚠️ Non-Determinism at Scale — Why AI agents behave like employees, not software, and why you need workforce-style governance — not just observability tooling — to manage them.
💸 Token Spend & Cost Visibility — How teams running agents in production are flying blind on cost, and what Guild shows you that your current stack doesn't.
🏗️ Lessons from Meta's DevMate — How Meta's AI coding agent went from experiment to submitting 50% of all diffs, and what that journey teaches every engineering leader about scaling agents safely.
🚦 Agent Identity & Governance — Why every agent needs an identity, what happens when they don't have one, and how agent sprawl becomes a governance crisis fast.
🔄 Sharing Agents as Infrastructure — Why Guild treats agents as shared production infrastructure rather than one-off scripts, and how that changes the economics of AI investment.
🛠️ Framework Agnosticism — Why betting on a single agent framework is a losing strategy, and how to build for a multi-model, multi-framework world from day one.
Essential viewing for engineering leaders, AI platform teams, and founders building production-grade agentic systems.
🔗 Guild.ai: https://guild.ai
🔗 James on X/Twitter: https://x.com/jevering
🔗 James on LinkedIn: https://www.linkedin.com/in/jameseveringham
🔗 Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
⏱️ Timestamps
[00:00] Context Transfer Challenges
[00:51] Control Plane for Agents
[02:17] Effective Agent Policies
[09:23] Agent Governance Policies
[15:34] Developer Tool Adoption
[22:02] Knowledge Sharing and Open Source
[24:59] Simulated Deployments and Confidence
[29:36] Agent Workloads vs Human Workloads
[39:55] AI as a Customer
[47:59] Agent Hub vs Autonomy
[53:21] Wrap up
#AgenticAI #AIAgents #AIEngineering
Więcej odcinków z kanału "MLOps.community"



Nie przegap odcinka z kanału “MLOps.community”! Subskrybuj bezpłatnie w aplikacji GetPodcast.








