How Capital One Delivers Multi-Agent Systems with Rashmi Shetty - #765
The TWIML AI Podcast · 2026-04-16 · 54 min
Episode notes
In this episode, Rashmi Shetty, senior director of enterprise generative AI platform at Capital One, joins us to explore how the company is designing, deploying, and scaling multi-agent systems in a highly regulated environment. Rashmi walks us through Chat Concierge, a multi-agent chat experience for auto dealerships that handles intent disambiguation, tool invocation, and human handoffs to deliver safer, more personalized customer journeys. We discuss Capital One’s platform-centric approach to AI agents and how it separates design from runtime governance, embedding policies, guardrails, and cyber controls across agent threat boundaries. Rashmi shares how the team approaches the developer experience for agent builders, observability, and evals for stochastic, multi-agent workflows; and strategies for model specialization, including fine-tuning and distillation. We also cover standards and abstraction, closed-loop learning from production telemetry, and key lessons for enterprises building agentic systems. The complete show notes for this episode can be found at .
More from The TWIML AI Podcast
All episodes →- Why AI Agents Break the GenAI Security Model with Devvret Rishi - #77077 / 100
- Is RAG Dead? Lessons from Building AI for Tax Law with Alex Bowcut - #76982 / 100
- Relational Foundation Models for Enterprise Data with Jure Leskovec - #76892 / 100
- How to Find the Agent Failures Your Evals Miss with Scott Clark - #767
- How to Engineer AI Inference Systems with Philip Kiely - #766