← Practical AI
Listen to this episodeAll Practical AI episodes →
Optimizing for efficiency with IBM’s Granite
Practical AI · 2025-03-14 · 44 min
Episode notes
We often judge AI models by leaderboard scores, but what if efficiency matters more? Kate Soule from IBM joins us to discuss how Granite AI is rethinking AI at the edge - breaking tasks into smaller, efficient components and co-designing models with hardware. She also shares why AI should prioritize efficiency frontiers over incremental benchmark gains and how seamless model routing can optimize performance. Featuring: Kate Soule - LinkedIn Chris Benson - Website , GitHub , LinkedIn , X Daniel Whitenack - Website , GitHub , X Links: IBM Granite IBM Granite on Hugging Face IBM Expands Granite Model Family with New Multi-Modal and Reasoning AI Built for the Enterprise
More from Practical AI
All episodes →- AIUC-1: Building trust in AI agents54 / 100
- Zero Trust for AI Agents41 / 100
- Breaking down the 2026 Stanford AI Index Report33 / 100
- Rebooting Enterprise AI with MCP and Kubernetes
- Hermes Agent: Agents that grow with you