How Shopify Handles Black Friday Without Breaking
The CTO Podcast with Fexingo · 2026-05-29 · 8 min
Episode notes
Lucas and Luna dig into the engineering behind Shopify's Black Friday infrastructure - specifically, how the platform absorbs 10,000 requests per second per store during peak traffic without cascading failures. They break down the shift from monolithic scaling to a 'cell-based' architecture where each merchant's data lives in an isolated shard, preventing one viral store from taking down the whole platform. Lucas explains the surprising bottleneck: not database queries, but TLS handshake overhead at the load balancer layer. Luna challenges whether this level of isolation creates operational complexity that offsets the reliability gains. They also touch on how Shopify's engineering team stress-tests with 'failure injection Fridays' and why the company chose to open-source parts of its sharding toolkit. The episode ends with a candid look at whether cell-based architecture is overkill for smaller platforms, and a quick nod to how listener support keeps the podcast ad-free.
More from The CTO Podcast with Fexingo
All episodes →- How Airbnb Rebuilt Search for 8 Million Listings62 / 100
- How GitLab Built a Single Codebase for One Million CI Pipelines65 / 100
- How Slack Rebuilt Its Search Index for 10 Million Daily Queries57 / 100
- How Notion Rebuilt Its Sync Engine for Offline-First
- How Notion Rebuilt Its Block Engine for Hybrid Local-Sync