Code Room
System designHard
Question
Design a webhook delivery network that reliably delivers event notifications to millions of customer-owned HTTP endpoints. Peak is 500k events/sec, customer endpoints are flaky (timeouts, 5xx, slow, sometimes down for hours), and you must deliver at-least-once with exponential-backoff retries for up to 24h, preserve per-resource ordering where the customer requests it, sign payloads, and never let one slow customer's backlog starve everyone else. Walk through the delivery pipeline, the retry/ordering model, and the central trade-off.
What a strong answer looks like
Clarify scale and constraints first. Propose a clean component breakdown, then go deep on the hard parts — data model, bottlenecks, consistency, failure modes — and name the trade-offs you are making.
Learn the concepts
Loading whiteboard…
Run or narrate your approach, then ask the coach.