System designMediumsd-g052

Subject Request coalescingLevel Mid–Senior~35 minCommon in Distributed systems interviewsIndustries Technology, Software development

Question

Design request coalescing for an internal image-thumbnail service. On a cache miss, generating a thumbnail takes ~800ms and is CPU-heavy. During traffic spikes, hundreds of concurrent requests for the same uncached image all trigger generation simultaneously, melting the workers. Design a coalescing layer so only one generation runs per (image, size) while the rest wait and share the result, across a fleet of N stateless app servers.

What a strong answer looks like

Clarify scale and constraints first. Propose a clean component breakdown, then go deep on the hard parts — data model, bottlenecks, consistency, failure modes — and name the trade-offs you are making.

Learn the concepts

Narrate your design

Loading whiteboard…

Run or narrate your approach, then ask the coach.