Code Room
System designHardsd-g485
Subject Blob storeLevel Senior–Staff~45 minCommon in Storage & CDN interviewsIndustries Technology, Software development

Question

Design a content-addressed (CAS) deduplicating blob store for a backup/file-sync product holding 5 exabytes where the same byte ranges recur heavily across users (OS images, shared docs, common media). Blobs are split into variable-length content-defined chunks; identical chunks must be stored exactly once and shared across millions of users, while deletes must be safe (deleting your file must not corrupt mine). Target: write dedup at ingest, sub-100ms chunk-existence checks, and reclaiming space when the last reference to a chunk goes away.

What a strong answer looks like

Clarify scale and constraints first. Propose a clean component breakdown, then go deep on the hard parts — data model, bottlenecks, consistency, failure modes — and name the trade-offs you are making.

Narrate your design
Loading whiteboard…
Run or narrate your approach, then ask the coach.