Code Room
System designHardsd-g152
Subject GeospatialLevel Senior–Staff~50 minCommon in Distributed systems interviewsIndustries Technology

Question

Design a batch geospatial analytics pipeline that answers questions like 'how many of yesterday's 100M trips passed through each city district' and 'join 500M GPS pings to administrative boundary polygons' — a large-scale point-in-polygon spatial join over billions of points against millions of polygons, run daily. Cover the partitioning, the join algorithm, and how you avoid skew.

What a strong answer looks like

Clarify scale and constraints first. Propose a clean component breakdown, then go deep on the hard parts — data model, bottlenecks, consistency, failure modes — and name the trade-offs you are making.

Narrate your design
Loading whiteboard…
Run or narrate your approach, then ask the coach.