Code Room
On-callMedium
Question
A background worker consumes jobs from a queue (emails, image resizing) and normally keeps the queue near empty. The dashboard now shows the queue depth climbing steadily and the oldest message age growing — users report their actions are taking a long time to 'take effect.' Producers are enqueuing at the normal rate. The web tier is healthy. How do you triage why the backlog is growing?
What a strong answer looks like
Stop the bleeding first (mitigate), then form hypotheses from real signals. Separate root cause from symptom, communicate status as you go, and close with what prevents a repeat.
Learn the concepts
Loading whiteboard…
Run or narrate your approach, then ask the coach.