Question
A test suite has one test that fails roughly 1 in 20 CI runs. You paste the failing test and its assertion error into an AI agent, which concludes: "This is a timing flake — the assertion runs before the async write completes. Add a 200ms sleep before the assert and the flake goes away." The fix would likely turn the run green. Why is accepting that diagnosis risky, and how do you verify the real cause?
Treat the AI’s output as a draft to verify, not an answer to trust. Name the specific flaw and the input that triggers it, say how you’d catch it — tests, edge cases, reading critically — and how you’d re-prompt or decompose to get it right.
Vibe coding: describe the solution in plain language (or narrate it) and the coach grades your approach. Generating runnable code from your description is coming next.