Question
An AI agent generated a 200-line test suite for a date-range overlap library and reports 97% line coverage, all green. You don't have time to read all 200 lines, and you've learned that AI tests can be high-coverage but vacuous. Without trusting the coverage number, how do you efficiently determine whether this suite would actually catch bugs — and what's your decision rule for accepting it?
Treat the AI’s output as a draft to verify, not an answer to trust. Name the specific flaw and the input that triggers it, say how you’d catch it — tests, edge cases, reading critically — and how you’d re-prompt or decompose to get it right.
Vibe coding: describe the solution in plain language (or narrate it) and the coach grades your approach. Generating runnable code from your description is coming next.