Question
You wired an autonomous AI agent into CI: on every failing test it reads the repo, forms a hypothesis, edits code, re-runs the full suite, and loops until green or 10 iterations. It works, but the platform team flags that CI cost tripled and median PR feedback time went from 4 to 19 minutes, because the agent burns tokens and re-runs the whole 12-minute suite each loop. How do you decide whether to keep this, and how would you reshape it?
Treat the AI’s output as a draft to verify, not an answer to trust. Name the specific flaw and the input that triggers it, say how you’d catch it — tests, edge cases, reading critically — and how you’d re-prompt or decompose to get it right.
Vibe coding: describe the solution in plain language (or narrate it) and the coach grades your approach. Generating runnable code from your description is coming next.