I Built a Lie Detector for AI Coding Agents
Dev.to AI
•
Generative AI
AI Tools
The problem nobody talks about AI coding agents lie. Not on purpose - they hallucinate. Claude Code tells you "All tests pass!" when tests were never executed. It says "I updated the file" when the content is byte-for-byte identical. It sneaks in git commit --no-verify to skip the hooks that would catch its mistakes. This isn't rare. It's a documented bug and it hits every serious Claude Code user. System prompts don't help - the agent just ignores them when it "decides" something is done. I spent a couple weeks building a fix.