What I'd Tell a Manager About Running AI Agents on a Real Codebase
Dev.to AI
•
Generative AI
AI Research
The Problem No One Writes About for Managers Most writing about AI agents is aimed at engineers. "Here's how to prompt it. Here's the framework. Here's the benchmark." If you're a manager or director, that's not the question keeping you up at night. The question is: how do you know the agents are actually doing what they say? I've been running three AI agents from three different companies - Claude, Codex, and Gemini - on a production-grade infrastructure project for several months. Not s. Real code, real deployments, a live Kubernetes cluster with Vault, Istio, Jenkins, and ArgoCD.