The 15x Token Tax on Multi-Agent Coordination

Dev.to AI
Generative AI

Multi-agent systems that coordinate over HTTP serialize context into JSON, send it as request bodies, parse it on the other end, and feed it back into an LLM. Every message carries HTTP headers, content-type negotiations, and authentication tokens. Every coordination step re-transmits the accumulated context of the entire interaction because HTTP is stateless. Measurements show this coordination overhead costs up to 15x tokens than the actual task content. A three-agent pipeline processing a document: Agent A sends the document + instructions to Agent B (5K tokens.