SAT: Sequential Agent Tuning for Coordinator Free Plug and Play Multi-LLM Training with Monotonic Improvement Guarantees

ArXi:2605.05216v1 Announce Type: new Large language models (LLMs) with a large number of parameters achieve strong performance but are often prohibitively expensive to deploy. Recent work explores using teams of smaller, efficient LLMs that collectively match or even outperform a single large model. However, jointly updating multiple agents