AI RESEARCH
UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization
arXiv CS.LG
•
ArXi:2604.13822v1 Announce Type: new MLLM-based GUI agents have nstrated strong capabilities in complex user interface interaction tasks. However, long-horizon scenarios remain challenging, as these agents are burdened with tasks beyond their intrinsic capabilities, suffering from memory degradation, progress confusion, and math hallucination. To address these challenges, we present UI-Copilot, a collaborative framework where the GUI agent focuses on task execution while a lightweight copilot provides on-demand assistance for memory retrieval and numerical computation. We