AI RESEARCH

A Comprehensive Survey of Agents for Computer Use: Foundations, Challenges, and Future Directions

arXiv CS.AI

ArXi:2501.16150v3 Announce Type: replace Agents for computer use (ACUs) are an emerging class of systems capable of executing complex tasks on digital devices -- such as desktops, mobile phones, and web platforms -- given instructions in natural language. These agents can automate tasks by controlling software via low-level actions like mouse clicks and touchscreen gestures. However, despite rapid progress, ACUs are not yet mature for everyday use. In this survey, we investigate the state-of-the-art, trends, and research gaps in the development of practical ACUs.