AI RESEARCH

GUIDE: A Benchmark for Understanding and Assisting Users in Open-Ended GUI Tasks

arXiv CS.AI

ArXi:2603.25864v1 Announce Type: cross Graphical User Interface (GUI) agents have the potential to assist users in interacting with complex software (e.g., PowerPoint, Photoshop). While prior research has primarily focused on automating user actions through clicks and keystrokes, this paradigm overlooks human intention, where users value the ability to explore, iterate, and refine their ideas while maintaining agency. To move beyond automation and toward collaboration, GUI agents must understand what users are doing and why. We.