AI RESEARCH
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents
arXiv CS.LG
•
ArXi:2603.24440v1 Announce Type: new Computer-use agents (CUAs) hold great promise for automating complex desktop workflows, yet progress toward general-purpose agents is bottlenecked by the scarcity of continuous, high-quality human nstration videos. Recent work emphasizes that continuous video, not sparse screenshots, is the critical missing ingredient for scaling these agents. However, the largest existing open dataset, ScaleCUA, contains only 2M screenshots, equating to less than 20 hours of video. To address this bottleneck, we