AI RESEARCH
PSPA-Bench: A Personalized Benchmark for Smartphone GUI Agent
arXiv CS.AI
•
ArXi:2603.29318v1 Announce Type: new SmartGUI agents execute tasks by operating directly on app interfaces, offering a path to broad capability without deep system integration. However, real-world smartuse is highly personalized: users adopt diverse workflows and preferences, challenging agents to deliver customized assistance rather than generic solutions. Existing GUI agent benchmarks cannot adequately capture this personalization dimension due to sparse user-specific data and the lack of fine-grained evaluation metrics.