FingerTip 20K: A Benchmark for Proactive and Personalized Mobile LLM Agents

ArXi:2507.21071v2 Announce Type: replace-cross Mobile GUI agents are becoming critical tools to improve user experience on smart devices, with multimodal large language models (MLLMs) emerging as the dominant paradigms in this domain. Current agents, however, rely on explicit human instructions, overlooking the potential to leverage the contextual information (like location, time, user profile) and historical data for proactive task suggestions.