AI RESEARCH
Learning Next Action Predictors from Human-Computer Interaction
arXiv CS.CL
•
ArXi:2603.05923v1 Announce Type: new Truly proactive AI systems must anticipate what we will do next. This foresight demands far richer information than the sparse signals we type into our prompts -- it demands reasoning over the entire context of what we see and do. We formalize this as next action prediction (NAP): given a sequence of a user's multimodal interactions with a computer (screenshots, clicks, sensor data), predict that user's next action. Progress on this task requires both new data and modeling approaches.