AI RESEARCH
Out of Sight, Out of Mind? Evaluating State Evolution in Video World Models
arXiv CS.CV
•
ArXi:2603.13215v1 Announce Type: new Evolutions in the world, such as water pouring or ice melting, happen regardless of being observed. Video world models generate "worlds" via 2D frame observations. Can these generated "worlds" evolve regardless of observation? To probe this question, we design a benchmark to evaluate whether video world models can decouple state evolution from observation. Our benchmark, STEVO-Bench, applies observation control to evolving processes via instructions of occluder insertion, turning off the light, or specifying camera "lookaway" trajectories.