AI RESEARCH

Towards Spatio-Temporal World Scene Graph Generation from Monocular Videos

arXiv CS.CV

ArXi:2603.13185v1 Announce Type: new Spatio-temporal scene graphs provide a principled representation for modeling evolving object interactions, yet existing methods remain fundamentally frame-centric: they reason only about currently visible objects, discard entities upon occlusion, and operate in 2D. To address this, we first