AI RESEARCH

Memorize When Needed: Decoupled Memory Control for Spatially Consistent Long-Horizon Video Generation

arXiv CS.CV

ArXi:2604.18215v1 Announce Type: new Spatially consistent long-horizon video generation aims to maintain temporal and spatial consistency along predefined camera trajectories. Existing methods mostly entangle memory modeling with video generation, leading to inconsistent content during scene revisits and diminished generative capacity when exploring novel regions, even trained on extensive annotated data. To address these limitations, we propose a decoupled framework that separates memory conditioning from generation. Our approach significantly reduces.