AI RESEARCH
SLARM: Streaming and Language-Aligned Reconstruction Model for Dynamic Scenes
arXiv CS.CV
•
ArXi:2603.22893v1 Announce Type: new We propose SLARM, a feed-forward model that unifies dynamic scene reconstruction, semantic understanding, and real-time streaming inference. SLARM captures complex, non-uniform motion through higher-order motion modeling, trained solely on differentiable renderings without any flow supervision. Besides, SLARM distills semantic features from LSeg to obtain language-aligned representations.