AI RESEARCH

PanopticQuery: Unified Query-Time Reasoning for 4D Scenes

arXiv CS.CV

ArXi:2604.05638v1 Announce Type: new Understanding dynamic 4D environments through natural language queries requires not only accurate scene reconstruction but also robust semantic grounding across space, time, and viewpoints. While recent methods using neural representations have advanced 4D reconstruction, they remain limited in contextual reasoning, especially for complex semantics such as interactions, temporal actions, and spatial relations. A key challenge lies in transforming noisy, view-dependent predictions into globally consistent 4D interpretations. We.