AI RESEARCH

FrameVGGT: Frame Evidence Rolling Memory for streaming VGGT

arXiv CS.CV

ArXi:2603.07690v1 Announce Type: new Streaming Visual Geometry Transformers such as StreamVGGT enable strong online 3D perception but suffer from unbounded KV-cache growth, which limits deployment over long streams. We revisit bounded-memory streaming from the perspective of geometric. In geometry-driven reasoning, memory quality depends not only on how many tokens are retained, but also on whether the retained memory still preserves sufficiently coherent local.