AI RESEARCH

STORM: Segment, Track, and Object Re-Localization from a Single Image

arXiv CS.CV

ArXi:2511.09771v3 Announce Type: replace Accurate 6D pose estimation and tracking are core capabilities for physical AI systems, yet real-world deployment remains brittle and labor-intensive. Many pipelines rely on CAD models, manual masking, or per-object adaptation, and still fail under occlusion or fast motion without a principled way to recognize failure. We propose STORM, a unified framework for reference-conditioned 6D tracking that can operate from a single reference image, with minimal manual input and improved robustness.