AI RESEARCH
GemDepth: Geometry-Embedded Features for 3D-Consistent Video Depth
arXiv CS.CV
•
ArXi:2605.10525v1 Announce Type: new Video depth estimation extends monocular prediction into the temporal domain to ensure coherence. However, existing methods often suffer from spatial blurring in fine-detail regions and temporal inconsistencies. We argue that current approaches, which primarily rely on temporal smoothing via Transformers, struggle to maintain strict 3D geometric consistency-particularly under rotations or drastic view changes.