AI RESEARCH
RayD3D: Distilling Depth Knowledge Along the Ray for Robust Multi-View 3D Object Detection
arXiv CS.CV
•
ArXi:2603.07493v1 Announce Type: new Multi-view 3D detection with bird's eye view (BEV) is crucial for autonomous driving and robotics, but its robustness in real-world is limited as it struggles to predict accurate depth values. A mainstream solution, cross-modal distillation, transfers depth information from LiDAR to camera models but also unintentionally transfers depth-irrelevant information (e.g. LiDAR density). To mitigate this issue, we propose RayD3D, which transfers crucial depth knowledge along the ray: a line projecting from the camera to true location of an object.