AI RESEARCH

Multi-Modal Decouple and Recouple Network for Robust 3D Object Detection

arXiv CS.CV

ArXi:2603.07486v1 Announce Type: new Multi-modal 3D object detection with bird's eye view (BEV) has achieved desired advances on benchmarks. Nonetheless, the accuracy may drop significantly in the real world due to data corruption such as sensor configurations for LiDAR and scene conditions for camera. One design bottleneck of previous models resides in the tightly coupling of multi-modal BEV features during fusion, which may degrade the overall system performance if one modality or both is corrupted.