AI RESEARCH

SiMO: Single-Modality-Operable Multimodal Collaborative Perception

arXiv CS.CV

ArXi:2603.08240v1 Announce Type: new Collaborative perception integrates multi-agent perspectives to enhance the sensing range and overcome occlusion issues. While existing multimodal approaches leverage complementary sensors to improve performance, they are highly prone to failure--especially when a key sensor like LiDAR is unavailable. The root cause is that feature fusion leads to semantic mismatches between single-modality features and the downstream modules. This paper addresses this challenge for the first time in the field of collaborative perception.