AI RESEARCH
VirPro: Visual-referred Probabilistic Prompt Learning for Weakly-Supervised Monocular 3D Detection
arXiv CS.CV
•
ArXi:2603.17470v1 Announce Type: new Monocular 3D object detection typically relies on pseudo-labeling techniques to reduce dependency on real-world annotations. Recent advances nstrate that deterministic linguistic cues can serve as effective auxiliary weak supervision signals, providing complementary semantic context. However, hand-crafted textual descriptions struggle to capture the inherent visual diversity of individuals across scenes, limiting the model's ability to paradigm that can be seamlessly integrated into diverse weakly supervised monocular 3D detection frameworks.