AI RESEARCH

Towards Balanced Multi-Modal Learning in 3D Human Pose Estimation

arXiv CS.AI

ArXi:2501.05264v5 Announce Type: replace-cross 3D human pose estimation (3D HPE) has emerged as a prominent research topic, particularly in the realm of RGB-based methods. However, the use of RGB images is often limited by issues such as occlusion and privacy constraints. Consequently, multi-modal sensing, which leverages non-intrusive sensors, is gaining increasing attention. Nevertheless, multi-modal 3D HPE still faces challenges, including modality imbalance. In this work, we