AI RESEARCH
Align then Adapt: Rethinking Parameter-Efficient Transfer Learning in 4D Perception
arXiv CS.CV
•
ArXi:2602.23069v2 Announce Type: replace Point cloud video understanding is critical for robotics as it accurately encodes motion and scene interaction. We recognize that 4D datasets are far scarcer than 3D ones, which hampers the scalability of self-supervised 4D models. A promising alternative is to transfer 3D pre-trained models to 4D perception tasks. However, rigorous empirical analysis reveals two critical limitations that impede transfer capability: overfitting and the modality gap.