AI RESEARCH

Universal Pose Pretraining for Generalizable Vision-Language-Action Policies

arXiv CS.LG

ArXi:2602.19710v2 Announce Type: replace-cross Existing Vision-Language-Action (VLA) models often suffer from feature collapse and low