AI RESEARCH
Universal Pose Pretraining for Generalizable Vision-Language-Action Policies
arXiv CS.LG
•
ArXi:2602.19710v2 Announce Type: replace-cross Existing Vision-Language-Action (VLA) models often suffer from feature collapse and low