AI RESEARCH

MVP-LAM: Learning Action-Centric Latent Action via Cross-Viewpoint Reconstruction

arXiv CS.CV • May 05, 2026

ArXi:2602.03668v2 Announce Type: replace-cross Latent actions learned from diverse human videos serve as pseudo-labels for vision-language-action (VLA) pre

Read Full Article