AI RESEARCH

SCAR: Self-Supervised Continuous Action Representation Learning

arXiv CS.CV

ArXi:2605.16412v1 Announce Type: cross Despite the central role of action in embodied intelligence, learning transferable action representations from visual transitions remains a fundamental challenge, particularly when world models must generalize across embodiments under limited data. We argue that action is not merely an auxiliary conditioning signal, but a distinct representational factor that decouples the controllable change from embodiment-specific actuation.