AI RESEARCH

ViPRA: Video Prediction for Robot Actions

arXiv CS.AI

ArXi:2511.07732v2 Announce Type: replace-cross Can we turn a video prediction model into a robot policy? Videos, including those of humans or teleoperated robots, capture rich physical interactions. However, most of them lack labeled actions, which limits their use in robot learning. We present Video Prediction for Robot Actions (ViPRA), a simple pre