AI RESEARCH

WPT: World-to-Policy Transfer via Online World Model Distillation

arXiv CS.CV

ArXi:2511.20095v2 Announce Type: replace Recent years have witnessed remarkable progress in world models, which primarily aim to capture the spatio-temporal correlations between an agent's actions and the evolving environment. However, existing approaches often suffer from tight runtime coupling or depend on offline reward signals, resulting in substantial inference overhead or hindering end-to-end optimization. To overcome these limitations, we