AI RESEARCH

TMRL: Diffusion Timestep-Modulated Pretraining Enables Exploration for Efficient Policy Finetuning

arXiv CS.AI

ArXi:2605.12236v1 Announce Type: cross Fine-tuning pre-trained robot policies with reinforcement learning (RL) often inherits the bottlenecks