AI RESEARCH
TMRL: Diffusion Timestep-Modulated Pretraining Enables Exploration for Efficient Policy Finetuning
arXiv CS.AI
•
ArXi:2605.12236v1 Announce Type: cross Fine-tuning pre-trained robot policies with reinforcement learning (RL) often inherits the bottlenecks