AI RESEARCH

From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning

arXiv CS.LG

ArXi:2603.10263v1 Announce Type: cross