AI RESEARCH
From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning
arXiv CS.LG
•
ArXi:2603.10263v1 Announce Type: cross