AI RESEARCH
Decoupling KL and Trajectories: A Unified Perspective for SFT, DAgger, Offline RL, and OPD in LLM Distillation
arXiv CS.LG
•
ArXi:2605.16826v1 Announce Type: new Knowledge distillation is central to LLM post-