AI RESEARCH

Decoupling KL and Trajectories: A Unified Perspective for SFT, DAgger, Offline RL, and OPD in LLM Distillation

arXiv CS.LG

ArXi:2605.16826v1 Announce Type: new Knowledge distillation is central to LLM post-