AI RESEARCH
Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why
arXiv CS.AI
•
ArXi:2605.10889v1 Announce Type: cross On-policy distillation offers dense, per-token supervision for