AI RESEARCH

Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why

arXiv CS.AI

ArXi:2605.10889v1 Announce Type: cross On-policy distillation offers dense, per-token supervision for