AI RESEARCH
The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation
arXiv CS.LG
•
ArXi:2604.16830v1 Announce Type: new On-policy distillation (OPD) is an increasingly important paradigm for post-