AI RESEARCH

On-Policy Self-Distillation for Reasoning Compression

arXiv CS.LG

ArXi:2603.05433v2 Announce Type: replace Reasoning models think out loud, but much of what they say is noise. We