AI RESEARCH
How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum
arXiv CS.LG
•
ArXi:2604.25907v1 Announce Type: new Adapting reasoning models to new tasks during post-