AI RESEARCH

Step Rejection Fine-Tuning: A Practical Distillation Recipe

arXiv CS.AI

ArXi:2605.10674v1 Announce Type: cross Rejection Fine-Tuning (RFT) is a standard method for