AI RESEARCH
Step Rejection Fine-Tuning: A Practical Distillation Recipe
arXiv CS.AI
•
ArXi:2605.10674v1 Announce Type: cross Rejection Fine-Tuning (RFT) is a standard method for