AI RESEARCH
Learning to Choose or Choosing to Learn: Best-of-N vs. Supervised Fine-Tuning for Bit String Generation
arXiv CS.LG
•
ArXi:2505.17288v2 Announce Type: replace-cross Using the bit string generation problem as a, we theoretically compare two standard methods for adapting large language models to new tasks. The first, referred to as supervised fine-tuning, involves