FoE: Forest of Errors Makes the First Solution the Best in Large Reasoning Models

ArXi:2604.02967v1 Announce Type: new Recent Large Reasoning Models (LRMs) like DeepSeek-R1 have nstrated remarkable success in complex reasoning tasks, exhibiting human-like patterns in exploring multiple alternative solutions. Upon closer inspection, however, we uncover a surprising phenomenon: The First is The Best, where alternative solutions are not merely suboptimal but potentially detrimental. This observation challenges widely accepted test-time scaling laws, leading us to hypothesize that errors within the reasoning path scale concurrently with test time.