AI RESEARCH
Beyond Accuracy: Evaluating Strategy Diversity in LLM Mathematical Reasoning
arXiv CS.AI
•
ArXi:2605.09292v1 Announce Type: new Large language models now achieve high final-answer accuracy on mathematical reasoning benchmarks, but accuracy alone does not capture reasoning flexibility. We