AI RESEARCH

From Abstract to Contextual: What LLMs Still Cannot Do in Mathematics

arXiv CS.AI

ArXi:2601.23048v3 Announce Type: replace Large language models now solve many benchmark math problems at near-expert levels, yet this progress has not fully translated into reliable performance in real-world applications. We study this gap through contextual mathematical reasoning, where the mathematical core must be formulated from descriptive scenarios. We