AI RESEARCH

Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability

arXiv CS.AI • March 12, 2026

ArXi:2603.10384v1 Announce Type: new Evaluating LLM reliability via scalar probabilities often fails to capture the structural dynamics of reasoning. We

Read Full Article