Think Less, Know More: State-Aware Reasoning Compression with Knowledge Guidance for Efficient Reasoning

ArXi:2604.09150v1 Announce Type: new Large Reasoning Models (LRMs) achieve strong performance on complex tasks by leveraging long Chain-of-Thought (CoT), but often suffer from overthinking, leading to excessive reasoning steps and high inference latency. Existing CoT compression methods struggle to balance accuracy and efficiency, and lack fine-grained, step-level adaptation to redundancy and reasoning bias.