AI RESEARCH

Shorter, but Still Trustworthy? An Empirical Study of Chain-of-Thought Compression

arXiv CS.CL

ArXi:2604.04120v1 Announce Type: new Long chain-of-thought (Long-CoT) reasoning models have motivated a growing body of work on compressing reasoning traces to reduce inference cost, yet existing evaluations focus almost exclusively on task accuracy and token savings. Trustworthiness properties, whether acquired or reinforced through post-