AI SAFETY & ETHICS

R1 CoT illegibility revisited

LessWrong AI

This is a brief research note describing the results of running 's research code for the paper " Reasoning Models Sometimes Output Illegible Chains of Thought " using the Novita provider on OpenRouter. tl;dr: I re-ran the paper's R1 GPQA experiments with no changes except using Novita, and got an average illegibility score of only 2.30 (vs. 4.30 in the paper), with no examples scoring above 5 (vs. 29.4% of examples scoring above 7 in the paper