Monitoring Reasoning Through Chain‑of‑Thought Signals (7 minute read)

OpenAI studied whether reasoning models can deliberately manipulate their chain‑of‑thought to evade monitoring. Results showed current models struggle to control their reasoning traces, suggesting chain‑of‑thought monitoring remains a viable safety signal.