Entity · benchmark

CoT-Control

benchmarkactivecot-control-59db0b74·1 events·first seen May 20, 2026

Aliases: CoT-Control

Co-occurring entities

monitorability Chain-of-Thought Reasoning OpenAI

More like this (12)

J-CoT ControlNet IRCoT IV-CoT IS-CoT PPC (Preplan-Plan-CoT)OpenCoF shared control AI control CoTrace CoT-Output 2x2 safety matrix TCN

Recent events (1)

7Openai Blog·May 20, 2026·source ↗

Reasoning models struggle to control their chains of thought, and that's good

OpenAI introduces CoT-Control, a framework for evaluating how well reasoning models can deliberately manipulate or suppress their chain-of-thought outputs. The finding that models struggle to control their CoT is framed as a positive safety property, reinforcing the argument that visible reasoning traces serve as a meaningful monitorability safeguard. This contributes to ongoing research on whether chain-of-thought transparency is a reliable alignment and oversight tool.

Frontier Model Releases Evaluation and Benchmarking CoT-Control monitorability Chain-of-Thought Reasoning +3 more