Entity · paper

Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning

paperactiveagentic-chain-of-thought-steering-for-efficient-and-controllable-llm-reasoning-8f8c0b82·1 events·first seen Jun 3, 2026

Aliases: Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning

Co-occurring entities

ACTS Agentic Chain-of-Thought Steering

More like this (12)

Agentic Chain-of-Thought Steering Can We Break LLMs Out of Self-Loops? Fine-Grained Reasoning Control with Activation Steering Predicting Future Behaviors in Reasoning Models Enables Better Steering Look Light, Think Heavy: What Multimodal Chain-of-Thought Reasoning Can and Cannot Do Beyond the Commitment Boundary: Probing Epiphenomenal Chain-of-Thought in Large Reasoning Models When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models Chain-of-Thought Reasoning CheckRLM: Effective Knowledge-Thought Coherence Checking in Retrieval-Augmented Reasoning Token Budget Saturation and Mechanistic Early Detection of Reasoning Non-Convergence in Chain-of-Thought Models What Makes Effective Supervision in Latent Chain-of-Thought: An Information-Theoretic Analysis Visual Verification Enables Inference-time Steering and Autonomous Policy Improvement AIR: Adaptive Interleaved Reasoning with Code in MLLMs

Recent events (1)

5arXiv · cs.CL·Jun 3, 2026·source ↗

ACTS: Agentic Chain-of-Thought Steering for efficient and controllable LLM reasoning

Researchers introduce Agentic Chain-of-Thought Steering (ACTS), a framework that formulates inference-time reasoning control as a Markov decision process, where a controller agent adaptively steers a frozen reasoner by issuing reasoning strategy directives and steering phrases at each step. The controller is initialized from synthetic steering trajectories with multi-budget augmentation and further optimized via reinforcement learning with budget-conditioned reward shaping. ACTS matches full-thinking performance with significant token savings and enables controllable accuracy-efficiency trade-offs across multiple benchmarks and reasoner models.

Inference Economics Agent and Tool Ecosystem ACTS Agentic Chain-of-Thought Steering Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning