technique
Agentic Chain-of-Thought Steering
techniqueactiveprovisional
agentic-chain-of-thought-steering-617845b9·1 events·first seen 13d agoAliases: Agentic Chain-of-Thought Steering
Co-occurring entities
More like this (12)
Agentic Chain-of-Thought Steering for Efficient and Controllable LLM ReasoningActivation Steeringlatent chain-of-thoughtChain-of-Thought ReasoningState-Conditioned Dynamic SteeringPredicting Future Behaviors in Reasoning Models Enables Better Steeringchain-of-thought promptingchain-of-thought monitoringChain-of-Thought Self-ConsistencyAgentic AI SystemsAgentic CLEARagentic AI
Recent events (1)
ACTS: Agentic Chain-of-Thought Steering for efficient and controllable LLM reasoning
Researchers introduce Agentic Chain-of-Thought Steering (ACTS), a framework that formulates inference-time reasoning control as a Markov decision process, where a controller agent adaptively steers a frozen reasoner by issuing reasoning strategy directives and steering phrases at each step. The controller is initialized from synthetic steering trajectories with multi-budget augmentation and further optimized via reinforcement learning with budget-conditioned reward shaping. ACTS matches full-thinking performance with significant token savings and enables controllable accuracy-efficiency trade-offs across multiple benchmarks and reasoner models.