Entity · technique

self-refinement

techniqueactiveself-refinement-7b7130f6·1 events·first seen Jun 1, 2026

Aliases: self-refinement

Co-occurring entities

student-teacher prompting Chain-of-Thought Reasoning inference-time intervention gating policy hidden state probing

More like this (12)

Self-Refine Self-Verifying Refinement verification-refinement loop Introspection adversarial refinement Recursive Self-Improvement Rubric-Conditioned Self-Distillation Recursive Self-Improvement in AI: From Bounded Self-Refinement to Autonomous Research Loops self-attention Visual Contrastive Self-Distillation Posterior Refinement Self-Study Reconsidered: The Hidden Fragility of Learning from Self-Generated QA

Recent events (1)

6arXiv · cs.CL·Jun 1, 2026·source ↗

Question-Answering as Hidden State Probing for Test-Time Reasoning Intervention

This paper proposes using question-asking as an inference-time intervention to surface information about an LLM's hidden state during chain-of-thought reasoning. The authors train a probe on a student model's hidden states before and after question generation, finding it predictive of final answer correctness even before the teacher responds—suggesting self-diagnosis during question generation carries meaningful signal. They frame question-asking as a sequential decision problem with a gating policy, but find a gap between detection and recovery: interventions are as likely to harm correct trajectories as to fix incorrect ones. The results have implications for the limits of LLM self-refinement under uncertainty.

Evaluation and Benchmarking Agent and Tool Ecosystem student-teacher prompting Chain-of-Thought Reasoning inference-time intervention +4 more