technique
self-refinement
techniqueactiveprovisional
self-refinement-7b7130f6·1 events·first seen 16d agoAliases: self-refinement
Co-occurring entities
More like this (12)
verification-refinement loopadversarial refinementRecursive Self-Improvementself-attentioniterative trajectory refinementsource-level self-rewritingself-trainingCORE (Contrastive Reflection)The Role of Feedback Alignment in Self-Distillationon-policy self-distillationChain-of-Thought Self-ConsistencyReflexion
Recent events (1)
Question-Answering as Hidden State Probing for Test-Time Reasoning Intervention
This paper proposes using question-asking as an inference-time intervention to surface information about an LLM's hidden state during chain-of-thought reasoning. The authors train a probe on a student model's hidden states before and after question generation, finding it predictive of final answer correctness even before the teacher responds—suggesting self-diagnosis during question generation carries meaningful signal. They frame question-asking as a sequential decision problem with a gating policy, but find a gap between detection and recovery: interventions are as likely to harm correct trajectories as to fix incorrect ones. The results have implications for the limits of LLM self-refinement under uncertainty.