Entity · paper

Failed Reasoning Traces Tell You What Is Fixable (But Not by Reading Them)

paperactivefailed-reasoning-traces-tell-you-what-is-fixable-but-not-by-reading-them--1e8ebd43·1 events·first seen Jun 4, 2026

Aliases: Failed Reasoning Traces Tell You What Is Fixable (But Not by Reading Them)

More like this (12)

When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models Reasoning Enhancement Does Reasoning Preserve Alignment? On the Trustworthiness of Large Reasoning Models Long-context Reasoning Benchmarks Show Me How You Reason and I'll Tell You Who You Are: Reasoning Graphs for Robust LLM Authorship Attribution Predicting Future Behaviors in Reasoning Models Enables Better Steering Multilingual Reasoning Cascades Need More Context Exploring Extrinsic and Intrinsic Properties for Effective Reasoning with Code Interpreter Cognitive Episodes in LLM Reasoning Traces Enable Interpretable Human Item Difficulty Prediction Reasoning Language Models Fixed-Point Reasoning Model Reasoning over Grammar: Can Synthetic Linguistic Reasoning Traces Enhance Low-Resource Machine Translation?

Recent events (1)

6arXiv · cs.AI·Jun 4, 2026·source ↗

Failed reasoning traces encode recoverability structure for test-time routing and post-training analysis

A new arXiv paper argues that failed reasoning traces from post-trained LLMs contain exploitable signal about whether failures are recoverable via resampling or require structural intervention. The authors derive three trajectory features from the distributional signature of failed rollouts (not their text content) that cluster failures into stable regimes and characterize failure topography across post-training methods with 84.3% accuracy. A training-free routing rule built on these features lifts rescue rates by +12.2% on a deployment-relevant hard subset, and the features transfer across model families. The work reframes failed traces as diagnostic objects rather than discarded data, with implications for inference-time compute allocation and post-training analysis.

Evaluation and Benchmarking Inference Economics Failed Reasoning Traces Tell You What Is Fixable (But Not by Reading Them)+1 more