Entity · model

Large Reasoning Models

modelactivelarge-reasoning-models-f542c2fc·1 events·first seen May 19, 2026

Aliases: Large Reasoning Models

Co-occurring entities

Max-Pooling Chain-of-Thought Reasoning Probe Trajectories AUROC

More like this (12)

large language models Does Reasoning Preserve Alignment? On the Trustworthiness of Large Reasoning Models Reasoning Language Models OpenAI Reasoning Models large language model agents Multimodal Large Language Models Understanding Large Language Models The Riddle Riddle: Testing Flexible Reasoning in Large Language Models and Humans Large Language Models (frontier)Beyond the Commitment Boundary: Probing Epiphenomenal Chain-of-Thought in Large Reasoning Models frontier reasoning models Long-context Reasoning Benchmarks

Recent events (1)

6arXiv · cs.CL·May 19, 2026·source ↗

Probe Trajectories Reveal Reasoning Dynamics in Large Reasoning Models

This paper investigates whether hidden representations of Large Reasoning Models (LRMs) can predict future model behavior by analyzing probe trajectories—the continuous evolution of concept probabilities across Chain-of-Thought reasoning tokens. The authors find that temporal trajectory features (volatility, trend, steady-state) significantly outperform single static probes, with max-pooling achieving up to 95% AUROC across safety and mathematics domains. Two methodological insights are offered: template-based training data matches dynamically generated responses in quality, and pooling strategy is critical to probe performance. The work positions probe trajectories as a complementary safety monitoring framework for LRMs where CoT faithfulness cannot be assumed.

Frontier Model Releases Evaluation and Benchmarking Max-Pooling Chain-of-Thought Reasoning Probe Trajectories +4 more