Entity · technique

Probe Trajectories

techniqueactiveprobe-trajectories-25c9e90a·1 events·first seen May 19, 2026

Aliases: Probe Trajectories

Co-occurring entities

Max-Pooling Chain-of-Thought Reasoning Large Reasoning Models AUROC

More like this (12)

TypeProbe Kinematic Pose Trajectory Path Tracing Reverse Probing CoTrace Unified Latent Probe Facet-Probe Behavioral Trajectory Tracking Framework Trajectory Balance target-space recovery profiles Text-Only Probe iterative trajectory refinement

Recent events (1)

6arXiv · cs.CL·May 19, 2026·source ↗

Probe Trajectories Reveal Reasoning Dynamics in Large Reasoning Models

This paper investigates whether hidden representations of Large Reasoning Models (LRMs) can predict future model behavior by analyzing probe trajectories—the continuous evolution of concept probabilities across Chain-of-Thought reasoning tokens. The authors find that temporal trajectory features (volatility, trend, steady-state) significantly outperform single static probes, with max-pooling achieving up to 95% AUROC across safety and mathematics domains. Two methodological insights are offered: template-based training data matches dynamically generated responses in quality, and pooling strategy is critical to probe performance. The work positions probe trajectories as a complementary safety monitoring framework for LRMs where CoT faithfulness cannot be assumed.

Frontier Model Releases Evaluation and Benchmarking Max-Pooling Chain-of-Thought Reasoning Probe Trajectories +4 more