paper

Mechanism-Driven Monitors for Preemptive Detection of LLM Training Instability

paperactiveprovisionalmechanism-driven-monitors-for-preemptive-detection-of-llm-training-instability-7bcc5a3e·1 events·first seen 38h ago

Aliases: Mechanism-Driven Monitors for Preemptive Detection of LLM Training Instability

Co-occurring entities

Mixture of Experts Flash Attention 2

More like this (12)

LLM-as-monitor A sleep-like consolidation mechanism for LLMs Language Model Safety Monitor Paved with True Intents: Intent-Aware Training Improves LLM Safety Classification Across Training Regimes Stateful Online Monitor Backdoor Unlearning Generalization: A Path Toward the Removal of Unknown Triggers in LLMs ExpRL: Exploratory RL for LLM Mid-Training Continual LLM Upcycling: A Predictor-Gated Bank-Wise Sparsity Training Recipe for Dense-to-Sparse LLMs Tool Monitor Verifier-in-the-Loop Training (ViL)SIMMER: Benchmarking Latent Failures in LLM Executable Planning with a World Model Will the Agent Recuse Itself? Measuring LLM-Agent Compliance with In-Band Access-Deny Signals

Recent events (1)

6arXiv · cs.CL·38h ago·source ↗

Mechanism-driven internal monitors detect LLM training instability thousands of steps before loss divergence

A new arXiv preprint proposes mechanism-driven monitoring signals derived from the functional roles of critical modules (low-precision flash attention, MoE routers) to detect training instability before it manifests in loss or gradient norms. The authors derive monitors such as spectral entropy of a QK bilinear decomposition and MoE router indicators, showing via fault-injection experiments that these signals trigger thousands of steps ahead of loss divergence. The work targets a high-cost failure mode in frontier LLM training where instability can persist undetected for thousands of steps on expensive accelerator fleets.

Training Infrastructure Evaluation and Benchmarking Mixture of Experts Flash Attention 2 Mechanism-Driven Monitors for Preemptive Detection of LLM Training Instability