Recoverable but Not Stationary: Local Linear Structures in Weights and Activations
recoverable-but-not-stationary-local-linear-structures-in-weights-and-activations-d6928e90·1 events·first seen 7d agoAliases: Recoverable but Not Stationary: Local Linear Structures in Weights and Activations
Co-occurring entities
More like this (12)
Recent events (1)
Local linear structures in LLM weights and activations are dynamic, not fixed global directions
A new arXiv paper investigates the nature of linear structures in transformer weights and activations, finding strong local low-rank task-gradient structure but rejecting the hypothesis that fixed task planes exist. The authors show that useful bases drift substantially within 100 optimization steps, yet early recovery updates form a trajectory-prefix basis capturing 77% of LoRA recovery displacement. They also establish a formal connection between parameter perturbations and activation steering, finding a 0.58 cosine similarity between gradient-step-induced activation shifts and CAA steering vectors, suggesting linear structures are evolving local geometries rather than stable global task directions.