Entity · model

DistilGPT-2

modelactivedistilgpt-2-c87f6d46·1 events·first seen Jun 10, 2026

Aliases: DistilGPT-2

Co-occurring entities

CAA Qwen-0.5B LoRA GPT-2 Recoverable but Not Stationary: Local Linear Structures in Weights and Activations BoolQ

More like this (12)

GPT-2 GPT-3.5 GPT-5.3 GPT-2-small GPT-5.2-high GPT-5.2 LitGPT SparseGPT GPT-2 124M ChatGPT GPTs GPTBot GPT-2 355M

Recent events (1)

5arXiv · cs.LG·Jun 10, 2026·source ↗

Local linear structures in LLM weights and activations are dynamic, not fixed global directions

A new arXiv paper investigates the nature of linear structures in transformer weights and activations, finding strong local low-rank task-gradient structure but rejecting the hypothesis that fixed task planes exist. The authors show that useful bases drift substantially within 100 optimization steps, yet early recovery updates form a trajectory-prefix basis capturing 77% of LoRA recovery displacement. They also establish a formal connection between parameter perturbations and activation steering, finding a 0.58 cosine similarity between gradient-step-induced activation shifts and CAA steering vectors, suggesting linear structures are evolving local geometries rather than stable global task directions.

Evaluation and Benchmarking Alignment and RLHF CAA Qwen-0.5B LoRA +4 more