technique
key-value (KV) activation projection
techniqueactive
key-value-kv-activation-projection-f4d85c94·1 events·first seen 25d agoAliases: key-value (KV) activation projection
Co-occurring entities
More like this (12)
Recent events (1)
Self-Policy Distillation via Capability-Selective Subspace Projection
This paper introduces Self-Policy Distillation (SPD), a self-distillation method for LLMs that requires no external signals such as correctness filters or reward models. SPD extracts a low-rank capability subspace from the model's own gradients on correctness-defining tokens, then projects KV activations into this subspace during self-generation to isolate task-relevant signal from stylistic noise. Experiments across code generation, math reasoning, and QA show up to 13% improvement over prior signal-free self-distillation methods and 15% better out-of-domain generalization.