Entity · technique

low-rank subspace projection

techniqueactivelow-rank-subspace-projection-d36675ca·1 events·first seen May 22, 2026

Aliases: low-rank subspace projection

Co-occurring entities

large language models key-value (KV) activation projection on-policy self-distillation

More like this (12)

Subspace Projection low-rank structure Rank-Constrained Subspace Learning (RCSL)Recovery Subspace Dimensionality Orthogonal Residual Projection Routing-Conditioned Projection Boyle-Dykstra Projection Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders LoRA (Low-Rank Adaptation)Hoyer sparsity PALS: Percentile-Aware Layerwise Sparsity for LLM Pruning Exact Posterior Score Estimation for Solving Linear Inverse Problems

Recent events (1)

6arXiv · cs.CL·May 22, 2026·source ↗

Self-Policy Distillation via Capability-Selective Subspace Projection

This paper introduces Self-Policy Distillation (SPD), a self-distillation method for LLMs that requires no external signals such as correctness filters or reward models. SPD extracts a low-rank capability subspace from the model's own gradients on correctness-defining tokens, then projects KV activations into this subspace during self-generation to isolate task-relevant signal from stylistic noise. Experiments across code generation, math reasoning, and QA show up to 13% improvement over prior signal-free self-distillation methods and 15% better out-of-domain generalization.

Frontier Model Releases Evaluation and Benchmarking large language models key-value (KV) activation projection low-rank subspace projection +2 more