Entity · technique

low-rank structure

techniqueactivelow-rank-structure-63ecf239·1 events·first seen Jun 1, 2026

Aliases: low-rank structure

Co-occurring entities

More like this (12)

low-rank subspace projection LoRA (Low-Rank Adaptation)hierarchical summarization Hoyer sparsity hybrid dense-sparse retrieval Rank-to-Distill lambda-scaled structural decoding adaptive-rank instantiation rank-1 approximation LearnWeak compositional generalization power-law scaling

Recent events (1)

6arXiv · cs.CL·Jun 1, 2026·source ↗

CoRP: Gradient-Free Consolidation of Rewarded Perturbations for LLM Post-Training

CoRP (Consolidating Rewarded Perturbations) is a gradient-free post-training operator that folds an ensemble of reward-weighted weight-space perturbations into a single deployable model, eliminating the inference-time cost of ensemble methods like RandOpt. A split-half analysis across 25 model-task pairs reveals reproducible low-rank structure in the rewarded perturbation population, which CoRP exploits via reward-weighted aggregation, compatibility-aware reweighting, and a held-out validation gate. Evaluated on five models (0.5B–8B) across math, code, and creative writing, CoRP improves the base model by 8.1 points on average, exceeds single-inference RandOpt by 6.5 points using one-tenth the perturbation budget, and recovers more than half the gain of a 50-pass majority-vote ensemble at one forward pass per test example.

Inference Economics Alignment and RLHF low-rank structure CoRP GRPO +2 more