Entity · technique

Late-Stage LoRA

techniqueactivelate-stage-lora-35206a61·1 events·first seen May 22, 2026

Aliases: Late-Stage LoRA

Co-occurring entities

Terminal Expansion large language models temperature scaling LoRA Hyperfitting

More like this (12)

LoRA κ-LoRA MaLoRA Doc-to-LoRA QLoRA Localized LoRA-MoE MoE²-LoRA TailLoR Multi-LoRA serving Code2LoRA LoRA (Low-Rank Adaptation)RLOO

Recent events (1)

6arXiv · cs.CL·May 22, 2026·source ↗

Hyperfitting Explained: Terminal Geometric Expansion in Final Transformer Layers Drives Diversity Gains

This paper investigates the 'hyperfitting' phenomenon—where fine-tuning LLMs to near-zero loss on small datasets improves open-ended generation and reduces repetition—and demonstrates it is mechanistically distinct from temperature scaling. Entropy-matched control experiments falsify both the temperature-equivalence and static vocabulary reweighting hypotheses, instead localizing the effect to a 'Terminal Expansion' in the final transformer block where feature-space dimensionality expands by ~80.8 dimensions, enabling promotion of deep-tail tokens via context-dependent rank reordering. The authors introduce Late-Stage LoRA, a targeted fine-tuning strategy updating only the final 5 layers, achieving robust generation with minimal parameter updates.

Inference Economics Alignment and RLHF Terminal Expansion large language models temperature scaling +3 more