Entity · technique

Parameter-Efficient Fine-Tuning

techniqueactiveparameter-efficient-fine-tuning-4b4d4d50·3 events·first seen May 21, 2026

Aliases: Parameter-Efficient Fine-Tuning, parameter-efficient finetuning, PEFT (Parameter-Efficient Fine-Tuning)

Co-occurring entities

LoRA MinT stability-plasticity dilemma orthogonal finetuning PEFT-Arena path-wise rewinding supervised fine-tuning Hadamard modulation SMoA

More like this (12)

supervised fine-tuning fine-tuning reinforcement fine-tuning behavioral fine-tuning OpenAI Fine-Tuning Super-Tuning: From Activation-Aware Pruning to Sparse Fine-Tuning Retrieval-Augmented Fine-Tuning finetuning Prompt Tuning adapter fine-tuning malicious fine-tuning Fine-tuning GPT-2 from Human Preferences

Recent events (3)

7arXiv · cs.CL·Jun 2, 2026·source ↗

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

This paper reframes parameter-efficient fine-tuning (PEFT) not merely as a cheaper alternative to full fine-tuning, but as a substrate for persistent, instance-specific personal models layered atop shared foundation models. The authors analyze three scaling axes: Scale Up (stronger base models amplifying adapter utility), Scale Down (minimum viable adapter size), and Scale Out (managing millions of concurrent adapted instances). They introduce MinT as an infrastructure reference for adapter identity, versioning, provenance, evaluation, and serving at scale.

Training Infrastructure Inference Economics LoRA Parameter-Efficient Fine-Tuning MinT +2 more

6arXiv · cs.LG·May 28, 2026·source ↗

PEFT-Arena: Benchmarking Parameter-Efficient Finetuning via Stability-Plasticity Trade-offs

PEFT-Arena is a new benchmark that evaluates parameter-efficient finetuning methods jointly on downstream task performance and retention of pretrained general capabilities, framing the problem as a stability-plasticity dilemma. Across methods tested under comparable parameter budgets, orthogonal finetuning achieves the best Pareto frontier. The paper provides geometric analyses in both weight space (spectral/singular-value structure) and activation space (representation distortion metrics) to explain why different PEFT methods differ in forgetting behavior. A practical finding is that final SFT checkpoints often overshoot an optimal retention operating point, motivating path-wise rewinding as a post-hoc correction.

Evaluation and Benchmarking Agent and Tool Ecosystem stability-plasticity dilemma orthogonal finetuning PEFT-Arena +4 more

5arXiv · cs.CL·May 21, 2026·source ↗

SMoA: Spectrum Modulation Adapter for Parameter-Efficient Fine-Tuning

SMoA is a new parameter-efficient fine-tuning method that addresses LoRA's trade-off between rank size and parameter budget. It partitions model layers into spectral blocks and applies Hadamard-modulated low-rank branches to each diagonal block, enabling broader coverage of pretrained spectral directions without proportionally increasing trainable parameters. Theoretical analysis and empirical results on multiple tasks show SMoA outperforms LoRA and competitive LoRA-style baselines in lower-budget settings.

Inference Economics Alignment and RLHF Hadamard modulation LoRA Parameter-Efficient Fine-Tuning +1 more