Entity · technique

Language Model Finetuning

techniqueactivelanguage-model-finetuning-92fe361d·1 events·first seen May 26, 2026

Aliases: Language Model Finetuning

Co-occurring entities

catastrophic forgetting Continual Learning Self-Generated Replay

More like this (12)

Language Modeling Loss Tapered Language Models Language Model Safety Monitor Towards Mechanistically Understanding Why Memorized Knowledge Fails to Generalize in Large Language Model Finetuning Transformer Language Models LanguageModel protocol Parameter-Efficient Fine-Tuning Random Language Model Test-Time Finetuning (TTFT)A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design Decomposing Factual Sycophancy in Language Models: How Size and Instruction Tuning Shape Robustness Super-Tuning: From Activation-Aware Pruning to Sparse Fine-Tuning

Recent events (1)

6arXiv · cs.LG·May 26, 2026·source ↗

Self-Generated Replay Nearly Eliminates Catastrophic Forgetting in Language Models

This paper investigates catastrophic forgetting in language models during continual learning, finding that models can use self-generated samples from their own training distribution as effective replay data, nearly eliminating forgetting without requiring stored exemplars. The authors identify two key conditions where forgetting persists: when models are pretrained near capacity saturation (leaving no room for new knowledge), and when low learning rates are used to reduce forgetting at the cost of requiring far more training steps. Self-generated replay breaks this learning-rate/forgetting tradeoff, enabling fast high-learning-rate finetuning without degradation on prior tasks.

Enterprise Deployment Patterns Agent and Tool Ecosystem catastrophic forgetting Language Model Finetuning Continual Learning +2 more