technique
Language Model Finetuning
techniqueactiveprovisional
language-model-finetuning-92fe361d·1 events·first seen 22d agoAliases: Language Model Finetuning
Co-occurring entities
More like this (12)
Language Modeling LossLanguage Model Safety MonitorTransformer Language ModelsParameter-Efficient Fine-TuningTest-Time Finetuning (TTFT)A Unifying Lens on Supervised Fine-Tuning Through Target Distribution DesignDecomposing Factual Sycophancy in Language Models: How Size and Instruction Tuning Shape Robustnessmulti-turn language modelsFine-tuning GPT-2 from Human PreferencesChain-of-Thought Fine-Tuningencoder-only language modelsOpenAI Fine-Tuning
Recent events (1)
Self-Generated Replay Nearly Eliminates Catastrophic Forgetting in Language Models
This paper investigates catastrophic forgetting in language models during continual learning, finding that models can use self-generated samples from their own training distribution as effective replay data, nearly eliminating forgetting without requiring stored exemplars. The authors identify two key conditions where forgetting persists: when models are pretrained near capacity saturation (leaving no room for new knowledge), and when low learning rates are used to reduce forgetting at the cost of requiring far more training steps. Self-generated replay breaks this learning-rate/forgetting tradeoff, enabling fast high-learning-rate finetuning without degradation on prior tasks.