Entity · technique

catastrophic forgetting

techniqueactivecatastrophic-forgetting-f9893c37·2 events·first seen May 18, 2026

Aliases: catastrophic forgetting

Co-occurring entities

Language Model Finetuning Continual Learning Self-Generated Replay Elastic Weight Consolidation Dark Experience Replay University of California Los Angeles LIBERO GRPO LoRA Sony Jiaheng Hu Nanyang Technological University Jay Shim OpenVLA-OFT University of Texas Austin

More like this (12)

catastrophic overtraining supermemory episodic memory retrieval Retrospective Memory non-parametric memory memory neurosymbolic learning Thought Preservation FlashbackCL: Mitigating Temporal Forgetting in Federated Learning Counterfactual Editing abliteration Hindsight Experience Replay

Recent events (2)

6arXiv · cs.LG·May 26, 2026·source ↗

Self-Generated Replay Nearly Eliminates Catastrophic Forgetting in Language Models

This paper investigates catastrophic forgetting in language models during continual learning, finding that models can use self-generated samples from their own training distribution as effective replay data, nearly eliminating forgetting without requiring stored exemplars. The authors identify two key conditions where forgetting persists: when models are pretrained near capacity saturation (leaving no room for new knowledge), and when low learning rates are used to reduce forgetting at the cost of requiring far more training steps. Self-generated replay breaks this learning-rate/forgetting tradeoff, enabling fast high-learning-rate finetuning without degradation on prior tasks.

Enterprise Deployment Patterns Agent and Tool Ecosystem catastrophic forgetting Language Model Finetuning Continual Learning +2 more

5The Batch·May 18, 2026·source ↗

Sony and University Researchers Train Robots To Learn Without Catastrophic Forgetting

Researchers from UT Austin, UCLA, Nanyang Technological University, and Sony developed a sequential fine-tuning recipe combining LoRA and on-policy reinforcement learning (GRPO) to reduce catastrophic forgetting in vision-language-action (VLA) models for robotics. Applied to the OpenVLA-OFT model on the LIBERO benchmark, the method achieved 81.2% success on libero-spatial tasks with near-zero forgetting (0.3 percentage point drop), outperforming established continual learning baselines including Dark Experience Replay and Elastic Weight Consolidation. The approach requires no replay of prior task data and also showed modest generalization to unseen tasks. The authors note the method has not yet been tested outside robotics simulation contexts.

Evaluation and Benchmarking Agent and Tool Ecosystem Elastic Weight Consolidation Dark Experience Replay University of California Los Angeles +11 more