Entity · technique

Generalized Distillation

techniqueactivegeneralized-distillation-77cf246f·1 events·first seen Jun 3, 2026

Aliases: Generalized Distillation

Co-occurring entities

Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories Knowledge Seeding

More like this (12)

ensemble distillation Weak-to-Strong Distillation Self-Distillation Model Distillation distillation distillation attacks Weak-to-Strong Generalization via Direct On-Policy Distillation Distill to Detect on-policy distillation Parallel Decoding Distillation Rank-to-Distill Random Network Distillation

Recent events (1)

5arXiv · cs.LG·Jun 3, 2026·source ↗

Sleep paradigm for LLMs enables continual learning and memory consolidation via distillation and RL

A new arXiv preprint proposes a 'Sleep' paradigm for language models that enables continual learning by consolidating short-term in-context memories into long-term parameters. The framework has two stages: Knowledge Seeding (distilling a smaller model's memories into a larger network via on-policy distillation combined with RL-based imitation learning) and Dreaming (self-improvement via RL-generated synthetic curricula without human supervision). Experiments cover long-horizon tasks, continual learning, knowledge incorporation, and few-shot generalization, addressing a known weakness of current LLMs in retaining temporal knowledge across contexts.

Alignment and RLHF Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories Knowledge Seeding Generalized Distillation