technique
Block-Size Curriculum Learning
techniqueactiveprovisional
block-size-curriculum-learning-a868a55b·1 events·first seen 2d agoAliases: Block-Size Curriculum Learning
Co-occurring entities
More like this (12)
Block-Size Curriculum Learning for Diffusion Reasoning Modelscurriculum learningClass-Incremental LearningMegablocksBlockCurriculum ContinuityArithmetic Pedagogy for Language ModelsBlock-Compositional Caption Supervisionblock-sparse weightsStructured Interactive LearningScaffold, Not Vocabulary? A Controlled, Two-Tier, Pre-Registered Study of a Popperian Code-Generation Skilllarge-batch training
Recent events (1)
DreamReasoner-8B: Block-size curriculum learning enables long-CoT reasoning in diffusion language models
Researchers introduce DreamReasoner-8B, an open-source block diffusion language model trained with a block-size curriculum learning strategy that gradually transitions from fine-grained to coarse-grained block sizes during training. The work identifies a critical failure mode: training with large block sizes severely degrades reasoning, while small block sizes preserve it. The proposed curriculum bridges this gap, achieving math and code reasoning performance competitive with Qwen3-8B while retaining the parallel decoding efficiency of block diffusion models. The model and code are publicly released.