model

DreamReasoner-8B

modelactiveprovisionaldreamreasoner-8b-7b0d8f72·1 events·first seen 3d ago

Aliases: DreamReasoner-8B

Co-occurring entities

Qwen3-4B Block-Size Curriculum Learning for Diffusion Reasoning Models Block-Size Curriculum Learning DreamLM

More like this (12)

LLaDA-8B AceReason-14B Breeze-7B LLaDA-8B-Base IS-Writer-8B Dream-7B-Base Qwen2.5-8B DeepSeek-R1-0528-Qwen3-8B Llama-Krikri-8B DeepSeek-R1-Distill-Llama-8B EvoCUA-8B Phi-4-reasoning-vision-15B

Recent events (1)

6arXiv · cs.CL·3d ago·source ↗

DreamReasoner-8B: Block-size curriculum learning enables long-CoT reasoning in diffusion language models

Researchers introduce DreamReasoner-8B, an open-source block diffusion language model trained with a block-size curriculum learning strategy that gradually transitions from fine-grained to coarse-grained block sizes during training. The work identifies a critical failure mode: training with large block sizes severely degrades reasoning, while small block sizes preserve it. The proposed curriculum bridges this gap, achieving math and code reasoning performance competitive with Qwen3-8B while retaining the parallel decoding efficiency of block diffusion models. The model and code are publicly released.

Frontier Model Releases Open Weights Progress Qwen3-4B Block-Size Curriculum Learning for Diffusion Reasoning Models Block-Size Curriculum Learning +3 more