Entity · dataset

LM1B

datasetactivelm1b-5a1309c0·1 events·first seen Jun 10, 2026

Aliases: LM1B

Co-occurring entities

K-Forcing OpenWebText

More like this (12)

OLMo-1B LLaMA-2-13B LFM2-8B-A1B Multi-LCB MNLI mlx-lm OLMoE-1B-7B CO-LMLM CM-LRS LaMP-2 LLaDA-1.5-8B SmolLM2

Recent events (1)

5arXiv · cs.CL·Jun 10, 2026·source ↗

K-Forcing: Joint multi-token decoding via push-forward language modeling distillation

K-Forcing is a new inference acceleration paradigm that distills an autoregressive model into a push-forward mapping that generates k tokens per forward pass rather than one. The method uses progressive self-forcing distillation to match the teacher's sequence distribution, achieving 2.4–3.5x speedup at k=4 with modest quality degradation. Unlike speculative decoding, K-Forcing is designed to address high-load batch serving scenarios common in industrial deployment, while remaining compatible with standard AR infrastructure.

Training Infrastructure Inference Economics LM1B K-Forcing OpenWebText