Entity · paper

Arithmetic Pedagogy for Language Models

paperactivearithmetic-pedagogy-for-language-models-e0c17f7f·1 events·first seen Jun 4, 2026

Aliases: Arithmetic Pedagogy for Language Models

Co-occurring entities

More like this (12)

Reasoning Language Models Tapered Language Models Language Modeling Loss Transformer Language Models Random Language Model On the Limits of Prompt-Conditioned Language Models as General-Purpose Learners Civil Court Simulation with Large Language Models Reinforcement Learning for Language Models Knowledge-Less Language Models 7B language model Language Model Finetuning Dango: A Strictly L1-Only Large Language Model for Studying Second Language Acquisition

Recent events (1)

5arXiv · cs.AI·Jun 4, 2026·source ↗

GASING pedagogy-guided CoT training enables strong arithmetic reasoning in 86M-parameter GPT-2 model

Researchers train a small 86M-parameter GPT-2 decoder from scratch using Chain-of-Thought supervision derived from GASING, an Indonesian left-to-right arithmetic pedagogy, without any reinforcement learning. The model achieves over 80% accuracy on held-out arithmetic problems and competes with substantially larger models. Mechanistic analyses reveal two emergent capabilities: an explicit procedural pathway and a subsequent associative 'mental arithmetic' capacity that bypasses step-by-step computation. The work suggests that pedagogically structured training data can yield efficient arithmetic capability at small scale.

Evaluation and Benchmarking Alignment and RLHF GASING TOBA tokenizer GPT-2 +1 more