Almanac
technique

binning semiring

techniqueactiveprovisionalbinning-semiring-e8634fd3·1 events·first seen 8d ago

Aliases: binning semiring

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.CL·8d ago·source ↗

Causal evaluation framework for learnability of formal language tasks in LMs

A new arXiv preprint proposes a causal framework for evaluating how much task-specific data language models need to learn a given task. The authors use formal languages generated by probabilistic finite automata as a controlled testbed, introducing the 'binning semiring' algebraic object to control property frequency in training corpora. Experiments show that standard correlational evaluation practices produce incorrect learnability conclusions due to confounders, with implications for how natural-language task learning is studied.