Almanac
benchmark

chrF

benchmarkactiveprovisionalchrf-8259704e·1 events·first seen 11d ago

Aliases: chrF

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.CL·11d ago·source ↗

Reinforcement learning enables meta-skill for translating unseen low-resource languages via in-context linguistic knowledge

Researchers propose an RL-based training approach for translating extremely low-resource or unseen languages by rewarding models for extracting and applying in-context linguistic knowledge (e.g., grammar books) rather than memorizing specific languages. Using chrF as a surface-level reward signal, RL-trained models outperform both in-context learning and supervised fine-tuning on completely unseen languages at test time. The work extends outcome-based RL beyond math and coding reasoning tasks, suggesting broader applicability to language learning from context.