product
Luar
productactiveprovisional
luar-a684e7e5·1 events·first seen 15d agoAliases: Luar
Co-occurring entities
More like this (12)
Recent events (1)
Luar: Selective Translation via Reinforcement Learning for Multilingual Reasoning
Luar is a reinforcement learning framework that trains reasoning language models to selectively invoke English translation only when direct understanding of a non-English input is deemed unreliable. The approach, built on top of GRPO, outperforms standard multilingual baselines across reasoning benchmarks, with especially large gains on low-resource languages. Analysis confirms the model learns to avoid unnecessary translation when direct reasoning suffices, and generalizes the translation-call behavior to unseen low-resource languages.