other
Reasoning Language Models
otheractiveprovisional
reasoning-language-models-0ba1d62f·1 events·first seen 15d agoAliases: Reasoning Language Models
Co-occurring entities
More like this (12)
Large Reasoning ModelsDoes Reasoning Preserve Alignment? On the Trustworthiness of Large Reasoning ModelsTransformer Language ModelsRecursive Language Models (RLMs)Reasoning over Grammar: Can Synthetic Linguistic Reasoning Traces Enhance Low-Resource Machine Translation?Latent Context Language Modelsmulti-turn language modelsPredicting Future Behaviors in Reasoning Models Enables Better SteeringBeyond the Commitment Boundary: Probing Epiphenomenal Chain-of-Thought in Large Reasoning ModelsWhen the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning ModelsReinforcement Learning for Language ModelsLong-context Reasoning Benchmarks
Recent events (1)
Luar: Selective Translation via Reinforcement Learning for Multilingual Reasoning
Luar is a reinforcement learning framework that trains reasoning language models to selectively invoke English translation only when direct understanding of a non-English input is deemed unreliable. The approach, built on top of GRPO, outperforms standard multilingual baselines across reasoning benchmarks, with especially large gains on low-resource languages. Analysis confirms the model learns to avoid unnecessary translation when direct reasoning suffices, and generalizes the translation-call behavior to unseen low-resource languages.