菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-11
📄 Abstract - Multilingual Reasoning Gym: Multilingual Scaling of Procedural Reasoning Environments

We present the Multilingual Reasoning Gym, an extension of Reasoning Gym (Stojanovski et al., 2025), that procedurally generates verifiable reasoning problems across 14 languages. We translate templates for 94 tasks with native-speaker validation in 10 languages and targeted code or template adaptations to ensure linguistic naturalness. The Multilingual Reasoning Gym preserves the core benefits of the procedural generation approach used in the original Reasoning Gym, such as virtually unlimited problem instance generation and adjustable difficulty, and remains directly usable for Reinforcement Learning from Verifiable Rewards and evaluation settings. Problems in the Multilingual Reasoning Gym are parallel across languages, enabling crosslingually parallel data generation at massive scale due to the procedural nature of the environments. We release our implementation to support research into multilingual reasoning models.

顶级标签: llm benchmark agents
详细标签: multilingual reasoning procedural generation evaluation reinforcement learning crosslingual data 或 搜索:

多语言推理训练场:程序化推理环境的多语言扩展 / Multilingual Reasoning Gym: Multilingual Scaling of Procedural Reasoning Environments


1️⃣ 一句话总结

这篇论文提出了一个名为‘多语言推理训练场’的新平台,它在原有基础上将程序化生成的可验证推理问题扩展到了14种语言,通过精心翻译和适配确保了问题的自然性,从而能够大规模生成跨语言平行的训练数据,以支持多语言推理模型的研究。

源自 arXiv: 2603.10793