多语言推理级联需要更多上下文 / Multilingual Reasoning Cascades Need More Context
1️⃣ 一句话总结
这篇论文发现,在多语言推理任务中,传统的“翻译-推理-翻译回译”流程会丢失关键上下文信息(如文化背景和歧义消解),而通过简单地将原始问题、英文译文和推理过程一并保留并传递给最后的翻译模块,可以显著提升模型在多种语言上的回答质量,且原始语言问题贡献最大。
Translation cascades for reasoning translate the query from another language to English, reason in English, and translate the answer back to the original language. This is a competitive approach to multilingual reasoning, but structurally lossy, since each stage discards information later stages may need, including cues for cultural grounding, register, and disambiguation. We examine the benefits of a simple and training-free intervention: a context-aware translation cascade, which additionally provides the original question, the English translated question, and the reasoning trace to the context of the final translation module. We evaluate gains across nine multilingual benchmarks including various task types, three backbone models, and 285 high-, mid-, and low-resource languages, and demonstrate strong gains for open-ended generation across models and resource regimes. We show that the original language question carries most of the beneficial context. Our study emphasizes the need to better design information flow in machine translation cascades for mitigating error propagation, and provides a simple and actionable default strategy: preserve the original user question until the end of the pipeline.
多语言推理级联需要更多上下文 / Multilingual Reasoning Cascades Need More Context
这篇论文发现,在多语言推理任务中,传统的“翻译-推理-翻译回译”流程会丢失关键上下文信息(如文化背景和歧义消解),而通过简单地将原始问题、英文译文和推理过程一并保留并传递给最后的翻译模块,可以显著提升模型在多种语言上的回答质量,且原始语言问题贡献最大。
源自 arXiv: 2606.27306