水涨船高:利用机器翻译质量评估奖励提升习语翻译,进而改善整体翻译质量 / A Rising Tide Lifts All Boats: MTQE Rewards for Idioms Improve General Translation Quality
1️⃣ 一句话总结
这篇论文提出了一种新方法,通过使用机器翻译质量评估模型作为奖励来专门训练模型更好地翻译习语等非直译表达,结果发现这不仅显著提升了习语翻译水平,还意外地连带提高了模型的通用翻译能力和跨语言泛化能力。
Non-compositional expressions (e.g., idioms, proverbs, and metaphors) pose significant challenges for neural machine translation systems because their meanings cannot be derived from individual words alone. These expressions encode rich, cultural meaning, and have both figurative and literal meanings, making accurate translation difficult. Because models are fairly good at translating compositional text, we investigate GRPO-style fine-tuning using Machine Translation Quality Estimation (MTQE) models as reward functions to train models to better translate idioms. Using Chinese and Hindi idiom datasets, we find that idiom translation abilities improve by ~14 points, general, non-idiomatic translation implicitly improves by ~8 points, and cross-lingual translation abilities (trained on one language, evaluated on another) improves by ~6 points. Overall, our work quantifies the non-compositional translation gap and offers insights for developing LLMs with stronger cross-cultural and figurative language understanding.
水涨船高:利用机器翻译质量评估奖励提升习语翻译,进而改善整体翻译质量 / A Rising Tide Lifts All Boats: MTQE Rewards for Idioms Improve General Translation Quality
这篇论文提出了一种新方法,通过使用机器翻译质量评估模型作为奖励来专门训练模型更好地翻译习语等非直译表达,结果发现这不仅显著提升了习语翻译水平,还意外地连带提高了模型的通用翻译能力和跨语言泛化能力。
源自 arXiv: 2601.06307