Structured Document Translation via Format Reinforcement Learning

📄 Abstract - Structured Document Translation via Format Reinforcement Learning

Recent works on structured text translation remain limited to the sentence level, as they struggle to effectively handle the complex document-level XML or HTML structures. To address this, we propose \textbf{Format Reinforcement Learning (FormatRL)}, which employs Group Relative Policy Optimization on top of a supervised fine-tuning model to directly optimize novel structure-aware rewards: 1) TreeSim, which measures structural similarity between predicted and reference XML trees and 2) Node-chrF, which measures translation quality at the level of XML nodes. Additionally, we apply StrucAUC, a fine-grained metric distinguishing between minor errors and major structural failures. Experiments on the SAP software-documentation benchmark demonstrate improvements across six metrics and an analysis further shows how different reward functions contribute to improvements in both structural and translation quality.

通过格式强化学习实现结构化文档翻译 / Structured Document Translation via Format Reinforcement Learning

1️⃣ 一句话总结

这篇论文提出了一种名为‘格式强化学习’的新方法，通过优化两个专门衡量文档结构和翻译质量的奖励指标，有效解决了复杂文档（如XML/HTML格式）在翻译时难以保持原有结构完整性的难题，从而在软件文档翻译任务上取得了更好的效果。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要