菜单

关于 🐙 GitHub
arXiv 提交日期: 2025-12-04
📄 Abstract - Structured Document Translation via Format Reinforcement Learning

Recent works on structured text translation remain limited to the sentence level, as they struggle to effectively handle the complex document-level XML or HTML structures. To address this, we propose \textbf{Format Reinforcement Learning (FormatRL)}, which employs Group Relative Policy Optimization on top of a supervised fine-tuning model to directly optimize novel structure-aware rewards: 1) TreeSim, which measures structural similarity between predicted and reference XML trees and 2) Node-chrF, which measures translation quality at the level of XML nodes. Additionally, we apply StrucAUC, a fine-grained metric distinguishing between minor errors and major structural failures. Experiments on the SAP software-documentation benchmark demonstrate improvements across six metrics and an analysis further shows how different reward functions contribute to improvements in both structural and translation quality.

顶级标签: natural language processing machine learning model training
详细标签: document translation reinforcement learning structured text xml evaluation metrics 或 搜索:

通过格式强化学习实现结构化文档翻译 / Structured Document Translation via Format Reinforcement Learning


1️⃣ 一句话总结

这篇论文提出了一种名为‘格式强化学习’的新方法,通过优化两个专门衡量文档结构和翻译质量的奖励指标,有效解决了复杂文档(如XML/HTML格式)在翻译时难以保持原有结构完整性的难题,从而在软件文档翻译任务上取得了更好的效果。


源自 arXiv: 2512.05100