ATD-Trans:一个基于地理信息的日英游记翻译数据集 / ATD-Trans: A Geographically Grounded Japanese-English Travelogue Translation Dataset
1️⃣ 一句话总结
该论文提出了一个名为ATD-Trans的日英双语游记翻译数据集,专门用于评估机器翻译系统在处理富含地理信息文本时的表现,实验发现针对日语优化的翻译模型效果更好,且翻译游记中提到的日本国内地理实体比国外地理实体更困难。
Geographic text, or textual data rich in geographic (geo-) information is a valuable source for various geographic applications, e.g., tourism management. Making such information accessible to speakers of other languages further enhances its utility; thus, accurate machine translation (MT) is essential for equity in multilingual geo-information access. To facilitate in-depth analysis for geographic text, we introduce ATD-Trans, a geographically grounded Japanese--English travelogue translation dataset, which enables evaluation of MT quality at both the overall and geo-entity levels across domestic (within Japan) and overseas regions. Our experiments on existing language models examine two factors: model language focus and geographic regions. The results highlight advantages of Japanese-enhanced models and greater difficulty in translating domestic-region geo-entities mentioned in travel blogs.
ATD-Trans:一个基于地理信息的日英游记翻译数据集 / ATD-Trans: A Geographically Grounded Japanese-English Travelogue Translation Dataset
该论文提出了一个名为ATD-Trans的日英双语游记翻译数据集,专门用于评估机器翻译系统在处理富含地理信息文本时的表现,实验发现针对日语优化的翻译模型效果更好,且翻译游记中提到的日本国内地理实体比国外地理实体更困难。
源自 arXiv: 2605.12933