压力下的翻译:面向危机沟通的领域感知大语言模型 / Translating Under Pressure: Domain-Aware LLMs for Crisis Communication
1️⃣ 一句话总结
本文提出了一种领域自适应方法,通过扩充少量危机语料并优化小语言模型,使其在应急通信中能够生成简化的英语翻译,从而在无法覆盖所有语言时提供一种实用的通用沟通工具。
Timely and reliable multilingual communication is critical during natural and human-induced disasters, but developing effective solutions for crisis communication is limited by the scarcity of curated parallel data. We propose a domain-adaptive pipeline that expands a small reference corpus, by retrieving and filtering data from general corpora. We use the resulting dataset to fine-tune a small language model for crisis-domain translation and then apply preference optimization to bias outputs toward CEFR A2-level English. Automatic and human evaluation shows that this approach improves readability, while maintaining strong adequacy. Our results indicate that simplified English, combined with domain adaptation, can function as a practical lingua franca for emergency communication when full multilingual coverage is not feasible.
压力下的翻译:面向危机沟通的领域感知大语言模型 / Translating Under Pressure: Domain-Aware LLMs for Crisis Communication
本文提出了一种领域自适应方法,通过扩充少量危机语料并优化小语言模型,使其在应急通信中能够生成简化的英语翻译,从而在无法覆盖所有语言时提供一种实用的通用沟通工具。
源自 arXiv: 2604.26597