📄
Abstract - Aladdin-FTI @ AMIYA Three Wishes for Arabic NLP: Fidelity, Diglossia, and Multidialectal Generation
Arabic dialects have long been under-represented in Natural Language Processing (NLP) research due to their non-standardization and high variability, which pose challenges for computational modeling. Recent advances in the field, such as Large Language Models (LLMs), offer promising avenues to address this gap by enabling Arabic to be modeled as a pluricentric language rather than a monolithic system. This paper presents Aladdin-FTI, our submission to the AMIYA shared task. The proposed system is designed to both generate and translate dialectal Arabic (DA). Specifically, the model supports text generation in Moroccan, Egyptian, Palestinian, Syrian, and Saudi dialects, as well as bidirectional translation between these dialects, Modern Standard Arabic (MSA), and English. The code and trained model are publicly available.
Aladdin-FTI @ AMIYA:阿拉伯语自然语言处理的三个愿望:保真度、双言现象与多方言生成 /
Aladdin-FTI @ AMIYA Three Wishes for Arabic NLP: Fidelity, Diglossia, and Multidialectal Generation
1️⃣ 一句话总结
这篇论文提出了一个名为Aladdin-FTI的系统,它能够生成和翻译多种阿拉伯语方言,旨在利用大语言模型技术解决阿拉伯语方言因非标准化和高变异性而在自然语言处理中长期面临的挑战。