菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-15
📄 Abstract - Vavanagi: a Community-run Platform for Documentation of the Hula Language in Papua New Guinea

We present Vavanagi, a community-run platform for Hula (Vula'a), an Austronesian language of Papua New Guinea with approximately 10,000 speakers. Vavanagi supports crowdsourced English-Hula text translation and voice recording, with elder-led review and community-governed data infrastructure. To date, 77 translators and 4 reviewers have produced over 12k parallel sentence pairs covering 9k unique Hula words. We also propose a multi-level framework for measuring community involvement, from consultation to fully community-initiated and governed projects. We position Vavanagi at Level 5: initiative, design, implementation, and data governance all sit within the Hula community, making it, to our knowledge, the first community-led language technology initiative for a language of this size. Vavanagi shows how language technology can bridge village-based and urban members, connect generations, and support cultural heritage on the community's own terms.

顶级标签: natural language processing data systems
详细标签: language documentation crowdsourcing community governance low-resource languages digital preservation 或 搜索:

瓦瓦纳吉:巴布亚新几内亚胡拉语社区自主运营的文档记录平台 / Vavanagi: a Community-run Platform for Documentation of the Hula Language in Papua New Guinea


1️⃣ 一句话总结

这篇论文介绍了一个名为‘瓦瓦纳吉’的、完全由社区自主发起、设计和管理的平台,它通过众包翻译和录音的方式,成功记录和保存了巴布亚新几内亚约有1万人使用的胡拉语,并建立了一个衡量社区参与度的多层级框架,展示了语言技术如何以社区为主导的方式连接城乡成员、维系代际传承和保护文化遗产。

源自 arXiv: 2603.14210