WAXAL: A Large-Scale Multilingual African Language Speech Corpus

📄 Abstract - WAXAL: A Large-Scale Multilingual African Language Speech Corpus

The advancement of speech technology has predominantly favored high-resource languages, creating a significant digital divide for speakers of most Sub-Saharan African languages. To address this gap, we introduce WAXAL, a large-scale, openly accessible speech dataset for 21 languages representing over 100 million speakers. The collection consists of two main components: an Automated Speech Recognition (ASR) dataset containing approximately 1,250 hours of transcribed, natural speech from a diverse range of speakers, and a Text-to-Speech (TTS) dataset with over 180 hours of high-quality, single-speaker recordings reading phonetically balanced scripts. This paper details our methodology for data collection, annotation, and quality control, which involved partnerships with four African academic and community organizations. We provide a detailed statistical overview of the dataset and discuss its potential limitations and ethical considerations. The WAXAL datasets are released at this https URL under the permissive CC-BY-4.0 license to catalyze research, enable the development of inclusive technologies, and serve as a vital resource for the digital preservation of these languages.

WAXAL：一个大规模多语言非洲语言语音语料库 / WAXAL: A Large-Scale Multilingual African Language Speech Corpus

1️⃣ 一句话总结

这篇论文发布了一个名为WAXAL的大规模、公开的多语言非洲语言语音数据集，旨在通过提供超过1250小时的语音识别数据和180小时的语音合成数据，来弥合数字鸿沟，促进针对非洲语言的包容性语音技术研究和语言数字保存。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要