← 返回列表

arXiv 提交日期: 2026-06-30

📄 Abstract - LuxEmo: Expressive Text-to-Speech Corpus for Luxembourgish

State-of-the-art speech datasets predominantly focus on widely spoken languages, often overlooking low-resource languages such as Luxembourgish, which remain underrepresented in speech technology research. In this work, we introduce LuxEmo, a 21-hour conversational expressive speech corpus for Luxembourgish with 4 emotion categories. LuxEmo is derived from Radio Télévision Luxembourg (RTL) youth broadcasts, using automated detection followed by human validation. We propose a semi-automatic curation workflow combining voice activity detection, denoising, language identification, LuxASR-based segmentation, automatic emotion prediction, lexical cues, and targeted human review. Additionally, we benchmark five expressive TTS systems covering German-based cross-lingual transfer, multilingual Luxembourgish support, Luxembourgish adaptation, and non-parametric prosody transfer. Performance is evaluated using both objective metrics and human evaluation.

顶级标签: audio multi-modal model training

LuxEmo：面向卢森堡语的表达性文本转语音语料库 / LuxEmo: Expressive Text-to-Speech Corpus for Luxembourgish

1️⃣ 一句话总结

本文构建了LuxEmo——一个21小时的卢森堡语情感语音数据集，并开发了一套半自动化的语音筛选流程，在此基础上测试了五种不同的情感语音合成方法，旨在推动低资源语言在表达性语音技术上的发展。

👋 没兴趣 ☆ 感兴趣 📌 待读

打开原文 PDF

源自 arXiv: 2606.31947

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要