← 返回列表

arXiv 提交日期: 2026-03-31

📄 Abstract - Can LLM Agents Identify Spoken Dialects like a Linguist?

Due to the scarcity of labeled dialectal speech, audio dialect classification is a challenging task for most languages, including Swiss German. In this work, we explore the ability of large language models (LLMs) as agents in understanding the dialects and whether they can show comparable performance to models such as HuBERT in dialect classification. In addition, we provide an LLM baseline and a human linguist one. Our approach uses phonetic transcriptions produced by ASR systems and combines them with linguistic resources such as dialect feature maps, vowel history, and rules. Our findings indicate that, when linguistic information is provided, the LLM predictions improve. The human baseline shows that automatically generated transcriptions can be beneficial for such classifications, but also presents opportunities for improvement.

顶级标签: llm natural language processing audio

大语言模型代理能否像语言学家一样识别口语方言？ / Can LLM Agents Identify Spoken Dialects like a Linguist?

1️⃣ 一句话总结

这篇论文研究发现，当结合语音转写文本和方言特征图等语言学知识时，大语言模型在瑞士德语等方言分类任务上的表现会得到提升，其潜力接近专门的音频模型，但仍有改进空间。

👋 没兴趣 ☆ 感兴趣 📌 待读

打开原文 PDF

源自 arXiv: 2603.29541

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要