菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-23
📄 Abstract - Phoneme-Level Mispronunciation Screening in Polish-Speaking Children with an Explainable Assistant

Early identification of speech sound errors in children is often limited by access to specialists, motivating lightweight screening tools that can operate outside the clinic. We present a screening pipeline for Polish-speaking children focused on sibilant substitutions, coupling a wav2vec2-based CTC token recognizer with alignment-based error typing and a template-grounded caregiver assistant for screening, not diagnosis. On a held-out test set of 10 unseen children comprising 559 utterances, the recognizer achieves 88.7 percent exact sequence match. As a conservative screening proxy, we flag a mismatch when the system emits substitution-evidence bracketed tokens at the target segment, yielding 72.9 percent precision, 61.4 percent recall, F1 = 0.67, and a 2.7 percent false-alarm rate on target-correct items. We describe the assistant's safety boundaries and outline a clinician-in-the-loop validation plan for future deployment.

顶级标签: audio machine learning systems
详细标签: speech recognition wav2vec2 phoneme detection child speech explainable ai 或 搜索:

面向波兰语儿童音素级发音错误的可解释筛查助手 / Phoneme-Level Mispronunciation Screening in Polish-Speaking Children with an Explainable Assistant


1️⃣ 一句话总结

本论文介绍了一款针对波兰语儿童发音错误的轻量化筛查工具,它利用语音识别模型检测特定辅音发音错误,并通过模板辅助解释结果,旨在帮助家长在非临床环境中快速识别需要进一步专业评估的孩子。

源自 arXiv: 2606.25181