面向波兰语儿童音素级发音错误的可解释筛查助手 / Phoneme-Level Mispronunciation Screening in Polish-Speaking Children with an Explainable Assistant
1️⃣ 一句话总结
本论文介绍了一款针对波兰语儿童发音错误的轻量化筛查工具,它利用语音识别模型检测特定辅音发音错误,并通过模板辅助解释结果,旨在帮助家长在非临床环境中快速识别需要进一步专业评估的孩子。
Early identification of speech sound errors in children is often limited by access to specialists, motivating lightweight screening tools that can operate outside the clinic. We present a screening pipeline for Polish-speaking children focused on sibilant substitutions, coupling a wav2vec2-based CTC token recognizer with alignment-based error typing and a template-grounded caregiver assistant for screening, not diagnosis. On a held-out test set of 10 unseen children comprising 559 utterances, the recognizer achieves 88.7 percent exact sequence match. As a conservative screening proxy, we flag a mismatch when the system emits substitution-evidence bracketed tokens at the target segment, yielding 72.9 percent precision, 61.4 percent recall, F1 = 0.67, and a 2.7 percent false-alarm rate on target-correct items. We describe the assistant's safety boundaries and outline a clinician-in-the-loop validation plan for future deployment.
面向波兰语儿童音素级发音错误的可解释筛查助手 / Phoneme-Level Mispronunciation Screening in Polish-Speaking Children with an Explainable Assistant
本论文介绍了一款针对波兰语儿童发音错误的轻量化筛查工具,它利用语音识别模型检测特定辅音发音错误,并通过模板辅助解释结果,旨在帮助家长在非临床环境中快速识别需要进一步专业评估的孩子。
源自 arXiv: 2606.25181