← 返回列表

arXiv 提交日期: 2026-02-02

📄 Abstract - HuPER: A Human-Inspired Framework for Phonetic Perception

We propose HuPER, a human-inspired framework that models phonetic perception as adaptive inference over acoustic-phonetics evidence and linguistic knowledge. With only 100 hours of training data, HuPER achieves state-of-the-art phonetic error rates on five English benchmarks and strong zero-shot transfer to 95 unseen languages. HuPER is also the first framework to enable adaptive, multi-path phonetic perception under diverse acoustic conditions. All training data, models, and code are open-sourced. Code and demo avaliable at this https URL.

顶级标签: natural language processing audio model training

HuPER：一种受人类启发的语音感知框架 / HuPER: A Human-Inspired Framework for Phonetic Perception

1️⃣ 一句话总结

这篇论文提出了一个受人类听觉系统启发的语音感知框架HuPER，它通过结合声学证据和语言知识来识别语音，仅用少量数据就在英语语音识别上取得了顶尖效果，并能直接识别95种从未训练过的语言。

👋 没兴趣 ☆ 感兴趣 📌 待读

打开原文 PDF

源自 arXiv: 2602.01634

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要