迈向具备交互智能的数字人 / Towards Interactive Intelligence for Digital Humans
1️⃣ 一句话总结
这篇论文提出了一种名为‘交互智能’的新概念,并开发了一个叫做Mio的完整系统,让数字人不仅能模仿人的外表和动作,还能根据个性进行表达、适应不同互动场景并自我学习进化,从而实现了更自然、更智能的人机交互。
We introduce Interactive Intelligence, a novel paradigm of digital human that is capable of personality-aligned expression, adaptive interaction, and self-evolution. To realize this, we present Mio (Multimodal Interactive Omni-Avatar), an end-to-end framework composed of five specialized modules: Thinker, Talker, Face Animator, Body Animator, and Renderer. This unified architecture integrates cognitive reasoning with real-time multimodal embodiment to enable fluid, consistent interaction. Furthermore, we establish a new benchmark to rigorously evaluate the capabilities of interactive intelligence. Extensive experiments demonstrate that our framework achieves superior performance compared to state-of-the-art methods across all evaluated dimensions. Together, these contributions move digital humans beyond superficial imitation toward intelligent interaction.
迈向具备交互智能的数字人 / Towards Interactive Intelligence for Digital Humans
这篇论文提出了一种名为‘交互智能’的新概念,并开发了一个叫做Mio的完整系统,让数字人不仅能模仿人的外表和动作,还能根据个性进行表达、适应不同互动场景并自我学习进化,从而实现了更自然、更智能的人机交互。
源自 arXiv: 2512.13674