📄 论文总结
FreeAskWorld:面向以人为中心的具身人工智能的交互式闭环模拟器 / FreeAskWorld: An Interactive and Closed-Loop Simulator for Human-Centric Embodied AI
1️⃣ 一句话总结
这篇论文提出了一个名为FreeAskWorld的交互式模拟平台,它利用大语言模型和社交认知理论来模拟复杂的人类行为,并通过扩展的导航任务和大型数据集证明,该平台能有效提升AI系统的语义理解和人机交互能力。
As embodied intelligence emerges as a core frontier in artificial intelligence research, simulation platforms must evolve beyond low-level physical interactions to capture complex, human-centered social behaviors. We introduce FreeAskWorld, an interactive simulation framework that integrates large language models (LLMs) for high-level behavior planning and semantically grounded interaction, informed by theories of intention and social cognition. Our framework supports scalable, realistic human-agent simulations and includes a modular data generation pipeline tailored for diverse embodied this http URL validate the framework, we extend the classic Vision-and-Language Navigation (VLN) task into a interaction enriched Direction Inquiry setting, wherein agents can actively seek and interpret navigational guidance. We present and publicly release FreeAskWorld, a large-scale benchmark dataset comprising reconstructed environments, six diverse task types, 16 core object categories, 63,429 annotated sample frames, and more than 17 hours of interaction data to support training and evaluation of embodied AI systems. We benchmark VLN models, and human participants under both open-loop and closed-loop settings. Experimental results demonstrate that models fine-tuned on FreeAskWorld outperform their original counterparts, achieving enhanced semantic understanding and interaction competency. These findings underscore the efficacy of socially grounded simulation frameworks in advancing embodied AI systems toward sophisticated high-level planning and more naturalistic human-agent interaction. Importantly, our work underscores that interaction itself serves as an additional information modality.
FreeAskWorld:面向以人为中心的具身人工智能的交互式闭环模拟器 / FreeAskWorld: An Interactive and Closed-Loop Simulator for Human-Centric Embodied AI
这篇论文提出了一个名为FreeAskWorld的交互式模拟平台,它利用大语言模型和社交认知理论来模拟复杂的人类行为,并通过扩展的导航任务和大型数据集证明,该平台能有效提升AI系统的语义理解和人机交互能力。