菜单

🤖 系统
📄 Abstract - SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds

While LLM/VLM-powered AI agents have advanced rapidly in math, coding, and computer use, their applications in complex physical and social environments remain challenging. Building agents that can survive and thrive in the real world (for example, by autonomously earning income or running a business) requires massive-scale interaction, reasoning, training, and evaluation across diverse embodied scenarios. However, existing world simulators for such development fall short: they often rely on limited hand-crafted environments, simulate simplified game-like physics and social rules, and lack native support for LLM/VLM agents. We introduce SimWorld, a new simulator built on Unreal Engine 5, designed for developing and evaluating LLM/VLM agents in rich, real-world-like settings. SimWorld offers three core capabilities: (1) realistic, open-ended world simulation, including accurate physical and social dynamics and language-driven procedural environment generation; (2) a rich interface for LLM/VLM agents, with multimodal world inputs and open-vocabulary actions at varying levels of abstraction; and (3) diverse and extensible physical and social reasoning scenarios that are easily customizable by users. We demonstrate SimWorld by deploying frontier LLM agents (e.g., GPT-4o, Gemini-2.5-Flash, Claude-3.5, and DeepSeek-Prover-V2) on long-horizon multi-agent delivery tasks involving strategic cooperation and competition. The results reveal distinct reasoning patterns and limitations across models. We open-source SimWorld and hope it becomes a foundational platform for advancing real-world agent intelligence across disciplines: this https URL.

顶级标签: agents systems multi-modal
详细标签: simulation embodied ai autonomous agents evaluation multi-agent systems 或 搜索:

SimWorld:一个面向物理与社交世界中自主智能体的开放式真实模拟器 / SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds


1️⃣ 一句话总结

这篇论文介绍了一个名为SimWorld的新型高仿真模拟器,它基于虚幻引擎5构建,旨在为大型语言模型和视觉语言模型驱动的智能体提供一个开放、真实且可定制的物理与社交环境,以训练和评估它们在复杂现实任务(如多智能体协作与竞争)中的表现,并揭示了不同前沿模型的推理模式与局限。


📄 打开原文 PDF