菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-01-14
📄 Abstract - World Craft: Agentic Framework to Create Visualizable Worlds via Text

Large Language Models (LLMs) motivate generative agent simulation (e.g., AI Town) to create a ``dynamic world'', holding immense value across entertainment and research. However, for non-experts, especially those without programming skills, it isn't easy to customize a visualizable environment by themselves. In this paper, we introduce World Craft, an agentic world creation framework to create an executable and visualizable AI Town via user textual descriptions. It consists of two main modules, World Scaffold and World Guild. World Scaffold is a structured and concise standardization to develop interactive game scenes, serving as an efficient scaffolding for LLMs to customize an executable AI Town-like environment. World Guild is a multi-agent framework to progressively analyze users' intents from rough descriptions, and synthesizes required structured contents (\eg environment layout and assets) for World Scaffold . Moreover, we construct a high-quality error-correction dataset via reverse engineering to enhance spatial knowledge and improve the stability and controllability of layout generation, while reporting multi-dimensional evaluation metrics for further analysis. Extensive experiments demonstrate that our framework significantly outperforms existing commercial code agents (Cursor and Antigravity) and LLMs (Qwen3 and Gemini-3-Pro). in scene construction and narrative intent conveyance, providing a scalable solution for the democratization of environment creation.

顶级标签: llm agents systems
详细标签: world generation agentic framework text-to-environment multi-agent system visual simulation 或 搜索:

World Craft:通过文本创建可视化世界的智能体框架 / World Craft: Agentic Framework to Create Visualizable Worlds via Text


1️⃣ 一句话总结

这篇论文提出了一个名为World Craft的智能体框架,它能让普通用户仅通过文字描述,就能轻松创建出可运行、可交互的可视化虚拟世界,无需编程技能。

源自 arXiv: 2601.09150