菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-12
📄 Abstract - Code2Worlds: Empowering Coding LLMs for 4D World Generation

Achieving spatial intelligence requires moving beyond visual plausibility to build world simulators grounded in physical laws. While coding LLMs have advanced static 3D scene generation, extending this paradigm to 4D dynamics remains a critical frontier. This task presents two fundamental challenges: multi-scale context entanglement, where monolithic generation fails to balance local object structures with global environmental layouts; and a semantic-physical execution gap, where open-loop code generation leads to physical hallucinations lacking dynamic fidelity. We introduce Code2Worlds, a framework that formulates 4D generation as language-to-simulation code generation. First, we propose a dual-stream architecture that disentangles retrieval-augmented object generation from hierarchical environmental orchestration. Second, to ensure dynamic fidelity, we establish a physics-aware closed-loop mechanism in which a PostProcess Agent scripts dynamics, coupled with a VLM-Motion Critic that performs self-reflection to iteratively refine simulation code. Evaluations on the Code4D benchmark show Code2Worlds outperforms baselines with a 41% SGS gain and 49% higher Richness, while uniquely generating physics-aware dynamics absent in prior static methods. Code: this https URL. Website: this https URL.

顶级标签: llm systems multi-modal
详细标签: 4d generation world simulation code generation physics-aware hierarchical orchestration 或 搜索:

Code2Worlds:赋能编码大语言模型进行四维世界生成 / Code2Worlds: Empowering Coding LLMs for 4D World Generation


1️⃣ 一句话总结

这篇论文提出了一个名为Code2Worlds的新框架,它通过让大语言模型生成可执行的模拟代码,来创建不仅外观合理、而且符合物理规律的动态四维(3D+时间)虚拟世界,解决了现有方法在生成复杂场景和真实动态效果方面的不足。

源自 arXiv: 2602.11757