CoRe3D:作为三维智能基础的协同推理 / CoRe3D: Collaborative Reasoning as a Foundation for 3D Intelligence
1️⃣ 一句话总结
这篇论文提出了一个名为CoRe3D的框架,它通过将语言推理与结构化空间推理紧密结合,让AI能够根据文字描述,以分步骤、可解释的方式理解和生成高质量、符合描述的3D内容。
Recent advances in large multimodal models suggest that explicit reasoning mechanisms play a critical role in improving model reliability, interpretability, and cross-modal alignment. While such reasoning-centric approaches have been proven effective in language and vision tasks, their extension to 3D remains underdeveloped. CoRe3D introduces a unified 3D understanding and generation reasoning framework that jointly operates over semantic and spatial abstractions, enabling high-level intent inferred from language to directly guide low-level 3D content formation. Central to this design is a spatially grounded reasoning representation that decomposes 3D latent space into localized regions, allowing the model to reason over geometry in a compositional and procedural manner. By tightly coupling semantic chain-of-thought inference with structured spatial reasoning, CoRe3D produces 3D outputs that exhibit strong local consistency and faithful alignment with linguistic descriptions.
CoRe3D:作为三维智能基础的协同推理 / CoRe3D: Collaborative Reasoning as a Foundation for 3D Intelligence
这篇论文提出了一个名为CoRe3D的框架,它通过将语言推理与结构化空间推理紧密结合,让AI能够根据文字描述,以分步骤、可解释的方式理解和生成高质量、符合描述的3D内容。
源自 arXiv: 2512.12768