ManiTwin:将可用于数据生成的数字物体数据集扩展至10万个 / ManiTwin: Scaling Data-Generation-Ready Digital Object Dataset to 100K
1️⃣ 一句话总结
这篇论文提出了一个名为ManiTwin的自动化流程,能够仅用一张图片就快速生成大量高质量、带注释的3D数字物体模型,并构建了一个包含10万个此类模型的庞大数据集,为机器人模拟训练和数据生成提供了丰富的资源基础。
Learning in simulation provides a useful foundation for scaling robotic manipulation capabilities. However, this paradigm often suffers from a lack of data-generation-ready digital assets, in both scale and diversity. In this work, we present ManiTwin, an automated and efficient pipeline for generating data-generation-ready digital object twins. Our pipeline transforms a single image into simulation-ready and semantically annotated 3D asset, enabling large-scale robotic manipulation data generation. Using this pipeline, we construct ManiTwin-100K, a dataset containing 100K high-quality annotated 3D assets. Each asset is equipped with physical properties, language descriptions, functional annotations, and verified manipulation proposals. Experiments demonstrate that ManiTwin provides an efficient asset synthesis and annotation workflow, and that ManiTwin-100K offers high-quality and diverse assets for manipulation data generation, random scene synthesis, and VQA data generation, establishing a strong foundation for scalable simulation data synthesis and policy learning. Our webpage is available at this https URL.
ManiTwin:将可用于数据生成的数字物体数据集扩展至10万个 / ManiTwin: Scaling Data-Generation-Ready Digital Object Dataset to 100K
这篇论文提出了一个名为ManiTwin的自动化流程,能够仅用一张图片就快速生成大量高质量、带注释的3D数字物体模型,并构建了一个包含10万个此类模型的庞大数据集,为机器人模拟训练和数据生成提供了丰富的资源基础。
源自 arXiv: 2603.16866