前馈式3D编辑从语义部件变换中学习 / Feedforward 3D Editing Learns from Semantic-Part Transformation
1️⃣ 一句话总结
这篇论文提出了一种基于语义部件变换的3D编辑方法,通过构建包含10万对前后编辑样本的高质量数据集Pxform和专门设计的前馈神经网络PartFlow,实现了无需额外掩膜即可对3D物体进行几何和外观编辑,显著提升了编辑的精度、一致性和可控性。
3D editing is a fundamental capability for scalable 3D content creation. While image editing has rapidly evolved toward large-scale feedforward generative paradigms, 3D AI generation remains dominated by training-free editing pipelines. A central challenge of feedforward 3D editing lies in the lack of high-quality paired supervision. Editable 3D assets require simultaneous preservation of geometry, multi-view consistency, structural coherence, and localized edit controllability. Existing 3D editing datasets often rely on independently generated assets, image-mediated reconstruction or narrow edit taxonomies, leading to inaccurate localization, weak preservation, blurred edit boundaries, and limited semantic consistency. In this work, we introduce a new perspective: scalable feedforward 3D editing should be learned from semantic-part transformations. Based on this insight, we propose Pxform, a high-quality 3D editing dataset with over 100K consistent before/after editing pairs across seven edit types. Instead of treating objects as unstructured shapes, our pipeline grounds edits directly in semantic 3D parts. Built upon Pxform, we further propose PartFlow, a feedforward 3D editing network that injects source-aware latent control into pretrained 3D generative priors. PartFlow introduces mask-aware velocity preservation and render-space consistency supervision to jointly improve edit fidelity and source preservation, while requiring no 3D edit mask during inference. Extensive experiments demonstrate that high-quality semantic-part supervision substantially improves scalable 3D editing, enabling PartFlow to achieve state-of-the-art performance on both geometric and appearance editing benchmarks.
前馈式3D编辑从语义部件变换中学习 / Feedforward 3D Editing Learns from Semantic-Part Transformation
这篇论文提出了一种基于语义部件变换的3D编辑方法,通过构建包含10万对前后编辑样本的高质量数据集Pxform和专门设计的前馈神经网络PartFlow,实现了无需额外掩膜即可对3D物体进行几何和外观编辑,显著提升了编辑的精度、一致性和可控性。
源自 arXiv: 2605.27351