菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-01-01
📄 Abstract - MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing

3D morphing remains challenging due to the difficulty of generating semantically consistent and temporally smooth deformations, especially across categories. We present MorphAny3D, a training-free framework that leverages Structured Latent (SLAT) representations for high-quality 3D morphing. Our key insight is that intelligently blending source and target SLAT features within the attention mechanisms of 3D generators naturally produces plausible morphing sequences. To this end, we introduce Morphing Cross-Attention (MCA), which fuses source and target information for structural coherence, and Temporal-Fused Self-Attention (TFSA), which enhances temporal consistency by incorporating features from preceding frames. An orientation correction strategy further mitigates the pose ambiguity within the morphing steps. Extensive experiments show that our method generates state-of-the-art morphing sequences, even for challenging cross-category cases. MorphAny3D further supports advanced applications such as decoupled morphing and 3D style transfer, and can be generalized to other SLAT-based generative models. Project page: this https URL.

顶级标签: computer vision model training aigc
详细标签: 3d morphing latent representations attention mechanisms generative models temporal consistency 或 搜索:

MorphAny3D:释放结构化隐空间在三维形变中的潜力 / MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing


1️⃣ 一句话总结

这篇论文提出了一个无需额外训练的框架MorphAny3D,它通过巧妙融合源物体和目标物体在3D生成器内部的结构化隐空间特征,实现了高质量、语义一致且时间流畅的三维物体形变,甚至能处理跨类别物体的变形。

源自 arXiv: 2601.00204