菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-19
📄 Abstract - Structural Energy Guidance for View-Consistent Text-to-3D Generation

Text-to-3D generation based on diffusion models often suffers from the Janus problem, leading to inconsistent geometry across viewpoints. This work identifies viewpoint bias in 2D diffusion priors as the main cause and proposes Structural Energy-Guided Sampling (SEGS), a training-free and plug-and-play framework to improve multi-view consistency. SEGS constructs a structural energy in the PCA subspace of U-Net features and injects its gradient into the denoising process. It can be easily integrated into SDS/VSD pipelines without retraining. Experiments show that SEGS reduces the Janus Rate by about 10% on average and improves View-CS scores across multiple baselines, including DreamFusion, Magic3D, and LucidDreamer. This method effectively alleviates viewpoint artifacts while preserving appearance fidelity, providing a flexible solution for high-quality text-to-3D content generation.

顶级标签: computer vision aigc
详细标签: text-to-3d generation view consistency diffusion model janus problem energy guidance 或 搜索:

结构能量引导:实现视角一致的文本到3D生成 / Structural Energy Guidance for View-Consistent Text-to-3D Generation


1️⃣ 一句话总结

本文提出一种无需训练的SEGS方法,通过在扩散模型特征的主成分子空间中构建结构能量,并注入去噪过程,有效解决了文本生成3D模型时的多视角不一致(即“头像”问题),将视角矛盾率降低约10%,并提升了生成质量。

源自 arXiv: 2605.19876