菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-10
📄 Abstract - BridgeDiff: Bridging Human Observations and Flat-Garment Synthesis for Virtual Try-Off

Virtual try-off (VTOFF) aims to recover canonical flat-garment representations from images of dressed persons for standardized display and downstream virtual try-on. Prior methods often treat VTOFF as direct image translation driven by local masks or text-only prompts, overlooking the gap between on-body appearances and flat layouts. This gap frequently leads to inconsistent completion in unobserved regions and unstable garment structure. We propose BridgeDiff, a diffusion-based framework that explicitly bridges human-centric observations and flat-garment synthesis through two complementary components. First, the Garment Condition Bridge Module (GCBM) builds a garment-cue representation that captures global appearance and semantic identity, enabling robust inference of continuous details under partial visibility. Second, the Flat Structure Constraint Module (FSCM) injects explicit flat-garment structural priors via Flat-Constraint Attention (FC-Attention) at selected denoising stages, improving structural stability beyond text-only conditioning. Extensive experiments on standard VTOFF benchmarks show that BridgeDiff achieves state-of-the-art performance, producing higher-quality flat-garment reconstructions while preserving fine-grained appearance and structural integrity.

顶级标签: computer vision multi-modal aigc
详细标签: virtual try-off diffusion models garment reconstruction image synthesis structure constraint 或 搜索:

BridgeDiff:连接人体观察与平面服装合成以实现虚拟试穿 / BridgeDiff: Bridging Human Observations and Flat-Garment Synthesis for Virtual Try-Off


1️⃣ 一句话总结

这篇论文提出了一个名为BridgeDiff的新方法,它通过两个互补模块有效弥合了穿着者图像与标准平面服装图之间的差异,从而在虚拟试穿任务中实现了更高质量、结构更稳定的服装重建。

源自 arXiv: 2603.09236