菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-02
📄 Abstract - HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images

Human-product images, which showcase the integration of humans and products, play a vital role in advertising, e-commerce, and digital marketing. The essential challenge of generating such images lies in ensuring the high-fidelity preservation of product details. Among existing paradigms, reference-based inpainting offers a targeted solution by leveraging product reference images to guide the inpainting process. However, limitations remain in three key aspects: the lack of diverse large-scale training data, the struggle of current models to focus on product detail preservation, and the inability of coarse supervision for achieving precise guidance. To address these issues, we propose HiFi-Inpaint, a novel high-fidelity reference-based inpainting framework tailored for generating human-product images. HiFi-Inpaint introduces Shared Enhancement Attention (SEA) to refine fine-grained product features and Detail-Aware Loss (DAL) to enforce precise pixel-level supervision using high-frequency maps. Additionally, we construct a new dataset, HP-Image-40K, with samples curated from self-synthesis data and processed with automatic filtering. Experimental results show that HiFi-Inpaint achieves state-of-the-art performance, delivering detail-preserving human-product images.

顶级标签: computer vision aigc model training
详细标签: image inpainting reference-based generation detail preservation human-product images attention mechanism 或 搜索:

HiFi-Inpaint:面向生成细节保留的人-物图像的高保真参考修复方法 / HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images


1️⃣ 一句话总结

这篇论文提出了一个名为HiFi-Inpaint的新框架,通过引入共享增强注意力和细节感知损失,并构建一个新的大规模数据集,专门用于生成能高保真保留产品细节的人与产品融合图像。

源自 arXiv: 2603.02210