用于分层图像分解的循环一致性调优 / Cycle-Consistent Tuning for Layered Image Decomposition
1️⃣ 一句话总结
这篇论文提出了一种利用大型扩散模型来分离图像中不同视觉层(如物体表面的标志)的新方法,它通过一种循环训练策略和自增强过程,让模型在分解和重组图像时保持一致性,从而更准确地处理复杂的图像层间交互。
Disentangling visual layers in real-world images is a persistent challenge in vision and graphics, as such layers often involve non-linear and globally coupled interactions, including shading, reflection, and perspective distortion. In this work, we present an in-context image decomposition framework that leverages large diffusion foundation models for layered separation. We focus on the challenging case of logo-object decomposition, where the goal is to disentangle a logo from the surface on which it appears while faithfully preserving both layers. Our method fine-tunes a pretrained diffusion model via lightweight LoRA adaptation and introduces a cycle-consistent tuning strategy that jointly trains decomposition and composition models, enforcing reconstruction consistency between decomposed and recomposed images. This bidirectional supervision substantially enhances robustness in cases where the layers exhibit complex interactions. Furthermore, we introduce a progressive self-improving process, which iteratively augments the training set with high-quality model-generated examples to refine performance. Extensive experiments demonstrate that our approach achieves accurate and coherent decompositions and also generalizes effectively across other decomposition types, suggesting its potential as a unified framework for layered image decomposition.
用于分层图像分解的循环一致性调优 / Cycle-Consistent Tuning for Layered Image Decomposition
这篇论文提出了一种利用大型扩散模型来分离图像中不同视觉层(如物体表面的标志)的新方法,它通过一种循环训练策略和自增强过程,让模型在分解和重组图像时保持一致性,从而更准确地处理复杂的图像层间交互。
源自 arXiv: 2602.20989