菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-11
📄 Abstract - DuET: Dual Expert Trajectories for Diffusion Image Editing

Recent diffusion editors perform diverse instruction-based edits while conditioning on the source image at every denoising step. Yet persistent source-image conditioning can limit how fully an edit is executed and how natural the result appears, especially when the target scene diverges substantially from the input. We introduce DuET (Dual Expert Trajectories), a training-free inference method that temporarily relaxes source-image conditioning by transitioning through a text-to-image phase before returning to edit mode, allowing the denoising trajectory to move toward the target distribution while retaining the structural benefits of image-conditioned editing. Without modifying model weights or increasing sampling cost, DuET consistently improves instruction relevance, semantic fidelity, and perceptual quality across diverse models and benchmarks. In some cases, these gains come with a modest reduction in source-image preservation, revealing a predictable trade-off between source preservation and edit fidelity.

顶级标签: computer vision aigc
详细标签: diffusion models image editing inference method text-to-image edit fidelity 或 搜索:

DuET:用于扩散图像编辑的双专家轨迹 / DuET: Dual Expert Trajectories for Diffusion Image Editing


1️⃣ 一句话总结

本文提出了一种无需重新训练的推理方法DuET,通过在编辑过程中暂时放松对源图像的依赖、引入文本到图像的生成阶段,有效提升了扩散模型在复杂场景下的编辑质量和语义一致性。

源自 arXiv: 2606.13303