菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-20
📄 Abstract - Semantic Granularity Navigation in Image Editing

Despite the generative capabilities of diffusion and flow models, real-image editing remains constrained by a persistent trade-off between semantic editability and structural fidelity. We trace a primary cause of this limitation to the implicit coupling of edit progress with model scale in existing paradigms. Under this coupling, stronger edits typically require visiting noisier states, which spends computation on destabilizing layout before the semantic change is well localized. We introduce NaviEdit, a training-free inference-time controller that decouples edit progress from model scale traversal through a strict self-consistency contract. NaviEdit operates at the rollout level and leaves the underlying pretrained model unchanged. It treats scale as a control input and reallocates a fixed step budget toward semantically responsive intermediate scales instead of destructive high-noise regimes. Experiments show positive average gains across compatible editors and flow backbones, supporting decoupling as a portable inference-time control principle.

顶级标签: computer vision model training model evaluation
详细标签: image editing diffusion models inference-time control semantic editability structural fidelity 或 搜索:

图像编辑中的语义粒度导航 / Semantic Granularity Navigation in Image Editing


1️⃣ 一句话总结

本文提出了一种名为NaviEdit的轻量级方法,在不修改预训练模型的前提下,通过重新分配计算资源、避免过度干扰图像结构,从而在图像编辑中更好地平衡语义修改强度和画面保真度。

源自 arXiv: 2605.21190