概率守恒流引导 / Probability-Conserving Flow Guidance
1️⃣ 一句话总结
本文提出一种名为AdaMaG的新型图像生成引导方法,通过数学分析将传统引导方式拆解为“发散项”和“分数平行项”,并分别进行动态控制,从而在增强生成质量的同时避免图像失真或脱离真实数据分布,无需额外计算成本。
Diffusion and flow-based generative models dominate visual synthesis, with guidance aligning samples to user input and improving perceptual quality. However, Classifier-Free Guidance (CFG) and extrapolation-based methods are heuristic linear combinations of velocities/scores that ignore the generative manifold geometry, breaking probability conservation and driving samples off the learned manifold under strong guidance. We analyse guidance through the continuity equation and show its effect decomposes into a divergence term and a score-parallel term defined invariantly across parameterisations. We prove the divergence term blows up structurally as sampling approaches the data manifold, motivating a time-dependent schedule alongside score-parallel attenuation. The resulting plug-and-play rule, Adaptive Manifold Guidance (AdaMaG), bounds both terms at no additional inference cost. Finally, we show that most empirical heuristics for reducing saturation or improving generation quality correspond directly to the two terms in our decomposition. Across image generation benchmarks, AdaMaG improves realism, reduces hallucinations, and induces controlled desaturation in high-guidance regimes.
概率守恒流引导 / Probability-Conserving Flow Guidance
本文提出一种名为AdaMaG的新型图像生成引导方法,通过数学分析将传统引导方式拆解为“发散项”和“分数平行项”,并分别进行动态控制,从而在增强生成质量的同时避免图像失真或脱离真实数据分布,无需额外计算成本。
源自 arXiv: 2605.20079