曲率自适应一致性流匹配:基于强化学习的自主轨迹优化 / Curvature-Adaptive Consistency Flow Matching: Autonomous Trajectory Optimization via Reinforcement Learning
1️⃣ 一句话总结
本文提出了一种名为CACFM的新方法,通过引入强化学习智能体动态调整训练重点,解决了扩散模型在快速生成图像时边界阶段(初始和末尾)优化困难的问题,从而在极少步数下生成更清晰、结构更准确的图像。
Consistency distillation has significantly accelerated the inference of diffusion models. In this work, we reveal an intriguing asymmetry: while Logit-Normal sampling priors are highly efficacious for standard iterative generation, consistency distillation exhibits a distinctly different difficulty profile (e.g., U-shaped). We identify that the primary optimization bottlenecks reside at the boundary stages (initialization or final refinement) rather than the intermediate steps. To address the limitations of static sampling in accommodating evolving learning requirements, we propose Curvature-Adaptive Consistency Flow Matching (CACFM). By formulating distillation as a dynamic decision process, CACFM employs a lightweight Reinforcement Learning agent to actively probe Probability Flow ODE trajectories, automatically constructing an efficiency-oriented curriculum that prioritizes critical regions without manual scheduling. Integrated with a novel Flow Distribution Matching Distillation (DMD) objective, our approach achieves new state-of-the-art results on large-scale models such as FLUX and SDXL. It effectively mitigates structural deformities and preserves high-frequency details in extreme few-step regimes, achieving unprecedented visual fidelity.
曲率自适应一致性流匹配:基于强化学习的自主轨迹优化 / Curvature-Adaptive Consistency Flow Matching: Autonomous Trajectory Optimization via Reinforcement Learning
本文提出了一种名为CACFM的新方法,通过引入强化学习智能体动态调整训练重点,解决了扩散模型在快速生成图像时边界阶段(初始和末尾)优化困难的问题,从而在极少步数下生成更清晰、结构更准确的图像。
源自 arXiv: 2606.22394