基于流映射的扩散模型测试时缩放 / Test-time scaling of diffusions with flow maps
1️⃣ 一句话总结
这篇论文提出了一种名为FMTT的新方法,通过直接利用流映射而非奖励梯度,在扩散模型生成过程中更有效地引导样本向用户指定的奖励方向优化,从而实现了比现有方法更好的图像编辑和控制效果。
A common recipe to improve diffusion models at test-time so that samples score highly against a user-specified reward is to introduce the gradient of the reward into the dynamics of the diffusion itself. This procedure is often ill posed, as user-specified rewards are usually only well defined on the data distribution at the end of generation. While common workarounds to this problem are to use a denoiser to estimate what a sample would have been at the end of generation, we propose a simple solution to this problem by working directly with a flow map. By exploiting a relationship between the flow map and velocity field governing the instantaneous transport, we construct an algorithm, Flow Map Trajectory Tilting (FMTT), which provably performs better ascent on the reward than standard test-time methods involving the gradient of the reward. The approach can be used to either perform exact sampling via importance weighting or principled search that identifies local maximizers of the reward-tilted distribution. We demonstrate the efficacy of our approach against other look-ahead techniques, and show how the flow map enables engagement with complicated reward functions that make possible new forms of image editing, e.g. by interfacing with vision language models.
基于流映射的扩散模型测试时缩放 / Test-time scaling of diffusions with flow maps
这篇论文提出了一种名为FMTT的新方法,通过直接利用流映射而非奖励梯度,在扩散模型生成过程中更有效地引导样本向用户指定的奖励方向优化,从而实现了比现有方法更好的图像编辑和控制效果。