流控制与模型融合的统一密度算子视角 / A Unified Density Operator View of Flow Control and Merging
1️⃣ 一句话总结
这篇论文提出了一个统一的概率框架,将生成模型的奖励优化控制和多个模型的融合两大挑战联系起来,并设计了一种新算法,能够根据特定任务需求(如药物发现)有原则地组合多个预训练模型,从而生成更符合目标的新数据。
Recent progress in large-scale flow and diffusion models raised two fundamental algorithmic challenges: (i) control-based reward adaptation of pre-trained flows, and (ii) integration of multiple models, i.e., flow merging. While current approaches address them separately, we introduce a unifying probability-space framework that subsumes both as limit cases, and enables reward-guided flow merging, allowing principled, task-aware combination of multiple pre-trained flows (e.g., merging priors while maximizing drug-discovery utilities). Our formulation renders possible to express a rich family of operators over generative models densities, including intersection (e.g., to enforce safety), union (e.g., to compose diverse models), interpolation (e.g., for discovery), their reward-guided counterparts, as well as complex logical expressions via generative circuits. Next, we introduce Reward-Guided Flow Merging (RFM), a mirror-descent scheme that reduces reward-guided flow merging to a sequence of standard fine-tuning problems. Then, we provide first-of-their-kind theoretical guarantees for reward-guided and pure flow merging via RFM. Ultimately, we showcase the capabilities of the proposed method on illustrative settings providing visually interpretable insights, and apply our method to high-dimensional de-novo molecular design and low-energy conformer generation.
流控制与模型融合的统一密度算子视角 / A Unified Density Operator View of Flow Control and Merging
这篇论文提出了一个统一的概率框架,将生成模型的奖励优化控制和多个模型的融合两大挑战联系起来,并设计了一种新算法,能够根据特定任务需求(如药物发现)有原则地组合多个预训练模型,从而生成更符合目标的新数据。
源自 arXiv: 2602.08012