菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-14
📄 Abstract - The Velocity Deficit: Initial Energy Injection for Flow Matching

While Flow Matching theoretically guarantees constant-velocity trajectories, we identify a critical breakdown in high-dimensional practice: the Velocity Deficit. We show that the MSE objective systematically underestimates velocity magnitude, causing generated samples to fail to reach the data manifold-a phenomenon we term Integration Lag. To rectify this, we propose Initial Energy Injection, instantiated via two complementary methods: the training-based Magnitude-Aware Flow Matching (MAFM) and the training-free Scale Schedule Corrector (SSC). Both are grounded in our discovery of a crucial asymmetry: velocity contraction causes harmful kinetic stagnation at the trajectory's start, yet acts as a beneficial denoising mechanism at its end. Empirically, SSC yields significant efficiency gains with zero retraining and just one line of code. On ImageNet-1k (256x256), it improves FID by 44.6% (from 13.68 to 7.58) and achieves a 5x speedup, enabling a 50-step generator (FID 7.58) to beat a 250-step baseline (FID 8.65). Furthermore, our methods generalize to Text-to-Image tasks and high-resolution generation, improving FID on MS-COCO by ~22%.

顶级标签: machine learning computer vision model training
详细标签: flow matching velocity deficit integration lag energy injection text-to-image 或 搜索:

速度赤字:为流匹配注入初始能量 / The Velocity Deficit: Initial Energy Injection for Flow Matching


1️⃣ 一句话总结

本文发现流匹配模型在高维空间中存在“速度赤字”问题,即模型预测的速度偏小导致生成样本无法到达目标数据分布,并提出了两种解决方案(基于训练的MAFM和无须训练的SSC),后者只需修改一行代码就能在图像生成任务中大幅提升速度和质量。

源自 arXiv: 2605.14819