菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-18
📄 Abstract - Stabilizing, Scaling & Enhancing MeanFlow for Large-scale Diffusion Distillation

Diffusion models exhibit remarkable generative capability, but their high latency limits practical deployment. Many studies have attempted to reduce sampling steps to accelerate inference. Among them, MeanFlow has attracted considerable attention due to its concise formulation and remarkable performance. Nevertheless, the instability of its optimization objective and the ''mean-seeking bias'' have limited its applicability to distill large-scale industrial models. To stabilize MeanFlow for distilling large-scale models, we first introduce a warm-up technique, in which the original differential solution of MeanFlow is replaced by a discrete solution. This design avoids training collapse caused by the MeanFlow target containing a stop-gradient term from an undertrained model. Once the model acquires a preliminary ability to fit the average velocity field, we switch the optimization objective back to the differential solution, enabling further refinement. Meanwhile, to alleviate the ''mean-seeking bias'' of MeanFlow under extremely few-step inference with complex target distributions, we incorporate trajectory distribution alignment as an auxiliary objective, encouraging the student model's trajectory distribution to align more closely with that of the teacher model. Our proposed distillation framework achieves superior performance compared to existing distillation approaches when applied to the text-to-image (T2I) model FLUX.1-dev (up to 12B parameters). Furthermore, when extended to the 80B-parameter state-of-the-art (SOTA) T2I model HunyuanImage 3.0, our method continues to demonstrate robust generalization and strong performance.

顶级标签: llm model training model evaluation
详细标签: diffusion models knowledge distillation meanflow text-to-image large-scale 或 搜索:

稳定、扩展并增强MeanFlow以实现大规模扩散模型蒸馏 / Stabilizing, Scaling & Enhancing MeanFlow for Large-scale Diffusion Distillation


1️⃣ 一句话总结

本文针对现有扩散模型蒸馏方法MeanFlow在大规模模型上训练不稳定且存在“均值偏差”的问题,提出了预热训练和轨迹分布对齐两种改进策略,成功将该方法应用于千亿参数级别的强文生图模型,显著提升了生成速度与质量。

源自 arXiv: 2605.17834