菜单

🤖 系统
📄 Abstract - Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs

Timestep distillation is an effective approach for improving the generation efficiency of diffusion models. The Consistency Model (CM), as a trajectory-based framework, demonstrates significant potential due to its strong theoretical foundation and high-quality few-step generation. Nevertheless, current continuous-time consistency distillation methods still rely heavily on training data and computational resources, hindering their deployment in resource-constrained scenarios and limiting their scalability to diverse domains. To address this issue, we propose Trajectory-Backward Consistency Model (TBCM), which eliminates the dependence on external training data by extracting latent representations directly from the teacher model's generation trajectory. Unlike conventional methods that require VAE encoding and large-scale datasets, our self-contained distillation paradigm significantly improves both efficiency and simplicity. Moreover, the trajectory-extracted samples naturally bridge the distribution gap between training and inference, thereby enabling more effective knowledge transfer. Empirically, TBCM achieves 6.52 FID and 28.08 CLIP scores on MJHQ-30k under one-step generation, while reducing training time by approximately 40% compared to Sana-Sprint and saving a substantial amount of GPU memory, demonstrating superior efficiency without sacrificing quality. We further reveal the diffusion-generation space discrepancy in continuous-time consistency distillation and analyze how sampling strategies affect distillation performance, offering insights for future distillation research. GitHub Link: this https URL.

顶级标签: model training machine learning systems
详细标签: timestep distillation consistency models efficient generation training-free diffusion models 或 搜索:

📄 论文总结

基于轨迹采样对连续时间一致性的免图像时间步蒸馏 / Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs


1️⃣ 一句话总结

这项研究提出了一种无需外部训练数据的扩散模型高效蒸馏方法,通过直接从教师模型的生成轨迹中提取特征来训练轻量级生成模型,在显著减少训练时间和资源消耗的同时保持了高质量的图像生成效果。


📄 打开原文 PDF