TC-Padé:用于扩散加速的轨迹一致性帕德逼近 / TC-Padé: Trajectory-Consistent Padé Approximation for Diffusion Acceleration
1️⃣ 一句话总结
这篇论文提出了一种名为TC-Padé的新方法,它通过更精确的数学逼近和自适应策略来加速扩散模型的图像和视频生成过程,在减少计算步骤的同时保持高质量输出,显著超越了现有的加速技术。
Despite achieving state-of-the-art generation quality, diffusion models are hindered by the substantial computational burden of their iterative sampling process. While feature caching techniques achieve effective acceleration at higher step counts (e.g., 50 steps), they exhibit critical limitations in the practical low-step regime of 20-30 steps. As the interval between steps increases, polynomial-based extrapolators like TaylorSeer suffer from error accumulation and trajectory drift. Meanwhile, conventional caching strategies often overlook the distinct dynamical properties of different denoising phases. To address these challenges, we propose Trajectory-Consistent Padé approximation, a feature prediction framework grounded in Padé approximation. By modeling feature evolution through rational functions, our approach captures asymptotic and transitional behaviors more accurately than Taylor-based methods. To enable stable and trajectory-consistent sampling under reduced step counts, TC-Padé incorporates (1) adaptive coefficient modulation that leverages historical cached residuals to detect subtle trajectory transitions, and (2) step-aware prediction strategies tailored to the distinct dynamics of early, mid, and late sampling stages. Extensive experiments on DiT-XL/2, FLUX.1-dev, and Wan2.1 across both image and video generation demonstrate the effectiveness of TC-Padé. For instance, TC-Padé achieves 2.88x acceleration on FLUX.1-dev and 1.72x on Wan2.1 while maintaining high quality across FID, CLIP, Aesthetic, and VBench-2.0 metrics, substantially outperforming existing feature caching methods.
TC-Padé:用于扩散加速的轨迹一致性帕德逼近 / TC-Padé: Trajectory-Consistent Padé Approximation for Diffusion Acceleration
这篇论文提出了一种名为TC-Padé的新方法,它通过更精确的数学逼近和自适应策略来加速扩散模型的图像和视频生成过程,在减少计算步骤的同时保持高质量输出,显著超越了现有的加速技术。
源自 arXiv: 2603.02943