SoftCap: Soft-Budget Control for Diffusion Transformer Acceleration

📄 Abstract - SoftCap: Soft-Budget Control for Diffusion Transformer Acceleration

Diffusion Transformers (DiTs) achieve strong visual quality, but their iterative denoising process requires many costly Transformer evaluations. Training-free acceleration methods reduce this cost by caching, forecasting, or verifying intermediate features, yet the runtime decision of when to execute a Full step is often driven by fixed schedules or hand-tuned thresholds. We propose \textbf{SoftCap}, a training-free control layer for cache-based DiT inference. SoftCap couples a Trajectory Drift Observer, which estimates local cache risk from lightweight hidden-state statistics, with a Soft-Budget PI Controller, which adjusts the Full-triggering threshold from realized compute relative to a fixed reference profile. The budget is a soft ceiling: it shapes the threshold but does not require a run to spend a prescribed number of Full evaluations. On FLUX.1-dev, SoftCap improves over SpeCa at a comparable middle-compute operating point, raising ImageReward from 0.967 to 0.981 and reducing LPIPS-Full from 0.518 to 0.498 at nearly identical FLOPs, while target-sweep diagnostics show the intended soft-ceiling behavior as the budget is relaxed.

SoftCap：面向扩散Transformer加速的软预算控制方法 / SoftCap: Soft-Budget Control for Diffusion Transformer Acceleration

1️⃣ 一句话总结

本文提出一种无需额外训练的加速框架SoftCap，它通过实时监测模型状态并动态调整计算预算，在保证图像质量的前提下，智能决定何时执行完整计算步骤，从而显著提升扩散Transformer的推理效率。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要