PeLAP-A:面向轻量级潜在扩散模型的自适应潜在通道剪枝 / PeLAP-A: Adaptive Latent Pruning for Lightweight Latent Diffusion Models
1️⃣ 一句话总结
本文提出一个轻量级框架PeLAP-A,通过学习一个通道重要性预测器,自动剪枝掉VAE潜在空间中冗余的通道,实验发现即使将所有潜在通道都压制到接近零,去噪UNet的性能反而略有提升,揭示出潜在扩散模型对输入信息存在大量冗余,为模型轻量化提供了新思路。
Latent diffusion models achieve strong generative performance by operating in a compressed latent space produced by a variational autoencoder (VAE). However, it remains unclear whether all latent channels contribute equally to the diffusion process, or whether significant redundancy exists. We introduce PeLAP-A (Adaptive Latent Pruning for Diffusion), a lightweight framework that augments a standard latent diffusion pipeline with a learnable channel-wise importance predictor. A two-layer MLP operating on globally pooled latent features produces a soft mask that suppresses unimportant latent channels before they enter the denoising UNet. The entire system is trained jointly on CIFAR-10 under a combined diffusion, reconstruction, and sparsity loss. Experiments reveal a striking result: under aggressive sparsity regularization (lambda = 0.01), the importance predictor drives all latent channels to near-zero yet the denoising UNet achieves lower diffusion loss (0.0236 vs. 0.0240) and lower VAE reconstruction MSE (22.59 vs. 24.67) compared to the unpruned baseline. We term this the sparsity collapse phenomenon and provide an analysis of why it occurs and what it reveals about the information requirements of latent diffusion models. These findings constitute an exploratory study of sparsity dynamics in latent diffusion training, and demonstrate that denoising UNets can remain remarkably robust to latent channel suppression even under aggressive regularization. Code is available at: this https URL.
PeLAP-A:面向轻量级潜在扩散模型的自适应潜在通道剪枝 / PeLAP-A: Adaptive Latent Pruning for Lightweight Latent Diffusion Models
本文提出一个轻量级框架PeLAP-A,通过学习一个通道重要性预测器,自动剪枝掉VAE潜在空间中冗余的通道,实验发现即使将所有潜在通道都压制到接近零,去噪UNet的性能反而略有提升,揭示出潜在扩散模型对输入信息存在大量冗余,为模型轻量化提供了新思路。
源自 arXiv: 2606.23086