菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-13
📄 Abstract - On the Limits of Latent Reuse in Diffusion Models

Diffusion models are often trained in low-dimensional latent spaces, which are then reused for related but shifted datasets. In this work, we study when such latent reuse remains reliable under distribution shift. We consider a source-target setting in which both datasets are approximately low-dimensional but may lie near different subspaces. We show that freezing and reusing a source latent space induces a target-domain score error governed by two quantities: the principal-angle misalignment between the source and target subspaces, and the target ambient noise amplified by the diffusion time scale. Motivated by these limits, we further study mixed source-target training and characterize how the required shared latent dimension depends on the relative geometry of the two distributions. Our results provide theoretical guidance on when latent reuse is reliable and when learning a shared representation may be necessary.

顶级标签: machine learning theory
详细标签: diffusion models latent space distribution shift representation learning information theory 或 搜索:

扩散模型中潜在空间复用的局限性 / On the Limits of Latent Reuse in Diffusion Models


1️⃣ 一句话总结

本文研究了扩散模型在训练后将预先学习到的低维潜在空间直接用于新数据集时,因数据分布变化(特别是子空间方向不一致和环境噪声放大)而导致的性能下降问题,并提出了混合训练策略下所需共享潜在空间维度的理论指导。

源自 arXiv: 2605.13448