现代Hopfield网络中的持续学习及其在扩散模型中的应用 / Continual Learning in Modern Hopfield Networks with an Application to Diffusion Models
1️⃣ 一句话总结
本文通过分析现代Hopfield网络的能量特性,发现任务切换后高能量、孤立的数据样本更容易被遗忘,而回放这些高能量样本能有效缓解遗忘,并将这一发现成功应用于扩散模型(如Stable Diffusion)的持续学习中。
Generative models, including diffusion models, are increasingly used as foundation models and adapted through sequential fine-tuning, making continual learning an essential problem setting. However, continual learning in such generative models remains poorly understood: after a task change, what aspects of the learned distribution are most easily lost, and what replay samples should be prioritized? We address these questions through the modern Hopfield energy. Recent links between modern Hopfield networks (MHNs) and diffusion models allow analyses in MHNs to be transferred to diffusion models. We introduce intrinsic forgetting as an increase in Hopfield energy after the task change. In tractable settings in an MHN, we prove that high-energy, outlier-like samples undergo a larger energy increase than cluster-like samples, implying that samples located in sharp, isolated basins are more forgettable. We further analyze memory replay and show that replay is particularly effective for high-energy samples, enabling an energy-based selection of replay samples. We validate these predictions in experiments on MHNs and two diffusion models under continual-learning settings: Stable Diffusion and a pixel-space DDPM. In these diffusion models, Hopfield energy tracks reconstruction-based forgetting, and replay experiments reveal energy-dependent mitigation of forgetting that is consistent with the MHN analysis.
现代Hopfield网络中的持续学习及其在扩散模型中的应用 / Continual Learning in Modern Hopfield Networks with an Application to Diffusion Models
本文通过分析现代Hopfield网络的能量特性,发现任务切换后高能量、孤立的数据样本更容易被遗忘,而回放这些高能量样本能有效缓解遗忘,并将这一发现成功应用于扩散模型(如Stable Diffusion)的持续学习中。
源自 arXiv: 2605.27975