菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-19
📄 Abstract - Understanding Dynamics of Adam in Zero-Sum Games: An ODE Approach

The remarkable success of the Adam in training neural networks has naturally led to the widespread use of its descent-ascent counterpart, Adam-DA, for solving zero-sum games. Despite its popularity in practice, a rigorous theoretical understanding of Adam-DA still lags behind. In this paper, we derive ordinary differential equations (ODEs) that serve as continuous-time limits of the Adam-DA. These ODEs closely approximate the discrete-time dynamics of Adam-DA, providing a tractable analytical framework for understanding its behavior in zero-sum games. Using this ODE approach, we investigate two fundamental aspects of Adam-DA: local convergence and implicit gradient regularization. Our analysis reveals that the roles of the first- and second-order momentum parameters in zero-sum games are exactly the opposite of their well-documented effects in minimization problems. We validate these predictions through GAN experiments across multiple architectures and datasets, demonstrating the practical implications of this reversed momentum effect.

顶级标签: theory machine learning
详细标签: adam optimizer zero-sum games ode analysis convergence gradient regularization 或 搜索:

零和博弈中Adam优化器动力学机制的常微分方程研究 / Understanding Dynamics of Adam in Zero-Sum Games: An ODE Approach


1️⃣ 一句话总结

本文通过建立常微分方程模型,首次严谨分析了Adam优化器在零和博弈(如生成对抗网络训练)中的行为,发现其一阶和二阶动量参数的作用与经典最小化问题完全相反,并通过实验验证了这一反直觉现象的实用性。

源自 arXiv: 2605.19392