菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-01
📄 Abstract - Stochastic convergence of parallel asynchronous adaptive first-order methods

A new class of asynchronous adaptive first-order optimization methods is introduced, comprising asynchronous variants of several popular algorithms. Versions of these methods using momentum and/or inexact normalization are also considered. The convergence of methods in the class on non-convex functions is analyzed in a fully stochastic setting, and is shown to be (up to logarithmic factors) of order O(1/sqrt{t}) under reasonable assumptions. Numerical experiments suggest that such asynchronous adaptive algorithms are very relevant in heterogeneous large-scale machine learning systems.

顶级标签: machine learning systems theory
详细标签: asynchronous optimization adaptive methods stochastic convergence non-convex optimization large-scale learning 或 搜索:

并行异步自适应一阶方法的随机收敛性分析 / Stochastic convergence of parallel asynchronous adaptive first-order methods


1️⃣ 一句话总结

本文提出了一类新的并行异步自适应优化算法,涵盖了多种经典算法的异步版本,并在非凸函数的全随机环境下证明其收敛速度可达O(1/√t),数值实验表明这类算法在异构大规模机器学习系统中非常实用。

源自 arXiv: 2606.01787