通过自收缩性改进约束在线凸优化的性能保证 / Improved Guarantees for Constrained Online Convex Optimization via Self-Contraction
1️⃣ 一句话总结
本文提出了一种简单的基于投影的算法,能够显著降低在线决策问题中违反约束的累积次数,在强凸损失下将此指标从平方根级改进为对数级,同时保持最优的决策后悔度,从而解决了长期存在的性能瓶颈。
We consider Constrained Online Convex Optimization (COCO) with adversarially chosen constraints. At each round, the learner chooses an action before observing the loss and constraint function for that round. The goal is to achieve small static regret against the best point satisfying all constraints while also controlling cumulative constraint violation ($\mathsf{CCV}$). For strongly convex losses, state-of-the-art algorithms achieve $O(\log T)$ regret and $O(\sqrt{T \log T})$ $\mathsf{CCV}.$ The corresponding best-known bounds for convex losses is $O(\sqrt{T})$ regret and $O(\sqrt{T} \log T)$ $\mathsf{CCV}$. In this paper, we give a simple projection-based algorithm that simultaneously achieves $O(\log T)$ regret and $O(\log T)$ $\mathsf{CCV}$ for strongly-convex losses, yielding an exponential improvement in the $\mathsf{CCV}$. For the convex losses, our algorithm improves the $\mathsf{CCV}$ to $O(\sqrt{T})$ while maintaining the optimal $O(\sqrt{T})$ regret. The key to our improvement is a recent geometric result for self-contracted curves, which may be of independent interest.
通过自收缩性改进约束在线凸优化的性能保证 / Improved Guarantees for Constrained Online Convex Optimization via Self-Contraction
本文提出了一种简单的基于投影的算法,能够显著降低在线决策问题中违反约束的累积次数,在强凸损失下将此指标从平方根级改进为对数级,同时保持最优的决策后悔度,从而解决了长期存在的性能瓶颈。
源自 arXiv: 2605.21107