菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-20
📄 Abstract - Spectral bandits for smooth graph functions

Smooth functions on graphs have wide applications in manifold and semi-supervised learning. In this paper, we study a bandit problem where the payoffs of arms are smooth on a graph. This framework is suitable for solving online learning problems that involve graphs, such as content-based recommendation. In this problem, each item we can recommend is a node and its expected rating is similar to its neighbors. The goal is to recommend items that have high expected ratings. We aim for the algorithms where the cumulative regret with respect to the optimal policy would not scale poorly with the number of nodes. In particular, we introduce the notion of an effective dimension, which is small in real-world graphs, and propose two algorithms for solving our problem that scale linearly and sublinearly in this dimension. Our experiments on real-world content recommendation problem show that a good estimator of user preferences for thousands of items can be learned from just tens of nodes evaluations.

顶级标签: machine learning theory systems
详细标签: bandit algorithms graph-based learning recommendation systems regret analysis online learning 或 搜索:

平滑图函数的光谱多臂老虎机 / Spectral bandits for smooth graph functions


1️⃣ 一句话总结

这篇论文提出了一种基于图结构的光谱多臂老虎机算法,用于解决像内容推荐这样的在线学习问题,其核心创新是引入了一个‘有效维度’的概念,并设计了两种算法,使得累计遗憾不会随图中节点数大幅增加,从而仅需评估少量节点就能有效学习用户对大量物品的偏好。

源自 arXiv: 2604.18420