菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-02
📄 Abstract - Best-Arm Identification with Noisy Actuation

In this paper, we consider a multi-armed bandit (MAB) instance and study how to identify the best arm when arm commands are conveyed from a central learner to a distributed agent over a discrete memoryless channel (DMC). Depending on the agent capabilities, we provide communication schemes along with their analysis, which interestingly relate to the zero-error capacity of the underlying DMC.

顶级标签: theory machine learning
详细标签: multi-armed bandit best-arm identification communication noisy actuation zero-error capacity 或 搜索:

带有噪声驱动的多臂老虎机最优臂识别 / Best-Arm Identification with Noisy Actuation


1️⃣ 一句话总结

这篇论文研究了一种特殊的多臂老虎机问题,即当中央学习者的指令需要通过一个有噪声的通信信道传递给远程执行代理时,如何设计有效的通信方案来准确识别出最优的选项。

源自 arXiv: 2604.02255