菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-03
📄 Abstract - Reinforcement Learning with Symbolic Reward Machines

Reward Machines (RMs) are an established mechanism in Reinforcement Learning (RL) to represent and learn sparse, temporally extended tasks with non-Markovian rewards. RMs rely on high-level information in the form of labels that are emitted by the environment alongside the observation. However, this concept requires manual user input for each environment and task. The user has to create a suitable labeling function that computes the labels. These limitations lead to poor applicability in widely adopted RL frameworks. We propose Symbolic Reward Machines (SRMs) together with the learning algorithms QSRM and LSRM to overcome the limitations of RMs. SRMs consume only the standard output of the environment and process the observation directly through guards that are represented by symbolic formulas. In our evaluation, our SRM methods outperform the baseline RL approaches and generate the same results as the existing RM methods. At the same time, our methods adhere to the widely used environment definition and provide interpretable representations of the task to the user.

顶级标签: reinforcement learning theory systems
详细标签: reward machines symbolic reasoning non-markovian rewards interpretable rl automated reward design 或 搜索:

基于符号奖励机的强化学习 / Reinforcement Learning with Symbolic Reward Machines


1️⃣ 一句话总结

这篇论文提出了一种名为‘符号奖励机’的新方法,它能自动理解强化学习任务的目标,无需人工预先设定规则,在保持高性能的同时让任务目标对用户更透明易懂。

源自 arXiv: 2603.03068