What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline

📄 Abstract - What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline

In the past decade, artificial intelligence (AI) has developed quickly. With this rapid progression came the need for systems capable of complying with the rules and norms of our society so that they can be successfully and safely integrated into our daily lives. Inspired by the story of Pinocchio in ``Le avventure di Pinocchio - Storia di un burattino'', this thesis proposes a pipeline that addresses the problem of developing norm compliant and context-aware agents. Building on the AJAR, Jiminy, and NGRL architectures, the work introduces \pino, a hybrid model in which reinforcement learning agents are supervised by argumentation-based normative advisors. In order to make this pipeline operational, this thesis also presents a novel algorithm for automatically extracting the arguments and relationships that underlie the advisors' decisions. Finally, this thesis investigates the phenomenon of \textit{norm avoidance}, providing a definition and a mitigation strategy within the context of reinforcement learning agents. Each component of the pipeline is empirically evaluated. The thesis concludes with a discussion of related work, current limitations, and directions for future research.

如果匹诺曹是一个强化学习智能体：一种规范化的端到端流程 / What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline

1️⃣ 一句话总结

这篇论文借鉴匹诺曹的故事，提出了一种让强化学习智能体理解和遵守社会规范的监督框架，通过论证式顾问来指导智能体行为，并研究了如何防止它们钻规则空子。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要