菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-13
📄 Abstract - MADQRL: Distributed Quantum Reinforcement Learning Framework for Multi-Agent Environments

Reinforcement learning (RL) is one of the most practical ways to learn from real-life use-cases. Motivated from the cognitive methods used by humans makes it a widely acceptable strategy in the field of artificial intelligence. Most of the environments used for RL are often high-dimensional, and traditional RL algorithms becomes computationally expensive and challenging to effectively learn from such systems. Recent advancements in practical demonstration of quantum computing (QC) theories, such as compact encoding, enhanced representation and learning algorithms, random sampling, or the inherent stochastic nature of quantum systems, have opened up new directions to tackle these challenges. Quantum reinforcement learning (QRL) is seeking significant traction over the past few years. However, the current state of quantum hardware is not enough to cater for such high-dimensional environments with complex multi-agent setup. To tackle this issue, we propose a distributed framework for QRL where multiple agents learn independently, distributing the load of joint training from individual machines. Our method works well for environments with disjoint sets of action and observation spaces, but can also be extended to other systems with reasonable approximations. We analyze the proposed method on cooperative-pong environment and our results indicate ~10% improvement from other distribution strategies, and ~5% improvement from classical models of policy representation.

顶级标签: reinforcement learning multi-agents quantum computing
详细标签: quantum reinforcement learning distributed learning multi-agent systems cooperative environments policy representation 或 搜索:

MADQRL:面向多智能体环境的分布式量子强化学习框架 / MADQRL: Distributed Quantum Reinforcement Learning Framework for Multi-Agent Environments


1️⃣ 一句话总结

这篇论文提出了一个名为MADQRL的分布式量子强化学习框架,通过让多个智能体在量子计算环境中独立学习以分担计算负荷,有效解决了当前量子硬件难以处理复杂多智能体任务的瓶颈,并在实验中取得了优于传统分布式策略和经典模型的性能提升。

源自 arXiv: 2604.11131