菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-14
📄 Abstract - MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

Automatic multi-agent systems aim to instantiate agent workflows without relying on manually designed or fixed orchestration. However, existing automatic MAS approaches remain only partially adaptive: they either perform training-free test-time search or optimize the meta-level designer while keeping downstream execution agents frozen, which creating a frozen-executor ceiling and leaving the end-to-end training of self-designing and self-executing agentic models unexplored. To address this, we introduce MetaAgent-X, an end-to-end reinforcement learning framework that jointly optimizes automatic MAS design and execution. MetaAgent-X enables script-based MAS generation, execution rollout collection, and credit assignment for both designer and executor trajectories. To support stable and scalable optimization, we propose Executor Designer Hierarchical Rollout and Stagewise Co-evolution to improve training stability and expose the dynamics of designer-executor co-evolution. MetaAgent-X consistently outperforms existing automatic MAS baselines, achieving up to 21.7% gains. Comprehensive ablations show that both designer and executor improve throughout training, and that effective automatic MAS learning follows a stagewise co-evolution process. These results establish end-to-end trainable automatic MAS as a practical paradigm for building self-designing and self-executing agentic models.

顶级标签: agents reinforcement learning systems
详细标签: multi-agent systems end-to-end training hierarchical rollout co-evolution workflow generation 或 搜索:

MetaAgent-X:通过端到端强化学习突破自动多智能体系统的天花板 / MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning


1️⃣ 一句话总结

本文提出了一种名为MetaAgent-X的端到端强化学习框架,能够同时优化多智能体系统的自动设计与执行过程,通过让系统自己学习如何设计任务流程和执行任务,显著提升了现有自动多智能体系统的性能,实现了最高21.7%的提升。

源自 arXiv: 2605.14212