SEMAG:自进化的多智能体代码生成框架 / SEMAG: Self-Evolutionary Multi-Agent Code Generation
1️⃣ 一句话总结
这篇论文提出了一个名为SEMAG的自进化多智能体框架,它能像人类编程一样分阶段、自适应地处理复杂编程任务,并通过实时选用最新模型来自动升级,从而在多个代码生成基准测试中取得了最先进的性能。
Large Language Models (LLMs) have made significant progress in handling complex programming tasks. However, current methods rely on manual model selection and fixed workflows, which limit their ability to adapt to changing task complexities. To address this, we propose SEMAG, a Self-Evolutionary Multi-Agent code Generation framework that mimics human coding practices. It decomposes programming tasks into stages, including planning, coding, debugging, and discussion, while adapting workflows to task difficulty. Its self-evolutionary agents can access the latest models in real time and automatically upgrade the backbone model. SEMAG sets new state-of-the-art Pass@1 accuracy across benchmarks. Using identical backbone models, SEMAG outperforms prior methods by 3.3% on CodeContests. When augmented with self-evolutionary model selection that automatically identifies optimal backbones, SEMAG reaches 52.6%, showcasing both framework effectiveness and adaptability to evolving LLM capabilities.
SEMAG:自进化的多智能体代码生成框架 / SEMAG: Self-Evolutionary Multi-Agent Code Generation
这篇论文提出了一个名为SEMAG的自进化多智能体框架,它能像人类编程一样分阶段、自适应地处理复杂编程任务,并通过实时选用最新模型来自动升级,从而在多个代码生成基准测试中取得了最先进的性能。
源自 arXiv: 2603.15707