菜单

🤖 系统
📄 Abstract - Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Large Language Model (LLM) Agents, often trained with Reinforcement Learning (RL), are constrained by a dependency on human-curated data, limiting scalability and tethering AI to human knowledge. Existing self-evolution frameworks offer an alternative but are typically restricted by the model's inherent capabilities and single-round interactions, hindering the development of complex curricula involving tool use or dynamic reasoning. We introduce Agent0, a fully autonomous framework that evolves high-performing agents without external data through multi-step co-evolution and seamless tool integration. Agent0 establishes a symbiotic competition between two agents initialized from the same base LLM: a curriculum agent that proposes increasingly challenging frontier tasks, and an executor agent that learns to solve them. We integrate external tools to enhance the executor's problem-solving capacity; this improvement, in turn, pressures the curriculum agent to construct more complex, tool-aware tasks. Through this iterative process, Agent0 establishes a self-reinforcing cycle that continuously produces high-quality curricula. Empirically, Agent0 substantially boosts reasoning capabilities, improving the Qwen3-8B-Base model by 18% on mathematical reasoning and 24% on general reasoning benchmarks. Code is available at this https URL.

顶级标签: agents llm model training
详细标签: self-evolution tool integration co-evolution autonomous agents reasoning 或 搜索:

📄 论文总结

Agent0:通过工具集成推理从零数据释放自进化智能体 / Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning


1️⃣ 一句话总结

这篇论文提出了一个名为Agent0的自主框架,它通过让两个智能体在工具辅助下相互竞争与学习,无需外部数据就能自我进化,显著提升了语言模型在数学和通用推理任务上的能力。


📄 打开原文 PDF