菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-01-26
📄 Abstract - Yunjue Agent Tech Report: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended Tasks

Conventional agent systems often struggle in open-ended environments where task distributions continuously drift and external supervision is scarce. Their reliance on static toolsets or offline training lags behind these dynamics, leaving the system's capability boundaries rigid and unknown. To address this, we propose the In-Situ Self-Evolving paradigm. This approach treats sequential task interactions as a continuous stream of experience, enabling the system to distill short-term execution feedback into long-term, reusable capabilities without access to ground-truth labels. Within this framework, we identify tool evolution as the critical pathway for capability expansion, which provides verifiable, binary feedback signals. Within this framework, we develop Yunjue Agent, a system that iteratively synthesizes, optimizes, and reuses tools to navigate emerging challenges. To optimize evolutionary efficiency, we further introduce a Parallel Batch Evolution strategy. Empirical evaluations across five diverse benchmarks under a zero-start setting demonstrate significant performance gains over proprietary baselines. Additionally, complementary warm-start evaluations confirm that the accumulated general knowledge can be seamlessly transferred to novel domains. Finally, we propose a novel metric to monitor evolution convergence, serving as a function analogous to training loss in conventional optimization. We open-source our codebase, system traces, and evolved tools to facilitate future research in resilient, self-evolving intelligence.

顶级标签: agents systems model training
详细标签: self-evolving agents tool synthesis open-ended tasks zero-start learning evolutionary efficiency 或 搜索:

云珏智能体技术报告:一个完全可复现、零起点、原位自进化的智能体系统,用于开放任务 / Yunjue Agent Tech Report: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended Tasks


1️⃣ 一句话总结

这篇论文提出了一个名为‘原位自进化’的新方法,让智能体系统能够在没有预先设定工具和外部监督的开放环境中,通过与任务互动来自我学习、创造并优化新工具,从而持续提升其解决问题的能力。

源自 arXiv: 2601.18226