菜单

🤖 系统
📄 Abstract - WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Recent advances in deep-research systems have demonstrated the potential for AI agents to autonomously discover and synthesize knowledge from external sources. In this paper, we introduce WebResearcher, a novel framework for building such agents through two key components: (1) WebResearcher, an iterative deep-research paradigm that reformulates deep research as a Markov Decision Process, where agents periodically consolidate findings into evolving reports while maintaining focused workspaces, overcoming the context suffocation and noise contamination that plague existing mono-contextual approaches; and (2) WebFrontier, a scalable data synthesis engine that generates high-quality training data through tool-augmented complexity escalation, enabling systematic creation of research tasks that bridge the gap between passive knowledge recall and active knowledge construction. Notably, we find that the training data from our paradigm significantly enhances tool-use capabilities even for traditional mono-contextual methods. Furthermore, our paradigm naturally scales through parallel thinking, enabling concurrent multi-agent exploration for more comprehensive conclusions. Extensive experiments across 6 challenging benchmarks demonstrate that WebResearcher achieves state-of-the-art performance, even surpassing frontier proprietary systems.

顶级标签: agents systems model training
详细标签: long-horizon reasoning deep research tool-augmented training multi-agent exploration knowledge synthesis 或 搜索:

📄 论文总结

WebResearcher:释放长视野智能体的无限推理能力 / WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents


1️⃣ 一句话总结

这篇论文提出了一个名为WebResearcher的新型AI智能体框架,通过将深度研究建模为决策过程并生成高质量训练数据,有效解决了传统方法在长周期任务中的信息过载和噪声干扰问题,从而显著提升了智能体的工具使用能力和多任务并行推理性能。


📄 打开原文 PDF