面向长程智能体任务的并行扩展:智能体聚合方法 / Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks
1️⃣ 一句话总结
本文提出了一种名为AggAgent的智能体聚合方法,它能像‘导演’一样,高效整合多个并行运行的AI智能体在复杂长程任务(如深度研究)中产生的不同解决方案,从中筛选并综合出最佳答案,从而以极低的额外计算成本显著提升任务完成质量。
We study parallel test-time scaling for long-horizon agentic tasks such as agentic search and deep research, where multiple rollouts are generated in parallel and aggregated into a final response. While such scaling has proven effective for chain-of-thought reasoning, agentic tasks pose unique challenges: trajectories are long, multi-turn, and tool-augmented, and outputs are often open-ended. Aggregating only final answers discards rich information from trajectories, while concatenating all trajectories exceeds the model's context window. To address this, we propose AggAgent, an aggregation agent that treats parallel trajectories as an environment. We equip it with lightweight tools to inspect candidate solutions and search across trajectories, enabling it to navigate and synthesize information on demand. Across six benchmarks and three model families (GLM-4.7, Qwen3.5, MiniMax-M2.5), AggAgent outperforms all existing aggregation methods-by up to 5.3% absolute on average and 10.3% on two deep research tasks-while adding minimal overhead, as the aggregation cost remains bounded by a single agentic rollout. Our findings establish agentic aggregation as an effective and cost-efficient approach to parallel test-time scaling.
面向长程智能体任务的并行扩展:智能体聚合方法 / Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks
本文提出了一种名为AggAgent的智能体聚合方法,它能像‘导演’一样,高效整合多个并行运行的AI智能体在复杂长程任务(如深度研究)中产生的不同解决方案,从中筛选并综合出最佳答案,从而以极低的额外计算成本显著提升任务完成质量。
源自 arXiv: 2604.11753