Atlas:为多领域复杂推理编排异构模型与工具 / Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning
1️⃣ 一句话总结
这篇论文提出了一个名为ATLAS的智能框架,它能像一位经验丰富的指挥家一样,根据不同的复杂任务(如数学、编程或视觉推理),自动选择最合适的大语言模型和外部工具进行组合与协作,从而在多项测试中超越了GPT-4o等顶尖模型的表现。
The integration of large language models (LLMs) with external tools has significantly expanded the capabilities of AI agents. However, as the diversity of both LLMs and tools increases, selecting the optimal model-tool combination becomes a high-dimensional optimization challenge. Existing approaches often rely on a single model or fixed tool-calling logic, failing to exploit the performance variations across heterogeneous model-tool pairs. In this paper, we present ATLAS (Adaptive Tool-LLM Alignment and Synergistic Invocation), a dual-path framework for dynamic tool usage in cross-domain complex reasoning. ATLAS operates via a dual-path approach: (1) \textbf{training-free cluster-based routing} that exploits empirical priors for domain-specific alignment, and (2) \textbf{RL-based multi-step routing} that explores autonomous trajectories for out-of-distribution generalization. Extensive experiments across 15 benchmarks demonstrate that our method outperforms closed-source models like GPT-4o, surpassing existing routing methods on both in-distribution (+10.1%) and out-of-distribution (+13.1%) tasks. Furthermore, our framework shows significant gains in visual reasoning by orchestrating specialized multi-modal tools.
Atlas:为多领域复杂推理编排异构模型与工具 / Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning
这篇论文提出了一个名为ATLAS的智能框架,它能像一位经验丰富的指挥家一样,根据不同的复杂任务(如数学、编程或视觉推理),自动选择最合适的大语言模型和外部工具进行组合与协作,从而在多项测试中超越了GPT-4o等顶尖模型的表现。
源自 arXiv: 2601.03872