菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-06
📄 Abstract - RouteGoT: Node-Adaptive Routing for Cost-Efficient Graph of Thoughts Reasoning

Large Language Models (LLMs) excel at multi-step reasoning, yet increasing the structural complexity of inference does not consistently improve system-level returns. Methods such as Tree of Thoughts (ToT), Graph of Thoughts (GoT), and Adaptive Graph of Thoughts (AGoT) can boost accuracy on some benchmarks, but often introduce substantial overhead in token consumption and latency, and their gains can be unstable across task distributions-sometimes underperforming simpler Chain-of-Thought (CoT) or direct input-output prompting (IO). We attribute this inefficiency to stage-wise and node-wise heterogeneity inside GoT-style reasoning pipelines: high-quality planning and final synthesis are globally coupled and typically benefit from strong models, whereas many intermediate subtasks are localized and can be solved accurately by lighter models with far fewer tokens. Motivated by these observations, we propose RouteGoT, a budget-controllable, node-adaptive routing framework for graph-structured reasoning. RouteGoT performs in-graph routing by prioritizing strong models for planning and synthesis, while dynamically allocating lightweight models and cost-effective strategies to leaf subtasks based on predicted difficulty. It further integrates explicit budget constraints into a global inference scheduler to control graph expansion under a user-specified token budget, enabling predictable performance-cost trade-offs. Experiments across reasoning, retrieval, and multi-hop QA benchmarks show that RouteGoT matching or improving accuracy while substantially reducing token usage; specifically, it achieves an average 8.1 percentage points accuracy improvement and 79.1\% output token reduction compared to AGoT. Furthermore, RouteGoT outperforms existing routing baselines by maintaining a superior cost-accuracy trade-off, demonstrating improved robustness under varying budget targets and tasks.

顶级标签: llm agents model evaluation
详细标签: graph of thoughts adaptive routing cost efficiency budget control reasoning framework 或 搜索:

RouteGoT:面向成本高效思维图推理的节点自适应路由方法 / RouteGoT: Node-Adaptive Routing for Cost-Efficient Graph of Thoughts Reasoning


1️⃣ 一句话总结

这篇论文提出了一种名为RouteGoT的智能路由框架,它通过为思维图推理中不同难度的节点任务动态分配不同规模的语言模型,在保证或提升推理准确率的同时,大幅降低了计算成本(如令牌消耗),实现了性能与成本的高效平衡。

源自 arXiv: 2603.05818