菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-07
📄 Abstract - AgentOpt v0.1 Technical Report: Client-Side Optimization for LLM-Based Agent

AI agents are increasingly deployed in real-world applications, including systems such as Manus, OpenClaw, and coding agents. Existing research has primarily focused on \emph{server-side} efficiency, proposing methods such as caching, speculative execution, traffic scheduling, and load balancing to reduce the cost of serving agentic workloads. However, as users increasingly construct agents by composing local tools, remote APIs, and diverse models, an equally important optimization problem arises on the client side. Client-side optimization asks how developers should allocate the resources available to them, including model choice, local tools, and API budget across pipeline stages, subject to application-specific quality, cost, and latency constraints. Because these objectives depend on the task and deployment setting, they cannot be determined by server-side systems alone. We introduce AgentOpt, the first framework-agnostic Python package for client-side agent optimization. We first study model selection, a high-impact optimization lever in multi-step agent pipelines. Given a pipeline and a small evaluation set, the goal is to find the most cost-effective assignment of models to pipeline roles. This problem is consequential in practice: at matched accuracy, the cost gap between the best and worst model combinations can reach 13--32$\times$ in our experiments. To efficiently explore the exponentially growing combination space, AgentOpt implements eight search algorithms, including Arm Elimination, Epsilon-LUCB, Threshold Successive Elimination, and Bayesian Optimization. Across four benchmarks, Arm Elimination recovers near-optimal accuracy while reducing evaluation budget by 24--67\% relative to brute-force search on three of four tasks. Code and benchmark results available at this https URL.

顶级标签: agents systems model evaluation
详细标签: client-side optimization model selection multi-step pipelines search algorithms cost efficiency 或 搜索:

AgentOpt v0.1 技术报告:面向基于大语言模型智能体的客户端优化 / AgentOpt v0.1 Technical Report: Client-Side Optimization for LLM-Based Agent


1️⃣ 一句话总结

这篇论文提出了首个框架无关的客户端智能体优化工具AgentOpt,它通过高效的搜索算法,帮助开发者在构建多步骤AI智能体时,自动选择最符合成本、质量和速度要求的大模型组合,从而显著降低部署成本。

源自 arXiv: 2604.06296