菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-13
📄 Abstract - CANTANTE: Optimizing Agentic Systems via Contrastive Credit Attribution

LLM-based multi-agent systems have demonstrated strong performance across complex real-world tasks, such as software engineering, predictive modeling, and retrieval-augmented generation. Yet automating their configuration remains a structural challenge, as scores are available only at the system level, whereas the parameters governing agent behavior are local. We argue that optimizing these systems is fundamentally a credit-assignment problem. We therefore introduce CANTANTE, a framework that decomposes system-level rewards into per-agent update signals by contrasting rollouts of multiple joint configurations on the same query. We instantiate it for prompt optimization, treating agent prompts as learnable system parameters. We evaluate CANTANTE against GEPA and MIPROv2 on programming (MBPP), mathematical reasoning (GSM8K), and multi-hop question answering (HotpotQA). Across these benchmarks, CANTANTE achieves the best average rank among all evaluated optimizers and consistently outperforms unoptimized prompts. It improves over the strongest baseline by +18.9 percentage points on MBPP and +12.5 percentage points on GSM8K, while incurring a lower inference cost. It remains within one standard deviation of the strongest baseline on HotpotQA. Crucially, our credit correlation analysis confirms that the attributer produces meaningful per-agent signals rather than echoing the global system score.

顶级标签: agents llm machine learning
详细标签: credit assignment prompt optimization multi-agent systems benchmark 或 搜索:

CANTANTE:通过对比信用分配优化智能体系统 / CANTANTE: Optimizing Agentic Systems via Contrastive Credit Attribution


1️⃣ 一句话总结

本文提出了一种名为CANTANTE的框架,通过对比不同智能体组合在同一任务上的表现,将整个系统的奖励分数合理分配给每个智能体,从而自动优化多智能体系统的提示词,显著提升了编程、数学推理和多跳问答等任务的性能,同时降低了计算成本。

源自 arXiv: 2605.13295