CANTANTE: Optimizing Agentic Systems via Contrastive Credit Attribution

📄 Abstract - CANTANTE: Optimizing Agentic Systems via Contrastive Credit Attribution

LLM-based multi-agent systems have demonstrated strong performance across complex real-world tasks, such as software engineering, predictive modeling, and retrieval-augmented generation. Yet automating their configuration remains a structural challenge, as scores are available only at the system level, whereas the parameters governing agent behavior are local. We argue that optimizing these systems is fundamentally a credit-assignment problem. We therefore introduce CANTANTE, a framework that decomposes system-level rewards into per-agent update signals by contrasting rollouts of multiple joint configurations on the same query. We instantiate it for prompt optimization, treating agent prompts as learnable system parameters. We evaluate CANTANTE against GEPA and MIPROv2 on programming (MBPP), mathematical reasoning (GSM8K), and multi-hop question answering (HotpotQA). Across these benchmarks, CANTANTE achieves the best average rank among all evaluated optimizers and consistently outperforms unoptimized prompts. It improves over the strongest baseline by +18.9 percentage points on MBPP and +12.5 percentage points on GSM8K, while incurring a lower inference cost. It remains within one standard deviation of the strongest baseline on HotpotQA. Crucially, our credit correlation analysis confirms that the attributer produces meaningful per-agent signals rather than echoing the global system score.

CANTANTE：通过对比信用分配优化智能体系统 / CANTANTE: Optimizing Agentic Systems via Contrastive Credit Attribution

1️⃣ 一句话总结

本文提出了一种名为CANTANTE的框架，通过对比不同智能体组合在同一任务上的表现，将整个系统的奖励分数合理分配给每个智能体，从而自动优化多智能体系统的提示词，显著提升了编程、数学推理和多跳问答等任务的性能，同时降低了计算成本。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要