菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-09
📄 Abstract - PRISM: A Principled Framework for Multi-Agent Reasoning via Gain Decomposition

Multi-agent collaboration has emerged as a promising paradigm for enhancing reasoning capabilities of Large Language Models (LLMs). However, existing approaches remain largely heuristic, lacking principled guidance on what drives performance gains and how to systematically optimize multi-agent reasoning. Specifically, it remains unclear why multi-agent collaboration outperforms single-agent reasoning and which design choices contribute most to these gains, making it difficult to build better systems. We address this gap by introducing a unified theoretical framework that decomposes multi-agent reasoning gains into three conceptually independent dimensions: Exploration for diverse solution coverage, Information for high-fidelity feedback, and Aggregation for principled consensus. Through this lens, existing methods can be understood as special cases that optimize only subsets of these dimensions. Building upon this decomposition, a novel framework called PRISM (Propose-Review-Integrate Synthesis for Multi-agent Reasoning) is proposed, which jointly maximizes all three dimensions through role-based diversity, execution-grounded feedback with evidence-based cross-evaluation, and iterative synthesis with closed-loop validation. Extensive experiments across mathematical reasoning, code generation, and function calling benchmarks demonstrate that PRISM achieves state-of-the-art performance with superior compute-efficiency compared to methods optimizing partial dimensions. The theoretical framework provides actionable design principles for future multi-agent reasoning systems.

顶级标签: llm agents theory
详细标签: multi-agent reasoning gain decomposition theoretical framework role-based diversity principled consensus 或 搜索:

PRISM:一种基于增益分解的多智能体推理原则性框架 / PRISM: A Principled Framework for Multi-Agent Reasoning via Gain Decomposition


1️⃣ 一句话总结

这篇论文提出了一个名为PRISM的理论框架,将多智能体协作提升大语言模型推理能力的原因分解为探索、信息和聚合三个核心维度,并基于此设计了一个能同时优化这三个维度的新系统,在多个任务上取得了领先的性能和计算效率。

源自 arXiv: 2602.08586