📄
Abstract - Critic-Guided Heterogeneous Multi-Agent Reasoning for Reliable Mathematical Problem Solving
Recent Large Language Models (LLMs) have shown impressive reasoning abilities; but they are still susceptible to hallucinations, intermediate reasoning mistakes, and unreliable reasoning results in complex mathematical reasoning problems. In this study, we introduce a critic-based heterogeneous multi-agent approach to improve the dependability of mathematical reasoning. This framework incorporates several LLM agents of different specialties and employs a critic-driven adaptive learning system to assess and guide the reasoning process based on intermediate feedback. The system adopts a generator-validator framework, with the validator not only determining correctness but also offering critiques to guide regeneration of solutions. This allows for adaptive error correction and prevents error cascading. Our experiments on the GSM8K benchmark show that the proposed method achieves up to 13% accuracy improvement over single-shot and non-critic models. Additionally, findings suggest that heterogeneity and critique reduce the need for large models, allowing smaller models to perform on par. Ablation studies reveal the main performance gains are due to the critic-based feedback loop and not model size. In summary, the proposed approach showcases the benefits of combining heterogeneous multi-agent collaboration and critique to obtain reliable and interpretable reasoning systems.
基于批判引导的异构多智能体推理:实现可靠的数学问题求解 /
Critic-Guided Heterogeneous Multi-Agent Reasoning for Reliable Mathematical Problem Solving
1️⃣ 一句话总结
本文提出一种融合多个不同专长的大语言模型(LLM)智能体的方法,通过一个“生成器-验证器”框架,让验证器不仅能判断答案对错,还能给出具体改进建议,从而在数学推理中自适应纠错、防止错误累积,并在GSM8K数据集上显著提升了13%的准确率,且允许用小模型达到与大模型相当的效果。