菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-11
📄 Abstract - Resource-constrained Amazons chess decision framework integrating large language models and graph attention

Artificial intelligence has advanced significantly through the development of intelligent game-playing systems, providing rigorous testbeds for decision-making, strategic planning, and adaptive learning. However, resource-constrained environments pose critical challenges, as conventional deep learning methods heavily rely on extensive datasets and computational resources. In this paper, we propose a lightweight hybrid framework for the Game of the Amazons, which explores the paradigm of weak-to-strong generalization by integrating the structural reasoning of graph-based learning with the generative capabilities of large language models. Specifically, we leverage a Graph Attention Autoencoder to inform a multi-step Monte Carlo Tree Search, utilize a Stochastic Graph Genetic Algorithm to optimize evaluation signals, and harness GPT-4o-mini to generate synthetic training data. Unlike traditional approaches that rely on expert demonstrations, our framework learns from noisy and imperfect supervision. We demonstrate that the Graph Attention mechanism effectively functions as a structural filter, denoising the LLM's outputs. Experiments on a 10$\times$10 Amazons board show that our hybrid approach not only achieves a 15\%--56\% improvement in decision accuracy over baselines but also significantly outperforms its teacher model (GPT-4o-mini), achieving a competitive win rate of 45.0\% at N=30 nodes and a decisive 66.5\% at only N=50 nodes. These results verify the feasibility of evolving specialized, high-performance game AI from general-purpose foundation models under stringent computational constraints.

顶级标签: llm agents model training
详细标签: game ai graph attention monte carlo tree search synthetic data generation weak-to-strong generalization 或 搜索:

集成大语言模型与图注意力的资源受限亚马逊棋决策框架 / Resource-constrained Amazons chess decision framework integrating large language models and graph attention


1️⃣ 一句话总结

这篇论文提出了一种在计算资源有限的情况下,通过结合图注意力网络和大语言模型来提升亚马逊棋AI决策能力的新方法,该方法无需依赖大量专家数据,仅利用有噪声的监督信号就能训练出超越其教师模型的强大游戏智能。

源自 arXiv: 2603.10512