菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-01-29
📄 Abstract - Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model

While Large Language Models (LLMs) have demonstrated strong zero-shot reasoning capabilities, their deployment as embodied agents still faces fundamental challenges in long-horizon planning. Unlike open-ended text generation, embodied agents must decompose high-level intent into actionable sub-goals while strictly adhering to the logic of a dynamic, observed environment. Standard LLM planners frequently fail to maintain strategy coherence over extended horizons due to context window limitation or hallucinate transitions that violate constraints. We propose GiG, a novel planning framework that structures embodied agents' memory using a Graph-in-Graph architecture. Our approach employs a Graph Neural Network (GNN) to encode environmental states into embeddings, organizing these embeddings into action-connected execution trace graphs within an experience memory bank. By clustering these graph embeddings, the framework enables retrieval of structure-aware priors, allowing agents to ground current decisions in relevant past structural patterns. Furthermore, we introduce a novel bounded lookahead module that leverages symbolic transition logic to enhance the agents' planning capabilities through the grounded action projection. We evaluate our framework on three embodied planning benchmarks-Robotouille Synchronous, Robotouille Asynchronous, and ALFWorld. Our method outperforms state-of-the-art baselines, achieving Pass@1 performance gains of up to 22% on Robotouille Synchronous, 37% on Asynchronous, and 15% on ALFWorld with comparable or lower computational cost.

顶级标签: llm agents systems
详细标签: embodied ai task planning graph neural networks action generation long-horizon planning 或 搜索:

基于图结构信息引导与大型语言模型的具身任务规划 / Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model


1️⃣ 一句话总结

这篇论文提出了一种名为GiG的新框架,它通过图神经网络和记忆库来组织环境信息,帮助大型语言模型驱动的机器人或虚拟智能体在复杂环境中进行更连贯、更高效的长程任务规划,并在多个测试平台上取得了显著优于现有方法的性能。

源自 arXiv: 2601.21841