菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-07
📄 Abstract - Experience Transfer for Multimodal LLM Agents in Minecraft Game

Multimodal LLM agents operating in complex game environments must continually reuse past experience to solve new tasks efficiently. In this work, we propose Echo, a transfer-oriented memory framework that enables agents to derive actionable knowledge from prior interactions rather than treating memory as a passive repository of static records. To make transfer explicit, Echo decomposes reusable knowledge into five dimensions: structure, attribute, process, function, and interaction. This formulation allows the agent to identify recurring patterns shared across different tasks and infer what prior experience remains applicable in new situations. Building on this formulation, Echo leverages In-Context Analogy Learning (ICAL) to retrieve relevant experiences and adapt them to unseen tasks through contextual examples. Experiments in Minecraft show that, under a from-scratch learning setting, Echo achieves a 1.3x to 1.7x speed-up on object-unlocking tasks. Moreover, Echo exhibits a burst-like chain-unlocking phenomenon, rapidly unlocking multiple similar items within a short time interval after acquiring transferable experience. These results suggest that experience transfer is a promising direction for improving the efficiency and adaptability of multimodal LLM agents in complex interactive environments.

顶级标签: agents multi-modal llm
详细标签: experience transfer memory framework in-context learning minecraft multimodal agents 或 搜索:

面向《我的世界》游戏的多模态大语言模型智能体经验迁移研究 / Experience Transfer for Multimodal LLM Agents in Minecraft Game


1️⃣ 一句话总结

这项研究提出了一个名为Echo的记忆框架,通过将过去的游戏经验分解为五个维度并利用类比学习进行迁移,使得多模态AI智能体在《我的世界》游戏中学习新任务的速度提升了30%到70%,并能快速解锁一系列相似物品。

源自 arXiv: 2604.05533