ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents

📄 Abstract - ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents

Vision-language models (VLMs) have shown remarkable general capabilities, yet embodied agents built on them fail at complex tasks, often skipping critical steps, proposing invalid actions, and repeating mistakes. These failures arise from a fundamental gap between the static training data of VLMs and the physical interaction for embodied tasks. VLMs can learn rich semantic knowledge from static data but lack the ability to interact with the world. To address this issue, we introduce ELITE, an embodied agent framework with {E}xperiential {L}earning and {I}ntent-aware {T}ransfer that enables agents to continuously learn from their own environment interaction experiences, and transfer acquired knowledge to procedurally similar tasks. ELITE operates through two synergistic mechanisms, \textit{i.e.,} self-reflective knowledge construction and intent-aware retrieval. Specifically, self-reflective knowledge construction extracts reusable strategies from execution trajectories and maintains an evolving strategy pool through structured refinement operations. Then, intent-aware retrieval identifies relevant strategies from the pool and applies them to current tasks. Experiments on the EB-ALFRED and EB-Habitat benchmarks show that ELITE achieves 9\% and 5\% performance improvement over base VLMs in the online setting without any supervision. In the supervised setting, ELITE generalizes effectively to unseen task categories, achieving better performance compared to state-of-the-art training-based methods. These results demonstrate the effectiveness of ELITE for bridging the gap between semantic understanding and reliable action execution.

ELITE：面向自我提升具身智能体的经验学习与意图感知迁移框架 / ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents

1️⃣ 一句话总结

这篇论文提出了一个名为ELITE的智能体框架，它能让机器人通过‘在实践中反思和总结’的方式，自动从自己的失败经验中学习有效策略，并把这些策略灵活应用到类似的新任务上，从而显著提升其在复杂物理环境中完成任务的可靠性和成功率。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要