连接价值观与行为:面向主动型具身智能体的层次化框架 / Bridging Values and Behavior: A Hierarchical Framework for Proactive Embodied Agents
1️⃣ 一句话总结
本文提出一个名为 ValuePlanner 的智能体架构,通过将高层价值观决策与低层动作执行分离,让机器人能像人类一样根据内在价值权衡来主动规划长期行为,解决了现有智能体只能被动执行指令或单纯满足需求的局限。
Current embodied agents are often limited to passive instruction-following or reactive need-satisfaction, lacking a stable, high-order value framework essential for long-term, self-directed behavior and resolving motivational conflicts. We introduce \textit{ValuePlanner}, a hierarchical cognitive architecture that decouples high-level value scheduling from low-level action execution. \textit{ValuePlanner} employs an LLM-based cognitive module to generate symbolic subgoals by reasoning through abstract value trade-offs, which are then translated into executable action plans by a classical PDDL planner. This process is refined via a closed-loop feedback mechanism. Evaluating such autonomy requires methods beyond task-success rates, and we therefore propose a value-centric evaluation suite measuring cumulative value gain, preference alignment, and behavioral diversity. Experiments in the TongSim household environment demonstrate that \textit{ValuePlanner} arbitrates competing values to generate coherent, long-horizon, self-directed behavior absent from instruction-following and needs-driven baselines. Our work offers a structured approach to bridging intrinsic values and grounded behavior for autonomous agents.
连接价值观与行为:面向主动型具身智能体的层次化框架 / Bridging Values and Behavior: A Hierarchical Framework for Proactive Embodied Agents
本文提出一个名为 ValuePlanner 的智能体架构,通过将高层价值观决策与低层动作执行分离,让机器人能像人类一样根据内在价值权衡来主动规划长期行为,解决了现有智能体只能被动执行指令或单纯满足需求的局限。
源自 arXiv: 2604.27699