MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning

📄 Abstract - MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning

Current interactive LLM agents rely on goal-conditioned stepwise planning, where environmental understanding is acquired reactively during execution rather than established beforehand. This temporal inversion leads to Delayed Environmental Perception: agents must infer environmental constraints through trial-and-error, resulting in an Epistemic Bottleneck that traps them in inefficient failure cycles. Inspired by human affordance perception and cognitive map theory, we propose the Map-then-Act Paradigm (MAP), a plug-and-play framework that shifts environment understanding before execution. MAP consists of three stages: (1) Global Exploration, acquiring environment-general priors; (2) Task-Specific Mapping, constructing a structured cognitive map; and (3) Knowledge-Augmented Execution, solving tasks grounded on the map. Experiments show consistent gains across benchmarks and LLMs. On ARC-AGI-3, MAP enables frontier models to surpass near-zero baseline performance in 22 of 25 game environments. We further introduce MAP-2K, a dataset of map-then-act trajectories, and show that training on it outperforms expert execution traces, suggesting that understanding environments is more fundamental than imitation.

MAP：一种用于长期交互式智能体推理的“先构建地图，再行动”范式 / MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning

1️⃣ 一句话总结

这篇论文提出了一种名为MAP的新型智能体框架，让AI在执行复杂任务前先主动探索环境、构建一张结构化的“认知地图”，从而避免了传统方法靠反复试错才能理解环境的低效循环，实验证明该范式在多种游戏和基准测试中显著提升了性能，甚至比直接模仿专家行为更有效。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要