菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-18
📄 Abstract - AERR-Nav: Adaptive Exploration-Recovery-Reminiscing Strategy for Zero-Shot Object Navigation

Zero-Shot Object Navigation (ZSON) in unknown multi-floor environments presents a significant challenge. Recent methods, mostly based on semantic value greedy waypoint selection, spatial topology-enhanced memory, and Multimodal Large Language Model (MLLM) as a decision-making framework, have led to improvements. However, these architectures struggle to balance exploration and exploitation for ZSON when encountering unseen environments, especially in multi-floor settings, such as robots getting stuck at narrow intersections, endlessly wandering, or failing to find stair entrances. To overcome these challenges, we propose AERR-Nav, a Zero-Shot Object Navigation framework that dynamically adjusts its state based on the robot's environment. Specifically, AERR-Nav has the following two key advantages: (1) An Adaptive Exploration-Recovery-Reminiscing Strategy, enables robots to dynamically transition between three states, facilitating specialized responses to diverse navigation scenarios. (2) An Adaptive Exploration State featuring Fast and Slow-Thinking modes helps robots better balance exploration, exploitation, and higher-level reasoning based on evolving environmental information. Extensive experiments on the HM3D and MP3D benchmarks demonstrate that our AERR-Nav achieves state-of-the-art performance among zero-shot methods. Comprehensive ablation studies further validate the efficacy of our proposed strategy and modules.

顶级标签: robotics agents computer vision
详细标签: zero-shot navigation exploration strategy multi-floor environments adaptive planning embodied ai 或 搜索:

AERR-Nav:用于零样本目标导航的自适应探索-恢复-回忆策略 / AERR-Nav: Adaptive Exploration-Recovery-Reminiscing Strategy for Zero-Shot Object Navigation


1️⃣ 一句话总结

这篇论文提出了一种名为AERR-Nav的新方法,通过让机器人在探索、恢复和回忆三种状态间智能切换,并采用快慢思考模式,有效解决了机器人在未知多层环境中寻找从未见过物体时容易迷路或卡住的问题,从而实现了更优的导航性能。

源自 arXiv: 2603.17712