菜单

🤖 系统
📄 Abstract - Nav-R1: Reasoning and Navigation in Embodied Scenes

Embodied navigation requires agents to integrate perception, reasoning, and action for robust interaction in complex 3D environments. Existing approaches often suffer from incoherent and unstable reasoning traces that hinder generalization across diverse environments, and difficulty balancing long-horizon semantic reasoning with low-latency control for real-time navigation. To address these challenges, we propose Nav-R1, an embodied foundation model that unifies reasoning in embodied environments. We first construct Nav-CoT-110K, a large-scale dataset of step-by-step Chains-of-Thought (CoT) for embodied tasks, which enables cold-start initialization with structured reasoning. Building on this foundation, we design a GRPO-based reinforcement learning framework with three complementary rewards: format, understanding, and navigation, to improve structural adherence, semantic grounding, and path fidelity. Furthermore, we introduce a Fast-in-Slow reasoning paradigm, decoupling deliberate semantic reasoning from low-latency reactive control for efficient yet coherent navigation. Extensive evaluations on embodied AI benchmarks demonstrate that Nav-R1 consistently outperforms strong baselines, with over 8% average improvement in reasoning and navigation performance. Real-world deployment on a mobile robot further validates its robustness under limited onboard resources. Code: this https URL. Website: this https URL.

顶级标签: robotics agents model training
详细标签: embodied navigation reasoning reinforcement learning chains-of-thought mobile robotics 或 搜索:

📄 论文总结

Nav-R1:具身场景中的推理与导航 / Nav-R1: Reasoning and Navigation in Embodied Scenes


1️⃣ 一句话总结

这篇论文提出了一个名为Nav-R1的智能体模型,它通过结合思维链数据集和强化学习奖励机制,解决了机器人在复杂环境中实时导航时推理不稳定和控制延迟的难题,显著提升了导航和推理性能。


📄 打开原文 PDF