菜单

关于 🐙 GitHub
arXiv 提交日期: 2025-12-22
📄 Abstract - LoGoPlanner: Localization Grounded Navigation Policy with Metric-aware Visual Geometry

Trajectory planning in unstructured environments is a fundamental and challenging capability for mobile robots. Traditional modular pipelines suffer from latency and cascading errors across perception, localization, mapping, and planning modules. Recent end-to-end learning methods map raw visual observations directly to control signals or trajectories, promising greater performance and efficiency in open-world settings. However, most prior end-to-end approaches still rely on separate localization modules that depend on accurate sensor extrinsic calibration for self-state estimation, thereby limiting generalization across embodiments and environments. We introduce LoGoPlanner, a localization-grounded, end-to-end navigation framework that addresses these limitations by: (1) finetuning a long-horizon visual-geometry backbone to ground predictions with absolute metric scale, thereby providing implicit state estimation for accurate localization; (2) reconstructing surrounding scene geometry from historical observations to supply dense, fine-grained environmental awareness for reliable obstacle avoidance; and (3) conditioning the policy on implicit geometry bootstrapped by the aforementioned auxiliary tasks, thereby reducing error this http URL evaluate LoGoPlanner in both simulation and real-world settings, where its fully end-to-end design reduces cumulative error while metric-aware geometry memory enhances planning consistency and obstacle avoidance, leading to more than a 27.3\% improvement over oracle-localization baselines and strong generalization across embodiments and environments. The code and models have been made publicly available on the \href{this https URL}{project page}.

顶级标签: robotics computer vision agents
详细标签: visual navigation end-to-end planning metric geometry obstacle avoidance localization 或 搜索:

LoGoPlanner:基于定位与度量感知视觉几何的导航策略 / LoGoPlanner: Localization Grounded Navigation Policy with Metric-aware Visual Geometry


1️⃣ 一句话总结

这篇论文提出了一种名为LoGoPlanner的新型端到端导航框架,它通过整合度量感知的视觉几何理解和历史观测信息,让机器人在没有独立定位模块的情况下,也能在复杂未知环境中实现更精准、更鲁棒的自主导航和避障。

源自 arXiv: 2512.19629