菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-16
📄 Abstract - Thermal Image Refinement with Depth Estimation using Recurrent Networks for Monocular ORB-SLAM3

Autonomous navigation in GPS-denied and visually degraded environments remains challenging for unmanned aerial vehicles (UAVs). To this end, we investigate the use of a monocular thermal camera as a standalone sensor on a UAV platform for real-time depth estimation and simultaneous localization and mapping (SLAM). To extract depth information from thermal images, we propose a novel pipeline employing a lightweight supervised network with recurrent blocks (RBs) integrated to capture temporal dependencies, enabling more robust predictions. The network combines lightweight convolutional backbones with a thermal refinement network (T-RefNet) to refine raw thermal inputs and enhance feature visibility. The refined thermal images and predicted depth maps are integrated into ORB-SLAM3, enabling thermal-only localization. Unlike previous methods, the network is trained on a custom non-radiometric dataset, obviating the need for high-cost radiometric thermal cameras. Experimental results on datasets and UAV flights demonstrate competitive depth accuracy and robust SLAM performance under low-light conditions. On the radiometric VIVID++ (indoor-dark) dataset, our method achieves an absolute relative error of approximately 0.06, compared to baselines exceeding 0.11. In our non-radiometric indoor set, baseline errors remain above 0.24, whereas our approach remains below 0.10. Thermal-only ORB-SLAM3 maintains a mean trajectory error under 0.4 m.

顶级标签: computer vision robotics systems
详细标签: thermal imaging depth estimation monocular slam recurrent networks uav navigation 或 搜索:

利用循环网络进行深度估计的热图像优化及其在单目ORB-SLAM3中的应用 / Thermal Image Refinement with Depth Estimation using Recurrent Networks for Monocular ORB-SLAM3


1️⃣ 一句话总结

这项研究提出了一种新方法,通过一个轻量级神经网络优化热成像图像并估计深度,从而让无人机仅凭一个普通热像摄像头就能在黑暗或无GPS信号的环境中实现精准定位和地图构建。

源自 arXiv: 2603.14998