菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-08
📄 Abstract - Scale-Aware UAV-to-Satellite Cross-View Geo-Localization: A Semantic Geometric Approach

Cross-View Geo-Localization (CVGL) between UAV imagery and satellite images plays a crucial role in target localization and UAV self-positioning. However, most existing methods rely on the idealized assumption of scale consistency between UAV queries and satellite galleries, overlooking the severe scale ambiguity commonly encountered in real-world scenarios. This discrepancy leads to field-of-view misalignment and feature mismatch, significantly degrading CVGL robustness. To address this issue, we propose a geometric framework that recovers the absolute metric scale from monocular UAV images using semantic anchors. Specifically, small vehicles (SVs), characterized by relatively stable prior size distributions and high detectability, are exploited as metric references. A Decoupled Stereoscopic Projection Model is introduced to estimate the absolute image scale from these semantic targets. By decomposing vehicle dimensions into radial and tangential components, the model compensates for perspective distortions in 2D detections of 3D vehicles, enabling more accurate scale estimation. To further reduce intra-class size variation and detection noise, a dual-dimension fusion strategy with Interquartile Range (IQR)-based robust aggregation is employed. The estimated global scale is then used as a physical constraint for scale-adaptive satellite image cropping, improving UAV-to-satellite feature alignment. Experiments on augmented DenseUAV and UAV-VisLoc datasets demonstrate that the proposed method significantly improves CVGL robustness under unknown UAV image scales. Additionally, the framework shows strong potential for downstream applications such as passive UAV altitude estimation and 3D model scale recovery.

顶级标签: computer vision systems robotics
详细标签: cross-view geo-localization scale estimation semantic anchors uav localization monocular vision 或 搜索:

尺度感知的无人机到卫星跨视角地理定位:一种语义几何方法 / Scale-Aware UAV-to-Satellite Cross-View Geo-Localization: A Semantic Geometric Approach


1️⃣ 一句话总结

这篇论文提出了一种新方法,通过识别无人机图像中的小汽车作为‘尺子’,来估算图像的真实物理尺度,从而显著提升了无人机图像与卫星地图进行跨视角匹配定位的准确性和鲁棒性。

源自 arXiv: 2603.07535