零训练导向的遥感目标检测与分割方法 / ZODS-RS -- Zero-training Oriented Detection & Segmentation for Remote Sensing
1️⃣ 一句话总结
该论文提出了一种无需任何训练即可同时完成遥感图像中水平框检测和实例分割的统一方法ZODS-RS,通过结合DINOv3特征与SAM提案,并设计旋转尺度适配、原型纯化等算法,在飞机、舰船等小目标密集场景及不同数据集下均取得了显著优于现有零训练方法的效果。
Remote-sensing and UAV applications need models that generalize across platforms and viewpoints without task-specific training. Yet training-free pipelines often falter on oriented geometry, scale/rotation variation, and crowded ports or airfields, and rarely unify detection and segmentation. We introduce ZODS-RS, a training-free, closed-form pipeline that outputs horizontal boxes (HBB) and instance masks. Built on DINOv3 dense features and SAM-style proposals, ZODS-RS chains: PP (prototype purification via Tyler covariance), R-SEM (rotation-scale equivariant matching with separable kernels and global Hungarian assignment), and UAM (uncertainty-aware pixelwise merging with adaptive priors and optional negative prototypes). A lightweight CWLA fuses multiple DINOv3 layers. On FAIR1M (HBB) we obtain $\mathrm{mAP}_{0.50:0.95}=\mathbf{13.06}$ and $\mathrm{AP}_S=\mathbf{2.93}$ \emph{(class-averaged over ship/airplane)}; on xView (HBB) we report $\mathrm{mAP}=\mathbf{16.69}$. On our UAV dataset, ZODS-RS achieves mask $\mathrm{mIoU}=\mathbf{31.10}$ and improves small-object AP by $\mathbf{+30.70}$ over Grounded-SAM on a single 5090. This work offers a unified, \emph{no-training} solution for horizontal-box detection plus instance segmentation in aerial imagery; provides explicit closed-form formulations for PP/R-SEM/UAM tightly coupled with DINOv3; and demonstrates \emph{consistent} gains on small and crowded targets and under cross-domain shifts while keeping deployment simple.
零训练导向的遥感目标检测与分割方法 / ZODS-RS -- Zero-training Oriented Detection & Segmentation for Remote Sensing
该论文提出了一种无需任何训练即可同时完成遥感图像中水平框检测和实例分割的统一方法ZODS-RS,通过结合DINOv3特征与SAM提案,并设计旋转尺度适配、原型纯化等算法,在飞机、舰船等小目标密集场景及不同数据集下均取得了显著优于现有零训练方法的效果。
源自 arXiv: 2606.10769