复杂背景下基于多尺度注意力与全局关系建模的小目标检测 / Small Object Detection in Complex Backgrounds with Multi-Scale Attention and Global Relation Modeling
1️⃣ 一句话总结
这篇论文提出了一种专门用于复杂背景下小目标检测的新方法,通过结合小波变换保留细节、全局建模抑制背景噪声以及跨尺度注意力融合多级特征,有效提升了小目标的检测精度和定位稳定性。
Small object detection under complex backgrounds remains a challenging task due to severe feature degradation, weak semantic representation, and inaccurate localization caused by downsampling operations and background interference. Existing detection frameworks are mainly designed for general objects and often fail to explicitly address the unique characteristics of small objects, such as limited structural cues and strong sensitivity to localization errors. In this paper, we propose a multi-level feature enhancement and global relation modeling framework tailored for small object detection. Specifically, a Residual Haar Wavelet Downsampling module is introduced to preserve fine-grained structural details by jointly exploiting spatial-domain convolutional features and frequency-domain representations. To enhance global semantic awareness and suppress background noise, a Global Relation Modeling module is employed to capture long-range dependencies at high-level feature stages. Furthermore, a Cross-Scale Hybrid Attention module is designed to establish sparse and aligned interactions across multi-scale features, enabling effective fusion of high-resolution details and high-level semantic information with reduced computational overhead. Finally, a Center-Assisted Loss is incorporated to stabilize training and improve localization accuracy for small objects. Extensive experiments conducted on the large-scale RGBT-Tiny benchmark demonstrate that the proposed method consistently outperforms existing state-of-the-art detectors under both IoU-based and scale-adaptive evaluation metrics. These results validate the effectiveness and robustness of the proposed framework for small object detection in complex environments.
复杂背景下基于多尺度注意力与全局关系建模的小目标检测 / Small Object Detection in Complex Backgrounds with Multi-Scale Attention and Global Relation Modeling
这篇论文提出了一种专门用于复杂背景下小目标检测的新方法,通过结合小波变换保留细节、全局建模抑制背景噪声以及跨尺度注意力融合多级特征,有效提升了小目标的检测精度和定位稳定性。
源自 arXiv: 2603.03788