菜单

关于 🐙 GitHub
arXiv 提交日期: 2025-12-15
📄 Abstract - UAGLNet: Uncertainty-Aggregated Global-Local Fusion Network with Cooperative CNN-Transformer for Building Extraction

Building extraction from remote sensing images is a challenging task due to the complex structure variations of the buildings. Existing methods employ convolutional or self-attention blocks to capture the multi-scale features in the segmentation models, while the inherent gap of the feature pyramids and insufficient global-local feature integration leads to inaccurate, ambiguous extraction results. To address this issue, in this paper, we present an Uncertainty-Aggregated Global-Local Fusion Network (UAGLNet), which is capable to exploit high-quality global-local visual semantics under the guidance of uncertainty modeling. Specifically, we propose a novel cooperative encoder, which adopts hybrid CNN and transformer layers at different stages to capture the local and global visual semantics, respectively. An intermediate cooperative interaction block (CIB) is designed to narrow the gap between the local and global features when the network becomes deeper. Afterwards, we propose a Global-Local Fusion (GLF) module to complementarily fuse the global and local representations. Moreover, to mitigate the segmentation ambiguity in uncertain regions, we propose an Uncertainty-Aggregated Decoder (UAD) to explicitly estimate the pixel-wise uncertainty to enhance the segmentation accuracy. Extensive experiments demonstrate that our method achieves superior performance to other state-of-the-art methods. Our code is available at this https URL

顶级标签: computer vision model training
详细标签: building extraction remote sensing cnn-transformer fusion uncertainty modeling semantic segmentation 或 搜索:

UAGLNet:一种用于建筑物提取的、结合CNN与Transformer协同工作的不确定性聚合全局-局部融合网络 / UAGLNet: Uncertainty-Aggregated Global-Local Fusion Network with Cooperative CNN-Transformer for Building Extraction


1️⃣ 一句话总结

这篇论文提出了一种名为UAGLNet的新方法,通过协同使用CNN和Transformer来更好地捕捉建筑物的全局和局部特征,并引入不确定性估计来提升从遥感图像中提取建筑物轮廓的准确性。


源自 arXiv: 2512.12941