UAGLNet: Uncertainty-Aggregated Global-Local Fusion Network with Cooperative CNN-Transformer for Building Extraction

This repository contains the official implementation of UAGLNet, a model for building extraction from remote sensing images, as presented in the paper "UAGLNet: Uncertainty-Aggregated Global-Local Fusion Network with Cooperative CNN-Transformer for Building Extraction".

UAGLNet addresses the challenges of building extraction from remote sensing images due to complex structure variations. It proposes an Uncertainty-Aggregated Global-Local Fusion Network capable of exploiting high-quality global-local visual semantics under the guidance of uncertainty modeling. Specifically, it features a novel cooperative encoder with hybrid CNN and transformer layers, an intermediate cooperative interaction block (CIB) to narrow feature gaps, and a Global-Local Fusion (GLF) module. Additionally, an Uncertainty-Aggregated Decoder (UAD) is introduced to explicitly estimate pixel-wise uncertainty and mitigate segmentation ambiguity in uncertain regions.

Paper

Code

Main Results

The following table presents the performance of UAGLNet on building extraction benchmarks.

Benchmark IoU F1 P R Weight
Inria 83.74 91.15 92.09 90.22 UAGLNet_Inria
Mass 76.97 86.99 88.28 85.73 UAGLNet_Mass
WHU 92.07 95.87 96.21 95.54 UAGLNet_WHU

Citation

If you find this project useful in your research, please cite it as:

@article{UAGLNet,
  title   = {UAGLNet: Uncertainty-Aggregated Global-Local Fusion Network with Cooperative CNN-Transformer for Building Extraction}, 
  author  = {Siyuan Yao and Dongxiu Liu and Taotao Li and Shengjie Li and Wenqi Ren and Xiaochun Cao},
  journal = {arXiv preprint arXiv:2512.12941},
  year    = {2025}
}
Downloads last month
20
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Collection including ldxxx/UAGLNet_Backbone