Building Extraction Network based on High-resolution Remote Sensing Image

Li, Han; Guo, Xian; Jiang, Jie; Gong, Changyu

doi:https://doi.org/10.5194/isprs-archives-XLVIII-G-2025-905-2025

Articles | Volume XLVIII-G-2025

https://doi.org/10.5194/isprs-archives-XLVIII-G-2025-905-2025

© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/isprs-archives-XLVIII-G-2025-905-2025

© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume XLVIII-G-2025

29 Jul 2025

| 29 Jul 2025

Building Extraction Network based on High-resolution Remote Sensing Image

Han Li, Xian Guo, Jie Jiang, and Changyu Gong

Keywords: Building extraction, Poly kernel inception network, Multi-scale feature fusion, High-resolution remote sensing images

Abstract. The segmentation of buildings from the background in high-resolution remote sensing images faces several challenges, including difficulties in extracting multi-scale information, insufficient capture of long-range contextual information, and the underutilization of multi-scale features. Existing methods often struggle to effectively capture features at different scales, which limits the segmentation accuracy. Furthermore, long-range contextual information is frequently overlooked, hindering model’s ability in understanding the global structure of buildings. Additionally, balancing low-level details with high-level semantic information poses challenges in effectively fusing multi-scale features from high-resolution imagery. To address these issues, this paper proposes the Multi-Scale Multi-Kernel Building Extraction Network (MMAENet), which significantly enhances the capability to capture multi-scale features through the integration of Poly Kernel Inception Network (PKINet), and improves the capture of long-range contextual information. The Panoramic Feature Pyramid (PFP) structure is introduced to ensure the full integration of both high-level and low-level information. Performance evaluation on the WHU Aerial dataset demonstrates that the model achieves superior accuracy in building segmentation compared to Convnext, PSPNet, and Swin Transformer.

Building Extraction Network based on High-resolution Remote Sensing Image

Useful Links

Useful External Links

Our Contact