The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Download
Share
Publications Copernicus
Download
Citation
Share
Articles | Volume XLVIII-M-9-2025
https://doi.org/10.5194/isprs-archives-XLVIII-M-9-2025-1251-2025
https://doi.org/10.5194/isprs-archives-XLVIII-M-9-2025-1251-2025
03 Oct 2025
 | 03 Oct 2025

Towards LOD-2 Building Reconstruction: Leveraging Segmentation and Roof Shape Extraction Methods from VHR Imagery

Vaibhav Rajan, Sander Münster, Jonas Bruschke, and Ferdinand Maiwald

Keywords: LOD-2 Models, VHR Satellite Imagery, Roof Shape Detection, Building Segmentation, Edge Reconstruction, Feature Extraction

Abstract. Accurate extraction of roof structures from aerial imagery is a critical step in the creation of detailed 3D models for digital heritage reconstruction. This study explores a hybrid methodology that combines prompt-based segmentation with structured vector reconstruction to enhance the extraction of roof skeletons from Very High Resolution (VHR) orthophotos. Using HEAT (Holistic Edge Attention Transformer) as the primary reconstruction model, we fine-tuned it on a domain-specific dataset containing representative gabled and hipped roofs to adapt to the unique geometries found in the city of Jena, Germany. To test whether prior roof isolation could improve reconstruction performance, we integrated mask outputs from two segmentation models — RobustSAM and LangSAM — into the HEAT pipeline. While segmentation offered visually precise results in several instances, overall evaluation revealed that prior segmentation did not consistently improve HEAT’s reconstruction accuracy. These findings underscore HEAT’s robustness and adaptability, especially when properly fine-tuned. Moreover, while SAM variants did not significantly boost performance here, their ease of use and potential for improvement through domain-specific fine-tuning suggest promising applications in other contexts.

Share