Towards LOD-2 Building Reconstruction: Leveraging Segmentation and Roof Shape Extraction Methods from VHR Imagery

Rajan, Vaibhav; Münster, Sander; Bruschke, Jonas; Maiwald, Ferdinand

doi:https://doi.org/10.5194/isprs-archives-XLVIII-M-9-2025-1251-2025

Articles | Volume XLVIII-M-9-2025

https://doi.org/10.5194/isprs-archives-XLVIII-M-9-2025-1251-2025

© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/isprs-archives-XLVIII-M-9-2025-1251-2025

© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume XLVIII-M-9-2025

03 Oct 2025

| 03 Oct 2025

Towards LOD-2 Building Reconstruction: Leveraging Segmentation and Roof Shape Extraction Methods from VHR Imagery

Vaibhav Rajan, Sander Münster, Jonas Bruschke, and Ferdinand Maiwald

Keywords: LOD-2 Models, VHR Satellite Imagery, Roof Shape Detection, Building Segmentation, Edge Reconstruction, Feature Extraction

Abstract. Accurate extraction of roof structures from aerial imagery is a critical step in the creation of detailed 3D models for digital heritage reconstruction. This study explores a hybrid methodology that combines prompt-based segmentation with structured vector reconstruction to enhance the extraction of roof skeletons from Very High Resolution (VHR) orthophotos. Using HEAT (Holistic Edge Attention Transformer) as the primary reconstruction model, we fine-tuned it on a domain-specific dataset containing representative gabled and hipped roofs to adapt to the unique geometries found in the city of Jena, Germany. To test whether prior roof isolation could improve reconstruction performance, we integrated mask outputs from two segmentation models — RobustSAM and LangSAM — into the HEAT pipeline. While segmentation offered visually precise results in several instances, overall evaluation revealed that prior segmentation did not consistently improve HEAT’s reconstruction accuracy. These findings underscore HEAT’s robustness and adaptability, especially when properly fine-tuned. Moreover, while SAM variants did not significantly boost performance here, their ease of use and potential for improvement through domain-specific fine-tuning suggest promising applications in other contexts.

Towards LOD-2 Building Reconstruction: Leveraging Segmentation and Roof Shape Extraction Methods from VHR Imagery

Useful Links

Useful External Links

Our Contact