DEEP LEARNING-BASED DOOR AND WINDOW DETECTION FROM BUILDING FAÇADE

Sezen, G.; Cakir, M.; Atik, M. E.; Duran, Z.

doi:https://doi.org/10.5194/isprs-archives-XLIII-B4-2022-315-2022

Articles | Volume XLIII-B4-2022

https://doi.org/10.5194/isprs-archives-XLIII-B4-2022-315-2022

© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/isprs-archives-XLIII-B4-2022-315-2022

© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume XLIII-B4-2022

01 Jun 2022

| 01 Jun 2022

DEEP LEARNING-BASED DOOR AND WINDOW DETECTION FROM BUILDING FAÇADE

G. Sezen, M. Cakir, M. E. Atik, and Z. Duran

Keywords: Deep learning, Building Façade Elements, Object Detection, YOLO, Faster R-CNN

Abstract. Detecting building façade elements is a crucial problem in computer vision for image interpretation. In Building Information Modeling (BIM) studies, the detection of building façade elements has an important role. BIM is a tool that allows maintaining a digital representation of all aspects of building information; therefore, it will enable the storage of almost any data related to a given structure, regarding its geometric and non-geometric aspects. Façade segmentation was first studied in the 1970s using hand-crafted expertise. Later, detection and segmentation studies emerged based on shapes of objects and parametric rules. With the developing technology, deep learning approaches in object detection studies have intensified. It is obvious that the desired analyses can be performed faster with deep learning approaches. However, deep learning methods require large training data. Algorithms that consider different situations and are suitable for real-world scenarios continue to be developed. The need in this direction continues in the literature. In this study, door and window detection was carried out with deep learning on an original data set. The algorithms used are YOLOv3, YOLOv4, YOLOv5, and Faster R-CNN. Precision, recall and mean average precision (mAP) are used as evaluation metrics. As a result of the study, precision, recall, and mAP values with YOLOv5 were obtained as 0.85, 0.72, and 0.79, respectively. With Faster R-CNN with the lowest performance, precision, recall, and mAP were obtained as 0.54, 0.63, and 0.54, respectively.

DEEP LEARNING-BASED DOOR AND WINDOW DETECTION FROM BUILDING FAÇADE

Useful Links

Useful External Links

Our Contact