A 3D INDOOR-OUTDOOR BENCHMARK DATASET FOR LoD3 BUILDING POINT CLOUD SEMANTIC SEGMENTATION
Keywords: 3D Building Benchmark, Deep Learning, Machine Learning, Indoor/Outdoor Dataset, Mesh, Point Cloud
Abstract. Deep learning (DL) algorithms require high quality training samples as well as accurate and thorough annotations to work effectively. Up until now a limited number of datasets are available to train DL techniques for semantic segmentation of 3D building point clouds, except a few ones focusing on specific categories of constructions (e.g., cultural heritage buildings). This paper presents a new 3D Indoor/Outdoor building dataset (BIO dataset), which is aimed to provide a highly accurate, detailed, and comprehensive dataset to be used for applications related to sematic classification of buildings based on point clouds and meshes. This benchmark dataset contains 100 building models generated from existing polygonal models and belonging to different categories. These include commercial buildings, residential houses, industrial and institutional buildings. Structural elements of buildings are annotated into 11 semantic categories, following standards from IFC and CityGML. To verify the applicability of the BIO dataset for the semantic segmentation task, it has been successfully tested by using one machine learning technique and four different DL algorithms.