<i>k</i>CV-B: BOOTSTRAP WITH CROSS-VALIDATION FOR DEEP LEARNING MODEL DEVELOPMENT, ASSESSMENT AND SELECTION

Nurunnabi, A.; Teferle, F. N.; Laefer, D. F.; Remondino, F.; Karas, I. R.; Li, J.

doi:10.5194/isprs-archives-XLVIII-4-W3-2022-111-2022

Articles | Volume XLVIII-4/W3-2022

https://doi.org/10.5194/isprs-archives-XLVIII-4-W3-2022-111-2022

© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/isprs-archives-XLVIII-4-W3-2022-111-2022

© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume XLVIII-4/W3-2022

02 Dec 2022

| 02 Dec 2022

kCV-B: BOOTSTRAP WITH CROSS-VALIDATION FOR DEEP LEARNING MODEL DEVELOPMENT, ASSESSMENT AND SELECTION

A. Nurunnabi, F. N. Teferle, D. F. Laefer, F. Remondino, I. R. Karas, and J. Li

Keywords: Classification, Cross-Validation, Neural Network, PointNet, Semantic Segmentation, Supervised Machine Learning

Abstract. This study investigates the inability of two popular data splitting techniques: train/test split and k-fold cross-validation that are to create training and validation data sets, and to achieve sufficient generality for supervised deep learning (DL) methods. This failure is mainly caused by their limited ability of new data creation. In response, the bootstrap is a computer based statistical resampling method that has been used efficiently for estimating the distribution of a sample estimator and to assess a model without having knowledge about the population. This paper couples cross-validation and bootstrap to have their respective advantages in view of data generation strategy and to achieve better generalization of a DL model. This paper contributes by: (i) developing an algorithm for better selection of training and validation data sets, (ii) exploring the potential of bootstrap for drawing statistical inference on the necessary performance metrics (e.g., mean square error), and (iii) introducing a method that can assess and improve the efficiency of a DL model. The proposed method is applied for semantic segmentation and is demonstrated via a DL based classification algorithm, PointNet, through aerial laser scanning point cloud data.

kCV-B: BOOTSTRAP WITH CROSS-VALIDATION FOR DEEP LEARNING MODEL DEVELOPMENT, ASSESSMENT AND SELECTION

Useful Links

Useful External Links

Our Contact