Efficient Flood Detection through Hybrid Machine Learning and Metaheuristic Methods using Sentinel-1
Keywords: Flood Detection, Machine Learning, VGG-16, Hyperparameter Optimization, Metaheuristic Algorithms, Sentinel-1
Abstract. Floods are considered among the most destructive natural disasters, requiring precise and timely management. Remote sensing, utilizing diverse satellite imagery data, enables effective monitoring and assessment of flood impacts. In this context, machine learning and deep learning methods, as effective and scalable approaches, can significantly enhance the accuracy of flood detection and management by analyzing remote sensing data, thereby playing a crucial role in mitigating flood-related risks. In this study, to flood detection using Sentinel-1 SAR data, machine learning algorithms, including Random Forest (RF) and Histogram-based Gradient Boosting Decision Tree, were employed, along with two metaheuristic algorithms, Harris Hawks Optimization (HHO) and Ant Colony Optimization (ACO), for hyperparameter optimization. Additionally, to enhance the models' ability to detect flooded pixels and improve overall performance accurately, a pre-trained VGG-16 Neural Network was used as a deep feature extractor. Finally, four ensemble flood detection models—RF-HHO, RF-ACO, HGBDT-HHO, and HGBDT-ACO—were implemented, and their performance was evaluated and compared based on statistical metrics. Based on the obtained results, all four ensemble flood detection models demonstrated excellent performance in the validation and testing phases. The overall accuracy of these models reached over 95% in the validation phase and exceeded 97% in the testing phase. However, the HGBDT-ACO model achieved the highest accuracy and the lowest error rate in detecting flood pixels, making it the best-performing model in this study. Generally, HGBDT models showed a relative advantage over RF models, as they required significantly less time for training while achieving comparable results. Therefore, they were efficient and performed better in terms of computational complexity.