Flood risk mapping and performance efficiency evaluation of machine learning algorithms: Best practice in northern Iran

Shirmohammadi, Mahdieh; Pirasteh, Saied; Li, Weilian; Mafi-Gholami, Davood

doi:https://doi.org/10.5194/isprs-archives-XLVIII-G-2025-1347-2025

Articles | Volume XLVIII-G-2025

https://doi.org/10.5194/isprs-archives-XLVIII-G-2025-1347-2025

© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/isprs-archives-XLVIII-G-2025-1347-2025

© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume XLVIII-G-2025

31 Jul 2025

| 31 Jul 2025

Flood risk mapping and performance efficiency evaluation of machine learning algorithms: Best practice in northern Iran

Mahdieh Shirmohammadi, Saied Pirasteh, Weilian Li, and Davood Mafi-Gholami

Keywords: Flood, Machine Learning, GIS, Risk Mapping, Performance Efficiency

Abstract. Flooding is one of the most devastating natural hazards, and inadequate management can amplify its impacts, leading to severe social, economic, and environmental consequences. Accurate and efficient flood risk mapping is essential for mitigating these effects and supporting effective disaster management strategies. However, challenges remain in optimizing the accuracy and reliability of machine learning (ML) algorithms for flood susceptibility assessment. In this study, we applied several ML algorithms, including Random Forest (RF), XGBoost (Extreme Gradient Boosting), LightGBM, CatBoost, and Support Vector Machine (SVM), to develop flood risk maps for a region in northern Iran. For the analysis, we selected a comprehensive set of environmental and geographical parameters influencing flood susceptibility. These included the Digital Elevation Model (DEM), slope, aspect, Topographic Wetness Index (TWI), Stream Power Index (SPI), river distance, river density, rainfall, lithology, Normalized Difference Vegetation Index (NDVI), Normalized Difference Moisture Index (NDMI), soil texture, and land use. Data processing, feature extraction, and model training were conducted using Python, Google Earth Engine, and ArcGIS. Our results demonstrate a strong level of consistency across the models. XGBoost achieved the highest Area Under the Curve (AUC) of 0.87, closely followed by CatBoost at 0.86, Random Forest (RF), and LightGBM, each reaching 0.85. SVM recorded a slightly lower AUC of 0.82. These findings underscore the robust performance of advanced ML algorithms, particularly ensemble methods with tree-based structures, in flood risk mapping, especially within complex environmental contexts.

Flood risk mapping and performance efficiency evaluation of machine learning algorithms: Best practice in northern Iran

Useful Links

Useful External Links

Our Contact