DOI: 10.3390/su18126347 ISSN: 2071-1050

Cluster-Based Interpretable Machine Learning for Landslide Susceptibility Mapping: A Case Study in Northern Guangdong

Zhanhui Qing, Wenfeng Cui, Chuangeng Sun, Zhiwen Zheng, Wei Zhang, Jinxiang Li, Muhammad Zeeshan Ali

Operational landslide susceptibility mapping (LSM) remains challenging in regions with pronounced geo-environmental heterogeneity, where single global models often overlook spatially variable landslide-environment relationships. Northern Guangdong, China, is a typical humid mountainous region where steep terrain, diverse lithology, and highly variable rainfall produce non-stationary landslide controls. To address this challenge, we develop a cluster-informed LSM framework that integrates unsupervised consensus K-means sub-zoning with localized Random Forest (RF) models and SHapley Additive exPlanations (SHAP). We use a harmonized inventory of 1510 landslides (2011–2022), together with twelve 30 m conditioning factors, for model training and validation. Compared with logistic regression, Support Vector Machines (SVM), and Light Gradient Boosting Machine (LightGBM), RF consistently achieves higher accuracy across clusters, and the cluster-wise RF ensemble attains pooled ACC = 0.8212, F1 = 0.8176, and AUC = 0.8956. SHAP highlights both regionally consistent predictors (e.g., NDVI, distance to road) and distinct cluster-specific controls linked to geomorphic and hydrologic settings. The proposed framework enhances predictive accuracy, produces finer susceptibility gradients, and yields better-calibrated probability estimates than a single global model. These results demonstrate that explicitly accounting for geo-environmental heterogeneity can generate interpretable, spatially adaptive susceptibility outputs. By identifying high-risk zones for priority monitoring, land-use regulation, infrastructure protection, and mitigation planning, the proposed framework provides a practical decision-support tool for sustainable mountain development and disaster risk reduction in heterogeneous mountainous regions.

More from our Archive