Hydrological changes induced by extreme weather events have increased prolonged monsoon periods and localized heavy rainfall, rapidly saturating the ground and reducing the shear resistance of slopes. This, in turn, accelerates the activity along pote...
Hydrological changes induced by extreme weather events have increased prolonged monsoon periods and localized heavy rainfall, rapidly saturating the ground and reducing the shear resistance of slopes. This, in turn, accelerates the activity along potential failure surfaces and expands weak zones, thereby increasing the likelihood of slope failures.
Such slope hazards cause significant property damage and loss of life. Consequently, the Ministry of the Interior and Safety has been conduction Field surveys to identify steep slopes nationwide and implement management measures to prevent disasters.
Because slope failures occur unpredictably in both time and space, it is essential to evaluate the factors influencing slope stability and to manage them in advance in order to ensure disaster prevention and public safety.
The assessment of unstable steep slopes is typically performed using tools such as the “Disaster Risk Assessnent Table.” Establishing appropriate evaluation criteria and determining key assessment factors are therefore critical for accurately evaluation hazard levels and predicting failures. To improve the reliability of assessment tables, numerous studies have applied diverse methodologies.
Past research on developing assessment tables has utilized probabilistic and statistical approaches such as logistic regression, artificial neural networks, and satellite imagery analysis. Models based on AHP analysis derived from expert surveys have also been widely used. More recently, big data techniques, machine learning, and deep learning have been applied to overcome the limitations inherent in traditional mathematical and engineering approaches.
However, existing models that rely on binary classification(failure vs. non-failure) face limitations in data generation and sampling. In particular, the scarcity of field data on failed slopes has resulted in insufficient consideration of critical risk factors that strongly influence slope failure.
Therefore, this study aims to develop a framework for classifying steep slopes with high failure potential by applying various maching-learing algorithms to field-survey-based hazard-factor data and evaluating their classification performance. Using field investigation data from 7,791 non-failure slopes and 90 failure slopes, along with disaster risk assessment records, we conducted analyses and identified major risk factors influencing slope failure.
Analysis of failed slopes showed that failures predominantly occurred in rock slopes(77.9%), on irregular-shaped transverse profiles(62.2%), and in small-to medium-sized slopes(5-24m) with high gradients(>63°). For soil slopes, rotational failure and surface erosion were strongly associated with failure, whereas rock slopes were primarily affected by rockfall and well-developed joints.
For model training, a portion of the dataset was separated and cross-validation was performed. After hyperparameter tuning, the XGBoost model demonstrated superior performance across most evaluation metrics, with significantly higher sensitivity compared to other models, indication strong predictive capability for slope failures.
To analyze the relative importance of various factors, the XGBoost model was combined with SHAP, an explainable AI technique. The results showed that transverse shape, soil depth, rockfall, surface erosion, herbaceous vegetation density, woody vegetation density, joint development, slope length, and slope angle exhibited the highest variable importance in this order. These findings were further validated through SHAP summary plots, force plots, and waterfall plots for individual slope profiles.
The analysis indicates that the currently used disaster risk assessment table does not adequately capture the hazard characteristics of slopes, as discrepancies between risk factors and evaluation items contribute to underestimation-particularly in terms of social factors and the relative scale of failures. Based on the results of this study, we recommend the development of more refined data exploration techniques and continuous research to enhance the performance and reliability of machine learning models that account for characteristics of slope-failure datasets.