中国安全科学学报 ›› 2022, Vol. 32 ›› Issue (7): 113-120.doi: 10.16265/j.cnki.issn1003-3033.2022.07.1473

• 安全工程技术 • 上一篇    下一篇

基于LightGBM算法的边坡稳定性预测研究

张凯1(), 张科1,2,**()   

  1. 1 昆明理工大学 电力工程学院,云南 昆明 650500
    2 昆明理工大学 建筑工程学院,云南 昆明 650500
  • 收稿日期:2022-01-14 修回日期:2022-05-05 出版日期:2022-08-12 发布日期:2023-01-28
  • 通讯作者: 张科
  • 作者简介:

    作者简介:张 凯 (1997—),男,重庆人,博士研究生,主要研究方向为岩土力学与工程。E-mail:

  • 基金资助:
    国家自然科学基金资助(11902128); 国家自然科学基金资助(41762021); 云南省应用基础研究计划项目(2019FI012)

Prediction study on slope stability based on LightGBM algorithm

ZHANG Kai1(), ZHANG Ke1,2,**()   

  1. 1 Faculty of Electric Power Engineering, Kunming University of Science and Technology, Kunming Yunnan 650500, China
    2 Faculty of Civil and Architectural Engineering, Kunming University of Science and Technology, Kunming Yunnan 650500, China
  • Received:2022-01-14 Revised:2022-05-05 Online:2022-08-12 Published:2023-01-28
  • Contact: ZHANG Ke

摘要:

为减少边坡失稳造成的灾害和事故,提出基于轻量级梯度提升机(LightGBM)算法的边坡稳定性预测模型;将容重、黏聚力、内摩擦角、边坡角、边坡高度和孔隙压力比6个主要影响因素作为模型的输入,将边坡稳定性作为模型的输出;引入基于混淆矩阵的分类性能度量指标和被试工作特性曲线线下面积(AUC),评估模型的泛化性能。结果表明:提出的边坡稳定性预测模型能很好地描述影响因素与边坡稳定性之间复杂的非线性关系;与其他算法相比,LightGBM算法的F1-Score和AUC分别为0.92和0.91,分别提高1.10%~61.40%和1.11%~28.17%;相较于0-均值归一化和反正切归一化,考虑正负相关性的最大值和最小值归一化更适合作为LightGBM模型的前处理方法;通过改变训练数据集长度,进行单因素分析,发现模型的泛化性能与训练数据集长度呈正相关关系。

关键词: 轻量级梯度提升机(LightGBM), 边坡稳定性, 机器学习算法, 混淆矩阵, 归一化

Abstract:

In order to reduce the disasters and accidents caused by the slope instability, a slope stability prediction model based on the LightGBM algorithm was proposed. Six main influencing factors, i.e. volume weight, cohesion, internal friction angle, slope angle, slope height and pore pressure ratio, were taken as the input of the model, and the slope stability was taken as the output of the model. The classification performance metrics based on the confusion matrix and the area under curve (AUC) of the test performance curve were introduced to evaluate the generalization performance of this model. The results show that the proposed prediction model can well describe the complex nonlinear relationship between the influencing factors and the slope stability. Compared with other algorithms, the F1-Score and AUC of the LightGBM algorithm are 0.92 and 0.91, respectively, increasing by 1.10%-61.40% and 1.11%-28.17%, respectively. Compared with zero-mean normalization and arctangent normalization, the normalization of the minimum-maximum values considering the positive and negative correlations is more applicable to the data preprocessing of LightGBM model. Furthermore, by changing the length of the training dataset and single factor analysis, it is found that the generalization performance of the model is positively correlated with the length of the training dataset.

Key words: light gradient boosting machine(LightGBM), slope stability, machine learning algorithm, confusion matrix, normalization