China Safety Science Journal ›› 2017, Vol. 27 ›› Issue (6): 61-66.doi: 10.16265/j.cnki.issn1003-3033.2017.06.011

• Safety Science of Engineering and Technology • Previous Articles     Next Articles

Prediction of coal spontaneous combustion in caving zone with unbalanced data

SHAO Liangshan, LI Xiangchen   

  1. System Engineering Institute, Liaoning Technical University, Huludao Liaoning 125105,China
  • Received:2017-02-24 Revised:2017-04-10 Published:2020-10-16

Abstract: In order to improve the prediction accuracy of spontaneous combustion of a few samples under the unbalanced data, a prediction model based on K-means-Relief-HSMOTE-SVM was built. First, the K-means method was used to optimize the traditional Relief method for index selection, to make up its deficiency-combustion indexes' unreasonablly high weights caused by the feature extraction under unbalanced data. Then, in view of problems such as overfitting encountered in dealing with unbalanced data with the SMOTE method an h dimensional spherical space thought was introduced, and the clustering algorithm was used to determine the center and establish the spherical space, and an improved HSMOTE algorithm was developed for balancing the spontaneous combustion data. Next, the SVM was used to predict the spontaneous combustion data. Actual samples from Xuandong No.2 coal mine were used to conduct 50 experiments, and a result comparison was made between the model built by the authors, the traditional SVM and other models. The results indicate that K-means-Relief-HSMOTE-SVM can effectively extract feature factors and overcome SMOTE defect,that compared with other models, K-means-Relief-HSMOTE-SVM can more effectively improve the traditional SVM in unbalanced prediction accuracy and geometric mean correct rate for combustion samples of the minority class natural data.

Key words: unbalanced data, spontaneous combustion in caving zone, support vector machine(SVM), prediction, h dimensional synthetic minority over sampling technique(HSMOTE)

CLC Number: