China Safety Science Journal ›› 2022, Vol. 32 ›› Issue (5): 112-118.doi: 10.16265/j.cnki.issn1003-3033.2022.05.2389

• Safety engineering technology • Previous Articles     Next Articles

Track circuit fault diagnosis method for massive imbalanced data

XING Yulong1(), WANG Jian1,2, SHANGGUAN Wei1,2, PENG Cong1, ZHU Linfu3,4   

  1. 1 School of Electronics and Information Engineering, Beijing Jiaotong University, Beijing 100044, China
    2 State Key Laboratory of Rail Traffic Control and Safety, Beijing Jiaotong University, Beijing 100044, China
    3 Standards & Metrology Research Institute, China Academy of Railway Sciences Corporation Limited, Beijing 100081, China
    4 China Railway Test & Certification Center, Beijing 100081, China
  • Received:2021-12-16 Revised:2022-04-09 Online:2022-08-17 Published:2022-11-28

Abstract:

In order to address deviation of decision-making boundary of track circuit diagnosis model due to imbalanced monitoring data and slow training speed caused by massive data, a fault diagnosis method based on data resampling and ensemble learning algorithm was proposed. Firstly, imbalanced data were processed by feature synthesis and resampling including random down-sampling and Synthetic Minority Oversampling Technique (SMOTE). Secondly, a fault diagnosis module for massive monitoring data was constructed based on LightGBM algorithm which could be trained efficiently, training and diagnosis flow was designed, and key parameters were selected by grid search and cross-validation. Finally, Macro-F1, which was not affected by imbalanced data, was introduced as an evaluation indicator of the model. The results show that the comprehensive performance of each diagnosis model for imbalanced data can be improved by feature synthesis and data resampling. Compared with other algorithms, LightGBM is the best in terms of comprehensive performance and training time, ensuring superiority and rapidity when faced with massive data.

Key words: imbalanced data, track circuit, fault diagnosis, ensemble learning, light gradient boosting machine(LightGBM)