China Safety Science Journal ›› 2024, Vol. 34 ›› Issue (12): 56-63.doi: 10.16265/j.cnki.issn1003-3033.2024.12.0713

• Safety engineering technology • Previous Articles     Next Articles

Tunnel construction safety domain named entity recognition based on BERT-BiLSTM-CRF

ZHANG Nian1,2(), ZHOU Caifeng1,2, WAN Fei3, LIU Fei1, WANG Yaoyao1, XU Dongliang1,2   

  1. 1 College of Civil Engineering, Taiyuan University of Technology, Taiyuan Shanxi 030024, China
    2 Research Center of Tunneling and Underground Engineering of Ministry of Education, Beijing Jiaotong University, Beijing 100044, China
    3 Research Institute of Highway Ministry of Transport, Beijing 100088, China
  • Received:2024-09-14 Revised:2024-11-18 Online:2024-12-28 Published:2025-06-28

Abstract:

To solve the problems existing in the traditional NER methods in the domain of tunnel construction safety, such as fuzzy entity boundary, difficulty in small-sample learning, and insufficiently comprehensive extraction of feature information, an entity recognition method for tunnel construction accident text based on the BERT-BiLSTM-CRF model was proposed. Firstly, the BERT model was used to encode the tunnel construction accident text to obtain word vectors containing semantic features. Then, the word vectors output after the training of the BERT model were input into the BiLSTM model to further obtain the context feature of the tunnel construction accident text and conduct label probability prediction. Finally, by utilizing the constraints of the annotation rules of the CRF layer, the output result of the BiLSTM model was corrected, and the maximum probability sequence annotation result was obtained, so as to realize the intelligent classification of the labels of the tunnel construction accident texts. Comparative experiments were conducted between this model and other four commonly used traditional NER models on the tunnel construction safety accident corpus dataset. The results show that the recognition accuracy rate, recall rate and F1 value of the BERT-BiLSTM-CRF model are 88%, 89% and 88% respectively, and the entity recognition effect is better than other benchmark models. By using the established NER model to recognize the entities in the actual tunnel construction accident texts, its application effect in the domain of tunnel construction safety is verified.

Key words: bidirectional encoder representations from transformers (BERT), bidirectional long short-term memory(BiLSTM), conditional random fields (CRF), tunnel construction, safety field, named entity recognition (NER), deep learning

CLC Number: