China Safety Science Journal ›› 2025, Vol. 35 ›› Issue (6): 51-59.doi: 10.16265/j.cnki.issn1003-3033.2025.06.1237

• Safety social science and safety management • Previous Articles     Next Articles

Extracting construction safety requirement information using natural language processing

WU Zhijiang1(), LIU Mengyao1, MA Guofeng2   

  1. 1 College of Architectural Science and Engineering,Yangzhou University,Yangzhou Jiangsu 225009,China
    2 School of Economics and Management,Tongji University,Shanghai 200092,China
  • Received:2025-02-15 Revised:2025-04-19 Online:2025-06-28 Published:2025-07-30

Abstract:

To solve the problem that construction safety requirement information hidden in project documents is hard to be discovered without relevance and semantic ambiguity, a two-stage integration framework combining NLP techniques was developed for project document analysis and classification and extraction of requirement information. First, the safety targets of the project to be evaluated were obtained by combining the multivariate techniques of NLP, and an association model was established based on the topic model to recommend the appropriate requirement types. Then, the semantic features of the three types of elements were considered, and keyword analysis, sentiment analysis, and dependency analysis were adopted to extract the three types of elements, respectively. Finally, two types of construction projects (civil and industrial) were used as case to test the type recommendation and extraction of construction safety requirements. The results show that the two-stage integration framework recommends four appropriate requirement types for civil and industrial buildings respectively, and the combination of lexical properties and lexical sentiment can effectively extract the requirement keywords and behavior opinion words, and the extraction accuracy of the main elements can reach 88.6% after supplementing the description of building types. The test results confirm that responding to safety target features can recommend suitable types from the complicated requirement information, and the classification and extraction of requirement information combined with NLP avoids subjective preferences and improves the accuracy of information extraction.

Key words: natural language processing (NLP), construction safety requirements, requirement information, project documentation, requirement types

CLC Number: