China Safety Science Journal ›› 2025, Vol. 35 ›› Issue (9): 185-192.doi: 10.16265/j.cnki.issn1003-3033.2025.09.1298

• Safety engineering technology • Previous Articles     Next Articles

Multimodal large model-based approach for construction safety hazard recognition

AN Siqi1(), CAI Anglin2, MA Zicheng2, ZHU Baoyan1,**()   

  1. 1 School of Safety Science and Engineering, Liaoning Technical University, Huludao Liaoning 125105, China
    2 School of Science, China University of Mining and Technology-Beijing, Beijing 100083, China
  • Received:2025-04-11 Revised:2025-06-15 Online:2025-09-28 Published:2026-03-28
  • Contact: ZHU Baoyan

Abstract:

In order to enhance the automatic recognition of safety hazards and improve safety management in construction scenarios, a multimodal large-model-based method for construction safety hazard recognition was proposed and its core component—the multimodal safety hazard recognition model, LLaVA(Large Language and Vision Assistant)-CS(Construction Site), was implemented. The system integrated images (construction site photos) with safety operating procedures (worker behavior descriptions), leveraging multimodal learning and deep learning technologies to perform real-time monitoring and analysis of construction sites. To support the system's effective operation, a multimodal dataset covering complex conditions such as varying lighting, occlusions, and multi-person scenarios was constructed, addressing the gaps in existing public datasets. Through prompt tuning of the LLaVA-1.5 model, the LLaVA-CS model effectively integrated visual and textual information, enhancing the accuracy and interpretability of safety hazard recognition. Experimental results show that this method achieves an accuracy of 0.722 2 in multiple real-world construction projects, generating detailed explanatory texts in real time to help managers quickly understand specific safety hazard contexts, thereby improving decision-making in safety management. This study innovatively applies multimodal large models to construction safety management systems, providing real-time, interpretable safety monitoring solutions and offering new technical support and optimization directions for construction safety management.

Key words: multimodal large model, construction safety hazards, complex construction scenarios, accident prevention, tips for tuning

CLC Number: