China Safety Science Journal ›› 2026, Vol. 36 ›› Issue (3): 104-112.doi: 10.16265/j.cnki.issn1003-3033.2026.03.0881
• Safety Technology and Engineering • Previous Articles Next Articles
NIE Benwu1,2,3(
), CHEN Shu1,3,**(
), CHEN Yun1,3, TIAN Xueqi2, CAO Kunyu1, LI Zhi4
Received:2025-09-14
Revised:2025-12-11
Online:2026-03-31
Published:2026-09-28
Contact:
CHEN Shu
CLC Number:
NIE Benwu, CHEN Shu, CHEN Yun, TIAN Xueqi, CAO Kunyu, LI Zhi. An image-text multimodal intelligent identification method for construction safety hazards in hydropower engineering[J]. China Safety Science Journal, 2026, 36(3): 104-112.
Add to citation manager EndNote|Ris|BibTeX
URL: http://www.cssjj.com.cn/EN/10.16265/j.cnki.issn1003-3033.2026.03.0881
Table 1
Classification characteristics of construction safety hazards in hydropower engineering
| 隐患类型 | 隐患特征 |
|---|---|
| 高处坠落 | 存在从坠落高度基准面2 m以上(含2 m)处坠落的隐患 |
| 触电事故 | 电流有通过人体造成伤害或死亡的隐患 |
| 物体打击 | 失控物体存在(如工具、材料)击中人体造成伤害的隐患 |
| 起重伤害 | 存在因吊具、吊物等引发的伤害的隐患 |
| 机械伤害 | 存在机械设备运动部件(如齿轮、皮带)导致的夹击、切割等伤害的隐患 |
| 火灾事故 | 易燃物堆积、电气短路或明火管理不当等发生燃烧的隐患 |
| 车辆伤害 | 存在机动车辆(如自卸车、挖掘机)碰撞、碾压导致伤害的隐患 |
| 爆炸事故 | 存在压力容器、危险化学品等因瞬间能量释放造成破坏的隐患 |
| 坍塌事故 | 存在土方、模板或建筑结构塌落的隐患 |
| 设备损坏 | 生产设备非正常损毁和机械故障、超负荷运行或维护不足引发连锁的隐患 |
| 文明施工 | 因现场管理混乱(如杂物堆放、通道堵塞)等脏乱差导致的间接隐患 |
| 其他事故 | 未涵盖在上述分类中的其他风险或隐患 |
Table 3
Sample size of multimodal experimental data for construction safety hazards
| 隐患类型 | 训练集 | 验证集 | 测试集 | 总数 | 占比/% | 隐患类型 | 训练集 | 验证集 | 测试集 | 总数 | 占比/% |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 爆炸事故 | 57 | 16 | 8 | 81 | 4 | 起重伤害 | 85 | 24 | 12 | 121 | 5 |
| 车辆伤害 | 15 | 4 | 2 | 22 | 1 | 设备损坏 | 20 | 6 | 3 | 29 | 1 |
| 触电事故 | 505 | 144 | 72 | 721 | 32 | 坍塌事故 | 41 | 12 | 6 | 59 | 3 |
| 高处坠落 | 192 | 55 | 27 | 274 | 12 | 文明施工 | 246 | 70 | 35 | 352 | 15 |
| 火灾事故 | 205 | 59 | 29 | 293 | 13 | 物体打击 | 44 | 13 | 6 | 63 | 3 |
| 机械伤害 | 41 | 12 | 6 | 58 | 3 | 正常照片 | 700 | 200 | 100 | 1 000 | — |
| 其他事故 | 146 | 42 | 21 | 209 | 9 | 合计 | 2 297 | 656 | 328 | 3 282 | 100 |
Table 6
Effect of image-text multimodal fusion on hazard type recognition
| 隐患类型 | 精确率 | 召回率 | F1值 |
|---|---|---|---|
| 其他事故 | 0.600 0 | 0.250 0 | 0.352 9 |
| 坍塌事故 | 0.307 7 | 0.307 7 | 0.307 7 |
| 文明施工 | 0.685 5 | 0.885 4 | 0.772 7 |
| 机械伤害 | 0.875 0 | 0.875 0 | 0.875 0 |
| 火灾事故 | 0.913 8 | 0.929 8 | 0.921 7 |
| 爆炸事故 | 0.875 0 | 0.875 0 | 0.875 0 |
| 物体打击 | 1.000 0 | 0.428 6 | 0.600 0 |
| 触电事故 | 0.905 1 | 0.925 4 | 0.915 1 |
| 设备损坏 | 1.000 0 | 0.571 4 | 0.727 3 |
| 起重伤害 | 0.892 9 | 0.925 9 | 0.909 1 |
| 车辆伤害 | 0.666 7 | 0.400 0 | 0.500 0 |
| 高处坠落 | 0.857 1 | 0.857 1 | 0.857 1 |
Table 7
Differences in F1-score of comparative models
| 隐患 类型 | 文本 模型 | 图像 模型 | 图文 模型 | 门控融合 图文模型 |
|---|---|---|---|---|
| 起重伤害 | 0.925 9 | 0.763 3 | 0.924 0 | 0.909 3 |
| 触电事故 | 0.908 6 | 0.834 7 | 0.909 2 | 0.915 2 |
| 火灾事故 | 0.891 7 | 0.875 2 | 0.920 4 | 0.921 8 |
| 机械伤害 | 0.875 0 | 0.802 4 | 0.936 1 | 0.875 0 |
| 爆炸事故 | 0.774 7 | 0.606 6 | 0.833 6 | 0.875 0 |
| 文明施工 | 0.776 3 | 0.722 8 | 0.757 2 | 0.781 2 |
| 设备损坏 | 0.766 2 | 0.676 2 | 0.766 2 | 0.766 2 |
| 高处坠落 | 0.845 4 | 0.563 7 | 0.850 2 | 0.857 1 |
| 车辆伤害 | 0.783 3 | 0.522 2 | 0.600 0 | 0.522 2 |
| 物体打击 | 0.449 2 | 0.338 1 | 0.619 0 | 0.676 2 |
| 其他事故 | 0.395 5 | 0.335 2 | 0.483 7 | 0.401 0 |
| 坍塌事故 | 0.198 1 | 0.278 9 | 0.444 6 | 0.307 7 |
| [1] |
卢冰, 陈述, 曹坤煜, 等. 水电工程施工安全隐患类别辅助校正方法[J]. 水力发电学报, 2025, 44(4): 42-49.
|
|
|
|
| [2] |
陈述, 王典学, 杨应柳, 等. 水电工程施工安全隐患语义匹配模型[J]. 中国安全科学学报, 2024, 34(12): 40-47.
doi: 10.16265/j.cnki.issn1003-3033.2024.12.0795 |
|
doi: 10.16265/j.cnki.issn1003-3033.2024.12.0795 |
|
| [3] |
杨阳蕊, 潘世峰, 刘雪梅, 等. 多模态知识图谱与大模型协同的水利工程风险应对决策推荐[J]. 水利学报, 2025, 56(4): 519-530.
|
|
|
|
| [4] |
张泽辉, 张乾隆, 徐晓滨, 等. 基于视觉的工人高处攀爬不安全行为识别模型[J]. 中国安全科学学报, 2025, 35(2): 144-151.
doi: 10.16265/j.cnki.issn1003-3033.2025.02.0278 |
|
doi: 10.16265/j.cnki.issn1003-3033.2025.02.0278 |
|
| [5] |
|
| [6] |
doi: 10.1109/TEM.2021.3093166 |
| [7] |
陈述, 张超, 陈云, 等. 基于命名实体识别的水电工程施工安全规范实体识别模型[J]. 中国安全科学学报, 2024, 34(9): 19-26.
doi: 10.16265/j.cnki.issn1003-3033.2024.09.0008 |
|
doi: 10.16265/j.cnki.issn1003-3033.2024.09.0008 |
|
| [8] |
周佳一, 郑霞忠, 田丹, 等. 水电工程施工安全隐患多标签文本智能分类方法[J]. 水力发电学报, 2024, 43(11): 114-124.
|
|
|
|
| [9] |
|
| [10] |
田丹, 许仁乐, 邵波, 等. 强化特征表达的水电工程施工安全隐患自动辨识方法[J/OL]. 河海大学学报:自然科学版: 1-11.[2025-05-28]. https://kns.cnki.net/kcms2/article.html.
|
|
|
|
| [11] |
陈岩松, 张乐, 张雷瀚, 等. 基于跨模态注意力和门控单元融合网络的多模态情感分析方法[J]. 数据分析与知识发现, 2024, 8(7): 67-76.
doi: 10.11925/infotech.2096-3467.2023.0591 |
|
doi: 10.11925/infotech.2096-3467.2023.0591 |
|
| [12] |
|
| [13] |
|
| [14] |
doi: 10.1007/s00521-019-04559-1 |
| [15] |
|
| [16] |
|
| [1] | CAO Haiqing, YAO Zhiying, LYU Shuran, YAO Cuiyou. Evaluation of employee's psychological stress status using LSTM with attention mechanism [J]. China Safety Science Journal, 2026, 36(3): 229-237. |
| [2] | SHAO Shuyu, LI Yanping, HAN Jiaqi. Driver cognitive state recognition in autonomous driving takeover decision making [J]. China Safety Science Journal, 2026, 36(3): 255-263. |
| [3] | AN Siqi, CAI Anglin, MA Zicheng, ZHU Baoyan. Multimodal large model-based approach for construction safety hazard recognition [J]. China Safety Science Journal, 2025, 35(9): 185-192. |
| [4] | WANG Haiquan, YU Haowei, YANG Yueyi, XU Xiaobin, BU Xiangzhou, KURKOVA P. Multimodal information fusion decision-making strategy for personnel behavior in industrial scene [J]. China Safety Science Journal, 2025, 35(8): 84-92. |
| [5] | HAO Qinxia, ZHEN Haolong. Unsafe behavior recognition of miners in coal mine belt area based on multimodal feature fusion [J]. China Safety Science Journal, 2025, 35(11): 32-41. |
| [6] | WANG Zhe, HUANG Haichen, LI Ruiqin, WEI Yongchang. Intelligent question answering model for construction safety hazards based on vision-language multimodality [J]. China Safety Science Journal, 2025, 35(10): 106-114. |
| [7] | JIANG Yuanliang, REN Qingying, REN Yuan, LIU Haipeng, DONG Shaohua. High-consequence area indentation of remote sensing images of China-Myanmar oil and gas pipeline based on improved YOLO model [J]. China Safety Science Journal, 2025, 35(1): 103-111. |
| [8] | JIANG Song, LI Yanbo, HE Xuqian, HE Runfeng, ZHANG Chao, ZHANG Cunliang. Intelligent identification of landslide disaster based on deep learning of UAV images [J]. China Safety Science Journal, 2024, 34(7): 229-238. |
| [9] | JIN Lianghai, WANG Shuqing, WANG Xinyu. Research on multimodal emotion characteristics based on short video of rainstorm disaster [J]. China Safety Science Journal, 2024, 34(7): 219-228. |
| [10] | ZHENG Xiazhong, LIU Yicheng, SHAO Bo, WANG Shuo, KE Shan'gang. Accident causal analysis of object strike in hydropower project construction based on text mining [J]. China Safety Science Journal, 2024, 34(4): 50-57. |
| [11] | CHEN Shu, WANG Dianxue, YANG Yingliu, CAO Kunyu, NIE Benwu. Semantic matching model of potential safety hazards in hydroelectric project construction [J]. China Safety Science Journal, 2024, 34(12): 40-47. |
| [12] | DUAN Bin, HE Jiaping, QIN Shihe, YAN Siyuan, CHEN Zhichao. Surface deformation monitoring of high slope in hydropower project based on GB-InSAR technology [J]. China Safety Science Journal, 2022, 32(S2): 64-69. |
| [13] | CHEN Shu, ZHU Liping, CHEN Yun, ZHENG Xiazhong, JI Qin. Sequential characteristics of safety hazards in hydropower project construction based on complex networks [J]. China Safety Science Journal, 2022, 32(8): 61-66. |
| [14] | FA Huiyan, SHUAI Bin, LYU Min, HUANG Wencheng. Safety risk assessment of multimodal transportation of China Railway Express based on WBS-RBS and IFWA operator [J]. China Safety Science Journal, 2022, 32(6): 200-206. |
| [15] | LIU Song, SHAO Yiming, PENG Yong, XIAO Yunpeng. Multi-modal transport route optimization of emergency relief materials [J]. China Safety Science Journal, 2019, 29(12): 152-157. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||