中国安全科学学报 ›› 2026, Vol. 36 ›› Issue (4): 19-27.doi: 10.16265/j.cnki.issn1003-3033.2026.04.1106

• 安全科学理论与方法 • 上一篇    下一篇

基于多元拓扑图卷积网络的建筑工人不安全行为识别

余洋1(), 蒋琳1, 胡启军2,**(), 何乐平3, 蔡其杰3, 白羽3   

  1. 1 西南石油大学 计算机与软件学院, 四川 成都 610500
    2 西南交通大学 土木工程学院, 四川 成都 610031
    3 西南石油大学 土木工程与测绘学院, 四川 成都 610500
  • 收稿日期:2025-11-14 修回日期:2026-01-22 出版日期:2026-04-28
  • 通信作者:
    **胡启军(1977—),男,湖南衡阳人,博士,教授,博士生导师,主要从事工程安全与深度学习研究。E-mail:
  • 作者简介:

    余洋 (1987—),男,四川泸州人,博士,讲师,硕士生导师,主要从事人因安全、工程安全、工业互联网、工业大数据等方面的研究。E-mail:

    何乐平, 副教授

    蔡其杰, 讲师

    白羽, 讲师

  • 基金资助:
    国家自然科学基金资助(U23A2046); 国家自然科学基金资助(62441610); 四川省“天府万人计划”天府科技菁英项目(川万人第658号); 四川省科技计划项目(2025YFHZ0015)

Recognition of construction workers' unsafe behaviors based on a multi-component topology graph convolutional network

Yu Yang1(), Jiang Lin1, Hu Qijun2,**(), He Leping3, Cai Qijie3, Bai Yu3   

  1. 1 School of Computer Science and Software Engineering, Southwest Petroleum University, Chengdu Sichuan 610500, China
    2 School of Civil Engineering, Southwest Jiaotong University, Chengdu Sichuan 610031, China
    3 School of Civil Engineering and Geomatics, Southwest Petroleum University, Chengdu Sichuan 610500, China
  • Received:2025-11-14 Revised:2026-01-22 Published:2026-04-28

摘要:

为有效识别高空作业环境下建筑工人的不安全行为,提出一种基于多元拓扑图卷积网络(MCT-GCN)的工人不安全行为识别模型。首先,设计数据预处理模块,将视频监控数据转换为三维骨架数据;其次,构建三元动态邻接图卷积,融合可学习结构先验拓扑、通道相关性自适应拓扑和相对位置编码拓扑,动态构建出适应不同动作的邻接矩阵;然后,提出多尺度可分离时间卷积,将标准卷积分解为深度时序卷积与逐点卷积,分别对施工人员的动作时序特征和空间分布特征独立建模;最后,在公共数据集和自建的工人不安全行为数据集上进行试验验证和对比分析。结果表明:该模型在识别准确率和跨场景泛化能力上均优于现有方法;在自建的数据集上,该模型达到95.8%的最高识别准确率,能够显著提升复杂动态施工环境下工人不安全行为的识别准确度。

关键词: 多元拓扑图卷积网络(MCT-GCN), 建筑工人, 不安全行为识别, 多尺度, 骨架

Abstract:

In order to effectively identify the unsafe behaviors of construction workers in high-altitude environments, a recognition model based on MCT-GCN was proposed. Firstly, a data preprocessing module was designed to convert video surveillance data into three-dimensional skeleton data. Secondly, a tri-component dynamic adjacency graph convolution was constructed. It integrated learnable structural prior topology, channel correlation adaptive topology, and relative position encoding topology to dynamically build adjacency matrices adapted to different actions. Furthermore, a multi-scale separable temporal convolution was proposed to decompose standard convolutions into deep temporal convolution and point-by-point convolution, thereby independently modeling the temporal characteristics and spatial distribution characteristics of construction workers' actions. Finally, experimental verification and comparative analysis were conducted on both public datasets and a self-built dataset of workers' unsafe behaviors. The results demonstrate that the proposed model outperforms existing methods in terms of recognition accuracy and cross-scene generalization. On the self-built dataset, the model achieved a peak recognition accuracy of 95.8%, significantly enhancing the detection of workers' unsafe behaviors in complex and dynamic construction environments, and making a significant contribution to the development of intelligent monitoring and control in the construction industry.

Key words: multi-component topology graph convolutional network(MCT-GCN), construction workers, unsafe behavior recognition, multi-scale, skeleton

中图分类号: