轻量化神经网络结合深度相机的矿工目标检测与定位

doi:10.16265/j.cnki.issn1003-3033.2025.03.0863

中国安全科学学报 ›› 2025, Vol. 35 ›› Issue (3): 115-124.doi: 10.16265/j.cnki.issn1003-3033.2025.03.0863

轻量化神经网络结合深度相机的矿工目标检测与定位

张淼¹^,²(), 王孝军¹^,², 雷经发¹^,²^,³^,^**(), 赵汝海¹^,², 李永玲¹^,²

¹ 安徽建筑大学机械与电气工程学院,安徽合肥 230601
² 工程机械智能制造安徽省教育厅重点实验室,安徽合肥 230601
³ 过程装备与控制工程四川省高校重点实验室,四川自贡 643000

收稿日期:2024-10-23 修回日期:2024-12-25 出版日期:2025-03-28
通信作者:
** 雷经发(1978—),男,安徽巢湖人,博士,教授,主要从事视觉检测、人因工程等方面的研究。E-mail: rain78828@163.com。
作者简介:
张淼 (1986—),男,河南项城人,博士,讲师,主要从事机器视觉、人机工程等方面的研究。E-mail: miao@ahjzu.edu.cn。
赵汝海，副教授；
李永玲，讲师
基金资助:
安徽高校自然科学研究重大项目(KJ2021ZD0068); 安徽高校协同创新项目(GXXT2022-019); 过程装备与控制工程四川省高校重点实验室开放基金资助(GK202308)

Lightweight neural network combined with depth camera for miner target detection and localization

ZHANG Miao¹^,²(), WANG Xiaojun¹^,², LEI Jingfa¹^,²^,³^,^**(), ZHAO Ruhai¹^,², LI Yongling¹^,²

¹ School of Mechanical and Electrical Engineering, Anhui Jianzhu University, Hefei Anhui 230601, China
² Key Laboratory of Intelligent Manufacturing of Construction Machinery, Anhui Education Department, Hefei Anhui 230601, China
³ Sichuan Provincial Key Laboratory of Process Equipment and Control Engineering, Zigong Sichuan 643000, China

Received:2024-10-23 Revised:2024-12-25 Published:2025-03-28

摘要/Abstract

摘要：

为防止矿工误入危险区域,提出一种YOLOv5s-MPD轻量化井下矿工目标检测模型,并结合深度相机定位矿工目标,实时检测矿工是否进入危险区域。首先,使用MobileNetv3轻量化神经网络作为主干特征提取网络,大幅降低模型体积;其次,引入极化自注意力模块(PSA),增强目标的感知能力;最后,采用可变形卷积网络(DCNv2)替代特征融合层中C3模块的标准卷积,解决常规卷积丢失部分特征信息的问题,利用改进模型结合深度相机获取的彩色图像检测矿工目标,并得到目标中心点的空间三维坐标。结果表明:改进模型相比于YOLOv5s,参数量和计算量分别减少83.54%和77.03%,模型体积大小仅为3.4 MB,检测速度为70.2帧/s,提升54.97%,平均精度均值(mAP)为0.825。与主流目标检测模型相比,改进模型的参数量、计算量、模型体积、检测速度和mAP较为均衡。在实际定位精度试验中,1~8 m范围内测得相机与矿工目标间距离的平均绝对误差和平均相对误差分别为0.11 m和1.74%;最大绝对误差和最大相对误差分别为0.25 m和2.96%。在动态检测中,均能检测到矿工目标并输出其位置信息,检测成功率达97.5%。

关键词: 轻量化, 神经网络, 深度相机, 目标检测, 目标定位, 安全预警

Abstract:

To prevent miners from mistakenly entering dangerous areas, a lightweight underground miner object detection model based on YOLOv5s-MPD was proposed, which combined with depth camera to locate miner targets and detect whether miners had entered dangerous areas in real time. Specifically, the MobileNetv3 lightweight neural network was used as the backbone feature extraction network to significantly reduce the model size. Secondly, Polarized Self-Attention (PSA) module was introduced to enhance the perception of targets. Finally, Deformable Convolution Network v2 (DCNv2) was used to replace the standard convolution in the C3 module of the feature fusion layer, solving the problem of partial feature information loss in conventional convolution. The improved model was used in combination with the color images obtained by the depth camera to detect miner targets and obtain the spatial three-dimensional coordinates of the target center points. The results show that compared with YOLOv5s, the improved model reduces the number of parameters and computation by 83.54% and 77.03%, respectively. The model size is only 3.4 MB, and a detection speed of 70.2 f/s, which is increased by 54.97%. The mean average precision is 0.825. Compared with mainstream object detection models, the improved model has a more balanced number of parameters, computation, model size, detection speed, and mean average precision. In the actual positioning accuracy test, within a range of 1-8 meters, the average absolute error and average relative error of the distance between the camera and the miner target were 0.11 meters and 1.74%, respectively. The maximum absolute error and maximum relative error were 0.25 meters and 2.96%, respectively. In the dynamic detection, the miner target could be detected and its location information output, with a detection success rate of 97.5%.

Key words: lightweight, neural network, deep camera, target detection, target localization, security warning

中图分类号:

X924.3安全监控系统

张淼, 王孝军, 雷经发, 赵汝海, 李永玲. 轻量化神经网络结合深度相机的矿工目标检测与定位[J]. 中国安全科学学报, 2025, 35(3): 115-124.

ZHANG Miao, WANG Xiaojun, LEI Jingfa, ZHAO Ruhai, LI Yongling. Lightweight neural network combined with depth camera for miner target detection and localization[J]. China Safety Science Journal, 2025, 35(3): 115-124.

图/表 14

图1

图2

图3

图4

图5

图6

表1

表2

表3

图7

图8

表4

表5

图9

参考文献 18

[1]	LI Ailing, ZHANG Jixiong, ZHOU Nan, et al. A model for evaluating the production system of an intelligent mine based on unascertained measurement theory[J]. Journal of Intelligent & Fuzzy Systems, 2020, 38(2): 1865-1875.
[2]	ZHANG Kexue, KANG Lei, CHEN Xuexi, et al. A review of intelligent unmanned mining current situation and development trend[J]. Energies, 2022, 15(2): DOI: 10.3390/en15020513.
[3]	付玉平, 陈兆波, 赵振保, 等. 煤矿安全事故的综合致因模型[J]. 安全与环境学报, 2024, 24(7): 2731-2740.
	FU Yuping, CHEN Zhaobo, ZHAO Zhenbao, et al. Research on the comprehensive causative model of coal mine safety accidents[J]. Journal of Safety and Environment, 2024, 24(7): 2731-2740.
[4]	王国庆, 赵鑫, 杨春雨, 等. 煤矿地下空间定位技术研究进展[J]. 工程科学学报, 2024, 46(10): 1713-1727.
	WANG Guoqing, ZHAO Xin, YANG Chunyu, et al. Recent advances in research on underground space positioning technology for coal mining[J]. Chinese Journal of Engineering, 2024, 46(10): 1713-1727.
[5]	孙哲星. 煤矿井下人员精确定位方法[J]. 煤炭科学技术, 2018, 46(3): 130-134.
	SUN Zhexing. Personnel position method in underground coal mine[J]. Coal Science and Technology, 2018, 46(3): 130-134.
[6]	李梅, 姜展, 姜龙飞, 等. 三维可视化技术在智慧矿山领域的研究进展[J]. 煤炭科学技术, 2021, 49(2): 153-162.
	LI Mei, JIANG Zhan, JIANG Longfei, et al. Research progress on 3D visualization technology for intelligent mine[J]. Coal Science and Technology, 2021, 49(2): 153-162.
[7]	左明成, 焦文华. 面向煤矿井下作业场景的安全帽佩戴识别算法[J]. 中国安全科学学报, 2024, 34(3): 237-246. doi: 10.16265/j.cnki.issn1003-3033.2024.03.1985
	ZUO Mingcheng, JIAO Wenhua. Helmet-wearing recognition algorithm for coal mine underground operation scenarios[J]. China Safety Science Journal, 2024, 34(3): 237-246. doi: 10.16265/j.cnki.issn1003-3033.2024.03.1985
[8]	张磊, 李熙尉, 燕倩如, 等. 基于改进YOLOv5s的综采工作面人员检测算法[J]. 中国安全科学学报, 2023, 33(7): 82-89. doi: 10.16265/j.cnki.issn1003-3033.2023.07.2226
	ZHANG Lei, LI Xiwei, YAN Qianru, et al. Personnel detection algorithm in fully mechanized coal face based on improved YOLOv5s[J]. China Safety Science Journal, 2023, 33(7): 82-89. doi: 10.16265/j.cnki.issn1003-3033.2023.07.2226
[9]	XIN Fangfang, HE Xinyu, YAO Chaoxiu, et al. A real-time detection for miner behavior via DYS-YOLOv8n model[J]. Journal of Real-Time Image Processing, 2024, 21(3): DOI: 10.1007/s11554-024-01466-0.
[10]	张春堂, 管利聪. 基于SSD-MobileNet的矿工安保穿戴设备检测系统[J]. 工矿自动化, 2019, 45(6): 96-100.
	ZHANG Chuntang, GUAN Licong. Detection system of miners' wearable security equipments based on SSD-MobileNet[J]. Industry and Mine Automation, 2019, 45(6): 96-100.
[11]	郭永存, 童佳乐, 王爽. 井下无人驾驶电机车行驶场景中多目标检测研究[J]. 工矿自动化, 2022, 48(6): 56-63.
	GUO Yongcun, TONG Jiale, WANG Shuang. Research on multi-object detection in driving scene of underground unmanned electric locomotive[J]. Journal of Mine Automation, 2022, 48(6): 56-63.
[12]	李伟山, 卫晨, 王琳. 改进的Faster RCNN煤矿井下行人检测算法[J]. 计算机工程与应用, 2019, 55(4): 200-207. doi: 10.3778/j.issn.1002-8331.1711-0282
	LI Weishan, WEI Chen, WANG Lin. Improved Faster RCNN approach for pedestrian detection in underground coal mine[J]. Computer Engineering and Applications, 2019, 55(4): 200-207. doi: 10.3778/j.issn.1002-8331.1711-0282
[13]	王科平, 连凯海, 杨艺, 等. 基于改进YOLOv4的综采工作面目标检测[J]. 工矿自动化, 2023, 49(2): 70-76.
	WANG Keping, LIAN Kaihai, YANG Yi, et al. Target detection of the fully mechanized working face based on improved YOLOv4[J]. Journal of Mine Automation, 2023, 49(2): 70-76.
[14]	郝明月, 闵冰冰, 张新建, 等. 基于改进YOLOv5s的矿工排队检测方法[J]. 工矿自动化, 2023, 49(11): 160-166.
	HAO Mingyue, MIN Bingbing, ZHANG Xinjian, et al. A miner queue detection method based on improved YOLOv5s[J]. Journal of Mine Automation, 2023, 49(11): 160-166.
[15]	杨豚, 郭永存, 王爽, 等. 煤矿井下无人驾驶轨道电机车障碍物识别[J]. 浙江大学学报:工学版, 2024, 58(1): 29-39.
	YANG Tun, GUO Yongcun, WANG Shuang, et al. Obstacle recognition of unmanned rail electric locomotive in underground coal mine[J]. Journal of Zhejiang University: Engineering Science, 2024, 58(1): 29-39.
[16]	郭曦, 谢炜, 朱红秀, 等. 井下目标跟踪与测距方法研究[J]. 煤炭工程, 2019, 51(3): 117-121. doi: 10.11799/ce201903026
	GUO Xi, XIE Wei, ZHU Hongxiu, et al. Target tracking and ranging method in underground mine[J]. Coal Engineering, 2019, 51(3): 117-121.
[17]	ZHOU Cheng, REN Dacong, ZHANG Xiangyan, et al. Human position detection based on depth camera image information in mechanical safety[J]. Advances in Mathematical Physics, 2022, 2022(1): DOI: 10.1155/2022/9170642.
[18]	韩江洪, 袁稼轩, 卫星, 等. 基于深度学习的井下巷道行人视觉定位算法[J]. 计算机应用, 2019, 39(3): 688-694. doi: 10.11772/j.issn.1001-9081.2018071501
	HAN Jianghong, YUAN Jiaxuan, WEI Xing, et al. Pedestrian visual positioning algorithm for underground roadway based on deep learning[J]. Journal of Computer Applications, 2019, 39(3): 688-694. doi: 10.11772/j.issn.1001-9081.2018071501

名称	配置
CPU	Intel Core i5-13600KF
GPU	NVIDIA GeForce RTX4060
PyTorch	1.12.1
CUDA	12.2

模型	查准率	召回率	mAP
YOLOv5s-MobileNetv3	0.859	0.688	0.79
+SE	0.868	0.677	0.792
+CBAM	0.864	0.674	0.785
+CA	0.881	0.661	0.787
+ECA	0.880	0.664	0.787
+PSA	0.872	0.702	0.811

模型	改进策略			查准率	计算量/ G	参数量/ M	模型体积/ MB	mAP	FPS/ (帧·s^-1)
模型	MobileNetv3	PSA	DCNv2	查准率	计算量/ G	参数量/ M	模型体积/ MB	mAP	FPS/ (帧·s^-1)
YOLOv5s	—	—	—	0.891	15.8	7.01	13.7	0.848	45.3
	√	—	—	0.859	2.5	1.39	3.0	0.79	77.2
	√	√	—	0.872	2.9	1.54	3.3	0.811	67.4
	√	—	√	0.869	2.2	1.45	3.1	0.806	71.8
YOLOv5s-MPD	√	√	√	0.884	2.6	1.61	3.4	0.825	70.2

模型	参数量/M	计算量/G	模型体积/MB	mAP	FPS/ (帧·s^-1)
YOLOv5s	7.01	15.8	13.7	0.848	45.3
YOLOv5n	1.76	4.1	3.8	0.792	61.8
YOLOv5m	20.85	47.9	42.1	0.857	24.5
YOLOv7tiny	6.01	13.2	12.0	0.833	30.2
YOLOv8s	11.13	28.4	22.0	0.851	39.2
Faster R-CNN	136.69	401.7	110.8	0.795	13.4
SSD	23.61	273.2	92.8	0.764	17.2
YOLOv5s-MPD	1.61	2.6	3.4	0.825	70.2

编号	定位坐标	预测距离/m	测量距离/m	绝对误差/m	相对误差/%
1	(-45,11,1112)	1.11	1.11	0.00	0.27
2	(-61,37,2125)	2.13	2.12	0.01	0.24
3	(-430,124,3056)	3.09	3.13	0.06	1.25
4	(354,-162,4174)	4.19	4.12	0.07	1.77
5	(-412,325,5012)	5.04	4.96	0.08	1.70
6	(601,412,6213)	6.26	6.44	0.18	2.88
7	(1054,-141,7019)	7.10	7.30	0.20	2.88
8	(-723,-211,8622)	8.66	8.41	0.25	2.96

轻量化神经网络结合深度相机的矿工目标检测与定位

Lightweight neural network combined with depth camera for miner target detection and localization

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 14

参考文献 18

相关文章 15

编辑推荐

Metrics

本文评价

[1]	许慧, 叶泽鸿, 周启琳, 张日芬. 中国大陆核心城市韧性发展预测及动态空间分异研究[J]. 中国安全科学学报, 2025, 35(3): 179-186.
[2]	柯巍, 朱权洁, 陈长茂, 吴成毅, 刘衍, 张艳林. 基于改进YOLOV11的卷烟仓储人员不安全行为分类及识别[J]. 中国安全科学学报, 2025, 35(3): 36-44.
[3]	王春源, 刘权捷. 基于改进YOLOv8s模型的隧道火灾检测[J]. 中国安全科学学报, 2025, 35(3): 69-76.
[4]	汪伟, 崔欣超, 祁云, 李绪萍, 王璜瑞, 齐庆杰. 改进SSA优化BPNN的煤体瓦斯渗透率预测模型[J]. 中国安全科学学报, 2025, 35(2): 137-143.
[5]	张泽辉, 张乾隆, 徐晓滨, 赵祖国, 王海泉, 李昊. 基于视觉的工人高处攀爬不安全行为识别模型[J]. 中国安全科学学报, 2025, 35(2): 144-151.
[6]	邵舒羽, 张扬, 刘艳. 基于KGCN的地质地震灾害事件演化结果预测[J]. 中国安全科学学报, 2025, 35(2): 212-219.
[7]	姜垣良, 任庆滢, 任远, 刘海鹏, 董绍华. 基于改进YOLO模型的中缅油气管道遥感图像高后果区识别方法[J]. 中国安全科学学报, 2025, 35(1): 103-111.
[8]	顾清华, 殷书檀, 王丹, 李学现, 尹慧民. 基于改进YOLOv8的矿卡司机疲劳驾驶检测[J]. 中国安全科学学报, 2025, 35(1): 60-66.
[9]	赵江平, 王欣然, 吴立舟. 改进YOLOv5s的路面坑槽目标检测模型[J]. 中国安全科学学报, 2025, 35(1): 67-74.
[10]	刘惠临, 方琼, 江宇, 魏华章, 王涛, 张树川. 基于YOLOv5s的轻量化森林火灾探测算法[J]. 中国安全科学学报, 2025, 35(1): 75-83.
[11]	孙国思, 马鹏飞, 王威淳, 郜普浩, 朱健伟. 基于YOLOv8算法的露天煤矿大型工程车辆识别与闯入安全预警[J]. 中国安全科学学报, 2024, 34(S1): 191-198.
[12]	刘迪, 杨辉, 卢才武, 阮顺领, 江松. 基于MISSA-CNN-BiLSTM模型的尾矿坝位移预测[J]. 中国安全科学学报, 2024, 34(9): 145-154.
[13]	陈述, 张超, 陈云, 张光飞, 李智. 基于命名实体识别的水电工程施工安全规范实体识别模型[J]. 中国安全科学学报, 2024, 34(9): 19-26.
[14]	高飞, 梁宁, 贾喆, 侯青. 基于SSA-RBF神经网络的煤自然发火预测模型[J]. 中国安全科学学报, 2024, 34(8): 128-137.
[15]	郝景开, 李红艳, 张峰, 张翀, 毛立波, 刘大为. 基于MCDM-BPNN的城市内涝风险评价及调蓄池选址[J]. 中国安全科学学报, 2024, 34(8): 214-221.