China Safety Science Journal ›› 2026, Vol. 36 ›› Issue (5): 139-149.doi: 10.16265/j.cnki.issn1003-3033.2026.05.1332

• Safety Technology and Engineering • Previous Articles     Next Articles

YOLOv8n-based personnel detection model for underground mines optimized with SPDs-Conv and WIoU

Rong Hai1,2(), Xi Zhouyong1,2,**(), Li Jincheng1, Pan Xiangyin3, Zhang Weida1, Han Mingyu4   

  1. 1 College of Mining, Liaoning Technical University, Fuxin Liaoning 123000, China
    2 Ordos Research Institute, Liaoning Technical University, Ordos Inner Mongolia 017010, China
    3 China Northeast Architectural Design & Research Institute Co., Ltd., Shenyang Liaoning 110004, China
    4 Ordos City Haohua Coking Coal Co., Ltd., Ordos Inner Mongolia 017200, China
  • Received:2025-12-01 Revised:2026-02-26 Online:2026-05-28 Published:2026-11-28
  • Contact: Xi Zhouyong

Abstract:

To address the issues of low accuracy and weak robustness in existing detection algorithms due to insufficient lighting, scale differences among personnel, and frequent obstruction by equipment in coal mine environments, as well as the challenges posed by high parameter and computational requirements of some models, which make them difficult to adapt to edge devices underground, an improved YOLOv8n model was proposed to optimize personnel detection tasks in complex mine environments. An enhanced SPDs-Conv module was introduced to enhance the extraction of small target features and improve the recognition accuracy of low-pixel personnel in distant views. Cross stage partial feature fusion + selective kernel attention (C2f_SKAttention) module was designed to strengthen the model's focus on targets of different scales and cope with the scale differences of underground personnel. A dynamic detection head was constructed to adapt to the diversity and complexity of targets, and to improve robustness to occlusion and other scenarios. The WIoU loss function was improved to increase the bounding box localization accuracy and reduce the localization deviation caused by low illumination. The results show that the proposed improved YOLOv8n model achieves an mean average precision (mAP) @0.5 of 83.5% and an mAP@0.5:0.95 of 39.0% on the mine personnel detection dataset. Compared with the original YOLOv8n, the P is improved by 8.5%, the R by 11.9%, the mAP@0.5 by 4.7%, and the mAP@0.5:0.95 by 3.3%. The number of parameters only increases from 3.1M to 3.2M, and the Giga Floating-point operations per second (GFLOPS) rises from 14.0G to 14.4G. The proposed model maintains a lightweight structure while improving detection accuracy and robustness. It effectively alleviates missed detection of small underground targets, insufficient multi-scale adaptation and weak anti-interference capability in complex environments, making it suitable for the limited computing power of underground edge equipment.

Key words: space-to-depth separable convolution (SPDs-Conv), weighted intersection over union (WIoU), YOLOv8n, underground personnel detection, lightweighting, attention mechanism, loss function

CLC Number: