Citation: | XIE Beijing,LI Heng,DONG Hang,et al. Intelligent recognition algorithm and application of coal mine overhead passenger device based on multiscale feature fusion[J]. Coal Science and Technology,2024,52(12):272−286. DOI: 10.12438/cst.2024-0081 |
The intelligent recognition technology for Coal mine overhead passenger devices(Cmopd) plays a crucial role in achieving automated inspection, real-time monitoring, and warning tasks for cmopd, thereby promoting the intelligent development of coal mines. However, there are several challenges that need to be addressed, such as the limited number of samples in the cmopd dataset, poor lighting conditions in underground images, overlapping and occlusion between operating cmopd, varying sitting postures of miners, difficulty in detecting small cmopd targets, complex model deployment, and low efficiency of traditional recognition methods for cmopd with different passenger-carrying statuses.To overcome these challenges, a cmopd dataset was created from various coal mines in Guizhou province. The passenger-carrying status of cmopd was classified into two categories: cmopd with passengers (HC_miner) and cmopd without passengers (HC_nominer). The YOLOv8n single-stage object detection algorithm was used as the baseline model, and a coal mine cmopd intelligent recognition algorithm based on multi-scale feature fusion was proposed.In the image preprocessing stage, adaptive histogram equalization was employed to enhance image quality, and random rectangle masking was applied to simulate real scenarios where cmopd is occluded by underground objects during operation. This approach addressed the scarcity of cmopd image datasets and reduced the interference from negative underground environments. In the feature extraction stage, the partial convolution of the backbone network C2f module is replaced by deformable convolution, and a novel C2f_DCN module is designed. This enhancement increased the dynamic adjustment capability of the target receptive field for cmopd with different passenger-carrying statuses, allowing the model to capture different scale information and better learn the coupled features of cmopd and miners. As a result, the model became more adaptable to various sitting postures of miners and improved its ability to identify cmopd targets with different passenger-carrying statuses. In the feature fusion stage, a path aggregation network with a coordinate attention mechanism (CLC−PAN−CA) was proposed to achieve cross-level contat of features and adaptively capture the contextual information of cmopd. The CLC−PAN−CA module effectively integrated multi-scale features and improved the accuracy of cmopd recognition. The experimental results show that the proposed model achieves a precision of 95.8%, which is 7.4% higher than the baseline model. The recall is 93.3%, representing an improvement of 9.8%, and the mean average precision is 95.6%, indicating a 7.7% increase. Furthermore, the model parameters and size are only 3.1×106 and 6.1 MB, respectively. The recognition speed is 71 frames per second Compare to a variety of mainstream single-stage two-stage detection models, the proposed model demonstrated effective identification of cmopd targets with and without passengers, significantly improved the accuracy of cmopd recognition, reduced false positives and false negatives, and exhibited faster recognition speed and better extraction of contextual information. The proposed algorithm can meet the requirements of practical inspection scenarios and provide a feasible method for accurate recognition of cmopd with different passenger-carrying statuses. Finally, the proposed cmopd intelligent recognition algorithm and the underground monitoring video stream were embedded into the designed cmopd intelligent recognition system. Partial implementation approaches for deploying the video media stream into the cmopd intelligent recognition system were provided. The concept of an end-to-end integrated cmopd intelligent recognition system, which integrates the dispatching system on the ground and the monitoring system underground, was proposed. This increases the expectations for intelligent inspection applications in coal mines and provides real-time warnings for the safe transportation of cmopd with passengers.
[1] |
王国法. 煤矿智能化最新技术进展与问题探讨[J]. 煤炭科学技术,2022,50(1):1−27. doi: 10.3969/j.issn.0253-2336.2022.1.mtkxjs202201001
WANG Guofa. New technological progress of coal mine intelligence and its problems[J]. Coal Science and Technology,2022,50(1):1−27. doi: 10.3969/j.issn.0253-2336.2022.1.mtkxjs202201001
|
[2] |
王国法,赵国瑞,任怀伟. 智慧煤矿与智能化开采关键核心技术分析[J]. 煤炭学报,2019,44(1):34−41.
WANG Guofa,ZHAO Guorui,REN Huaiwei. Analysis on key technologies of intelligent coal mine and intelligent mining[J]. Journal of China Coal Society,2019,44(1):34−41.
|
[3] |
尹茂振. 煤矿架空乘人装置集中控制系统设计[D]. 徐州:中国矿业大学,2021.
YIN Maozhen. Design of centralized control system for coal mine overhead passenger device[D]. Xuzhou:China University of Mining and Technology,2021.
|
[4] |
王国法. 煤矿智能化十大“痛点”解析及对策[J]. 智能矿山,2021,2(3):1−4.
|
[5] |
王国法,杜毅博,任怀伟,等. 智能化煤矿顶层设计研究与实践[J]. 煤炭学报,2020,45(6):1909−1924.
WANG Guofa,DU Yibo,REN Huaiwei,et al. Top level design and practice of smart coal mines[J]. Journal of China Coal Society,2020,45(6):1909−1924.
|
[6] |
AL-KARKHI N K,ABBOOD W T,KHALID E A,et al. Intelligent robotic welding based on a computer vision technology approach[J]. Computers,2022,11(11):155. doi: 10.3390/computers11110155
|
[7] |
YANG W J,ZHANG X H,MA B,et al. An open dataset for intelligent recognition and classification of abnormal condition in longwall mining[J]. Scientific Data,2023,10(1):416. doi: 10.1038/s41597-023-02322-9
|
[8] |
WANG L Y,WANG X W,LI B. Data-driven model SSD-BSP for multi-target coal-gangue detection[J]. Measurement,2023,219:113244. doi: 10.1016/j.measurement.2023.113244
|
[9] |
YAN P C,SUN Q S,YIN N N,et al. Detection of coal and gangue based on improved YOLOv5.1 which embedded scSE module[J]. Measurement,2022,188:110530. doi: 10.1016/j.measurement.2021.110530
|
[10] |
刘普壮. 基于改进YOLO算法的煤矸识别方法与实验研究[D]. 淮南:安徽理工大学,2022.
LIU Puzhuang. Experimental research on coal gangue recognition method based on improved YOLO algorithm[D]. Huainan:Anhui University of Science & Technology,2022.
|
[11] |
张磊,王浩盛,雷伟强,等. 基于YOLOv5s-SDE的带式输送机煤矸目标检测[J]. 工矿自动化,2023,49(4):106−112.
ZHANG Lei,WANG Haosheng,LEI Weiqiang,et al. Coal gangue target detection of belt conveyor based on YOLOv5s-SDE[J]. Journal of Mine Automation,2023,49(4):106−112.
|
[12] |
徐志强,吕子奇,王卫东,等. 煤矸智能分选的机器视觉识别方法与优化[J]. 煤炭学报,2020,45(6):2207−2216.
XU Zhiqiang,LYU Ziqi,WANG Weidong,et al. Machine vision recognition method and optimization for intelligent separation of coal and gangue[J]. Journal of China Coal Society,2020,45(6):2207−2216.
|
[13] |
程德强,钱建生,郭星歌,等. 煤矿安全生产视频AI识别关键技术研究综述[J]. 煤炭科学技术,2023,51(2):349−365.
CHENG Deqiang,QIAN Jiansheng,GUO Xingge,et al. Review on key technologies of AI recognition for videos in coal mine[J]. Coal Science and Technology,2023,51(2):349−365.
|
[14] |
程德强,徐进洋,寇旗旗,等. 融合残差信息轻量级网络的运煤皮带异物分类[J]. 煤炭学报,2022,47(3):1361−1369.
CHENG Deqiang,XU Jinyang,KOU Qiqi,et al. Lightweight network based on residual information for foreign body classification on coal conveyor belt[J]. Journal of China Coal Society,2022,47(3):1361−1369.
|
[15] |
程德强,陈杰,寇旗旗,等. 融合层次特征和注意力机制的轻量化矿井图像超分辨率重建方法[J]. 仪器仪表学报,2022,43(8):73−84.
CHENG Deqiang,CHEN Jie,KOU Qiqi,et al. Lightweight super-resolution reconstruction method based on hierarchical features fusion and attention mechanism for mine image[J]. Chinese Journal of Scientific Instrument,2022,43(8):73−84.
|
[16] |
郝帅,张旭,马旭,等. 基于CBAM-YOLOv5的煤矿输送带异物检测[J]. 煤炭学报,2022,47(11):4147−4156.
HAO Shuai,ZHANG Xu,MA Xu,et al. Foreign object detection in coal mine conveyor belt based on CBAM-YOLOv5[J]. Journal of China Coal Society,2022,47(11):4147−4156.
|
[17] |
闫志蕊,王宏伟,耿毅德. 基于改进DeeplabV3+和迁移学习的煤岩界面图像识别方法[J]. 煤炭科学技术,2023,51(S1):429−439.
YAN Zhirui,WANG Hongwei,GENG Yide. Coal-rock interface image recognition method based on improved DeeplabV3+ and transfer learning[J]. Coal Science and Technology,2023,51(S1):429−439.
|
[18] |
李伟山,卫晨,王琳. 改进的Faster RCNN煤矿井下行人检测算法[J]. 计算机工程与应用,2019,55(4):200−207. doi: 10.3778/j.issn.1002-8331.1711-0282
LI Weishan,WEI Chen,WANG Lin. Improved faster RCNN approach for pedestrian detection in underground coal mine[J]. Computer Engineering and Applications,2019,55(4):200−207. doi: 10.3778/j.issn.1002-8331.1711-0282
|
[19] |
张明臻. 基于Dense-YOLO网络的井下行人检测模型[J]. 工矿自动化,2022,48(3):86−90.
ZHANG Mingzhen. Underground pedestrian detection model based on Dense-YOLO network[J]. Industry and Mine Automation,2022,48(3):86−90.
|
[20] |
邵小强,李鑫,杨涛,等. 改进YOLOv5s和DeepSORT的井下人员检测及跟踪算法[J]. 煤炭科学技术,2023,51(10):291−301.
SHAO Xiaoqiang,LI Xin,YANG Tao,et al. Underground personnel detection and tracking based on improved YOLOv5s and DeepSORT[J]. Coal Science and Technology,2023,51(10):291−301.
|
[21] |
YANG W J,WU J C,ZHANG J L,et al. Deformable convolution and coordinate attention for fast cattle detection[J]. Computers and Electronics in Agriculture,2023,211:108006. doi: 10.1016/j.compag.2023.108006
|
[22] |
ZHU X Z,HU H,LIN S,et al. Deformable ConvNets V2:more deformable,better results[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Long Beach,CA,USA. IEEE,2019.
|
[23] |
范瑶瑶,王兴芬,刘亚辉. 改进DeepLabv3+网络的钢板表面缺陷检测研究[J]. 计算机工程与应用,2023,59(16):150−158. doi: 10.3778/j.issn.1002-8331.2210-0249
FAN Yaoyao,WANG Xingfen,LIU Yahui. Improved DeepLabv3+Model for surface defect detection on steel plates[J]. Computer Engineering and Applications,2023,59(16):150−158. doi: 10.3778/j.issn.1002-8331.2210-0249
|
[24] |
颜玉松,尹芳洁,王彩玲. 融合Xception特征提取和坐标注意力机制的血细胞分割[J]. 计算机系统应用,2023,32(1):275−280.
YAN Yusong,YIN Fangjie,WANG Cailing. Blood cell segmentation fusing xception feature extraction and coordinate attention mechanism[J]. Computer Systems & Applications,2023,32(1):275−280.
|
[25] |
OMAR N,SENGUR A,AL-ALI S G S. Cascaded deep learning-based efficient approach for license plate detection and recognition[J]. Expert Systems with Applications,2020,149:113280. doi: 10.1016/j.eswa.2020.113280
|
[26] |
DEVRIES T , TAYLOR G W. Improved regularization of convolutional neural networks with cutout[EB/OL]. 2017. [2024−01−06]. https://arxiv.org/abs/2107.08430.
|
[27] |
DING C,CHEN Y F,LI R Z,et al. Integrating hybrid pyramid feature fusion and coordinate attention for effective small sample hyperspectral image classification[J]. Remote Sensing,2022,14(10):2355. doi: 10.3390/rs14102355
|
[28] |
ZHANG J M,XIE Z P,SUN J,et al. A cascaded R-CNN with multiscale attention and imbalanced samples for traffic sign detection[J]. IEEE Access,2020,8:29742−29754. doi: 10.1109/ACCESS.2020.2972338
|
[29] |
REN S Q,HE K M,GIRSHICK R,et al. Faster R-CNN:towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137−1149. doi: 10.1109/TPAMI.2016.2577031
|
[30] |
LIU W,ANGUELOV D,ERHAN D,et al. SSD:Single shot MultiBox detector[M]//Computer vision–ECCV 2016. Cham:Springer International Publishing,2016:21−37.
|
[31] |
LIN T Y,GOYAL P,GIRSHICK R,et al. Focal loss for dense object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,42(2):318−327. doi: 10.1109/TPAMI.2018.2858826
|
[32] |
DANG F Y,CHEN D,LU Y Z,et al. YOLOWeeds:a novel benchmark of YOLO object detectors for multi-class weed detection in cotton production systems[J]. Computers and Electronics in Agriculture,2023,205:107655. doi: 10.1016/j.compag.2023.107655
|
[33] |
GE Z , LIU S , WANG F ,et al. YOLOX: Exceeding YOLO Series in 2021[EB/OL]. [2024−01−06]. https://arxiv.org/abs/2107.08430.
|
[34] |
SELVARAJU R R,COGSWELL M,DAS A,et al. Grad-CAM:visual explanations from deep networks via gradient-based localization[J]. International Journal of Computer Vision,2020,128(2):336−359. doi: 10.1007/s11263-019-01228-7
|
1. |
黄东亮,顾存凯,蔡素燕,刘淑莲. 基于流场仿真的高压高效大型轴流风机设计研发. 浙江科技大学学报. 2025(01): 20-32 .
![]() |