Citation: | WANG Yuanbin,GUO Yaru,LIU Jia,et al. Low illumination image enhancement algorithm of CycleGAN coal mine based on attention mechanism and Dilated convolution[J]. Coal Science and Technology,2024,52(S2):375−383. DOI: 10.12438/cst.2023-1597 |
The complex underground environment, filled with a large amount of dust and water vapor, and uneven illumination of artificial light source, leads to problems such as low illumination and loss of detail features in images collected by underground monitoring equipment, which seriously affects the real-time performance of mining safety monitoring equipment, is not good for subsequent computer vision tasks, and it is difficult to collect underground data. It is difficult to make paired low-light image data sets for model training. To solve these problems, a low illumination image enhancement algorithm based on CycleGAN is proposed. In view of the difficulty of collecting paired data set under mine, CycleGAN network is used for unsupervised learning. In order to improve the detail feature extraction ability of the generator network, the image enhancement network was constructed by using the Parameter-Free Attention Mechanism (simAM) and the Dual-Channel Attention Mechanism (CBAM) to improve the anti-interference ability of the model in complex background, so that the model could recover the image detail features better, which improved the anti-interference ability of the model under complex background and made the model recover the detail features better. A luminance enhancement module based on residual cavity convolution is introduced to increase the luminance of the image while enlarging the receptive field of the generator network, which is conducive to detail recovery and visual quality improvement. Patch-GAN is apply for the discriminator of the network, and the input is mapped into a matrix to pay more comprehensive attention to the details of different regions of the image, and improve the discriminator's resolution of image details. Experimental results show that compared with the CycleGAN algorithm, the proposed method improves the peak signal-to-noise ratio (PSNR), structural similarity (SSIM), information entropy and visual information fidelity (VIF) by 11.31%, 8.07%, 2.58% and 6.18% on average.
[1] |
程德强,钱建生,郭星歌,等. 煤矿安全生产视频AI识别关键技术研究综述[J]. 煤炭科学技术,2023,51(2):349−365.
CHENG Deqiang,QIAN Jiansheng,GUO Xingge,et al. Review on key technologies of AI recognition for videos in coal mine[J]. Coal Science and Technology,2023,51(2):349−365.
|
[2] |
贺磊,魏明生,仇欣宇,等. 基于UWB的井下人员定位算法研究[J]. 工矿自动化,2022,48(6):134−138.
HE Lei,WEI Mingsheng,QIU Xinyu,et al. Research on positioning algorithm of underground personnel based on UWB[J]. Journal of Mine Automation,2022,48(6):134−138.
|
[3] |
田文洪,曾柯铭,莫中勤,等. 基于卷积神经网络的驾驶员不安全行为识别[J]. 电子科技大学学报,2019,48(3):381−387.
TIAN Wenhong,ZENG Keming,MO Zhongqin,et al. Recognition of unsafe driving behaviors based on convolutional neural network[J]. Journal of University of Electronic Science and Technology of China,2019,48(3):381−387.
|
[4] |
XU Dawen,LIU Yong. Reversible data hiding in H. 264/AVC videos based on hybrid-dimensional histogram modification[J]. Multimedia Tools and Applications,2022,81(20):29305−29319. doi: 10.1007/s11042-022-12740-3
|
[5] |
PAUL A. Adaptive tri-plateau limit tri-histogram equalization algorithm for digital image enhancement[J]. The Visual Computer,2023,39(1):297−318. doi: 10.1007/s00371-021-02330-z
|
[6] |
于林泉,陆军. 复杂背景下的人眼状态与虹膜位置检测[J]. 哈尔滨工程大学学报,2023,44(4):594−605.
YU Linquan,LU Jun. Human eye state and iris position detection in a complex background[J]. Journal of Harbin Engineering University,2023,44(4):594−605.
|
[7] |
王满利,田子建. 基于非下采样轮廓波变换的矿井图像增强算法[J]. 煤炭学报,2020,45(9):3351−3362.
WANG Manli,TIAN Zijian. Mine image enhancement algorithm based on nonsubsampled contourlet transform[J]. Journal of China Coal Society,2020,45(9):3351−3362.
|
[8] |
王满利,张航,李佳悦,等. 基于深度神经网络的煤矿井下低光照图像增强算法[J]. 煤炭科学技术,2023,51(9):231−241. doi: 10.12438/cst.2022-1626
WANG Manli,ZHANG Hang,LI Jiayue,et al. Deep neural network-based image enhancement algorithm for low-illumination images underground coal mines[J]. Coal Science and Technology,2023,51(9):231−241. doi: 10.12438/cst.2022-1626
|
[9] |
吴一全,史骏鹏. 基于多尺度Retinex的非下采样Contourlet域图像增强[J]. 光学学报,2015,35(3):87−96.
WU Yiquan,SHI Junpeng. Image enhancement in non-subsampled contourlet transform domain based on multi-scale retinex[J]. Acta Optica Sinica,2015,35(3):87−96.
|
[10] |
江兴方,王戈,沈为民. 一种色彩改进型Retinex彩色图像增强方法[J]. 光电子. 激光,2008,19(10):1402−1404.
JIANG Xingfang,WANG Ge,SHEN Weimin. A method of color image enhancement using color advanced retinex[J]. Journal of Optoelectronics. Laser,2008,19(10):1402−1404.
|
[11] |
LORE K G,AKINTAYO A,SARKAR S. LLNet:A deep autoencoder approach to natural low-light image enhancement[J]. Pattern Recognition,2017,61:650−662. doi: 10.1016/j.patcog.2016.06.008
|
[12] |
WU Dongmei,ZHANG Siqi. Research on image enhancement algorithm of coal mine dust[C]//2018 International Conference on Sensor Networks and Signal Processing (SNSP). Piscataway,NJ,USA. IEEE,2018:261−265.
|
[13] |
LYU Feifan,LU Feng,WU Jianhua,et al. MBLLEN:low-light image video enhancement using CNNs[C]//Proceedings of BMVC’18. Washington,DC,USA. BMVC,2018:220−226.
|
[14] |
GUO Xiaojie,LI Yu,LING Haibin. LIME:Low-light image enhancement via illumination map estimation[J]. IEEE Transactions on Image Processing,2017,26(2):982−993. doi: 10.1109/TIP.2016.2639450
|
[15] |
刘超,张晓晖. 超低照度下微光图像的深度卷积自编码网络复原[J]. 光学 精密工程,2018,26(4):951−961. doi: 10.3788/OPE.20182604.0951
LIU Chao,ZHANG Xiaohui. Deep convolutional autoencoder networks approach to low-light level image restoration under extreme low-light illumination[J]. Optics and Precision Engineering,2018,26(4):951−961. doi: 10.3788/OPE.20182604.0951
|
[16] |
TAO Li,ZHU Chuang,XIANG Guoqing,et al. LLCNN:A convolutional neural network for low-light image enhancement[C]//2017 IEEE Visual Communications and Image Processing (VCIP). Piscataway,NJ,USA. IEEE,2017:1−4.
|
[17] |
GOODFELLOW I J,POUGET-Abadie J,MIRZA M,et al. Generative adversarial nets[C]//Proceedings of the 27th International Conference on Neural Information Processing Systems. Cambridge,MA,USA. MIT Press,2014:2672−2680.
|
[18] |
ZHANG He,SINDAGI V,PATEL V M. Image de-raining using a conditional generative adversarial network[J]. IEEE Transactions on Circuits and Systems for Video Technology,2020,30(11):3943−3956. doi: 10.1109/TCSVT.2019.2920407
|
[19] |
ISOLA P,ZHU Junyan,ZHOU Tinghui,et al. Image-to-image translation with conditional adversarial networks[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway,NJ,USA. IEEE,2017:5967−5976.
|
[20] |
JIANG Yifan,GONG Xinyu,LIU Ding,et al. EnlightenGAN:Deep light enhancement without paired supervision[J]. IEEE Transactions on Image Processing,2021,30:2340−2349. doi: 10.1109/TIP.2021.3051462
|
[21] |
ZHU Junyan,PARK T,ISOLA P. Unpaired image-to-image translation using cycle-consistent adversarial networks [C]//2017 IEEE International Conference on Computer Vision (ICCV). Piscataway,NJ,USA. IEEE,2017:2223−2232.
|
[22] |
阮顺领,李少博,顾清华,等. 基于双向特征融合的露天矿区道路障碍检测[J]. 煤炭学报,2023,48(3):1425−1438.
RUAN Shunling,LI Shaobo,GU Qinghua,et al. Road obstacle detection in open-pit mines based on bidirectional feature fusion[J]. Journal of China Coal Society,2023,48(3):1425−1438.
|
[23] |
吴佳奇,张文琪,陈伟,等. 基于改进CycleGAN的煤矿井下低照度图像增强方法[J]. 华中科技大学学报(自然科学版),2023,51(5):40−46.
WU Jiaqi,ZHANG Wenqi,CHEN Wei,et al. Image enhancement method of underground low illumination in coal mine based on improved CycleGAN[J]. Journal of Huazhong University of Science and Technology (Natural Science Edition),2023,51(5):40−46.
|