基于改进型生成对抗网络的矿井图像超分辨重建方法研究

张帆; 刘莹; 宋惠; 张嘉荣; 程海星

doi:10.12438/cst.2024-0298

基于改进型生成对抗网络的矿井图像超分辨重建方法研究

Super-resolution reconstruction of mine image based on generative adversarial network

摘要

摘要: 智能化无人开采是煤炭资源绿色、智能、安全、高效开采的技术发展趋势，高分辨率的矿井图像能够为煤矿智能开采和智能监控提供关键技术支撑。针对煤矿井下雾尘环境，目前采用常规的深度学习方法虽然能够提高矿井图像重建效果，但是受井下环境噪声影响，模型训练的稳定性较差，难以获得矿井图像的重建高频信息，导致图像重构质量欠佳，易出现矿井图像模糊和分辨率下降等问题。针对上述问题，提出一种基于生成对抗网络的矿井图像超分辨率重建方法。该方法基于SRGAN网络，对网络结构和损失函数进行改进优化，在生成器的浅层特征提取层和重建层分别采用2个5×5的卷积层，并在浅层特征提取层的每个卷积层后加入非线性激活函数，深层特征提取层采用残差结构，通过级联亚像素卷积层以实现矿井图像不同倍数的超分辨重建；采用Wasserstein距离对损失函数进行改进，并去掉判别器输出层的Sigmoid，使用RMSProp方法对网络进行优化，提高模型训练的收敛速度和稳定性；利用训练好的生成器模型，据此分别对矿井图像进行2倍和4倍超分辨重建，并对实验结果进行主观视觉分析和客观评价。结果表明，与传统的双三次插值、SRCNN、SRGAN相比，在相同缩放因子条件下，所提方法的峰值信噪比分别提升了2.68、1.50和1.59 dB，结构相似性分别提升了0.0334、0.0048和0.0061，所提方法能够重建出清晰的矿井图像纹理和细节信息，在主观视觉上以及峰值信噪比和结构相似性上都实现了更好的重建效果，且整体性能优于其他几种方法，有效提高了矿井图像的分辨率。

Abstract: Intelligent unmanned mining of coal mine is the technological development trend of green, intelligent and safe mining of coal resources. High-resolution mine images can provide key technical support for intelligent unmanned mining of coal mine. Aiming at the degradation phenomenon of mine images, in order to improve the resolution of mine images, a super-resolution reconstruction method mine image based on generative adversarial network is proposed. Based on SRGAN, this method improves the network structure and loss function. First, two 5×5 convolutional layers are used in the low-level feature extraction layer and reconstruction layer of the generator, and non-linearity is added after each convolutional layer of the low-level feature extraction layer, and the high-level feature extraction layer adopts the residual structure, and the sub-pixel convolutional layer is cascaded to achieve super-resolution reconstruction of different multiples. Secondly, the Wasserstein distance is used to improve the loss function, and the Sigmoid of the output layer of the discriminator is removed. The RMSProp method is used to improve the network optimization method, which can make the model training more stable. Finally, the trained generator model is used to reconstruct the mine images by 2 times and 4 times super-resolution, and subjective visual analysis and objective evaluation were carried out on the experimental results. The results show that compared with the traditional bicubic interpolation, SRCNN, and SRGAN, when the scaling factor is 4, the peak signal-to-noise ratio of the proposed method is increased by 2.68, 1.50 and 1.59 dB; the structural similarity is increased by 0.0334, 0.0048 and 0.0061. The method proposed achieves better reconstruction results in terms of subjective vision, peak signal-to-noise ratio and structural similarity. The improved method can reconstruct clear texture and detail information, and its overall performance is better than several other methods.

HTML全文

参考文献(25)

施引文献

资源附件(0)