多尺度融合增强与注意力机制结合的图像语义分割

SEMANTIC SEGMENTATION OF IMAGES COMBINED WITH MULTI-SCALE FUSION ENHANCEMENT AND ATTENTION MECHANISM

摘要: 针对当前图像语义分割中分割效率不高与分割边界不连续问题，提出一种多尺度融合增强与注意力机制结合的语义分割算法。该算法对原有DeepLabv3 +网络结构进行改进，在编码器部分提出一种特征提取增强网络结构，充分利用相邻层各个尺度的特征信息进行融合，在解码器末端使用改进的轻量化卷积注意力模块，使得对于物体边界分割更加充分。通过在Pascal VOC2007和Cityscapes数据集上进行实验验证，结果表明该方法较原有网络的精确度有显著的提高。

Abstract: Aimed at the problems of low segmentation efficiency and discontinuity of segmentation boundaries in the current image semantic segmentation, a semantic segmentation algorithm combined with multi-scale fusion enhancement and attention mechanism is proposed. The algorithm improved the original DeepLabv3 + network structure, proposed a feature extraction enhancement network structure in the encoder part, made full use of the feature information of each scale of the adjacent layer for fusion, and used the improved lightweight convolution attention module at the end of the decoder, making the object boundary segmentation more complete. Experimental verification on the Pascal VOC2007 and Cityscapes datasets shows that the accuracy of the proposed method is significantly improved compared with the original network.