基于多尺度注意力机制的实例分割卷积神经网络

AN INSTANCE SEGMENTATION CONVOLUTIONAL NEURAL NETWORK BASED ON MULTISCALE ATTENTION MECHANISM

  • 摘要: 在Mask R-CNN实例分割模型的基础上提出一种新的深度学习方法MixedMask。该方法提出并应用两种有效的策略:(1) 使用混合尺度的卷积核,提高网络对分辨率较低实例的提取能力;(2) 在压缩激励网络的基础上进行改进,解决原网络中降低维度导致的通道信息丢失问题。在气球数据集和xBD数据集上进行测试,该算法分别达到了83.46%和58.92%的AP(IoU=50),相比Mask R-CNN模型,分别提升了1.3%和5.9%。

     

    Abstract: Based on the instance segmentation model of Mask R-CNN, this paper proposes a new deep learning method named MixedMask. It offered two effective strategies. (1) The convolution kernel with mixed scale was used to improve the network's capability of extracting low-resolution instances. (2) Based on the squeeze-and-excitation networks, the improvement was made to solve the problem of channel information loss caused by the dimension reduction in the original network. Test on balloon datasets and xBD datasets shows that this method reaches 83.46% and 58.92% AP (IoU=50) respectively. Compared with the Mask R-CNN, the results were increase by 1.3% and 5.9% respectively.

     

/

返回文章
返回