基于深度可分离卷积与注意力的SSD目标检测模型

SINGLE SHOT MULTI-BOX DETECTOR BASED ON DEPTH-WISE SEPARABLE CONVOLUTION AND ATTENTION

  • 摘要: SSD是基于深度学习的单阶段目标检测模型,但其特征金字塔中的特征图缺乏多尺度信息融合,导致对中小型目标识别效果不佳。针对该问题,提出一种基于注意力机制与深度可分离卷积的SSD目标检测模型(Attention&DSC Single Shot MultiBox Detector, AD-SSD)。AD-SSD首先归一化融合特征金字塔中的特征图,再引入注意力机制加强对目标信息的表征,并采用深度可分离卷积降低参数量。该方法提高了SSD的检测精度的同时,还加快了检测速度。在PASCAL VOC07+12数据集中,AD-SSD获得了81.7%的平均精度(mAP),小型目标精度提高6.3百分点,中型目标精度提高6.4百分点,检测速度达到55.1FPS。

     

    Abstract: SSD, a one-stage detector based on deep learning with simple architecture of feature pyramid network (FPN), has limitation of feature extraction due to lack of cross-scale information fusion which causes poor performance on detecting small and medium objects. To alleviate the problem, this paper proposes AD-SSD (Attention&DSC Single Shot Multi-Box Detector). AD-SSD introduced fast normalized fusion to FPN, and made use of self-attention mechanism to enhance feature information of objects at different scales. Depth-wise separable convolution was used to reduce model parameters. This method not only improved the mean average precision (mAP) of SSD but also accelerated the detection speed. Results on PASCAL VOC07+12 dataset show that it achieves 81.7% mAP at speed of 55.1 FPS while mAP for small and medium object are improved by 6.3% and 6.4% respectively.

     

/

返回文章
返回