基于MobileViT轻量化网络的目标检测算法

OBJECT DETECTION ALGORITHM BASED ON MOBILEVIT LIGHTWEIGHT NETWORK

  • 摘要: 针对当前目标检测卷积神经网络参数量大、结构复杂、不善于提取全局特征信息的问题,基于MobileViT轻量化网络并结合YOLO系列算法的研究成果,设计一个轻量化的目标检测网络模型。将MobileViT作为骨干网络,增加提取全局特征信息的能力,而后使用PAN拓扑结构进行多尺度特征融合,再采用Decoupled Head结构提高检测精度。提出的网络模型以更小的参数量,在VOC07+12上的检测精度平均达到85.5%。在NVIDIA RTX3060 GPU上的检测速度为80.3FPS。实验结果表明,提出的算法与其他体量相当的目标检测算法相比,参数量更小,检测精度更高。

     

    Abstract: Object detection algorithms based on deep convolutional neural networks usually have a large number of parameters and complex structures and are poor at extracting global feature information. In view of this, we propose a lightweight object detection model based on MobileViT, which combines the results of the YOLO family of algorithms. The MobileViT and SimPPF modules were used as the backbone network, and the PAN topological structure was used for multi-scale feature fusion, while the Decoupled Head structure was used to improve detection accuracy. On VOC07+12, the proposed model achieved a detection accuracy of 85.5% AP with smaller parameters. On NVIDIA RTX3060 GPU, the detection speed is 80.3 FPS. The experimental results show that the algorithm in this paper has a smaller number of parameters and higher detection accuracy than other object detection algorithms of similar scales.

     

/

返回文章
返回