基于FPT改进的Mask RCNN算法的道路信息检测研究

ROAD INFORMATION DETECTION BASED ON IMPROVED MASK RCNN ALGORITHM OF FPT

  • 摘要: 针对目前目标检测算法应用于道路信息目标检测效果差,容易出现漏检、错检等问题,提出一种基于FPT (Feature Pyramid Transformer)改进的Mask RCNN道路信息检测算法用于帮助城市道路建设提高工作质量。在特征融合网络中引入借鉴Transformer思想的FPT代替原有的FPN,对特征进行跨空间跨尺度的融合达到特征增强的效果,提高模型检测精度。在实验中,采用迁移学习思想,对PASCAL-VOC2012数据集进行预测结果得到模型的预测结果权重,实验结果表明,该算法相较于原算法在分别采用ResNet50, ResNet101时平均精度分别提高了7.5百分点、10.6百分点,对小目标的检测效果变得更好。

     

    Abstract: In view of the poor effect of target detection algorithm applied to road information target detection and the problems of missed detection and false detection, an improved mask RCNN road information detection algorithm based on FPT (feature pyramid transformer) is proposed to help urban road construction to improve the quality of work. In the feature fusion network, FPT based on transformer idea was introduced to replace the original FPN to fuse features across space and scale, so as to achieve the effect of feature enhancement and improve the accuracy of model detection. In the experiment, the idea of transfer learning was used to pre-train PASCAL-VOC2012 data set, and the pre-training weight was obtained. The experimental results show that the average accuracy of the algorithm is improved by 7.5/10.6 percentage points compared with the original algorithm when ResNet50/ResNet101 is used respectively, and the performance on small targets is better than other commonly-used algorithms.

     

/

返回文章
返回