基于多尺度特征聚合和密集连接的人群计数网络

CROWD COUNTING NETWORK BASED ON MULTI-SCALE FEATURE AGGREGATION AND DENSE CONNECTION

  • 摘要: 人群计数任务处理的图像受到遮挡、视角变化和透视效应、背景成像干扰等多方面的挑战。针对复杂场景中人群计数任务存在的尺度变化、图片噪声等问题,设计一种基于多尺度特征聚合和密集连接的人群计数网络模型。模型的一个重要组成部分是密集连接而成的多尺度特征聚合模块,它通过不同卷积核提取多尺度特征并聚合其跨尺度的信息进行更准确的估计。该网络模型在三个公开数据集Shanghai Tech、UCF_QNRF、UCF_CC_50上进行测试,实验结果表明,该模型相比目前先进算法CSRNet在平均绝对误差(MAE)、均方误差(MSE)均有不同程度的减少,模型精度更好。与其他模型相比,该模型更充分地利用多尺度特征信息,提高了人群计数任务的精度。

     

    Abstract: Crowd counting is challenged by many aspects, such as occlusion, perspective change and perspective effect, background imaging interference, etc. In this paper, we propose a multi-scale feature aggregation and dense connection network for crowd counting. A vital component of this network called multi-scale feature aggregation model (MFA) extracted multi-scale features through different convolution kernels and aggregated their cross-scale information for more accurate estimation. This model was tested on three public datasets Shanghai Tech A, Shanghai Tech B, UCF_QNRF and UCF_CC_50. The results show that the model has reduced the mean absolute error (MAE) and mean square error (MSE) to different degrees, and the model accuracy is better compared with CSRNet. In contrast with other models, the model makes full use of multi-scale feature information and improves the accuracy of the crowd counting.

     

/

返回文章
返回