融合深度字典学习和特征重建的遮挡人脸检测研究

AN OCCLUSION FACE DETECTION METHOD COMBINING DEPTH DICTIONARY LEARNING AND FEATURE RECONSTRUCTION

  • 摘要: 针对基于短时能量和短时过零率的传统端点检测算法在信噪比低于10 dB时其准确性明显下降的问题,提出一种基于声谱图特征增强的能零积端点检测改进算法。首先,对音频信号进行谱减法降噪,提升信号的信噪比;其次,依次对声谱图进行腐蚀、二值化和膨胀处理,以实现声谱图特征增强;最后,提取能零积特征,并利用双阈值端点检测算法对音频信号进行端点检测。实验结果表明,该算法在不同信噪比条件下可以有效捕获有环境异常音片段,具有良好的鲁棒性。

     

    Abstract: Aimed at the low accuracy of occluded face detection in complex scenes, an occlusion face detection method combining depth dictionary learning and feature reconstruction is proposed. A shallow CNN was used to generate face candidate regions, and the pre-trained VGG16 network was used to characterize them. A sparse coding method was used to establish a deep retrieval dictionary composed of typical faces and non-faces. Using the locality preserving projections method, the feature descriptor of the face candidate region was reconstructed into a similarity-based feature vector by using the retrieval dictionary. The reconstructed feature vector was sent to the deep neural network to perform face/nonface classification and face bounding box location regression simultaneously. The experimental results on the MAFA occlusion face dataset show that the detection accuracy of this method is about 12.3 percentage points higher than the current mainstream face detection method.

     

/

返回文章
返回