基于分阶段注意时序对齐的少样本动作识别

FEW-SHOT ACTION RECOGNITION BASED ON TEMPORAL ALIGNMENT WITH MULTI-PHASE ATTENTION MECHANISM

  • 摘要: 由于视频中动作各进展阶段发生时间长短不一,时刻各不相同,动作的时序对齐直接影响少样本动作识别性能。对此提出基于分阶段注意时序对齐的少样本动作识别方法,通过分阶段的注意力机制更精确地实现视频片段的时间对齐,避免视频片段级别的时序错配,更合理地获取利用视频中动作的时序信息;通过剔除相似度过低的视频片段特征时,可有效降低非动作片段的干扰,以提高少样本动作识别的准确率。模型训练采用了c路k样本的元学习的训练方式。将所提出的方法在公开数据集UCF101和Kinetics上进行实验,与相关先进方法比较分析,表明所提出方法的有效性。

     

    Abstract: Since the lengths and moments of actions occurring in the videos vary, the effectiveness of action recognition is directly affected by temporal alignment. A few-shot action recognition method based on temporal alignment with multi-phase attention mechanism is presented in this paper. Through the phased attention mechanism, the temporal alignment of video clips was realized more accurately, the temporal mismatch at the video stage level was avoided, and the temporal information of actions in the video was more reasonably obtained and utilized. By eliminating segment-wise features pairs with low similarity scores, the interference of non-action segments was reduced, and the accuracy of few-shot action recognition was improved. c-way k-shot meta learning was adopted in the training procedure. The experiments were conducted on the UCF-101 and Kinetics datasets, which verified the effectiveness of the proposed method compared with related advanced methods.

     

/

返回文章
返回