深度学习中文本相似度计算研究综述

REVIEW OF TEXTUAL SIMILARITY CALCULATION IN DEEP LEARNING

  • 摘要: 文本相似度计算是自然语言处理的重要任务之一,通过总结分析学术界提出的经典方法和最新进展,对深度学习中文本相似度计算的文本表示和相似度计算两个模块进行分类整理和全面综述。社交网络的发展衍生出了短文本相似度计算这一重要子任务,因此针对每一模块,梳理文本相似度计算相关技术与理论基础,总结其在短文本中的具体应用及改进;整理文本相似度计算领域的常用数据集及评价指标;讨论文本相似度计算未来可能的发展方向。

     

    Abstract: Textual similarity calculation is one of the most important tasks of natural language processing. By summing up and analyzing the classical methods and the latest developments, the text representation and textual similarity calculation in deep learning were summarized and reviewed comprehensively. Due to the development of social networks, short textual similarity calculation, an important subtask has been derived. For each module, this paper sorted out the related technologies and theoretical foundations of textual similarity calculation, and summarized their specific applications and improvements in short texts. This paper summed up the common data sets and evaluation indexes in the field of textual similarity calculation. Some possible directions of textual similarity calculation were pointed out.

     

/

返回文章
返回