基于语义拼写理解和门控注意力机制的不良言论检测

OFFENSIVE LANGUAGE DETECTION BASED ON SEMANTIC SPELLING COMPREHENSION AND GATED ATTENTION MECHANISM

  • 摘要: 如何自动检测网络传播的不良言论信息是自然语言处理研究领域的热门研究内容之一。针对不良言论中语义表达和拼写习惯的特点,提出一种基于语义拼写理解和门控注意力机制的不良言论检测方法。该方法采用自注意力机制获取文本的语义特征,采用卷积神经网络提取文本的拼写特征,采用前期特征融合和门控注意力机制相结合的方式融合语义和拼写特征。在两个公共数据集上的实验结果表明,提出的模型能够有效地提取不良言论的语义特征,提高不良言论检测的性能。

     

    Abstract: How to automatically detect offensive language information spread on the Internet is one of the hot research contents in the field of natural language processing. Aiming at the characteristics of semantic expression and spelling habits in offensive language, this paper proposes a offensive language detection method based on semantic spelling understanding and gating attention mechanism. This method used a self-attention mechanism to obtain the semantic features of the text, used a convolutional neural network to extract the spelling features of the text, and used a combination of early feature fusion and gated attention mechanism to fuse semantic and spelling features. Experimental results on two public datasets show that the proposed model can effectively extract the semantic features of offensive language and improve the performance of offensive language detection.

     

/

返回文章
返回