基于多粒度语义匹配的编程任务解决方案推荐

PROGRAMMING TASK-ORIENTED SOLUTION RECOMMENDATION METHOD BASED ON MULTI-GRANULARITY SEMANTIC MATCHING

  • 摘要: 现有的编程任务检索工作或是仅基于问题标题进行检索;或是将问题全文与标题拼接后一起表征,忽视了不同类型文本之间的差异,导致推荐结果与任务之间存在差距。为解决上述问题,提出一种基于多粒度语义匹配的编程任务解决方案推荐方法MGSMR。该方法分别基于标题和全文两个粒度使用不同的语义匹配模型寻找相关问答讨论作为候选解决方案,整合两个粒度的检索结果进行重排,补充API文档等外部知识生成解决方案推荐给开发人员。实验结果表明该方法在两个数据集的多个评测指标上较对比方法提升了18%~26%和2%~4%,可缩短用户23%的搜索时间,并提升所选答案22%的正确性。

     

    Abstract: Existing programming solution recommendation methods either directly ignore the full text of the question or fail to find a suitable way to exert its value, resulting in inaccurate recommendation results. In this paper, we propose MGSMR, a programming task-oriented solution recommendation method based on multi-granularity semantic matching. For a given programming task, MGSMR used the semantic matching model to find relevant question-and-answer discussions as candidate solutions based on two different granularities of the question title and the full text of the question. It further re-ranked the candidate solutions based on the multi-granularity matching method. It combined candidate solutions with external knowledge such as API documentation and third-party libraries to generate recommendation solutions. The evaluation shows that this method outperforms baselines in terms of multiple IR indicators of two datasets by 18%~26% and 2%~4%, and solution generation can help participants solve programming tasks 23% faster and 22% more accurately.

     

/

返回文章
返回