基于LBC模型的公共资源交易数据命名实体识别研究

LBC: A NAMED ENTITY RECOGNITION FRAMEWORK IN PUBLIC RESOURCE TRANSACTION DATA

  • 摘要: 公共资源交易数字化推动了公共资源交易数据指数级的增长,提升交易主体、项目属性等实体识别的准确度与泛化能力成为关键挑战。提出一种基于LBC(LIFL-BERT-CRF)模型的公共资源交易数据命名实体识别研究方法,使用LIFL改善单个字符表达语义不足的问题,基于BERT进行预训练,通过CRF考虑整个序列的上下文关系,实现公共资源交易命名实体识别全局优化的决策。实验结果表明该模型在准确性和有效性方面明显优于现有的基线方法。

     

    Abstract: With the acceleration of digitalization in public resource transactions, data such as tender announcements, bid-winning notices, and bidding records have shown an exponential growth trend. These data contain core information including transaction entities, project attributes, and related entities. How to achieve accurate and generalized entity recognition in such data remains an urgent challenge in this field. This paper proposes a named entity recognition (NER) method for public transaction data based on the LBC (LIFL-BERT-CRF) model. The LIFL module addressed the limitation of insufficient semantic representation in individual characters, BERT provided pre-trained contextual embedding, and the CRF layer optimized global sequence dependencies by considering contextual relationships across the entire sequence. This integrated approach achieved globally optimized decision-making for NER in public resource transactions. The experimental results demonstrate that the proposed model significantly outperforms existing baseline methods in both accuracy and effectiveness.

     

/

返回文章
返回