一种多路召回与加权排序的图像检索系统

A MULTI-WAY RECALL AND WEIGHTED SORTING IMAGE RETRIEVAL SYSTEM

  • 摘要: 随着数字图像数据的爆发式增长,图像检索技术面临准确率与效率难以平衡的核心挑战。传统方法过度依赖局部特征,但易忽略全局语义信息,导致检索结果准确性受限。为此,提出一种多路召回与动态加权融合的图像检索系统,通过全局、局部特征协同优化机制实现高效精准检索,主要被金融公司用于识别商户的证件照片和门店照片等实际应用场景。基于DINOv2模型提取图像深层语义特征,结合SIFT/VLAD编码的局部细节特征,构建双路召回策略,兼顾语义一致性与细粒度差异;设计自适应加权函数,以局部特征重合度为依据动态调整全局与局部特征的融合权重,解决传统固定权重策略的泛化性问题。实验表明,在负样本十万分之一误识率条件下,该算法较DINOv2模型有明显提升,且显著降低对单一模型的依赖。

     

    Abstract: With the explosive growth of digital image data, image retrieval technology faces the core challenge of balancing accuracy and efficiency. Traditional methods overly rely on local features but tend to neglect global semantic information, which limits the accuracy of retrieval results. To address this issue, this paper proposes an image retrieval system featuring multi-path recall and dynamic weighted fusion. By optimizing the global and local features collaboratively, it achieved efficient and precise retrieval, mainly applied in the financial company practical scenarios such as identifying merchant’s ID photos and store photos. Deep semantic features were extracted based on the DINOv2 model, combined with local detail features encoded by SIFT/VLAD, to construct a dual-path recall strategy that took into account both semantic consistency and fine-grained differences. An adaptive weighting function was designed to dynamically adjust the fusion weights of global and local features based on the degree of local feature overlap, solving the generalization problem of traditional fixed-weight strategies. Experiments show that under the condition of a false acceptance rate of one in 100 000 for negative samples, this algorithm significantly outperforms the DINOv2 model and reduces the reliance on a single model.

     

/

返回文章
返回