基于GoogleNet-ViT模型的心律不齐多标签诊断算法

黄浩; 朱俊江

doi:10.3969/j.issn.1000-386x.2025.05.034

基于GoogleNet-ViT模型的心律不齐多标签诊断算法

黄浩,
朱俊江

MULTI-LABEL DIAGNOSIS ALGORITHM OF ARRHYTHMIA VIA GOOGLENET-ViT MODEL

摘要

摘要: 心电图是诊断心律失常疾病的重要依据，利用计算机自动诊断心电图类型属于多标签分类问题。Vision Transformer (ViT) 模型在多标签分类上具有良好的表现。然而直接将其应用到心电图分类时，会破坏心电信号内部的形状特征，进而导致模型准确率较低。为此，提出一种基于GoogleNet-ViT模型心律不齐多标签分类算法。该算法使用预测的GoogleNet提取心电信号特征，然后采用Transformer Encoder构建特征全局关系，最后采用全连接层完成多标签分类。选用20 409例临床心电图数据进行测试，结果表明算法平均F1值达到7.0、862.3，平均准确率为97.68%，诊断标签完全正确的比例为83.14%，相比于ViT模型和其他常规CNN网络，该算法具有明显优势。

Abstract: Electrocardiogram (ECG) has been proved to be the most common and effective approach to investigate arrhythmia. The automatic diagnosis algorithm of arrhythmia can be seen as a multi-label classification problem. The vision transformer (ViT) model has a good performance on classification problems. However, when it is directly applied to ECG classification, it will destroy the shape features inside the ECG signal, resulting in lower model accuracy. To this end, a multi-label classification algorithm for arrhythmia based on the GoogleNet-ViT model is proposed. The algorithm used the pre-trained GoogleNet to extract features instead of directly segmenting the ECG signal, and only used a single Transformer Encoder to complete the construction of the global relationship of features, and finally inputted the fully connected layer to complete multi-label classification. 20 409 cases of clinical ECG data were selected for testing. The results show that the average F1 value of the algorithm reached 0.862 3, the average accuracy rate is 97.68%, and the proportion of diagnostic labels that are completely correct is 83.14%. Compared with the ViT model and the conventional CNN network, the proposed algorithm has clear advantages.

HTML全文

参考文献(0)

施引文献

资源附件(0)