计算机科学
瓶颈
卷积神经网络
人工智能
深度学习
模式识别(心理学)
编码器
变压器
特征学习
人工神经网络
机器学习
数据挖掘
工程类
电压
电气工程
操作系统
嵌入式系统
作者
Siyuan Zhang,Cheng Lian,Bingrong Xu,Yixin Su,Adi Alhudhaif
标识
DOI:10.1016/j.ins.2024.120239
摘要
The 12-lead electrocardiogram (ECG) is a reliable diagnostic tool for detecting and treating severe cardiovascular conditions like arrhythmia and heart attack. Deep neural networks (DNNs) have achieved higher accuracy in recent years than traditional ECG signal classification task methods. Convolutional neural network (CNN) and Transformer are the two mainstream architectures of DNN, respectively good at extracting local and global features from input data. This paper proposes the multi-scale convolutional Transformer network (MCTnet), an efficient combination of Transformer encoder and CNN for ECG signal classification. MCTnet utilizes the advantages of CNN and self-attention mechanisms to capture potential features in ECG signal accurately. The dual-branch Transformer encoder extracts different-scale feature representations, enabling the capture of both local and global information. Additionally, an information bottleneck method eliminates redundant information and enhances task-relevant information in the learned representations. To evaluate the performance of MCTnet, comprehensive experiments are conducted on three commonly used ECG datasets. The results demonstrate that MCTnet outperforms current deep learning-based models, highlighting its effectiveness in ECG signal classification. It also shows that the performance of the model can be effectively improved by utilizing multi-scale representation learning and information bottleneck.
科研通智能强力驱动
Strongly Powered by AbleSci AI