Can Left Atrial Enlargement Be Estimated From the Electrocardiogram? An Artificial Intelligence Approach to Screening Risk of Atrial Fibrillation and Heart Disease

医学 心房颤动 左心房扩大 心脏病学 内科学 P波 心脏病 疾病 窦性心律
作者
Vijay S. Chauhan
出处
期刊:Canadian Journal of Cardiology [Elsevier BV]
卷期号:40 (4): 595-597
标识
DOI:10.1016/j.cjca.2023.11.015
摘要

Assessing the risk of cardiovascular disease is challenging, but essential to appropriate clinical work-up and management. Among the plethora of cardiac risk markers available to primary care physicians, internists, and cardiologist, left atrial enlargement (LAE) is rarely considered, but it has been shown to independently predict atrial fibrillation (AF), heart failure, coronary artery disease, and all-cause mortality.1Bouzas-Mosquera A. Broullón F.J. Álvarez-García N. et al.Left atrial size and risk for all-cause mortality and ischemic stroke.CMAJ. 2011; 183: r657-r664Crossref PubMed Scopus (122) Google Scholar,2Gardin J.M. McClelland R. Kitzman D. et al.M-Mode echocardiographic predictors of six- to seven-year incidence of coronary heart disease, stroke, congestive heart failure, and mortality in an elderly cohort (the Cardiovascular Health Study).Am J Cardiol. 2001; 87: 1051-1057Abstract Full Text Full Text PDF PubMed Scopus (352) Google Scholar In fact, severe LAE (left atrial diameter [LAD] ≥ 50 mm) is associated with a 4-fold increased risk of new-onset AF.3Psaty B.M. Manolio T.A. Kuller L.H. et al.Incidence of and risk factors for atrial fibrillation in older adults.Circulation. 1997; 96: 2455-2461Crossref PubMed Scopus (1255) Google Scholar As such, there may be value in screening for LAE even before it is apparent on echocardiography. The 12-lead electrocardiogram (ECG) is a readily available diagnostic tool, but standard criteria for LAE are insensitive because only a few P-wave dimensions in leads II and V1 are considered.4Munuswamy K. Alpert M.A. Martin R.H. et al.Sensitivity and specificity of commonly used electrocardiographic criteria for left atrial enlargement determined by M-mode echocardiography.Am J Cardiol. 1984; 53: 829-832Abstract Full Text PDF PubMed Scopus (72) Google Scholar,5Lee K.S. Appleton C.P. Lester S.J. et al.Relation of electrocardiographic criteria for left atrial enlargement to two-dimensional echocardiographic left atrial volume measurements.Am J Cardiol. 2007; 99: 113-118Abstract Full Text Full Text PDF PubMed Scopus (38) Google Scholar In the past decade, artificial intelligence using deep learning has been applied to a variety of complex biological signals, including the ECG, to enhance the diagnosis and prognosis of cardiovascular disease.6Siontis K.C. Noseworthy P.A. Attia Z.I. Friedman P.A. Artificial intelligence-enhanced electrocardiography in cardiovascular disease management.Nate Rev. 2021; 18: 465-478Google Scholar Given the nuances in abnormal P-wave morphology and its company with the QRS-T wave, analysis of the 12-lead ECG waveform with the use of deep learning may improve ECG-based detection of LAE. In this issue of the Canadian Journal of Cardiology, Chou et al.7Chou C.C. Liu Z.Y. Chang P.C. et al.Comparing artificial intelligence-enabled electrocardiogram models in identifying left atrium enlargement and long-term cardiovascular risk.Can J Cardiol. 2024; 40: 585-594Abstract Full Text Full Text PDF Scopus (1) Google Scholar develop a deep learning–enabled ECG model to detect LAE ≥ 50 mm and its long-term cardiovascular risk. Their single-centre, retrospective study included 382,593 consecutive adults each with a 12-lead ECG and echocardiogram performed within 2 weeks of one another. The echocardiogram provided the gold standard for classifying LAE (ie, LAD in the parasternal long-axis view). A convolutional neural network (CNN) was trained using 1 of 4 different 10-second ECG data sets—12-lead P-QRS-T waves, 12-lead segmented P-waves, and single-lead I or V2 P-QRS-T waves—to discriminate LAD ranging in size from 38 to 50 mm (ECGs for training 35%, internal validation 15%, testing 50%). External validation was performed using a separate cohort of 91,425 patient ECGs from another hospital. Cardiovascular events associated with LAE ≥ 50 mm were assessed from administrative International Classification of Diseases codes. The main study finding was that all 4 deep learning ECG models performed similarly in classifying LAE, with performance improving as LAE cutoff points increased from 38 mm (receiver operating characteristic (ROC) area under curve (AUC) 0.74-0.79) to 50 mm (ROC AUCs 0.86-0.88). The models performed equally well with external validation (ROC AUCs 0.85-0.88). When all 4 deep learning ECG models were developed separately on the 338,264 sinus rhythm ECGs vs 60,766 non–sinus rhythm ECGs (eg, AF, flutter, frequent premature atrial beats), model performance was better for the sinus rhythm ECGs in detecting LAE ≥ 50 mm (ROC AUCs 0.85-0.73). With deep learning ECG–detected LAE ≥ 50 mm, the hazard ratios (HRs) for predicting new-onset AF (HR 9.7-10.1), heart failure (HR 4.6-5.3), stroke (HR 1.5-1.6) and acute myocardial infarction (HR 1.7-2.1) over 1-year follow-up was similar with all 4 ECG models. These HRs were higher than those for echocardiography-detected LAE ≥ 50 mm, due in part to a lower predicted event rate among patients classified as LAE < 50 mm using the CNN ECG models. Overall, this is a well designed, rigourous study with novel findings that build on the literature of deep learning–enabled ECG analysis for diagnostic screening and prognostication. The selection of the database used for training significantly affects the quality and generalisability of machine learning models. Here, a large, diverse real-world ECG database was used that included sinus, nonsinus and even noisy ECGs unique to each patient. In addition, the model was externally validated to improve generalisability. The optimal ECG input format for deep neural network training remains an open question, but the signal-based ECG used in this study permitted single lead training without ECG grid interference, and this may offer higher fidelity than 2-dimensional ECG images. The CNN architecture itself is well described and necessary for the multidimensional ECG waveform analysis that would not be possible with supervised machine learning. With CNN-assisted diagnosis comes the concern of whether the output can be trusted and interpreted by the clinician and end-user. Chou et al.6Siontis K.C. Noseworthy P.A. Attia Z.I. Friedman P.A. Artificial intelligence-enhanced electrocardiography in cardiovascular disease management.Nate Rev. 2021; 18: 465-478Google Scholar demonstrate better performance with sinus rhythm ECGs than non–sinus rhythm ECGs, suggesting that the P-wave is relevant in their CNN-based classification of LAE ≥ 50 mm. Among all ECGs, the 12-lead P-QRS-T waves only marginally improved model performance compared with the 12-lead segmented P waves, further supporting the relevance of the P-wave in model classification. Before considering the clinical relevance and application of this study, it is important to understand its shortcomings. Foremost, although a large number of ECGs were included with random allocation to training, validation, and testing, significant class imbalance exists in these cohorts because only 5% of all ECGs were from patients with LAE ≥ 50 mm. Standard classification metrics such as ROC AUC do not adequately quantify model performance when data sets are imbalanced. Instead, the precision-recall AUC and F1 score (ie, harmonic mean of precision and recall) are more accurate,8Saito T. Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets.PLoS One. 2015; 10e0118432Crossref Scopus (2226) Google Scholar and in this study both were low (F1 score 0.27, precision-recall AUC 0.31) indicating poor performance in the internal testing and external validation cohorts. Data augmentation strategies9Perez L, Wang J. The effectiveness of data augmentation in image classification using deep learning [preprint; December 13, 2017]. arXiv:1712.04621.Google Scholar can help to improve class imbalance, but these were not considered. Data augmentation increases the number of cases by adding slightly modified copies of already existing data or newly created synthetic data from existing data. This approach was used by Liao et al.10Liao S. Bokhari M. Chakraborty P. et al.Use of wearable technology and deep learning to improve the diagnosis of Brugada syndrome.JACC Clin Electrophysiol. 2022; 8: 1010-1020Crossref PubMed Scopus (9) Google Scholar when developing a CNN model to classify the type 1 Brugada pattern from 12-lead Holters because only a limited number of cases were available for model training. Another important consideration is selecting the optimal deep learning architecture for training and inference. Although CNNs have dominated the field until now, recent advancements in attention-based generative pretrained transformers and vision transformers offer promising avenues for improving machine learning–enhanced ECG classification.11Pan S.J. Yang Q. A survey on transfer learning.IEEE Trans Knowl Data Eng. 2010; 22: 1345-1359Crossref Scopus (16223) Google Scholar,12van Steenkiste G. van Loon G. Crevecoeur G. Transfer learning in ECG classification from human to horse using a novel parallel neural network architecture.Sci Rep. 2020; 10: 186Crossref PubMed Scopus (38) Google Scholar Pretraining transformer models on large unlabelled data sets may enable the creation of specialised models when there are limited cases or rare diseases. Using a similar approach with transfer learning, Liu et al.13Liu C.M. Liu C.L. Hu K.W. et al.A deep learning–enabled electrocardiogram model for the identification of a rare inherited arrhythmia: Brugada syndrome.Can J Cardiol. 2022; 38: 152-159Abstract Full Text Full Text PDF PubMed Scopus (23) Google Scholar trained a CNN to accurately classify the type 1 Brugada pattern from 12-lead ECGs. Their model parameters from a source model trained on 12-lead ECGs of right bundle branch block were transferred to a target model trained on type 1 Brugada ECGs to enhance its performance. Besides model development, the study has limitations in how clinical metrics and outcomes were evaluated. Although a large real-world ECG database was collected, there may still be referral bias among patients undergoing both ECG and echocardiography within 2 weeks. As such, the performance of the model in an unselected population for LAE screening may be quite different, and class imbalance would become even more significant. Second, detecting LAE based on LAD is far less sensitive than using LA volume, which is now more commonly reported. Third, LAE often does not exist in isolation, but rather coexists with structural heart disease (eg, mitral regurgitation, cardiomyopathy, left ventricular hypertrophy) and clinical conditions (eg, congestive heart failure, AF, hypertension). Without adjusting for these confounders, it is unknown whether LAE independently predicts cardiovascular events or is simply a marker of underlying disease that is driving adverse outcomes. Finally, the interpretation of the higher hazard ratio for cardiovascular events with the CNN ECG model–detected LAE ≥ 50 mm compared with echocardiography-detected LAE ≥ 50 mm is complex and needs clarification. The CNN ECG models classified 5- to 6-fold more patients with LAE > 50 mm compared with echocardiography, indicating many false positives. However, the cardiovascular event rate with CNN ECG–detected LAE < 50 mm was lower (high negative predictive value of 0.96) compared with echocardiography. This difference contributed to the higher hazard ratio with the CNN ECG model–detected LAE ≥ 50 mm, especially for AF and heart failure. Notwithstanding these limitations, the study findings provide a framework for future development in this field that may ultimately pave the way for ECG-enabled LAE screening and prognostication. Interestingly, the performance of the single leads I and V2 ECG models (accuracy ∼ 77%, ROC AUC ∼ 0.85) in discriminating LAE ≥ 50 mm was similar to that of the 12-lead ECG model (accuracy 78%, ROC AUC 0.88). However, single-lead recordings, for example, from wearables, may not be relevant in detecting LAE, unlike arrhythmias, and may compromise ECG signal quality. Model performance also did not improve significantly with inclusion of the QRS-T complex. In contrast, the machine learning–enabled single-lead II ECG model of Hsu et al.14Hsu Cy Liu P.Y. Liu S.H. et al.Machine learning for electrocardiographic features to identify left atrial enlargement in young adults: CHIEF Heart study.Front Cardiovasc Med. 2022; 9840585Crossref Scopus (10) Google Scholar identified LAE > 40 mm based on features from the QRS complex rather than P-wave. It is unexpected that a full 12-lead P-QRS-T disclosure was not more discriminatory for LAE compared with the single-lead P-wave. The QRS-T complex would be influenced by ventricular pathology (eg, left ventricular hypertrophy, cardiomyopathy), which is an important cause of LAE. Furthermore, P-wave morphology would be affected by LA fibrosis burden independently from LA volume, leading to changes in P-wave morphology in multiple leads. As such, deep learning–enabled 12-lead P-QRS-T ECG models may hold the greatest promise in discriminating LAE. The premise of this study is that deep learning ECG models will detect severe LAE in a relatively asymptomatic population without atrial arrhythmias or cardiovascular disease to inform early work-up and treatment. Although this is a rational premise, it may not be clinically relevant because the prevalence of this population is very low (∼ 3%), and it is uncertain whether early intervention will actually change cardiovascular outcomes. Identifying a cohort with a higher pretest probability of severe LAE, such as those with more cardiovascular risk factors, may improve diagnostic yield. Ultimately, echocardiography will still be required to confirm deep learning–ECG detection of severe LAE. Therefore, minimising false positives with appropriate patient selection and a robust deep learning model are essential for cost-effective screening. Chou et al.6Siontis K.C. Noseworthy P.A. Attia Z.I. Friedman P.A. Artificial intelligence-enhanced electrocardiography in cardiovascular disease management.Nate Rev. 2021; 18: 465-478Google Scholar are to be commended for their novel application of deep learning–enabled ECG analysis with inclusion of the P-wave to detect severe LAE. Future studies in diverse populations will be needed to confirm deep learning model performance and the patient populations best served by this screening tool. Although in its infancy, this point of care approach to characterising abnormal atrial electroanatomic substrate has the potential to enhance the prediction of heart failure, AF, stroke, and even response to AF catheter ablation. Whether this prompts early intervention and changes clinical outcomes remains to be determined.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
chenbin完成签到,获得积分10
3秒前
6秒前
Titi完成签到 ,获得积分10
8秒前
1002SHIB完成签到,获得积分10
12秒前
nihaolaojiu完成签到,获得积分10
12秒前
贰鸟应助科研通管家采纳,获得20
12秒前
cdercder应助科研通管家采纳,获得20
12秒前
cdercder应助科研通管家采纳,获得20
12秒前
sheetung完成签到,获得积分10
12秒前
美好灵寒完成签到 ,获得积分10
17秒前
guangshuang完成签到 ,获得积分10
20秒前
自然的含蕾完成签到 ,获得积分10
34秒前
YOLO完成签到 ,获得积分10
41秒前
四叶草完成签到 ,获得积分10
42秒前
roundtree完成签到 ,获得积分10
53秒前
wefor完成签到 ,获得积分10
1分钟前
动听的千萍关注了科研通微信公众号
1分钟前
GankhuyagJavzan完成签到,获得积分10
1分钟前
fzh完成签到,获得积分10
1分钟前
yinlao完成签到,获得积分10
1分钟前
1分钟前
氟锑酸完成签到 ,获得积分10
1分钟前
小贾爱喝冰美式完成签到 ,获得积分10
1分钟前
我是笨蛋完成签到 ,获得积分10
1分钟前
1分钟前
多边棱发布了新的文献求助10
2分钟前
cdercder应助科研通管家采纳,获得20
2分钟前
CodeCraft应助科研通管家采纳,获得10
2分钟前
cdercder应助科研通管家采纳,获得20
2分钟前
gmc完成签到 ,获得积分10
2分钟前
庄怀逸完成签到 ,获得积分10
2分钟前
健康的大船完成签到 ,获得积分10
2分钟前
啤酒人完成签到 ,获得积分10
2分钟前
小鳄鱼一只完成签到,获得积分10
2分钟前
震动的沉鱼完成签到 ,获得积分10
2分钟前
冷傲凝琴完成签到,获得积分10
2分钟前
笨笨完成签到 ,获得积分10
2分钟前
2分钟前
fanssw完成签到 ,获得积分10
2分钟前
2分钟前
高分求助中
Technologies supporting mass customization of apparel: A pilot project 600
Introduction to Strong Mixing Conditions Volumes 1-3 500
Tip60 complex regulates eggshell formation and oviposition in the white-backed planthopper, providing effective targets for pest control 400
A Field Guide to the Amphibians and Reptiles of Madagascar - Frank Glaw and Miguel Vences - 3rd Edition 400
China Gadabouts: New Frontiers of Humanitarian Nursing, 1941–51 400
The Healthy Socialist Life in Maoist China, 1949–1980 400
Walking a Tightrope: Memories of Wu Jieping, Personal Physician to China's Leaders 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3798521
求助须知:如何正确求助?哪些是违规求助? 3344082
关于积分的说明 10318422
捐赠科研通 3060628
什么是DOI,文献DOI怎么找? 1679712
邀请新用户注册赠送积分活动 806761
科研通“疑难数据库(出版商)”最低求助积分说明 763353