Abstract 17760: Conversion of Dicom ECG Images to Tabular Format for Building Large Language Model in Diagnoses and Disease Progression of Cardiovascular Conditions

DICOM 医学 再现性 医学诊断 心电图 元数据 数据库 人工智能 核医学 心脏病学 内科学 放射科 计算机科学 统计 数学 操作系统
作者
Sujoy Kumar Kar,Shivkumar Jallepalli,Bharath Potla,Sai Praveen Haranath,Sangita Reddy
出处
期刊:Circulation [Ovid Technologies (Wolters Kluwer)]
卷期号:148 (Suppl_1)
标识
DOI:10.1161/circ.148.suppl_1.17760
摘要

Introduction: DICOM Electrocardiography (ECG) images from individuals are routinely stored at the institutional PACS server, including normal and abnormal findings. Hypothesis: The study’s objective is to create a database of ECGs as per Standard Communication Protocol (SCP) and to build an accurate Large Language Model (LLM) across different cardiovascular conditions Methods: DICOM ECGs are retrieved, anonymized and labelled to convert the signals to (x, y) coordinates with the help of PyDicom libraries. The tabular format is then stored with lead definition and dimensions of duration and amplitude, metadata, clinical conditions, and textual diagnoses. Results: In the pilot study, 1308 ECGs from individuals with different age (<45 & >45 years), gender (male & female), binary clinical categories (normal & abnormal) and heart rate (<90 & >90 per min) are selected for analysis and modeling. The DICOM image signal reproducibility following conversion (Figure 1 & 2) shows Cross Correlation Percentage of 0.85 (0.76 - 0.94) (Figure 3). An eXtreme Gradient Boosting (XGBoost) model (6) was trained to predict above variables. The results (Figure 4) show AUC highest for predicting binary clinical categories (normal vs abnormal) at 0.81 (0.7 - 0.9) and lowest for predicting age (<45 & >45 years) at 0.6 (0.46 - 0.7). The accuracies in the leads also show similar trend. The limitations include reproducibility of results in Lead VIII, IX and XI, which are sub average and hence being retrained with optimizers. These are initial results with relatively smaller database where multiclass disease or condition classifications are not performed. Conclusions: To conclude, the methodology possesses opportunities to improve the model with Deep Learning and Entity Disambiguation Techniques (NLP) to build Large Language ECG Models for cardiovascular diagnoses and disease progression trajectories.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
畅响完成签到 ,获得积分10
3秒前
Jessic发布了新的文献求助10
4秒前
Morgenstern_ZH完成签到,获得积分10
5秒前
5秒前
zhangsan完成签到,获得积分10
6秒前
华仔应助守护星星采纳,获得10
7秒前
敛涌完成签到,获得积分10
9秒前
研友_Lw4Ngn完成签到,获得积分20
9秒前
13秒前
14秒前
高枫关注了科研通微信公众号
15秒前
Adrian发布了新的文献求助20
15秒前
852应助清都山水郎采纳,获得10
16秒前
Docgyj完成签到 ,获得积分10
19秒前
紫金大萝卜举报杏林居士求助涉嫌违规
23秒前
Jessic关注了科研通微信公众号
24秒前
24秒前
可靠铅笔发布了新的文献求助20
25秒前
27秒前
杨杨杨发布了新的文献求助30
29秒前
九月y9完成签到,获得积分10
31秒前
nicelily发布了新的文献求助10
33秒前
小陈住垃圾桶完成签到,获得积分10
35秒前
YY完成签到 ,获得积分10
35秒前
Lucas应助谨慎哈密瓜采纳,获得10
37秒前
杨杨杨完成签到,获得积分20
41秒前
大个应助Siwen采纳,获得10
42秒前
在水一方应助Siwen采纳,获得10
42秒前
英姑应助Siwen采纳,获得10
42秒前
优秀的离子键完成签到 ,获得积分10
43秒前
xyh完成签到 ,获得积分10
47秒前
天行健完成签到,获得积分10
47秒前
48秒前
会飞的猪完成签到,获得积分10
49秒前
Soche发布了新的文献求助10
50秒前
daijk发布了新的文献求助10
51秒前
13ejgjfdd完成签到 ,获得积分20
51秒前
52秒前
53秒前
小欣完成签到,获得积分10
54秒前
高分求助中
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
We shall sing for the fatherland 500
Chinese-English Translation Lexicon Version 3.0 500
Electronic Structure Calculations and Structure-Property Relationships on Aromatic Nitro Compounds 500
マンネンタケ科植物由来メロテルペノイド類の網羅的全合成/Collective Synthesis of Meroterpenoids Derived from Ganoderma Family 500
[Lambert-Eaton syndrome without calcium channel autoantibodies] 400
Statistical Procedures for the Medical Device Industry 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2378724
求助须知:如何正确求助?哪些是违规求助? 2086055
关于积分的说明 5235309
捐赠科研通 1813049
什么是DOI,文献DOI怎么找? 904706
版权声明 558574
科研通“疑难数据库(出版商)”最低求助积分说明 482984