人工智能
深度学习
计算机科学
生成模型
一般化
医学影像学
骨闪烁照相术
模式识别(心理学)
机器学习
核医学
医学
生成语法
数学
数学分析
作者
David Haberl,Jing Ning,Kilian Kluge,Katarina Kumpf,Josef Yu,Zewen Jiang,Cláudia S. Constantino,A. Monaci,Maria Starace,Alexander Haug,René Rettl,Luca Camoni,Francesco Bertagna,Katharina Mascherbauer,Felix Hofer,Domenico Albano,Roberto Sciagrà,Francisco P. M. Oliveira,Durval C. Costa,Christian Nitsche
标识
DOI:10.1007/s00259-025-07091-8
摘要
Abstract Purpose Advancements of deep learning in medical imaging are often constrained by the limited availability of large, annotated datasets, resulting in underperforming models when deployed under real-world conditions. This study investigated a generative artificial intelligence (AI) approach to create synthetic medical images taking the example of bone scintigraphy scans, to increase the data diversity of small-scale datasets for more effective model training and improved generalization. Methods We trained a generative model on 99m Tc-bone scintigraphy scans from 9,170 patients in one center to generate high-quality and fully anonymized annotated scans of patients representing two distinct disease patterns: abnormal uptake indicative of (i) bone metastases and (ii) cardiac uptake indicative of cardiac amyloidosis. A blinded reader study was performed to assess the clinical validity and quality of the generated data. We investigated the added value of the generated data by augmenting an independent small single-center dataset with synthetic data and by training a deep learning model to detect abnormal uptake in a downstream classification task. We tested this model on 7,472 scans from 6,448 patients across four external sites in a cross-tracer and cross-scanner setting and associated the resulting model predictions with clinical outcomes. Results The clinical value and high quality of the synthetic imaging data were confirmed by four readers, who were unable to distinguish synthetic scans from real scans (average accuracy: 0.48% [95% CI 0.46–0.51]), disagreeing in 239 (60%) of 400 cases (Fleiss’ kappa: 0.18). Adding synthetic data to the training set improved model performance by a mean (± SD) of 33(± 10)% AUC ( p < 0.0001) for detecting abnormal uptake indicative of bone metastases and by 5(± 4)% AUC ( p < 0.0001) for detecting uptake indicative of cardiac amyloidosis across both internal and external testing cohorts, compared to models without synthetic training data. Patients with predicted abnormal uptake had adverse clinical outcomes (log-rank: p < 0.0001). Conclusions Generative AI enables the targeted generation of bone scintigraphy images representing different clinical conditions. Our findings point to the potential of synthetic data to overcome challenges in data sharing and in developing reliable and prognostic deep learning models in data-limited environments. Graphical abstract
科研通智能强力驱动
Strongly Powered by AbleSci AI