计算机科学
人工智能
机器学习
合成数据
样品(材料)
葡萄膜炎
病历
数据挖掘
医学
眼科
外科
色谱法
化学
作者
Heithem Sliman,Imen Megdiche,Loay Alajramy,Adel Taweel,Sami Yangui,Aida Drira,Elyes Lamine
标识
DOI:10.1016/j.iswa.2023.200223
摘要
Clinical decision support based on artificial intelligence (AI) methods has increasingly been employed in medical applications to support medical diagnosis. Developing efficient AI methods, however, depends necessarily on the availability of sufficiently large amount of data to provide reliable results. But, in medicine, it is not always possible to find sufficient amount of real data on all pathologies, particularly, for rare diseases. This paper proposes a methodological framework for generating synthetic data using data augmentation techniques combined with epidemiological profiles. It focuses on Uveitis, a rare disease in ophthalmology, which is difficult to diagnose because of the disparity in prevalence of its etiologies. The generated synthetic data have been qualitatively validated by specialist ophthalmologists and quantitatively tested using machine learning methods. Results show that, of a randomly selected sample of the generated data, more than 55% were assessed as good or excellent, which is very promising for generating synthetic, validated as near-real, medical data for rare diseases. They also show that the proposed framework is consistent in generating synthetic data, for Uveitis pathology, of different dataset sizes, achieving more than 80% diagnosis prediction accuracy for 2000 patient records or larger.
科研通智能强力驱动
Strongly Powered by AbleSci AI