病态的
肾病科
人工智能
医学
计算机科学
心理学
内科学
作者
Eiichiro Uchino,Kanata Suzuki,Noriaki Sato,Ryosuke Kojima,Yoshinori Tamada,Shusuke Hiragi,Hideki Yokoi,Nobuhiro Yugami,Sachiko Minamiguchi,Hironori Haga,Motoko Yanagita,Yasushi Okuno
标识
DOI:10.1016/j.ijmedinf.2020.104231
摘要
Conclusion: AI models for classifying 7 major findings of glomeruli were developed, which may improve clinicians' diagnostic accuracy. • Artificial intelligence models classified glomerular images of renal biopsy. • Seven major pathological findings were automatically classified by deep learning. • Majority decision among experts and our models can improve diagnostic performance. Automated classification of glomerular pathological findings is potentially beneficial in establishing an efficient and objective diagnosis in renal pathology. While previous studies have verified the artificial intelligence (AI) models for the classification of global sclerosis and glomerular cell proliferation, there are several other glomerular pathological findings required for diagnosis, and the comprehensive models for the classification of these major findings have not yet been reported. Whether the cooperation between these AI models and clinicians improves diagnostic performance also remains unknown. Here, we developed AI models to classify glomerular images for major findings required for pathological diagnosis and investigated whether those models could improve the diagnostic performance of nephrologists. We used a dataset of 283 kidney biopsy cases comprising 15,888 glomerular images that were annotated by a total of 25 nephrologists. AI models to classify seven pathological findings: global sclerosis, segmental sclerosis, endocapillary proliferation, mesangial matrix accumulation, mesangial cell proliferation, crescent, and basement membrane structural changes, were constructed using deep learning by fine-tuning of InceptionV3 convolutional neural network. Subsequently, we compared the agreement to truth labels between majority decision among nephrologists with or without the AI model as a voter. Our model for global sclerosis showed high performance (area under the curve: periodic acid-Schiff, 0.986; periodic acid methenamine silver, 0.983); the models for the other findings also showed performance close to those of nephrologists. By adding the AI model output to majority decision among nephrologists, out of the 14 constructed models, the results of the majority decision showed improvement in sensitivity for 10 models (four of them were statistically significant) and specificity for eight models (five significant). Our study showed a proof-of-concept for the classification of multiple glomerular findings in a comprehensive method of deep learning and suggested its potential effectiveness in improving diagnostic accuracy of clinicians.
科研通智能强力驱动
Strongly Powered by AbleSci AI