Enhancing the reliability and accuracy of AI-enabled diagnosis via complementarity-driven deferral to clinicians (CoDoC)

延期 互补性(分子生物学) 计算机科学 可靠性(半导体) 医学 可靠性工程 风险分析(工程) 业务 工程类 生物 会计 功率(物理) 物理 遗传学 量子力学
作者
Krishnamurthy Dvijotham,Jim Winkens,Melih Barsbey,Sumedh Ghaisas,Nick Pawlowski,Robert Stanforth,Patricia MacWilliams,Zahra S. Ahmed,Shekoofeh Azizi,Yoram Bachrach,Laura Culp,Mayank Daswani,Jan Freyberg,Christopher Kelly,Atilla P. Kiraly,Scott McKinney,Basil Mustafa,Vivek Natarajan,Krzysztof J. Geras,Jan Witowski
出处
期刊:Research Square - Research Square 被引量:4
标识
DOI:10.21203/rs.3.rs-2231672/v1
摘要

Abstract Diagnostic AI systems trained using deep learning have been shown to achieve expert-level identification of diseases in multiple medical imaging settings1,2. However, such systems are not always reliable and can fail in cases diagnosed accurately by clinicians and vice versa3. Mechanisms for leveraging this complementarity by learning to select optimally between discordant decisions of AIs and clinicians have remained largely unexplored in healthcare4, yet have the potential to achieve levels of performance that exceed that possible from either AI or clinician alone4. We develop a Complementarity-driven Deferral-to-Clinical Workflow (CoDoC) system that can learn to decide when to rely on a diagnostic AI model and when to defer to a clinician or their workflow. We show that our system is compatible with diagnostic AI models from multiple manufacturers, obtaining enhanced accuracy (sensitivity and/or specificity) relative to clinician-only or AI-only baselines in clinical workflows that screen for breast cancer or tuberculosis. For breast cancer, we demonstrate the first system that exceeds the accuracy of double-reading with arbitration (the “gold standard” of care) in a large representative UK screening program, with 25% reduction in false positives despite equivalent true-positive detection, while achieving a 66% reduction in clinical workload. In two separate US datasets, CoDoC exceeds the accuracy of single-reading by board certified radiologists and two different standalone state-of-the-art AI systems, with generalisation of this finding in different diagnostic AI manufacturers. For TB screening with chest X-rays, CoDoC improved specificity (while maintaining sensitivity) compared to standalone AI or clinicians for 3 of 5 commercially available diagnostic AI systems (5–15% reduction in false positives). Further, we show the limits of confidence score based deferral systems for medical AI, by demonstrating that no deferral strategy could have achieved significant improvement on the remaining two diagnostic AI systems. Our comprehensive assessment demonstrates that the superiority of CoDoC is sustained in multiple realistic stress tests for generalisation of medical AI tools along four axes: variation in the medical imaging modality; variation in clinical settings and human experts; different clinical deferral pathways within a given modality; and different AI softwares. Further, given the simplicity of CoDoC we believe that practitioners can easily adapt it and we provide an open-source implementation to encourage widespread further research and application.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
刚刚
轻舟发布了新的文献求助10
刚刚
molihuakai应助路与采纳,获得10
1秒前
科研通AI6.4应助Jun采纳,获得10
1秒前
haixia发布了新的文献求助10
2秒前
筝zheng完成签到,获得积分10
2秒前
yoy完成签到,获得积分10
2秒前
展锋发布了新的文献求助10
3秒前
饶天源发布了新的文献求助10
4秒前
朴实惜霜发布了新的文献求助10
4秒前
louis发布了新的文献求助10
5秒前
5秒前
李爱国应助迷了路的猫采纳,获得10
6秒前
yongp发布了新的文献求助10
7秒前
科研通AI6.1应助ememem采纳,获得10
8秒前
天天快乐应助sunrase采纳,获得10
9秒前
kunkun完成签到,获得积分10
10秒前
10秒前
Ava应助健康的幻珊采纳,获得30
10秒前
Jasper应助Psy_zhang采纳,获得10
10秒前
11秒前
奶油淘淘发布了新的文献求助10
11秒前
11秒前
浪老师完成签到 ,获得积分10
12秒前
13秒前
科研通AI2S应助Jackcaosky采纳,获得10
13秒前
上官若男应助寒暑易节采纳,获得10
13秒前
0000完成签到,获得积分10
13秒前
13秒前
13秒前
烟花应助跳跃的数据线采纳,获得10
14秒前
神sjsj发布了新的文献求助10
14秒前
15秒前
CodeCraft应助风音赫莱森采纳,获得30
15秒前
16秒前
Li完成签到 ,获得积分10
17秒前
17秒前
小蘑菇应助NANI采纳,获得10
17秒前
优秀元枫发布了新的文献求助10
18秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Picture this! Including first nations fiction picture books in school library collections 2000
The Cambridge History of China: Volume 4, Sui and T'ang China, 589–906 AD, Part Two 1500
Cowries - A Guide to the Gastropod Family Cypraeidae 1200
Quality by Design - An Indispensable Approach to Accelerate Biopharmaceutical Product Development 800
Pulse width control of a 3-phase inverter with non sinusoidal phase voltages 777
ON THE THEORY OF BIRATIONAL BLOWING-UP 666
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6392524
求助须知:如何正确求助?哪些是违规求助? 8207888
关于积分的说明 17375353
捐赠科研通 5445893
什么是DOI,文献DOI怎么找? 2879349
邀请新用户注册赠送积分活动 1855805
关于科研通互助平台的介绍 1698713