人工智能
计算机科学
前列腺癌
数据提取
磁共振成像
机器学习
模式识别(心理学)
特征提取
医学影像学
数据挖掘
患者数据
数据采集
计算机视觉
生成模型
基本事实
合成数据
人工神经网络
生成语法
作者
Anobel Y. Odisho,Andrew W. Liu,William A. Pace,Marvin N. Carlisle,Robert Krumm,Janet E. Cowan,Peter R. Carroll,Matthew R. Cooperberg
摘要
PURPOSE: Radiology reports are stored as plain text in most electronic health records, rendering the data computationally inaccessible. Large language models are powerful tools for analyzing unstructured text but relatively untested in urologic oncology. We aimed to develop a pipeline to extract data from plain text prostate magnetic resonance imaging (MRI) reports using GPT4.0 and compare the accuracy to manually abstracted data. METHODS: We developed a data pipeline using a secure, enterprise-wide deployment of OpenAI's GPT-4.0 to automatically extract data elements from prostate MRI report text when presented with prostate MRI reports. Identical prompts and reports were sent multiple times to determine response variability. We extracted 15 data elements per report and compared accuracy to a manually abstracted gold standard. RESULTS: < .001). In disagreements between manual and GPT-4.0 extracted data, GPT-4.0 responses were more often deemed correct by an additional reviewer. CONCLUSION: GPT-4.0 had high accuracy with low variability in extracting data points from prostate cancer MRI reports with low upfront programming requirements. This represents an effective tool to expedite medical data extraction for clinical and research use cases.
科研通智能强力驱动
Strongly Powered by AbleSci AI