计算机科学
腹水
混乱
短信
基本事实
管道(软件)
钥匙(锁)
肝硬化
人工智能
本体论
自然语言处理
放射科
医学
内科学
计算机安全
心理学
万维网
哲学
认识论
精神分析
程序设计语言
作者
Isabella C. Wiest,Dyke Ferber,Jiefu Zhu,Marko van Treeck,Sonja K. Meyer,Radhika Juglan,Zunamys I. Carrero,Daniel Paech,Jens Kleesiek,Matthias Ebert,Daniel Truhn,Jakob Nikolas Kather
标识
DOI:10.1038/s41746-024-01233-2
摘要
Most clinical information is encoded as free text, not accessible for quantitative analysis. This study presents an open-source pipeline using the local large language model (LLM) "Llama 2" to extract quantitative information from clinical text and evaluates its performance in identifying features of decompensated liver cirrhosis. The LLM identified five key clinical features in a zero- and one-shot manner from 500 patient medical histories in the MIMIC IV dataset. We compared LLMs of three sizes and various prompt engineering approaches, with predictions compared against ground truth from three blinded medical experts. Our pipeline achieved high accuracy, detecting liver cirrhosis with 100% sensitivity and 96% specificity. High sensitivities and specificities were also yielded for detecting ascites (95%, 95%), confusion (76%, 94%), abdominal pain (84%, 97%), and shortness of breath (87%, 97%) using the 70 billion parameter model, which outperformed smaller versions. Our study successfully demonstrates the capability of locally deployed LLMs to extract clinical information from free text with low hardware requirements.
科研通智能强力驱动
Strongly Powered by AbleSci AI