Standardizing Heterogeneous MRI Series Description Metadata Using Large Language Models

元数据 计算机科学 系列(地层学) 自然语言处理 情报检索 人工智能 万维网 地质学 古生物学
作者
Peter Kamel,Florence X. Doo,Dharmam Savani,Adway Kanhere,Paul H. Yi,Vishwa S. Parekh
标识
DOI:10.1007/s10278-025-01541-3
摘要

MRI metadata, particularly free-text series descriptions (SDs) used to identify sequences, are highly heterogeneous due to variable inputs by manufacturers and technologists. This variability poses challenges in correctly identifying series for hanging protocols and dataset curation. The purpose of this study was to evaluate the ability of large language models (LLMs) to automatically classify MRI SDs. We analyzed non-contrast brain MRIs performed between 2016 and 2022 at our institution, identifying all unique SDs in the metadata. A practicing neuroradiologist manually classified the SD text into: "T1," "T2," "T2/FLAIR," "SWI," "DWI," ADC," or "Other." Then, various LLMs, including GPT 3.5 Turbo, GPT-4, GPT-4o, Llama 3 8b, and Llama 3 70b, were asked to classify each SD into one of the sequence categories. Model performances were compared to ground truth classification using area under the curve (AUC) as the primary metric. Additionally, GPT-4o was tasked with generating regular expression templates to match each category. In 2510 MRI brain examinations, there were 1395 unique SDs, with 727/1395 (52.1%) appearing only once, indicating high variability. GPT-4o demonstrated the highest performance, achieving an average AUC of 0.983 ± 0.020 for all series with detailed prompting. GPT models significantly outperformed Llama models, with smaller differences within the GPT family. Regular expression generation was inconsistent, demonstrating an average AUC of 0.774 ± 0.161 for all sequences. Our findings suggest that LLMs are effective for interpreting and standardizing heterogeneous MRI SDs.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
1秒前
King强完成签到,获得积分10
2秒前
科研通AI6.1应助赤足先森采纳,获得20
2秒前
天天完成签到 ,获得积分10
3秒前
4秒前
Lazyazy_完成签到 ,获得积分10
4秒前
搞怪绿柳完成签到,获得积分10
5秒前
山城的酒完成签到,获得积分10
6秒前
AcA发布了新的文献求助10
6秒前
贾明灵完成签到,获得积分10
6秒前
芬芬完成签到,获得积分10
7秒前
科研通AI2S应助zby采纳,获得10
9秒前
10秒前
邹一寡发布了新的文献求助10
11秒前
轻松书白完成签到,获得积分10
11秒前
12秒前
12秒前
小陀螺完成签到,获得积分10
12秒前
怪兽完成签到,获得积分10
12秒前
隐形曼青应助年华采纳,获得10
13秒前
温婉的从阳完成签到,获得积分10
13秒前
Fuckacdemic完成签到,获得积分10
14秒前
14秒前
林正英发布了新的文献求助10
14秒前
考研小白发布了新的文献求助10
16秒前
一棵草完成签到,获得积分10
16秒前
73Jennie123完成签到,获得积分10
16秒前
魏魏魏完成签到,获得积分10
16秒前
爱岗敬业牛马人完成签到,获得积分10
17秒前
17秒前
风清扬应助靤君采纳,获得30
19秒前
Juanjuan完成签到,获得积分10
19秒前
丘比特应助可耐的天菱采纳,获得10
19秒前
不知道发布了新的文献求助10
19秒前
雪白的依玉完成签到 ,获得积分10
21秒前
邹一寡完成签到,获得积分20
22秒前
磊2024完成签到,获得积分10
22秒前
orixero应助AcA采纳,获得10
23秒前
秦奥洋完成签到,获得积分10
24秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Development Across Adulthood 800
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6444891
求助须知:如何正确求助?哪些是违规求助? 8258720
关于积分的说明 17592459
捐赠科研通 5504695
什么是DOI,文献DOI怎么找? 2901611
邀请新用户注册赠送积分活动 1878590
关于科研通互助平台的介绍 1718245