Semantic-Oriented Visual Prompt Learning for Diabetic Retinopathy Grading on Fundus Images

计算机科学 分级(工程) 人工智能 眼底(子宫) 自然语言处理 机器学习 医学 放射科 土木工程 工程类
作者
Yuhan Zhang,Xiao Ma,Kun Huang,Mingchao Li,Pheng‐Ann Heng
出处
期刊:IEEE Transactions on Medical Imaging [Institute of Electrical and Electronics Engineers]
卷期号:43 (8): 2960-2969 被引量:10
标识
DOI:10.1109/tmi.2024.3383827
摘要

Diabetic retinopathy (DR) is a serious ocular condition that requires effective monitoring and treatment by ophthalmologists. However, constructing a reliable DR grading model remains a challenging and costly task, heavily reliant on high-quality training sets and adequate hardware resources. In this paper, we investigate the knowledge transferability of large-scale pre-trained models (LPMs) to fundus images based on prompt learning to construct a DR grading model efficiently. Unlike full-tuning which fine-tunes all parameters of LPMs, prompt learning only involves a minimal number of additional learnable parameters while achieving a competitive effect as full-tuning. Inspired by visual prompt tuning, we propose Semantic-oriented Visual Prompt Learning (SVPL) to enhance the semantic perception ability for better extracting task-specific knowledge from LPMs, without any additional annotations. Specifically, SVPL assigns a group of learnable prompts for each DR level to fit the complex pathological manifestations and then aligns each prompt group to task-specific semantic space via a contrastive group alignment (CGA) module. We also propose a plug-and-play adapter module, Hierarchical Semantic Delivery (HSD), which allows the semantic transition of prompt groups from shallow to deep layers to facilitate efficient knowledge mining and model convergence. Our extensive experiments on three public DR grading datasets demonstrate that SVPL achieves superior results compared to other transfer tuning and DR grading methods. Further analysis suggests that the generalized knowledge from LPMs is advantageous for constructing the DR grading model on fundus images.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
chiweiyoung完成签到,获得积分10
刚刚
wyg_gzed完成签到,获得积分10
刚刚
李静完成签到,获得积分10
1秒前
1秒前
实验大牛完成签到,获得积分10
1秒前
Derek0203完成签到,获得积分10
1秒前
就吃一小口完成签到 ,获得积分10
2秒前
RYYYYYYY233完成签到 ,获得积分10
3秒前
3秒前
WuCola完成签到 ,获得积分10
3秒前
ZYP发布了新的文献求助10
3秒前
脑袋空空完成签到,获得积分10
3秒前
chunagtr发布了新的文献求助10
4秒前
stan完成签到,获得积分10
4秒前
Changlu完成签到,获得积分10
4秒前
4秒前
酷炫小馒头完成签到,获得积分10
5秒前
龙1完成签到,获得积分10
5秒前
5秒前
Allen完成签到,获得积分10
5秒前
123完成签到 ,获得积分10
6秒前
6秒前
hlt完成签到 ,获得积分10
7秒前
chen完成签到,获得积分10
7秒前
孙长秀完成签到,获得积分20
7秒前
beautiful540完成签到,获得积分10
8秒前
8秒前
8秒前
爱笑的蘑菇完成签到,获得积分10
8秒前
41完成签到,获得积分10
9秒前
任伟超完成签到,获得积分10
9秒前
程雪完成签到,获得积分10
9秒前
9秒前
天天开心完成签到,获得积分10
9秒前
10秒前
高挑的葵阴完成签到,获得积分10
10秒前
10秒前
买瓜吗发布了新的文献求助10
10秒前
TYK应助ZYP采纳,获得10
10秒前
南宫士晋完成签到 ,获得积分10
10秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
Burger's Medicinal Chemistry and Drug Discovery 400
A Step-by-Step Guide to Qualitative Data Coding 2nd Edition 400
Impact of Storage Orientation and Duration on Prefilled Syringe Performance: Break-Loose and Glide Forces, and Injection Time Across Multiple Time Points 360
Programming for Chemical Engineers Using C, C++, and MATLAB 300
Upland Kenya wild flowers and ferns: a flora of the flowers, ferns, grasses, and sedges of highland Kenya 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6664602
求助须知:如何正确求助?哪些是违规求助? 8414341
关于积分的说明 17986794
捐赠科研通 5869877
什么是DOI,文献DOI怎么找? 2975520
邀请新用户注册赠送积分活动 1951399
关于科研通互助平台的介绍 1877945