Continual Self-supervised Learning: Towards Universal Multi-modal Medical Data Representation Learning

计算机科学 人工智能 模态(人机交互) 机器学习 模式 特征学习 情态动词 监督学习 人工神经网络 社会科学 社会学 化学 高分子化学
作者
Yiwen Ye,Yutong Xie,Jianpeng Zhang,Ziyang Chen,Qi Wu,Yong Xia
出处
期刊:Cornell University - arXiv 被引量:1
标识
DOI:10.48550/arxiv.2311.17597
摘要

Self-supervised learning is an efficient pre-training method for medical image analysis. However, current research is mostly confined to specific-modality data pre-training, consuming considerable time and resources without achieving universality across different modalities. A straightforward solution is combining all modality data for joint self-supervised pre-training, which poses practical challenges. Firstly, our experiments reveal conflicts in representation learning as the number of modalities increases. Secondly, multi-modal data collected in advance cannot cover all real-world scenarios. In this paper, we reconsider versatile self-supervised learning from the perspective of continual learning and propose MedCoSS, a continuous self-supervised learning approach for multi-modal medical data. Unlike joint self-supervised learning, MedCoSS assigns different modality data to different training stages, forming a multi-stage pre-training process. To balance modal conflicts and prevent catastrophic forgetting, we propose a rehearsal-based continual learning method. We introduce the k-means sampling strategy to retain data from previous modalities and rehearse it when learning new modalities. Instead of executing the pretext task on buffer data, a feature distillation strategy and an intra-modal mixup strategy are applied to these data for knowledge retention. We conduct continuous self-supervised pre-training on a large-scale multi-modal unlabeled dataset, including clinical reports, X-rays, CT scans, MRI scans, and pathological images. Experimental results demonstrate MedCoSS's exceptional generalization ability across nine downstream datasets and its significant scalability in integrating new modality data. Code and pre-trained weight are available at https://github.com/yeerwen/MedCoSS.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
阿怜完成签到 ,获得积分10
1秒前
独特的豌豆完成签到,获得积分10
2秒前
shichasss完成签到,获得积分10
2秒前
amber完成签到,获得积分10
2秒前
林烯完成签到,获得积分10
3秒前
霸王爱吃面完成签到,获得积分10
3秒前
Xieyusen发布了新的文献求助10
3秒前
今天也不想搬砖完成签到,获得积分10
3秒前
Aries发布了新的文献求助10
3秒前
xxxqf520完成签到,获得积分10
4秒前
吴灵完成签到,获得积分10
4秒前
L3完成签到,获得积分10
4秒前
...完成签到,获得积分10
4秒前
11完成签到,获得积分10
4秒前
siraotianya完成签到,获得积分10
4秒前
5秒前
美满的珠完成签到 ,获得积分20
5秒前
亲亲完成签到,获得积分10
6秒前
人工智能小配方完成签到,获得积分10
6秒前
李离子完成签到,获得积分20
7秒前
阿豪完成签到,获得积分10
7秒前
川荣李奈完成签到,获得积分10
7秒前
甜心猪面完成签到,获得积分10
7秒前
7秒前
2385697574完成签到,获得积分10
8秒前
8秒前
Lifel完成签到 ,获得积分10
8秒前
8秒前
楷.发布了新的文献求助30
8秒前
XuNan完成签到,获得积分10
9秒前
CX完成签到,获得积分10
9秒前
biu完成签到,获得积分10
10秒前
csx发布了新的文献求助10
10秒前
10秒前
端庄千山完成签到 ,获得积分10
10秒前
孤独丹秋完成签到,获得积分10
10秒前
QIU完成签到 ,获得积分10
11秒前
1526完成签到,获得积分10
11秒前
Jolin完成签到,获得积分10
12秒前
无可无不可完成签到,获得积分10
12秒前
高分求助中
Malcolm Fraser : a biography 680
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Organic Reactions Volume 118 400
A Foreign Missionary on the Long March: The Unpublished Memoirs of Arnolis Hayman of the China Inland Mission 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6459386
求助须知:如何正确求助?哪些是违规求助? 8268465
关于积分的说明 17622373
捐赠科研通 5528716
什么是DOI,文献DOI怎么找? 2905930
邀请新用户注册赠送积分活动 1882667
关于科研通互助平台的介绍 1727870