STKD: Distilling Knowledge From Synchronous Teaching for Efficient Model Compression

计算机科学 判别式 过程(计算) 编码(集合论) 方案(数学) 机器学习 特征(语言学) 人工智能 数学分析 语言学 哲学 数学 集合(抽象数据类型) 程序设计语言 操作系统
作者
Tongtong Su,Jinsong Zhang,Zou Yu,Gang Wang,Xiaoguang Liu
出处
期刊:IEEE transactions on neural networks and learning systems [Institute of Electrical and Electronics Engineers]
卷期号:34 (12): 10051-10064 被引量:8
标识
DOI:10.1109/tnnls.2022.3164264
摘要

Knowledge distillation (KD) transfers discriminative knowledge from a large and complex model (known as teacher) to a smaller and faster one (known as student). Existing advanced KD methods, limited to fixed feature extraction paradigms that capture teacher's structure knowledge to guide the training of the student, often fail to obtain comprehensive knowledge to the student. Toward this end, in this article, we propose a new approach, synchronous teaching knowledge distillation (STKD), to integrate online teaching and offline teaching for transferring rich and comprehensive knowledge to the student. In the online learning stage, a blockwise unit is designed to distill the intermediate-level knowledge and high-level knowledge, which can achieve bidirectional guidance of the teacher and student networks. Intermediate-level information interaction provides more supervisory information to the student network and is useful to enhance the quality of final predictions. In the offline learning stage, the STKD approach applies a pretrained teacher to further improve the performance and accelerate the training process by providing prior knowledge. Trained simultaneously, the student learns multilevel and comprehensive knowledge by incorporating online teaching and offline teaching, which combines the advantages of different KD strategies through our STKD method. Experimental results on the SVHN, CIFAR-10, CIFAR-100, and ImageNet ILSVRC 2012 real-world datasets show that the proposed method achieves significant performance improvements compared with the state-of-the-art methods, especially with satisfying accuracy and model size. Code for STKD is provided at https://github.com/nanxiaotong/STKD.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
赘婿应助周俊雄采纳,获得10
刚刚
distance完成签到,获得积分10
刚刚
刚刚
大模型应助YY采纳,获得10
1秒前
1秒前
iSeVen完成签到 ,获得积分10
1秒前
刻苦的坤发布了新的文献求助10
2秒前
Xc发布了新的文献求助10
2秒前
2秒前
NexusExplorer应助刘志采纳,获得10
2秒前
大个应助kevin采纳,获得10
2秒前
乐观雪青关注了科研通微信公众号
3秒前
Elm应助畅快的慕灵采纳,获得10
3秒前
0043发布了新的文献求助10
3秒前
4秒前
SCC发布了新的文献求助10
4秒前
4秒前
科研通AI6.4应助Diss采纳,获得10
4秒前
tanXX完成签到,获得积分10
5秒前
5秒前
Aloha发布了新的文献求助10
5秒前
爆米花应助Spike采纳,获得10
5秒前
雨琴发布了新的文献求助10
5秒前
香蕉觅云应助linkyi采纳,获得10
6秒前
灵巧的小笼包完成签到,获得积分10
7秒前
7秒前
7秒前
张昊宇完成签到,获得积分10
7秒前
清秀诗珊发布了新的文献求助10
7秒前
8秒前
rtmatrix完成签到,获得积分10
8秒前
李子青完成签到,获得积分10
8秒前
9秒前
高玉峰发布了新的文献求助10
9秒前
wzgkeyantong发布了新的文献求助10
9秒前
ljj发布了新的文献求助10
10秒前
华仔应助gkkk采纳,获得10
10秒前
leexiaoyang发布了新的文献求助10
10秒前
10秒前
Loong发布了新的文献求助10
11秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6432143
求助须知:如何正确求助?哪些是违规求助? 8247821
关于积分的说明 17541082
捐赠科研通 5489293
什么是DOI,文献DOI怎么找? 2896490
邀请新用户注册赠送积分活动 1873020
关于科研通互助平台的介绍 1713159