Neural Network Compression Based on Tensor Ring Decomposition

计算复杂性理论 秩(图论) 压缩(物理) 因式分解 张量(固有定义) 人工神经网络 矩阵分解 算法 素数(序理论) 理论计算机科学 数学 计算机科学 人工智能 纯数学 材料科学 复合材料 特征向量 物理 组合数学 量子力学
作者
Kun Xie,Can Liu,Xin Wang,Xiaocan Li,Gaogang Xie,Jigang Wen,Kenli Li
出处
期刊:IEEE transactions on neural networks and learning systems [Institute of Electrical and Electronics Engineers]
卷期号:36 (3): 5388-5402 被引量:10
标识
DOI:10.1109/tnnls.2024.3383392
摘要

Deep neural networks (DNNs) have made great breakthroughs and seen applications in many domains. However, the incomparable accuracy of DNNs is achieved with the cost of considerable memory consumption and high computational complexity, which restricts their deployment on conventional desktops and portable devices. To address this issue, low-rank factorization, which decomposes the neural network parameters into smaller sized matrices or tensors, has emerged as a promising technique for network compression. In this article, we propose leveraging the emerging tensor ring (TR) factorization to compress the neural network. We investigate the impact of both parameter tensor reshaping and TR decomposition (TRD) on the total number of compressed parameters. To achieve the maximal parameter compression, we propose an algorithm based on prime factorization that simultaneously identifies the optimal tensor reshaping and TRD. In addition, we discover that different execution orders of the core tensors result in varying computational complexities. To identify the optimal execution order, we construct a novel tree structure. Based on this structure, we propose a top-to-bottom splitting algorithm to schedule the execution of core tensors, thereby minimizing computational complexity. We have performed extensive experiments using three kinds of neural networks with three different datasets. The experimental results demonstrate that, compared with the three state-of-the-art algorithms for low-rank factorization, our algorithm can achieve better performance with much lower memory consumption and lower computational complexity.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
小太阳哈哈完成签到 ,获得积分10
1秒前
CodeCraft应助cherish采纳,获得10
1秒前
2秒前
3秒前
3秒前
cc完成签到,获得积分10
4秒前
Nj发布了新的文献求助10
5秒前
momo完成签到,获得积分10
5秒前
韩喵喵完成签到,获得积分10
5秒前
11发布了新的文献求助10
7秒前
Hupoo发布了新的文献求助10
7秒前
沉默的雪枫应助zzzzw采纳,获得10
8秒前
8秒前
无花果应助真实的书雪采纳,获得10
9秒前
优雅羽毛完成签到 ,获得积分10
9秒前
香蕉觅云应助irie采纳,获得30
10秒前
冬卿留完成签到,获得积分10
10秒前
NexusExplorer应助Nj采纳,获得10
11秒前
迷路的紫完成签到,获得积分10
11秒前
隐形曼青应助飞快的柔采纳,获得10
12秒前
灰太狼完成签到,获得积分10
12秒前
arron完成签到,获得积分10
12秒前
14秒前
15秒前
赘婿应助滕友桃采纳,获得10
15秒前
19秒前
太白君发布了新的文献求助10
21秒前
wddmj发布了新的文献求助10
22秒前
ZZ完成签到,获得积分10
24秒前
qq发布了新的文献求助10
25秒前
bella1201完成签到,获得积分10
25秒前
26秒前
27秒前
29秒前
研友_莫笑旋完成签到,获得积分10
30秒前
嘻嘻哈哈应助张之静采纳,获得10
30秒前
科研木头人完成签到 ,获得积分10
32秒前
许中天发布了新的文献求助10
32秒前
蘸糖冰美式完成签到,获得积分10
33秒前
高分求助中
Clinical Epidemiology: The Essentials, 6e 10000
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Graphene Handbook (2019 Edition) 800
Adhesion Science: Principles & Practice 800
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
The Immune System (Fifth Edition) 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6559314
求助须知:如何正确求助?哪些是违规求助? 8342244
关于积分的说明 17873854
捐赠科研通 5679446
什么是DOI,文献DOI怎么找? 2941357
邀请新用户注册赠送积分活动 1917206
关于科研通互助平台的介绍 1789072