发布文献求助

Improving Extreme Low-Bit Quantization With Soft Threshold

量化（信号处理）算法数学舍入三元运算计算机科学离散数学算术操作系统程序设计语言

作者

Weixiang Xu,Fanrong Li,Yingying Jiang,A Yong,Xiangyu He,Peisong Wang,Jian Cheng

出处

期刊：IEEE Transactions on Circuits and Systems for Video Technology [Institute of Electrical and Electronics Engineers]
日期：2022-11-03 卷期号：33 (4): 1549-1563 被引量：27

标识

DOI：10.1109/tcsvt.2022.3216389

摘要

Deep neural networks executing with low precision at inference time can gain acceleration and compression advantages over their high-precision counterparts, but need to overcome the challenge of accuracy degeneration as the bit-width decreases. This work focuses on under 4-bit quantization that has a significant accuracy degeneration. We start with ternarization, a balance between efficiency and accuracy that quantizes both weights and activations into ternary values. We find that the hard threshold

$\Delta $

introduced in previous ternary networks for determining quantization intervals and the suboptimal solution of

$\Delta $

limit the performance of the ternary model. To alleviate it, we present Soft Threshold Ternary Networks (STTN), which enables the model to automatically determine ternarized values instead of depending on a hard threshold. Based on it, we further generalize the idea of soft threshold from ternarization to arbitrary bit-width, named Soft Threshold Quantized Networks (STQN). We observe that previous quantization relies on the rounding-to-nearest function, constraining the quantization solution space and leading to a significant accuracy degradation, especially in low-bit (

$\leq3$

-bits) quantization. Instead of relying on the traditional rounding-to-nearest function, STQN is able to determine quantization intervals by itself adaptively. Accuracy experiments on image classification, object detection and instance segmentation, as well as efficiency experiments on field-programmable gate array (FPGA) demonstrate that the proposed framework can achieve a prominent tradeoff between accuracy and efficiency. Code is available at: https://github.com/WeixiangXu/STTN .

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

建议保存本图，每天支付宝扫一扫（相册选取）领红包

实时播报: cmuzf完成签到，获得积分10

刚刚; zhang发布了新的文献求助10

1秒前; 大模型的应助被comm采纳，获得10

1秒前; yuu上传了应助文件

2秒前; 萧墙发布了新的文献求助10

3秒前; 李健的应助被小橘猫采纳，获得10

3秒前; 小夏饭桶关闭了小夏饭桶的文献求助

5秒前; MQ完成签到，获得积分10

5秒前; zhang完成签到，获得积分10

8秒前; 薄荷心完成签到，获得积分10

8秒前; 科研通AI2S上传了应助文件

9秒前; 小羊完成签到，获得积分10

9秒前; 王丰亿关闭了王丰亿的文献求助

10秒前; 香蕉觅云的应助被xjc采纳，获得10

11秒前; zhonglv7的应助被111采纳，获得10

12秒前; 芋圆的应助被111采纳，获得10

12秒前; zhonglv7的应助被111采纳，获得10

12秒前; 礼部尚书完成签到，获得积分10

13秒前; 善学以致用的应助被早期早睡采纳，获得10

13秒前; 猪猪hero的应助被迷人的叫兽采纳，获得10

14秒前; 小李飞刀发布了新的文献求助10

14秒前; 龙龙发布了新的文献求助20

14秒前; 合适熊猫完成签到，获得积分10

17秒前; 顾矜上传了应助文件

18秒前; 天天快乐的应助被lizhi采纳，获得10

18秒前; 怡然白桃完成签到，获得积分10

19秒前; 无花果的应助被小莱采纳，获得10

20秒前; nice1025完成签到，获得积分10

21秒前; 慕青上传了应助文件

21秒前; 星星boy完成签到，获得积分10

22秒前; 在水一方的应助被夏硕采纳，获得10

23秒前; wnll发布了新的文献求助10

24秒前; 彭于晏上传了应助文件

24秒前; 王丰亿发布了新的文献求助20

26秒前; 量子星尘发布了新的文献求助10

26秒前; ding的应助被小马嘻嘻采纳，获得10

27秒前; Peng发布了新的文献求助10

27秒前; 赛亚人完成签到，获得积分10

28秒前; 满意曼荷的应助被烂漫的铭采纳，获得10

28秒前; pililili发布了新的文献求助10

29秒前

高分求助中: (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; List of 1,091 Public Pension Profiles by Region 1041; Mentoring for Wellbeing in Schools 600; Binary Alloy Phase Diagrams, 2nd Edition 600; Atlas of Liver Pathology: A Pattern-Based Approach 500; A Technologist’s Guide to Performing Sleep Studies 500; EEG in Childhood Epilepsy: Initial Presentation & Long-Term Follow-Up 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 5492034; 求助须知：如何正确求助？哪些是违规求助？ 4590380; 关于积分的说明 14430051; 捐赠科研通 4522666; 什么是DOI，文献DOI怎么找？ 2477973; 邀请新用户注册赠送积分活动 1463068; 关于科研通互助平台的介绍 1435723

今日热心研友

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通