发布文献求助

A Configurable Floating-Point Multiple-Precision Processing Element for HPC and AI Converged Computing

计算机科学乘法（音乐）乘数（经济学）吞吐量过程（计算）符号加速算术浮点型算法点（几何）计算机硬件并行计算数学电信几何学组合数学经济无线宏观经济学操作系统

作者

Wei Mao,Kai Li,Quan Cheng,Liuyao Dai,Boyu Li,Xinang Xie,He Li,Longyang Lin,Hao Yu

出处

期刊：IEEE Transactions on Very Large Scale Integration Systems [Institute of Electrical and Electronics Engineers]
日期：2021-12-01 卷期号：30 (2): 213-226 被引量：20

链接

ieee.orgdoi.org

标识

DOI：10.1109/tvlsi.2021.3128435

摘要

There is an emerging need to design configurable accelerators for the high-performance computing (HPC) and artificial intelligence (AI) applications in different precisions. Thus, the floating-point (FP) processing element (PE), which is the key basic unit of the accelerators, is necessary to meet multiple-precision requirements with energy-efficient operations. However, the existing structures by using high-precision-split (HPS) and low-precision-combination (LPC) methods result in low utilization rate of the multiplication array and long multiterm processing period, respectively. In this article, a configurable FP multiple-precision PE design is proposed with the LPC structure. Half precision, single precision, and double precision are supported. The 100% multiplier utilization rate of the multiplication array for all precisions is achieved with improved speed in the comparison and summation process. The proposed design is realized in a 28-nm process with 1.429-GHz clock frequency. Compared with the existing multiple-precision FP methods, the proposed structure achieves 63% and 88% area-saving performance for FP16 and FP32 operations, respectively. The

$4\times $

and

$20\times $

maximum throughput rates are obtained when compared with fixed FP32 and FP64 operations. Compared with the previous multiple-precision PEs, the proposed one achieves the best energy-efficiency performance with 975.13 GFLOPS/W.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 司婷婷完成签到，获得积分10

刚刚; 可爱的函函的应助被虚拟采纳，获得10

刚刚; 斯文败类的应助被时尚战斗机采纳，获得20

1秒前; 赘婿上传了应助文件

1秒前; Xbro发布了新的文献求助10

1秒前; 赘婿上传了应助文件

2秒前; 成功Winy发布了新的文献求助10

3秒前; wanci上传了应助文件

3秒前; 汉堡包的应助被小小铱采纳，获得10

4秒前; 科研通AI5上传了应助文件

4秒前; 科研通AI5上传了应助文件

4秒前; www完成签到，获得积分10

4秒前; 务实羊完成签到，获得积分20

5秒前; Owen上传了应助文件

6秒前; Ni发布了新的文献求助10

6秒前; veblem发布了新的文献求助10

7秒前; 桐桐上传了应助文件

7秒前; 稻穗完成签到，获得积分10

7秒前; 李健上传了应助文件

7秒前; 华仔上传了应助文件

7秒前; Lucas上传了应助文件

8秒前; 清溪浅水XZ发布了新的文献求助10

9秒前; 成功Winy完成签到，获得积分10

10秒前; kkk发布了新的文献求助10

10秒前; 丘比特上传了应助文件

10秒前; 稻穗关注了科研通微信公众号

11秒前; 希望天下0贩的0上传了应助文件

11秒前; 韩立发布了新的文献求助10

12秒前; www发布了新的文献求助30

13秒前; 熬夜小猫发布了新的文献求助10

13秒前; 学术大咖发布了新的文献求助10

13秒前; veblem完成签到，获得积分10

13秒前; cxzdm发布了新的文献求助10

13秒前; ch发布了新的文献求助10

14秒前; 搜集达人上传了应助文件

14秒前; mjm完成签到，获得积分10

14秒前; 科研通AI2S的应助被zj3tears采纳，获得10

14秒前; 秦秦关注了科研通微信公众号

15秒前; Owen上传了应助文件

15秒前; 田様上传了应助文件

16秒前

高分求助中: (禁止应助)【重要！！请各位详细阅读】【科研通的精品贴汇总】 10000; International Code of Nomenclature for algae, fungi, and plants (Madrid Code) (Regnum Vegetabile) 1500; Linear and Nonlinear Functional Analysis with Applications, Second Edition 1200; Stereoelectronic Effects 1000; Robot-supported joining of reinforcement textiles with one-sided sewing heads 860; Nanosuspensions 500; Византийско-аланские отно- шения (VI–XII вв.) 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 4194617; 求助须知：如何正确求助？哪些是违规求助？ 3730307; 关于积分的说明 11749255; 捐赠科研通 3405398; 什么是DOI，文献DOI怎么找？ 1868386; 邀请新用户注册赠送积分活动 924582; 科研通“疑难数据库（出版商）”最低求助积分说明 835466

今日热心研友

飲啖茶食個包

快乐的胖子

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通