计算机科学
交错
平行性(语法)
延迟(音频)
推论
任务并行性
数据并行性
并行计算
操作员(生物学)
吞吐量
困境
分布式计算
无线
操作系统
人工智能
电信
哲学
化学
抑制因子
认识论
基因
转录因子
生物化学
作者
Jiangsu Du,Jinhui Wei,Jiazhi Jiang,Shenggan Cheng,Dan Huang,Zhiguang Chen,Yutong Lu
标识
DOI:10.1145/3627535.3638466
摘要
Distributed large model inference is still in a dilemma where balancing cost and effect. The online scenarios demand intraoperator parallelism to achieve low latency and intensive communications makes it costly. Conversely, the inter-operator parallelism can achieve high throughput with much fewer communications, but it fails to enhance the effectiveness.
科研通智能强力驱动
Strongly Powered by AbleSci AI