计算机科学
英菲尼班德
计算机网络
网络数据包
无损压缩
排队论
流量控制(数据)
电信线路
排队
以太网
远程直接内存访问
带宽(计算)
实时计算
数据压缩
计算机视觉
作者
Peiwen Yu,Fei Xue,Chen Tian,Xiaoliang Wang,Yanqing Chen,Tom Wu,Han Liu,Zhiyong Han,Bingquan Wang,Xiangyu Gong,Wanchun Dou,Guihai Chen
标识
DOI:10.1109/icnp59255.2023.10355634
摘要
RDMA over Converged Ethernet (RoCEv2) has been widely deployed to data centers (DCs) for its better compatibility with Ethernet/IP than Infiniband (IB). As cross-DC applications emerge, they also demand high throughput, low latency, and lossless network for cross-DC data transmission. However, RoCEv2's underlying lossless mechanism Priority-based Flow Control (PFC) cannot fit into the long-haul transmission scenario and degrades the performance of RoCEv2. PFC is myopic and only considers queue length to pause upstream senders, which leads to large queueing delay. This paper proposes Bifrost, a downstream-driven lossless flow control that supports long distance cross-DC data transmission. Bifrost uses virtual incoming packets, which indicates the upper bound of in-flight packets, together with buffered packets to control the flow rate. It minimizes the buffer space requirement to one-hop bandwidth delay product (BDP) and achieves low one-way latency. Real-world experiments are conducted with prototype switches and 80 kilometers cables. Evaluations demonstrate that compared to PFC, Bifrost reduces average/tail flow completion time (FCT) of inter-DC flows by up to 22.5%/42.0%, respectively. Bifrost is compatible with existing infrastructure and can support distance of thousands of kilometers.
科研通智能强力驱动
Strongly Powered by AbleSci AI