远程直接内存访问
计算机科学
网络拥塞
英菲尼班德
计算机网络
重传
流量控制(数据)
网络数据包
数据包丢失
水准点(测量)
吞吐量
钥匙(锁)
分布式计算
操作系统
地理
无线
大地测量学
作者
Xiaolong Zhong,Jiao Zhang,Yali Zhang,Zixuan Guan,Zirui Wan
标识
DOI:10.1109/infocom48880.2022.9796803
摘要
The rapid upgrade of link speed and the prosperity of new applications in data center networks (DCNs) lead to a rigorous demand for ultra-low latency and high throughput. To mitigate the overhead of traditional software-based packet processing at end-hosts, RDMA (Remote Direct Memory Access) has been widely adopted in DCNs. Particularly, congestion control (CC) mechanisms designed for RDMA have attracted much attention to avoid performance deterioration when packets lose. However, through comprehensive analysis, we found that existing RDMA CC schemes have limitations of a sluggish response to congestion and unawareness of tiny microbursts due to the long end-to-end control loop. In this paper, we propose PACC, a switch-driven RDMA CC algorithm with easy deployability. PACC is driven by PI controller-based computation, threshold-based flow discrimination and weight-based allocation at the switch. It leverages real-time queue length to generate accurate congestion feedback proactively and piggybacks it to the corresponding source without modification to end-hosts. We theoretically analyze the stability and key parameter settings of PACC. Then, we conduct both micro-benchmark and large-scale simulations to evaluate the performance of PACC. The results show that PACC achieves fairness, fast reaction, high throughput, and 6~69% lower FCT (Flow Completion Time) than DCQCN, TIMELY and HPCC.
科研通智能强力驱动
Strongly Powered by AbleSci AI