计算机科学
强化学习
网络拥塞
带宽(计算)
循环神经网络
人工神经网络
数据包丢失
部分可观测马尔可夫决策过程
钥匙(锁)
人工智能
网络数据包
带宽分配
计算机网络
机器学习
实时计算
分布式计算
马尔可夫链
马尔可夫模型
计算机安全
作者
Jingshun Du,Chaokun Zhang,Shen He,Wenyu Qu
标识
DOI:10.1109/iscc58397.2023.10218019
摘要
In recent years, Real-Time Communication (RTC) has been widely used in many scenarios, and Congestion Control (CC) is one of the important ways to improve the experience of such applications. Accurate bandwidth prediction is the key to CC schemes. However, designing an efficient congestion control scheme with accurate bandwidth prediction is challenging, largely because it is essentially a Partially Observable MDP (POMDP) problem, making it difficult to use traditional hand-crafted methods to solve. We propose a novel hybrid CC scheme LRCC, which combines attention-based Long Short-Term Memory (LSTM) and Reinforcement Learning (RL), realizing more accurate bandwidth prediction and congestion control by adding bandwidth memory information provided by the recurrent neural network to the RL decision-making process. Trace-driven experiments show that our proposed method can significantly reduce packet loss and improve bandwidth utilization in various network scenarios, outperforming baseline methods on overall QoE.
科研通智能强力驱动
Strongly Powered by AbleSci AI