推论                        
                
                                
                        
                            计算机科学                        
                
                                
                        
                            强化学习                        
                
                                
                        
                            云计算                        
                
                                
                        
                            人工智能                        
                
                                
                        
                            水准点(测量)                        
                
                                
                        
                            资源配置                        
                
                                
                        
                            机器学习                        
                
                                
                        
                            资源管理(计算)                        
                
                                
                        
                            深度学习                        
                
                                
                        
                            整数规划                        
                
                                
                        
                            边缘计算                        
                
                                
                        
                            边缘设备                        
                
                                
                        
                            马尔可夫决策过程                        
                
                                
                        
                            GSM演进的增强数据速率                        
                
                                
                        
                            分布式计算                        
                
                                
                        
                            马尔可夫过程                        
                
                                
                        
                            计算机网络                        
                
                                
                        
                            算法                        
                
                                
                        
                            操作系统                        
                
                                
                        
                            地理                        
                
                                
                        
                            统计                        
                
                                
                        
                            数学                        
                
                                
                        
                            大地测量学                        
                
                        
                    
            作者
            
                Weiting Zhang,Dong Yang,Haixia Peng,Wen Wu,Wei Quan,Hongke Zhang,Xuemin Shen            
         
                    
            出处
            
                                    期刊:IEEE Transactions on Vehicular Technology
                                                         [Institute of Electrical and Electronics Engineers]
                                                        日期:2021-03-23
                                                        卷期号:70 (8): 7605-7618
                                                        被引量:112
                                
         
        
    
            
            标识
            
                                    DOI:10.1109/tvt.2021.3068255
                                    
                                
                                 
         
        
                
            摘要
            
            Performing deep neural network (DNN) inference in real time requires excessive network resources, which poses a big challenge to the resource-limited industrial Internet of things (IIoT) networks. To address the challenge, in this paper, we introduce an end-edge-cloud orchestration architecture, in which the inference task assignment and DNN model placement are flexibly coordinated. Specifically, the DNN models, trained and pre-stored in the cloud, are properly placed at the end and edge to perform DNN inference. To achieve efficient DNN inference, a multi-dimensional resource management problem is formulated to maximize the average inference accuracy while satisfying the strict delay requirements of inference tasks. Due to the mix-integer decision variables, it is difficult to solve the formulated problem directly. Thus, we transform the formulated problem into a Markov decision process which can be solved efficiently. Furthermore, a deep reinforcement learning based resource management scheme is proposed to make real-time optimal resource allocation decisions. Simulation results are provided to demonstrate that the proposed scheme can efficiently allocate the available spectrum, caching, and computing resources, and improve average inference accuracy by 31.4$\%$ compared with the deep deterministic policy gradient benchmark.
         
            
 
                 
                
                    
                    科研通智能强力驱动
Strongly Powered by AbleSci AI