计算机科学
马尔可夫决策过程
调度(生产过程)
概率逻辑
马尔可夫过程
强化学习
分布式计算
马尔可夫链
动态优先级调度
实时计算
数学优化
运筹学
计算机网络
服务质量
人工智能
机器学习
统计
工程类
数学
作者
Surya K. Murthy,Natasha Neogi,Suda Bharadwaj
出处
期刊:Cornell University - arXiv
日期:2022-09-26
卷期号:371: 86-102
摘要
This work considers the scheduling problem for Urban Air Mobility (UAM) vehicles travelling between origin-destination pairs with both hard and soft trip deadlines. Each route is described by a discrete probability distribution over trip completion times (or delay) and over inter-arrival times of requests (or demand) for the route along with a fixed hard or soft deadline. Soft deadlines carry a cost that is incurred when the deadline is missed. An online, safe scheduler is developed that ensures that hard deadlines are never missed, and that average cost of missing soft deadlines is minimized. The system is modelled as a Markov Decision Process (MDP) and safe model-based learning is used to find the probabilistic distributions over route delays and demand. Monte Carlo Tree Search (MCTS) Earliest Deadline First (EDF) is used to safely explore the learned models in an online fashion and develop a near-optimal non-preemptive scheduling policy. These results are compared with Value Iteration (VI) and MCTS (Random) scheduling solutions.
科研通智能强力驱动
Strongly Powered by AbleSci AI