计算机科学
任务(项目管理)
强化学习
过程(计算)
弹道
功能(生物学)
适应性
任务分析
理论(学习稳定性)
马尔可夫决策过程
吞吐量
分布式计算
实时计算
人工智能
机器学习
工程类
马尔可夫过程
系统工程
天文
数学
无线
物理
操作系统
统计
生态学
生物
进化生物学
电信
摘要
Light and small UAVs have attracted widespread attention due to their good adaptability, low cost, and high temporal resolution. With the continuous development of technology, multi-UAV cooperation has become a research hot spot. The route planning problem of multi-UAV cooperation can be decomposed into two sub-problems: task allocation and route planning. In this paper, a task allocation method based on reinforcement learning is proposed for multi-UAV cooperation. Considering the task requirements, the capabilities of the UAV, the influence of the environment and the conflict of the task, we construct a MDP process include the state space, action space, reward function and discount factor with the constraints and optimization functions. In this paper, the task allocation process is combined with the trajectory planning based on maximizing information throughput, and a large number of simulation tests are carried out to verify the stability of the method.
科研通智能强力驱动
Strongly Powered by AbleSci AI