强化学习
计算机科学
运动规划
稳健性(进化)
人工智能
工作区
地形
机器学习
反事实思维
钥匙(锁)
实时计算
机器人
分布式计算
生态学
生物化学
化学
哲学
计算机安全
认识论
生物
基因
作者
Jonas Westheider,Julius Rückin,Marija Popović
标识
DOI:10.1109/iros55552.2023.10342516
摘要
Efficient aerial data collection is important in many remote sensing applications. In large-scale monitoring scenarios, deploying a team of unmanned aerial vehicles (UAVs) offers improved spatial coverage and robustness against individual failures. However, a key challenge is cooperative path planning for the UAVs to efficiently achieve a joint mission goal. We propose a novel multi-agent informative path planning approach based on deep reinforcement learning for adaptive terrain monitoring scenarios using UAV teams. We introduce new network feature representations to effectively learn path planning in a 3D workspace. By leveraging a counterfactual baseline, our approach explicitly addresses credit assignment to learn cooperative behaviour. Our experimental evaluation shows improved planning performance, i.e. maps regions of interest more quickly, with respect to non-counterfactual variants. Results on synthetic and real-world data show that our approach has superior performance compared to state-of-the-art non-learning-based methods, while being transferable to varying team sizes and communication constraints.
科研通智能强力驱动
Strongly Powered by AbleSci AI