会合
强化学习
航天器
钢筋
计算机科学
人工智能
自主学习
航空航天工程
工程类
控制工程
控制理论(社会学)
数学
控制(管理)
结构工程
数学教育
作者
Cheng Huang,Aijia Xing,Quanli Zeng,Fangyu Xiong
摘要
Abstract In this paper, an autonomous decision‐making method for close‐range rendezvous and approaching to noncooperative target with strong maneuvering is proposed based on the distributed distributional deep determined policy gradient (D4PG), and ground simulation system of rendezvous and approaching is constructed by the improved YOLOv5s and universal robotic arm to verify the effectiveness and feasibility. The contributions of this paper are as follows: (1) an improved nearest neighbor exploration mechanism including random constant value and logarithmic constant value is proposed to stabilize the convergence for the decision‐making under the uncertainty; (2) a discrete wavelet transform (DWT)‐based preprocessing, a fusion of generalized additive model (GAM), and an optimization scheme are proposed for the dark background and false detection of black objects; and (3) a comparison analysis of simulation result between the traditional noise and proposed state‐based exploration is presented for the superiority and some robot‐based ground experiments are done to verify feasibility of close‐range space rendezvous and approaching.
科研通智能强力驱动
Strongly Powered by AbleSci AI