Reinforcement Learning-based control using Q-learning and gravitational search algorithm with experimental validation on a nonlinear servo system

强化学习 计算机科学 初始化 人工神经网络 超调(微波通信) 粒子群优化 沉降时间 算法 控制器(灌溉) 非线性系统 适应度函数 人工智能 控制理论(社会学) 机器学习 阶跃响应 遗传算法 控制(管理) 控制工程 工程类 物理 生物 程序设计语言 电信 量子力学 农学
作者
Iuliu Alexandru Zamfirache,Radu Precup,Raul‐Cristian Roman,Emil M. Petriu
出处
期刊:Information Sciences [Elsevier BV]
卷期号:583: 99-120 被引量:89
标识
DOI:10.1016/j.ins.2021.10.070
摘要

This paper presents a novel Reinforcement Learning (RL)-based control approach that uses a combination of a Deep Q-Learning (DQL) algorithm and a metaheuristic Gravitational Search Algorithm (GSA). The GSA is employed to initialize the weights and the biases of the Neural Network (NN) involved in DQL in order to avoid the instability, which is the main drawback of the traditional randomly initialized NNs. The quality of a particular set of weights and biases is measured at each iteration of the GSA-based initialization using a fitness function aiming to achieve the predefined optimal control or learning objective. The data generated during the RL process is used in training a NN-based controller that will be able to autonomously achieve the optimal reference tracking control objective. The proposed approach is compared with other similar techniques which use different algorithms in the initialization step, namely the traditional random algorithm, the Grey Wolf Optimizer algorithm, and the Particle Swarm Optimization algorithm. The NN-based controllers based on each of these techniques are compared using performance indices specific to optimal control as settling time, rise time, peak time, overshoot, and minimum cost function value. Real-time experiments are conducted in order to validate and test the proposed new approach in the framework of the optimal reference tracking control of a nonlinear position servo system. The experimental results show the superiority of this approach versus the other three competing approaches.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
搜集达人应助科研通管家采纳,获得10
刚刚
852应助科研通管家采纳,获得10
刚刚
香蕉觅云应助科研通管家采纳,获得30
刚刚
Hello应助科研通管家采纳,获得10
1秒前
美丽小甜瓜完成签到,获得积分20
1秒前
CipherSage应助科研通管家采纳,获得10
1秒前
科目三应助科研通管家采纳,获得10
1秒前
Jasper应助科研通管家采纳,获得10
1秒前
1秒前
可可可11完成签到,获得积分10
1秒前
1秒前
曾经的朝雪完成签到,获得积分10
1秒前
1秒前
呜呜呜发布了新的文献求助10
1秒前
星辰大海应助孙伟健采纳,获得10
1秒前
1秒前
文刀发布了新的文献求助10
1秒前
打打应助东方采纳,获得10
2秒前
科研后腿发布了新的文献求助10
3秒前
3秒前
4秒前
随机昵称完成签到,获得积分10
4秒前
pluto应助小熊西采纳,获得10
5秒前
leng发布了新的文献求助30
5秒前
all完成签到,获得积分10
6秒前
6秒前
6秒前
Jasper应助圆圆方方采纳,获得10
7秒前
7秒前
7秒前
杨e完成签到,获得积分10
8秒前
大模型应助健忘的板凳采纳,获得10
8秒前
读研小白完成签到,获得积分10
8秒前
cathyyyyyy发布了新的文献求助30
8秒前
Yang完成签到,获得积分10
8秒前
hanL完成签到,获得积分10
9秒前
陈思思完成签到 ,获得积分10
9秒前
9秒前
10秒前
颜鸿发布了新的文献求助10
10秒前
高分求助中
Inorganic Chemistry Eighth Edition 1200
Free parameter models in liquid scintillation counting 1000
Standards for Molecular Testing for Red Cell, Platelet, and Neutrophil Antigens, 7th edition 1000
HANDBOOK OF CHEMISTRY AND PHYSICS 106th edition 1000
ASPEN Adult Nutrition Support Core Curriculum, Fourth Edition 1000
The Psychological Quest for Meaning 800
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6303852
求助须知:如何正确求助?哪些是违规求助? 8120487
关于积分的说明 17006797
捐赠科研通 5363537
什么是DOI,文献DOI怎么找? 2848597
邀请新用户注册赠送积分活动 1826072
关于科研通互助平台的介绍 1679863