Reinforcement learning for batch process control: Review and perspectives

强化学习 过程(计算) 控制(管理) 计算机科学 模型预测控制 最优控制 过程控制 批处理 控制工程 工业工程 人工智能 工程类 数学优化 数学 操作系统 程序设计语言
作者
Haeun Yoo,Ha-Eun Byun,Dongho Han,Jay H. Lee
出处
期刊:Annual Reviews in Control [Elsevier BV]
卷期号:52: 108-119 被引量:73
标识
DOI:10.1016/j.arcontrol.2021.10.006
摘要

Batch or semi-batch processing is becoming more prevalent in industrial chemical manufacturing but it has not benefited from advanced control technologies to a same degree as continuous processing. This is due to its several unique aspects which pose challenges to implementing model-based optimal control, such as its highly nonstationary operation and significant run-to-run variabilities. While existing advanced control methods like model predictive control (MPC) have been extended to address some of the challenges, they still suffer from certain limitations which have prevented their widespread industrial adoption. Reinforcement learning (RL) where the agent learns the optimal policy by interacting with the system offers an alternative to the existing model-based methods and has potential for bringing significant improvements to industrial batch process control practice. With such motivation, this paper examines the advantages that RL offers over the traditional model-based optimal control methods and how it can be tailored to better address the characteristics of industrial batch process control problems. After a brief review of the existing batch control methods, the basic concepts and algorithms of RL are introduced and issues for applying them to batch process control problems are discussed. The nascent literature on the use of RL in batch process control is briefly reviewed, both in recipe optimization and tracking control, and our perspectives on future research directions are shared.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
bkagyin应助少侠不是菜鸟采纳,获得10
1秒前
1秒前
1秒前
2秒前
obu_085801发布了新的文献求助10
2秒前
ABBCCC完成签到,获得积分10
3秒前
3秒前
科研通AI6.2应助kim采纳,获得10
3秒前
3秒前
23XZYZN发布了新的文献求助10
3秒前
君莫笑完成签到,获得积分10
3秒前
川上富江发布了新的文献求助10
4秒前
siliy发布了新的文献求助10
4秒前
4秒前
8R60d8应助靓丽的觅荷采纳,获得10
5秒前
典雅威发布了新的文献求助10
5秒前
5秒前
丘比特应助囚徒采纳,获得10
5秒前
5秒前
NMSL发布了新的文献求助10
5秒前
所所应助七七七采纳,获得10
5秒前
5秒前
ding应助Elm采纳,获得20
6秒前
小雪完成签到,获得积分10
6秒前
香蕉觅云应助kaka采纳,获得30
6秒前
我是老大应助燕燕于飞采纳,获得10
6秒前
zz发布了新的文献求助10
6秒前
6秒前
珈小羽完成签到,获得积分0
6秒前
6秒前
6秒前
7秒前
8秒前
宇宇发布了新的文献求助10
8秒前
晓晓完成签到 ,获得积分10
8秒前
骑龙猪猪发布了新的文献求助10
8秒前
醉熏的新波完成签到,获得积分10
9秒前
9秒前
9秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Materials selection in mechanical design 500
Bounds for Statistical Estimation in Semiparametric Models 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Ideology and Meaning-Making under the Putin Regime 450
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6478722
求助须知:如何正确求助?哪些是违规求助? 8280233
关于积分的说明 17660271
捐赠科研通 5561280
什么是DOI,文献DOI怎么找? 2911216
邀请新用户注册赠送积分活动 1888251
关于科研通互助平台的介绍 1742151