亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Training effective deep reinforcement learning agents for real-time life-cycle production optimization

强化学习 马尔可夫决策过程 数学优化 计算机科学 增强学习 时间范围 最优控制 生产(经济) 贝尔曼方程 动态规划 任务(项目管理) 人工智能 马尔可夫过程 工程类 数学 统计 宏观经济学 经济 系统工程
作者
Kai Zhang,Zhongzheng Wang,Guodong Chen,Liming Zhang,Yongfei Yang,Chuanjin Yao,Jian Wang,Jun Yao
出处
期刊:Journal of Petroleum Science and Engineering [Elsevier]
卷期号:208: 109766-109766 被引量:143
标识
DOI:10.1016/j.petrol.2021.109766
摘要

Life-cycle production optimization aims to obtain the optimal well control scheme at each time control step to maximize financial profit and hydrocarbon production. However, searching for the optimal policy under the limited number of simulation evaluations is a challenging task. In this paper, a novel production optimization method is presented, which maximizes the net present value (NPV) over the entire life-cycle and achieves real-time well control scheme adjustment. The proposed method models the life-cycle production optimization problem as a finite-horizon Markov decision process (MDP), where the well control scheme can be viewed as sequence decisions. Soft actor-critic, known as the state-of-the-art model-free deep reinforcement learning (DRL) algorithm, is subsequently utilized to train DRL agents that can solve the above MDP. The DRL agent strives to maximize long-term NPV rewards as well as the control scheme randomness by training a stochastic policy that maps reservoir states to well control variables and an action-value function that estimates the objective value of the current policy. Since the trained policy is an explicit function structure, the DRL agent can adjust the well control scheme in real-time under different reservoir states. Different from most existing methods that introduce task-specific sensitive parameters or construct complex supplementary structures, the DRL agent learns adaptively by executing goal-directed interactions with an uncertain reservoir environment and making use of accumulated well control experience, which is similar to the actual field well control mode. The key insight here is that the DRL method's ability to utilize gradients information (well-control experience) for higher sample efficiency. The simulation results based on two reservoir models indicate that compared to other optimization methods, the proposed method can attain higher NPV and access excellent performance in terms of oil displacement. • A novel production optimization framework that incorporating advanced deep reinforcement leaning technologies is presented. • The proposed method models the life-cycle production optimization problem as a finite-horizon Markov decision process. • The trained policy is an explicit function structure that utilizing powerful gradient information for higher sample efficiency. • The proposed method achieves excellent performance on one classic control task and two reservoir models.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
余人关注了科研通微信公众号
3秒前
5秒前
小白加油完成签到 ,获得积分10
7秒前
13秒前
风茠住发布了新的文献求助10
21秒前
余人完成签到,获得积分10
30秒前
科研通AI6.3应助风茠住采纳,获得10
31秒前
40秒前
42秒前
糊涂的彬彬完成签到,获得积分10
52秒前
张贵虎完成签到 ,获得积分10
54秒前
56秒前
cy完成签到 ,获得积分10
57秒前
云瑾发布了新的文献求助10
1分钟前
鸽子发布了新的文献求助10
1分钟前
1分钟前
rengar完成签到,获得积分10
1分钟前
尊敬彩虹发布了新的文献求助10
1分钟前
1分钟前
ajing完成签到,获得积分10
1分钟前
在水一方应助科研通管家采纳,获得10
1分钟前
赘婿应助科研通管家采纳,获得10
1分钟前
在水一方应助科研通管家采纳,获得10
1分钟前
赘婿应助科研通管家采纳,获得10
1分钟前
1分钟前
1分钟前
岳莹晓发布了新的文献求助10
2分钟前
香蕉觅云应助烂漫春天采纳,获得10
2分钟前
Hansheng完成签到,获得积分10
2分钟前
2分钟前
957完成签到 ,获得积分10
2分钟前
xiaolizi发布了新的文献求助10
2分钟前
orixero应助庾稀采纳,获得10
2分钟前
充电宝应助xiaolizi采纳,获得10
2分钟前
风茠住发布了新的文献求助10
2分钟前
科研通AI6.2应助风茠住采纳,获得10
2分钟前
研友_5Y9775发布了新的文献求助10
2分钟前
2分钟前
研友_5Y9775完成签到,获得积分10
2分钟前
阮小小完成签到 ,获得积分10
2分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Molecular Biology of Cancer: Mechanisms, Targets, and Therapeutics 3000
Kinesiophobia : a new view of chronic pain behavior 3000
Les Mantodea de guyane 2500
CCRN 的官方教材 《AACN Core Curriculum for High Acuity, Progressive, and Critical Care Nursing》第8版 1000
《Marino's The ICU Book》第五版,电子书 1000
Feldspar inclusion dating of ceramics and burnt stones 1000
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5965984
求助须知:如何正确求助?哪些是违规求助? 7243921
关于积分的说明 15974124
捐赠科研通 5102651
什么是DOI,文献DOI怎么找? 2741064
邀请新用户注册赠送积分活动 1704740
关于科研通互助平台的介绍 1620117