Reinforcement learning for continuous-time mean-variance portfolio selection in a regime-switching market

强化学习 贝尔曼方程 不可见的 文件夹 数学优化 随机控制 最优控制 数学 计算机科学 经济 计量经济学 人工智能 财务
作者
Bo Wu,Lingfei Li
出处
期刊:Journal of Economic Dynamics and Control [Elsevier BV]
卷期号:158: 104787-104787
标识
DOI:10.1016/j.jedc.2023.104787
摘要

We propose a reinforcement learning (RL) approach to solve the continuous-time mean-variance portfolio selection problem in a regime-switching market, where the market regime is unobservable. To encourage exploration for learning, we formulate an exploratory stochastic control problem with an entropy-regularized mean-variance objective. We obtain semi-analytical representations of the optimal value function and optimal policy, which involve unknown solutions to two linear parabolic partial differential equations (PDEs). We utilize these representations to parametrize the value function and policy for learning with the unknown solutions to the PDEs approximated based on polynomials. We develop an actor-critic RL algorithm to learn the optimal policy through interactions with the market environment. The algorithm carries out filtering to obtain the belief probability of the market regime and performs policy evaluation and policy gradient updates alternately. Empirical results demonstrate the advantages of our RL algorithm in relatively long-term investment problems over the classical control approach and an RL algorithm developed for the continuous-time mean-variance problem without considering regime switches.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Owen应助睡教早祈两年半采纳,获得10
刚刚
研友_n0DG7n完成签到,获得积分20
1秒前
1秒前
1秒前
可爱的函函应助哈哈哈采纳,获得10
1秒前
XiangsWei应助hrz采纳,获得10
2秒前
3秒前
4秒前
sxs发布了新的文献求助10
4秒前
888发布了新的文献求助30
4秒前
Oliver完成签到,获得积分10
4秒前
4秒前
研友_n0DG7n发布了新的文献求助10
4秒前
盛龙发布了新的文献求助10
4秒前
7z发布了新的文献求助10
4秒前
shao完成签到,获得积分10
5秒前
zzz关闭了zzz文献求助
5秒前
molihuakai应助rowam采纳,获得10
5秒前
杨佳虹完成签到,获得积分10
6秒前
6秒前
7秒前
OR发布了新的文献求助10
7秒前
7秒前
脑洞疼应助Fxy采纳,获得10
7秒前
cc发布了新的文献求助10
7秒前
7秒前
古地无明完成签到,获得积分10
7秒前
sherlin完成签到,获得积分20
8秒前
Baraka完成签到,获得积分10
8秒前
8秒前
spark发布了新的文献求助10
8秒前
jeery发布了新的文献求助10
8秒前
liu完成签到,获得积分10
9秒前
盐好甜完成签到,获得积分10
9秒前
9秒前
zhouyunan完成签到,获得积分10
9秒前
纸飞机完成签到,获得积分10
10秒前
10秒前
小希完成签到,获得积分10
10秒前
10秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
The Resilient Mindset 400
Impact of Storage Orientation and Duration on Prefilled Syringe Performance: Break-Loose and Glide Forces, and Injection Time Across Multiple Time Points 360
Programming for Chemical Engineers Using C, C++, and MATLAB 300
Upland Kenya wild flowers and ferns: a flora of the flowers, ferns, grasses, and sedges of highland Kenya 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6647084
求助须知:如何正确求助?哪些是违规求助? 8402840
关于积分的说明 17967268
捐赠科研通 5839755
什么是DOI,文献DOI怎么找? 2969962
邀请新用户注册赠送积分活动 1945150
关于科研通互助平台的介绍 1864065