强化学习
文件夹
差异(会计)
背景(考古学)
计算机科学
选择(遗传算法)
一般化
数学优化
现代投资组合理论
高斯分布
随机控制
最优控制
数学
人工智能
经济
金融经济学
物理
会计
数学分析
古生物学
生物
量子力学
标识
DOI:10.1109/ccdc52312.2021.9602795
摘要
This article studies a continuous-time mean-variance (MV) portfolio selection problem with reinforcement learning (RL). The MV problem is described by an exploratory, relaxed stochastic control problem, which is first proposed and developed by Wang et al. (2020) and Wang and Zhou (2020). Different from Wang and Zhou (2020), we minimize the variance and maximize the expected terminal return at the same time in the context of reinforcement learning, which is a generalization of Wang and Zhou (2020). By constructing an auxiliary problem, the optimal feedback control (or “policy” or “law”) for our problem is obtained, which is Gaussian with time-decaying variance.
科研通智能强力驱动
Strongly Powered by AbleSci AI