发布文献求助

Robust Risk-Aware Reinforcement Learning

强化学习数学优化下行风险计算机科学稳健优化投资组合优化文件夹套利最优化问题人工智能稳健性（进化）数学经济财务化学基因生物化学

作者

Sebastian Jaimungal,Silvana M. Pesenti,Ye Sheng Wang,Hariom Tatsat

出处

期刊：Siam Journal on Financial Mathematics [Society for Industrial and Applied Mathematics]
日期：2022-03-01 卷期号：13 (1): 213-226 被引量：11

链接

arxiv.org arxiv.orgdoi.org

标识

DOI：10.1137/21m144640x

摘要

We present a reinforcement learning (RL) approach for robust optimization of risk-aware performance criteria. To allow agents to express a wide variety of risk-reward profiles, we assess the value of a policy using rank dependent expected utility (RDEU). RDEU allows agents to seek gains, while simultaneously protecting themselves against downside risk. To robustify optimal policies against model uncertainty, we assess a policy not by its distribution but rather by the worst possible distribution that lies within a Wasserstein ball around it. Thus, our problem formulation may be viewed as an actor/agent choosing a policy (the outer problem) and the adversary then acting to worsen the performance of that strategy (the inner problem). We develop explicit policy gradient formulae for the inner and outer problems and show their efficacy on three prototypical financial problems: robust portfolio allocation, benchmark optimization, and statistical arbitrage.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

活动

『应助活动周』获奖名单已公布 🔥 (2025-4-2)

更新

『中科院2025期刊分区』已更新 (2025-3-23)

更新

『即时热点』模块已上线 (2025-2-28)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 可爱的函函上传了应助文件

刚刚; 蔡蔡完成签到，获得积分10

刚刚; 英姑的应助被奶冻采纳，获得10

2秒前; satchzhao发布了新的文献求助10

2秒前; Duckseid完成签到，获得积分10

3秒前; ZY完成签到，获得积分10

5秒前; 吴老四发布了新的文献求助10

5秒前; Amy完成签到，获得积分10

5秒前; ymxlcfc完成签到，获得积分10

9秒前; zhuzhu完成签到，获得积分10

9秒前; FashionBoy的应助被yoozii采纳，获得10

11秒前; 与淇完成签到，获得积分10

11秒前; 经纲完成签到，获得积分0

13秒前; jojo完成签到，获得积分10

15秒前; maclogos发布了新的文献求助10

15秒前; 无尘完成签到，获得积分10

17秒前; 进击的研狗完成签到，获得积分10

17秒前; 细雨听风完成签到，获得积分10

19秒前; 略略略完成签到，获得积分10

19秒前; only完成签到，获得积分10

20秒前; 油麦菜完成签到，获得积分10

21秒前; MAKEYF完成签到，获得积分10

21秒前; 黑粉头头完成签到，获得积分10

22秒前; 小嚣张完成签到，获得积分10

24秒前; 修水县1个科研人完成签到，获得积分10

25秒前; 乐乐驳回了科研通AI5的应助

27秒前; 冰冰橙上传了应助文件

27秒前; Springgg完成签到，获得积分10

28秒前; 完美世界上传了应助文件

28秒前; JHGG上传了应助文件

29秒前; 慧喆完成签到，获得积分10

29秒前; 知非完成签到，获得积分10

30秒前; FashionBoy上传了应助文件

31秒前; knn完成签到，获得积分10

32秒前; Solar energy发布了新的文献求助10

32秒前; 谢雷XIELei上传了应助文件

32秒前; 铭名洺完成签到，获得积分10

33秒前; 柠檬完成签到，获得积分10

33秒前; 不认识发布了新的文献求助10

33秒前; 铜锣湾小研仔上传了应助文件

33秒前

高分求助中: Technologies supporting mass customization of apparel: A pilot project 600; Introduction to Strong Mixing Conditions Volumes 1-3 500; Tip60 complex regulates eggshell formation and oviposition in the white-backed planthopper, providing effective targets for pest control 400; A Field Guide to the Amphibians and Reptiles of Madagascar - Frank Glaw and Miguel Vences - 3rd Edition 400; China Gadabouts: New Frontiers of Humanitarian Nursing, 1941–51 400; The Healthy Socialist Life in Maoist China, 1949–1980 400; Walking a Tightrope: Memories of Wu Jieping, Personal Physician to China's Leaders 400

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3798557; 求助须知：如何正确求助？哪些是违规求助？ 3344128; 关于积分的说明 10318663; 捐赠科研通 3060696; 什么是DOI，文献DOI怎么找？ 1679782; 邀请新用户注册赠送积分活动 806769; 科研通“疑难数据库（出版商）”最低求助积分说明 763353

今日热心研友

平常的毛豆

星辰坠于海

剑指东方是为谁

繁荣的心情

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通