亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Contextual Learning with Online Convex Optimization: Theory and Application to Medical Decision-Making

计算机科学 人工智能 凸优化 数学优化 管理科学 正多边形 机器学习 心理学 知识管理 经济 数学 几何学
作者
Esmaeil Keyvanshokooh,Mohammad Zhalechian,Cong Shi,Mark P. Van Oyen,Pooyan Kazemian
出处
期刊:Management Science [Institute for Operations Research and the Management Sciences]
卷期号:71 (12): 10442-10464 被引量:5
标识
DOI:10.1287/mnsc.2019.03211
摘要

Optimizing the treatment regimen is a fundamental medical decision-making problem. This can be thought of as a two-dimensional decision-making problem with a nested structure because it involves determining both the optimal medication and its optimal dose. Identifying the most effective medication for an individual often poses considerable difficulty, and even when a suitable medication is ascertained, dosing it optimally remains a significant challenge. Making these two nested decisions necessitates the adaptive learning of a personalized disease progression control model. To address this problem, we propose a novel contextual multiarmed bandit model under a two-dimensional control with a nested structure. For this model, we develop a new joint contextual learning and optimization algorithm, termed the stochastic subgradient descent atop contextual multiarmed bandit (SGD-MAB) algorithm. It sequentially selects for a patient (i) the best medication based on their contextual information and (ii) the corresponding dose optimized over the prior history of those patients who received the same medication. We prove that it admits a sublinear regret, which is tight up to a logarithmic factor. Our regret analysis leverages the strengths of both contextual bandit approaches and online convex optimization techniques in a seamless fashion. We substantiate the practicality of SGD-MAB using clinical data on patients with hypertension and heightened cardiovascular risks. Our analysis indicates that SGD-MAB has the potential to surpass current practices. We benchmark several policies to show the advantages of our approach and offer critical insights. Our framework holds promise for various applications beyond healthcare that require nested decision-making. This paper was accepted by J. George Shanthikumar, data science. Funding: This work was supported by the National Science Foundation (CMMI-1548201, CMMI-1634505) and the National Eye Institute (NIH Grant R01EY026641). Supplemental Material: The online appendix and data files are available at https://doi.org/10.1287/mnsc.2019.03211 .
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
20秒前
25秒前
41秒前
连玉完成签到,获得积分10
46秒前
NexusExplorer应助冷傲雨寒采纳,获得10
1分钟前
Yas完成签到,获得积分10
1分钟前
一颗柿子树完成签到,获得积分10
1分钟前
2分钟前
冷傲雨寒发布了新的文献求助10
2分钟前
3分钟前
123发布了新的文献求助10
3分钟前
123完成签到,获得积分10
3分钟前
奶黄包完成签到 ,获得积分10
3分钟前
Akim应助白夜采纳,获得10
3分钟前
kkdd完成签到,获得积分10
3分钟前
克里斯蒂娜完成签到,获得积分10
4分钟前
俭朴书桃发布了新的文献求助10
4分钟前
4分钟前
onmy发布了新的文献求助10
4分钟前
丘比特应助onmy采纳,获得10
4分钟前
赘婿应助赵大宝采纳,获得10
4分钟前
5分钟前
白夜发布了新的文献求助10
5分钟前
6分钟前
yanzinie发布了新的文献求助10
6分钟前
Orange应助Lee采纳,获得10
6分钟前
科研通AI6.2应助yanzinie采纳,获得10
6分钟前
6分钟前
6分钟前
zyq111111发布了新的文献求助10
6分钟前
完美世界应助科研通管家采纳,获得10
6分钟前
6分钟前
zyq111111完成签到,获得积分10
6分钟前
6分钟前
7分钟前
隐形曼青应助csy采纳,获得10
7分钟前
务实的远航完成签到 ,获得积分10
7分钟前
Richard应助wenky采纳,获得10
8分钟前
8分钟前
8分钟前
高分求助中
Malcolm Fraser : a biography 680
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Organic Reactions Volume 118 400
A Foreign Missionary on the Long March: The Unpublished Memoirs of Arnolis Hayman of the China Inland Mission 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6457863
求助须知:如何正确求助?哪些是违规求助? 8267699
关于积分的说明 17620790
捐赠科研通 5526024
什么是DOI,文献DOI怎么找? 2905558
邀请新用户注册赠送积分活动 1882315
关于科研通互助平台的介绍 1726506