亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Online Stochastic Optimization with Wasserstein-Based Nonstationarity

随机优化 计算机科学 数学优化 数学 计量经济学
作者
Jiashuo Jiang,Xiaocheng Li,Jiawei Zhang
出处
期刊:Management Science [Institute for Operations Research and the Management Sciences]
标识
DOI:10.1287/mnsc.2020.03850
摘要

We consider a general online stochastic optimization problem with multiple resource constraints over a horizon of finite time periods. In each time period, a reward function and multiple cost functions are revealed, and the decision maker needs to specify an action from a convex and compact action set to collect the reward and consume the resources. Each cost function corresponds to the consumption of one resource. The reward function and the cost functions of each time period are drawn from an unknown distribution, which is nonstationary across time. The objective of the decision maker is to maximize the cumulative reward subject to the resource constraints. This formulation captures a wide range of applications including online linear programming and network revenue management, among others. In this paper, we consider two settings: (i) a data-driven setting where the true distribution is unknown but a prior estimate (possibly inaccurate) is available and (ii) an uninformative setting where the true distribution is completely unknown. We propose a unified Wasserstein distance–based measure to quantify the inaccuracy of the prior estimate in setting (i) and the nonstationarity of the environment in setting (ii). We show that the proposed measure leads to a necessary and sufficient condition for the attainability of a sublinear regret in both settings. For setting (i), we propose an informative gradient descent algorithm. The algorithm takes a primal-dual perspective, and it integrates the prior information of the underlying distributions into an online gradient descent procedure in the dual space. The algorithm also naturally extends to the uninformative setting (ii). Under both settings, we show the corresponding algorithm achieves a regret of optimal order. We illustrate the algorithm’s performance through numerical experiments. This paper was accepted by Chung Piaw Teo, optimization. Supplemental Material: The online appendix and data files are available at https://doi.org/10.1287/mnsc.2020.03850 .

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
2秒前
4秒前
赖风娇发布了新的文献求助10
5秒前
奈何发布了新的文献求助10
7秒前
7秒前
打打应助禾几采纳,获得10
8秒前
8秒前
JamesPei应助paradeYH采纳,获得10
10秒前
mingming发布了新的文献求助10
13秒前
奈何完成签到,获得积分10
17秒前
Brendan完成签到,获得积分10
30秒前
中心湖小海棠完成签到,获得积分10
32秒前
常绝山完成签到 ,获得积分10
36秒前
顾矜应助来了采纳,获得10
39秒前
molihuakai应助mingming采纳,获得10
41秒前
42秒前
调皮乌完成签到,获得积分10
45秒前
调皮乌发布了新的文献求助10
48秒前
power完成签到,获得积分10
49秒前
AMAME12发布了新的文献求助10
52秒前
55秒前
58秒前
59秒前
云7发布了新的文献求助10
1分钟前
paradeYH发布了新的文献求助10
1分钟前
1分钟前
Isabella发布了新的文献求助10
1分钟前
Ainhoa完成签到 ,获得积分10
1分钟前
Isabella完成签到,获得积分10
1分钟前
Ronna完成签到,获得积分10
1分钟前
1分钟前
wanci应助Ymir采纳,获得10
1分钟前
1分钟前
Ronna发布了新的文献求助10
1分钟前
yorha3h应助sakura采纳,获得10
1分钟前
1分钟前
为医消得人憔悴完成签到,获得积分10
1分钟前
F光发布了新的文献求助10
1分钟前
蔓越莓完成签到 ,获得积分10
1分钟前
斯文紫菜完成签到 ,获得积分10
1分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Salmon nasal cartilage-derived proteoglycan complexes influence the gut microbiota and bacterial metabolites in mice 2000
The Composition and Relative Chronology of Dynasties 16 and 17 in Egypt 1500
Cowries - A Guide to the Gastropod Family Cypraeidae 1200
ON THE THEORY OF BIRATIONAL BLOWING-UP 666
Signals, Systems, and Signal Processing 610
“美军军官队伍建设研究”系列(全册) 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6384123
求助须知:如何正确求助?哪些是违规求助? 8196208
关于积分的说明 17332044
捐赠科研通 5437735
什么是DOI,文献DOI怎么找? 2875904
邀请新用户注册赠送积分活动 1852430
关于科研通互助平台的介绍 1696783