Resource Allocation for Sequential Decision Making Under Uncertainaty : Studies in Vehicular Traffic Control, Service Systems, Sensor Networks and Mechanism Design

计算机科学 计算机网络 资源配置 服务质量 分布式计算 控制(管理) 实时计算 服务(商务)
作者
L A Prashanth
链接
摘要

A fundamental question in a sequential decision making setting under uncertainty is “how to allocate resources amongst competing entities so as to maximize the rewards accumulated in the long run?”. The resources allocated may be either abstract quantities such as time or concrete quantities such as manpower. The sequential decision making setting involves one or more agents interacting with an environment to procure rewards at every time instant and the goal is to find an optimal policy for choosing actions. Most of these problems involve multiple (infinite) stages and the objective function is usually a long-run performance objective. The problem is further complicated by the uncertainties in the sys-tem, for instance, the stochastic noise and partial observability in a single-agent setting or private information of the agents in a multi-agent setting. The dimensionality of the problem also plays an important role in the solution methodology adopted. Most of the real-world problems involve high-dimensional state and action spaces and an important design aspect of the solution is the choice of knowledge representation. The aim of this thesis is to answer important resource allocation related questions in different real-world application contexts and in the process contribute novel algorithms to the theory as well. The resource allocation algorithms considered include those from stochastic optimization, stochastic control and reinforcement learning. A number of new algorithms are developed as well. The application contexts selected encompass both single and multi-agent systems, abstract and concrete resources and contain high-dimensional state and control spaces. The empirical results from the various studies performed indicate that the algorithms presented here perform significantly better than those previously proposed in the literature. Further, the algorithms presented here are also shown to theoretically converge, hence guaranteeing optimal performance. We now briefly describe the various studies conducted here to investigate problems of resource allocation under uncertainties of different kinds: Vehicular Traffic Control The aim here is to optimize the ‘green time’ resource of the individual lanes in road networks that maximizes a certain long-term performance objective. We develop several reinforcement learning based algorithms for solving this problem. In the infinite horizon discounted Markov decision process setting, a Q-learning based traffic light control (TLC) algorithm that incorporates feature based representations and function approximation to handle large road networks is proposed, see Prashanth and Bhatnagar [2011b]. This TLC algorithm works with coarse information, obtained via graded thresholds, about the congestion level on the lanes of the road network. However, the graded threshold values used in the above Q-learning based TLC algorithm as well as several other graded threshold-based TLC algorithms that we propose, may not be optimal for all traffic conditions. We therefore also develop a…

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Zing完成签到 ,获得积分10
1秒前
自由从筠完成签到 ,获得积分10
5秒前
14秒前
灵溪宗完成签到,获得积分0
26秒前
就算雨也不会停完成签到 ,获得积分10
35秒前
SOLOMON应助科研通管家采纳,获得10
44秒前
赘婿应助科研通管家采纳,获得10
44秒前
Last炫神丶完成签到,获得积分10
45秒前
唐然然完成签到 ,获得积分10
1分钟前
斯文的天奇完成签到 ,获得积分10
1分钟前
云帆沧海完成签到,获得积分10
1分钟前
Lesterem完成签到 ,获得积分10
1分钟前
啊一啾完成签到 ,获得积分10
1分钟前
颜林林发布了新的文献求助10
1分钟前
笨笨忘幽完成签到,获得积分10
1分钟前
CLTTT完成签到,获得积分10
1分钟前
阳光的凝冬完成签到 ,获得积分10
1分钟前
gmc完成签到 ,获得积分10
1分钟前
1分钟前
1分钟前
2分钟前
JACK发布了新的文献求助10
2分钟前
寻道图强应助一群小怪采纳,获得50
2分钟前
tx完成签到,获得积分10
2分钟前
大模型应助Singularity采纳,获得10
2分钟前
2分钟前
JACK完成签到 ,获得积分20
2分钟前
kvkill发布了新的文献求助10
2分钟前
Ava应助Singularity采纳,获得10
2分钟前
kvkill完成签到,获得积分10
2分钟前
迷人囧完成签到 ,获得积分10
3分钟前
3分钟前
梦想去广州当靓仔完成签到 ,获得积分10
3分钟前
苦行僧完成签到 ,获得积分10
3分钟前
LZC完成签到 ,获得积分10
3分钟前
我就是KKKK完成签到 ,获得积分10
4分钟前
chen完成签到,获得积分10
4分钟前
陈米花完成签到,获得积分10
4分钟前
yyjl31完成签到,获得积分10
4分钟前
Simon_chat完成签到,获得积分10
4分钟前
高分求助中
请在求助之前详细阅读求助说明!!!! 20000
The Three Stars Each: The Astrolabes and Related Texts 900
Yuwu Song, Biographical Dictionary of the People's Republic of China 700
Bernd Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
A radiographic standard of reference for the growing knee 400
Glossary of Geology 400
Additive Manufacturing Design and Applications 320
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2473593
求助须知:如何正确求助?哪些是违规求助? 2138800
关于积分的说明 5450839
捐赠科研通 1862817
什么是DOI,文献DOI怎么找? 926240
版权声明 562817
科研通“疑难数据库(出版商)”最低求助积分说明 495463