Deep Reinforcement Learning for Online Assortment Customization: A Data-Driven Approach

个性化 强化学习 计算机科学 钢筋 人工智能 万维网 心理学 社会心理学
作者
Tao Li,Chenhao Wang,Yao Wang,Shaojie Tang,Ningyuan Chen
出处
期刊:Production and Operations Management [Wiley]
卷期号:35 (2): 665-684 被引量:1
标识
DOI:10.1177/10591478251351737
摘要

When a platform has limited inventory, it is important to have a variety of products available for each customer while managing the remaining stock. To maximize revenue over the long term, the assortment policy needs to take into account the complex purchasing behavior of customers whose arrival orders and preferences may be unknown. We propose a data-driven approach for dynamic assortment planning that utilizes historical customer arrivals and transaction data. To address the challenge of online assortment customization, we use a Markov decision process framework and employ a model-free deep reinforcement learning (DRL) approach to solve the online assortment policy because of the computational challenge. Our method uses a specially designed deep neural network (DNN) model to create assortments while observing the inventory constraints, and an advantage actor-critic algorithm to update the parameters of the DNN model, with the help of a simulator built from the historical transaction data. To evaluate the effectiveness of our approach, we conduct simulations using both a synthetic data set generated with a pre-determined customer type distribution and ground-truth choice model, as well as a real-world data set. Our extensive experiments demonstrate that our approach produces significantly higher long-term revenue compared to some existing methods and remains robust under various practical conditions. We also demonstrate that our approach can be easily adapted to a more general problem that includes reusable products, where customers might return purchased items. In this setting, we find that our approach performs well under various usage time distributions.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
省静霞发布了新的文献求助30
刚刚
脑洞疼应助何以采纳,获得10
刚刚
yangts2021发布了新的文献求助10
1秒前
小小的玛卡吧卡完成签到,获得积分10
1秒前
期天应助烂漫的半梅采纳,获得10
1秒前
1秒前
所所应助银河采纳,获得10
2秒前
2秒前
3秒前
3秒前
烟花应助你真是那个啊采纳,获得10
3秒前
情怀应助deardorff采纳,获得10
3秒前
curry123完成签到,获得积分10
3秒前
CipherSage应助ZHQ采纳,获得10
3秒前
专注的胡萝卜完成签到 ,获得积分10
3秒前
FashionBoy应助亿眼万年采纳,获得10
4秒前
Skyline完成签到,获得积分10
4秒前
情怀应助noahxinny采纳,获得10
4秒前
潇潇完成签到 ,获得积分10
5秒前
5秒前
ljc完成签到,获得积分10
5秒前
5秒前
lezard发布了新的文献求助10
5秒前
大胆孤菱关注了科研通微信公众号
5秒前
Huguizhou完成签到,获得积分20
6秒前
丰富的绮波完成签到,获得积分10
6秒前
科研通AI6.4应助张思睿采纳,获得10
6秒前
6秒前
503503_发布了新的文献求助10
7秒前
西门子云完成签到,获得积分10
8秒前
rxn824发布了新的文献求助10
8秒前
8秒前
8秒前
Forken完成签到,获得积分10
9秒前
爱科研的小张完成签到 ,获得积分10
9秒前
zhuxf完成签到 ,获得积分10
10秒前
思源应助人美心善大野驴采纳,获得10
10秒前
烟花应助Huguizhou采纳,获得10
10秒前
11秒前
nn完成签到,获得积分10
11秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Cambridge History of China: Volume 4, Sui and T'ang China, 589–906 AD, Part Two 1500
Cowries - A Guide to the Gastropod Family Cypraeidae 1200
Quality by Design - An Indispensable Approach to Accelerate Biopharmaceutical Product Development 800
Pulse width control of a 3-phase inverter with non sinusoidal phase voltages 777
Signals, Systems, and Signal Processing 610
Research Methods for Applied Linguistics: A Practical Guide 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6400657
求助须知:如何正确求助?哪些是违规求助? 8217487
关于积分的说明 17413940
捐赠科研通 5453723
什么是DOI,文献DOI怎么找? 2882234
邀请新用户注册赠送积分活动 1858795
关于科研通互助平台的介绍 1700558