计算机科学
多项式logistic回归
后悔
选择(遗传算法)
职位(财务)
数学优化
产品(数学)
运筹学
机器学习
经济
数学
几何学
财务
出处
期刊:Operations Research
[Institute for Operations Research and the Management Sciences]
日期:2025-08-11
标识
DOI:10.1287/opre.2024.1556
摘要
This study addresses a key challenge in online retail: product positioning. The authors propose a novel online learning framework called dynamic assortment selection with positioning (DAP). Unlike traditional models that focus solely on item selection, DAP also learns optimal product placement to maximize revenue. The researchers model customer choices using a multinomial logit framework, where item appeal depends on both intrinsic preference and display position. They demonstrate that ignoring position effects leads to suboptimal performance and introduce a new algorithm, TLR-UCB, which effectively incorporates adaptive position-dependent feedback through a geometric linear bandit structure and truncated linear regression techniques. Theoretical analysis confirms that TLR-UCB achieves optimal learning efficiency. To handle unknown position effects, they further develop EI-TLR, a two-stage policy that jointly estimates customer preferences and positioning impacts before applying a generalized TLR-UCB procedure. Extensive simulations show that both TLR-UCB and EI-TLR significantly outperform existing benchmarks, offering powerful tools for dynamic, data-driven assortment and layout optimization in online marketplaces.
科研通智能强力驱动
Strongly Powered by AbleSci AI