强化学习
马尔可夫决策过程
计算机科学
背景(考古学)
订单(交换)
运筹学
比例(比率)
可靠性(半导体)
订单履行
过程(计算)
马尔可夫过程
工业工程
数学优化
供应链
人工智能
工程类
经济
业务
营销
数学
功率(物理)
统计
物理
财务
量子力学
操作系统
古生物学
生物
作者
Zhiwei Qin,Xiaocheng Tang,Yan Jiao,Fan Zhang,Zhe Xu,Hongtu Zhu,Jieping Ye
标识
DOI:10.1287/inte.2020.1047
摘要
Order dispatching is instrumental to the marketplace engine of a large-scale ride-hailing platform, such as the DiDi platform, which continuously matches passenger trip requests to drivers at a scale of tens of millions per day. Because of the dynamic and stochastic nature of supply and demand in this context, the ride-hailing order-dispatching problem is challenging to solve for an optimal solution. Added to the complexity are considerations of system response time, reliability, and multiple objectives. In this paper, we describe how our approach to this optimization problem has evolved from a combinatorial optimization approach to one that encompasses a semi-Markov decision-process model and deep reinforcement learning. We discuss the various practical considerations of our solution development and real-world impact to the business.
科研通智能强力驱动
Strongly Powered by AbleSci AI