强化学习
管道(软件)
计算机科学
水准点(测量)
动态定价
数学优化
动态规划
选择(遗传算法)
动态贝叶斯网络
人工智能
贝叶斯优化
自动化
机器学习
贝叶斯概率
工程类
算法
数学
营销
业务
程序设计语言
地理
机械工程
大地测量学
作者
Reza Afshar,Jason Rhuggenaath,Yingqian Zhang,Uzay Kaymak
出处
期刊:IEEE transactions on artificial intelligence
[Institute of Electrical and Electronics Engineers]
日期:2022-01-01
卷期号:: 1-10
被引量:1
标识
DOI:10.1109/tai.2022.3186292
摘要
Dynamic pricing problem is difficult due to the highly dynamic environment and unknown demand distributions. In this paper, we propose a Deep Reinforcement Learning (DRL) framework, which is a pipeline that automatically defines the DRL components for solving a Dynamic Pricing problem. The automated DRL pipeline is necessary because the DRL framework can be designed in numerous ways, and manually finding optimal configurations is tedious. The levels of automation make non-experts capable of using DRL for dynamic pricing. Our DRL pipeline contains three steps of DRL design, including MDP modeling, algorithm selection, and hyper-parameter optimization. It starts with transforming available information to state representation and defining reward function using a reward shaping approach. Then, the hyper-parameters are tuned using a novel hyper-parameters optimization method that integrates Bayesian Optimization and the selection operator of the Genetic algorithm. We employ our DRL pipeline on reserve price optimization problems in online advertising as a case study. We show that using the DRL configuration obtained by our DRL pipeline, a pricing policy is obtained whose revenue is significantly higher than the benchmark methods. The evaluation is performed by developing a simulation for the RTB environment that makes exploration possible for the RL agent.
科研通智能强力驱动
Strongly Powered by AbleSci AI