Mixed Negative Sampling for Learning Two-tower Neural Networks in Recommendations
采样(信号处理)
深度学习
作者
Ji Yang,Xinyang Yi,Derek Zhiyuan Cheng,Lichan Hong,Yang Li,Simon Wang,Taibai Xu,Ed H. Chi
出处
期刊:Companion Proceedings of the Web Conference 2020日期:2020-04-20卷期号:: 441-447被引量:16
标识
DOI:10.1145/3366424.3386195
摘要
Learning query and item representations is important for building large scale recommendation systems. In many real applications where there is a huge catalog of items to recommend, the problem of efficiently retrieving top k items given user’s query from deep corpus leads to a family of factorized modeling approaches where queries and items are jointly embedded into a low-dimensional space. In this paper, we first showcase how to apply a two-tower neural network framework, which is also known as dual encoder in the natural language community, to improve a large-scale, production app recommendation system. Furthermore, we offer a novel negative sampling approach called Mixed Negative Sampling (MNS). In particular, different from commonly used batch or unigram sampling methods, MNS uses a mixture of batch and uniformly sampled negatives to tackle the selection bias of implicit user feedback. We conduct extensive offline experiments using large-scale production dataset and show that MNS outperforms other baseline sampling methods. We also conduct online A/B testing and demonstrate that the two-tower retrieval model based on MNS significantly improves retrieval quality by encouraging more high-quality app installs.