亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Utilizing Non-click Samples via Semi-supervised Learning for Conversion Rate Prediction

计算机科学 点击率 人工智能 机器学习 情报检索
作者
Jiahui Huang,Lan Zhang,Junhao Wang,Shanyang Jiang,Dongbo Huang,Cheng Ding,Lan Xu
标识
DOI:10.1145/3640457.3688151
摘要

Conversion rate (CVR) prediction is essential in recommender systems, facilitating precise matching between recommended items and users' preferences. However, the sample selection bias (SSB) and data sparsity (DS) issues pose challenges to accurate prediction. Existing works have proposed the click-through and conversion rate (CTCVR) prediction task which models samples from exposure to ``click and conversion" in entire space and incorporates multi-task learning. This approach has shown efficacy in mitigating these challenges. Nevertheless, it intensifies the false negative sample (FNS) problem. To be more specific, the CTCVR task implicitly treats all the CVR labels of non-click samples as negative, overlooking the possibility that some samples might convert if clicked. This oversight can negatively impact CVR model performance, as empirical analysis has confirmed. To this end, we advocate for discarding the CTCVR task and proposing a Non-click samples Improved Semi-supErvised (NISE) method for conversion rate prediction, where the non-click samples are treated as unlabeled. Our approach aims to predict their probabilities of conversion if clicked, utilizing these predictions as pseudo-labels for further model training. This strategy can help alleviate the FNS problem, and direct modeling of the CVR task across the entire space also mitigates the SSB and DS challenges. Additionally, we conduct multi-task learning by introducing an auxiliary click-through rate prediction task, thereby enhancing embedding layer representations. Our approach is applicable to various multi-task architectures. Comprehensive experiments are conducted on both public and production datasets, demonstrating the superiority of our proposed method in mitigating the FNS challenge and improving the CVR estimation. The implementation code is available at https://github.com/Hjh233/NISE.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Chloe发布了新的文献求助10
1秒前
5秒前
lianmeiliu发布了新的文献求助10
10秒前
18秒前
19秒前
20秒前
英俊的铭应助飞快的从菡采纳,获得20
20秒前
Akim应助科研通管家采纳,获得10
20秒前
20秒前
思源应助科研通管家采纳,获得30
20秒前
今后应助科研通管家采纳,获得10
20秒前
lianmeiliu完成签到,获得积分10
23秒前
ray发布了新的文献求助10
24秒前
2589完成签到,获得积分10
27秒前
31秒前
桐桐应助2589采纳,获得10
34秒前
41秒前
ding应助酷炫灰狼采纳,获得10
51秒前
52秒前
56秒前
852应助幸福的战斗机采纳,获得10
1分钟前
科研通AI6.4应助小鲤鱼采纳,获得10
1分钟前
丘比特应助积极的皮卡丘采纳,获得10
1分钟前
1分钟前
正弦交流电完成签到,获得积分10
1分钟前
1分钟前
1分钟前
执着雨发布了新的文献求助10
1分钟前
jane发布了新的文献求助10
1分钟前
Jerry发布了新的文献求助10
1分钟前
1分钟前
1分钟前
pastel发布了新的文献求助10
1分钟前
动物园小科畜完成签到,获得积分10
1分钟前
酷炫灰狼发布了新的文献求助10
1分钟前
1分钟前
Moomba完成签到 ,获得积分10
1分钟前
Barista发布了新的文献求助10
1分钟前
1分钟前
香蕉觅云应助Barista采纳,获得10
1分钟前
高分求助中
Malcolm Fraser : a biography 680
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Organic Reactions Volume 118 400
A Foreign Missionary on the Long March: The Unpublished Memoirs of Arnolis Hayman of the China Inland Mission 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6457500
求助须知:如何正确求助?哪些是违规求助? 8267369
关于积分的说明 17620590
捐赠科研通 5525232
什么是DOI,文献DOI怎么找? 2905445
邀请新用户注册赠送积分活动 1882141
关于科研通互助平台的介绍 1726141