计算机科学
人工智能
任务(项目管理)
瓶颈
机器学习
事件(粒子物理)
鉴定(生物学)
标记数据
自然语言处理
物理
量子力学
植物
管理
经济
生物
嵌入式系统
作者
Bo Zhou,Yubo Chen,Kang Liu,Jun Zhao,Jiexin Xu,Xiao-Jian Jiang,Jinlong Li
标识
DOI:10.1145/3459637.3482150
摘要
Most existing approaches to script event prediction rely on manually labeled data heavily, which is often expensive to obtain. To cope with the training data bottleneck, we investigate methods of combining multiple self-supervised tasks, i.e. tasks where models are explicitly trained with automatically generated labels. We propose two self-supervised pre-training tasks:one is End Identification and the other is Contrastive Scoring. Multi-task learning framework is then leveraged to combine these two tasks to jointly train the model. The pre-trained model is then fine-tuned using human-annotated script event prediction training data. Experimental results on the commonly used dataset show that our approach can achieve competitive performance compared to the previous models which are trained with the whole dataset by using just 10% of the training data, and our model trained on the whole dataset outperforms previous models significantly.
科研通智能强力驱动
Strongly Powered by AbleSci AI