事件(粒子物理)
计算机科学
人工智能
安全性令牌
特征(语言学)
自然语言处理
相似性(几何)
代表(政治)
预处理器
嵌入
数据挖掘
机器学习
政治
图像(数学)
物理
哲学
量子力学
法学
语言学
计算机安全
政治学
作者
Zhenyu Huang,Yongjun Wang,Hongzuo Xu,Songlei Jian,Zhongyang Wang
标识
DOI:10.1145/3507548.3507585
摘要
Script event prediction is a big challenge and its goal is to predict the subsequent event based on the observed events. Since an event is described by text, the pre-trained models have been applied for event representation. However, the embedding based on the pre-trained models is sensitive to the short text format of events, and the existing works do not handle it well. In addition, previous models pay more attention to the semantic similarity but ignore the factors of emergencies. The turning event at the tail of the event chain can easily affect the follow-up direction. This paper proposes a new preprocessing method: cleaning, alignment, and connection, which helps to obtain richer event representations. On this basis, we concatenate the embedding of the CLS token and event sequence to integrate the semantic and temporal features of the event chain. To deal with the problem of event turning, we propose a tail event enhancement module. It adds the transition probability of tail events and candidate events into prediction layer, so as to avoid pay only attention to the semantic feature. The results of a large number of comparative experiments and ablation experiments confirm the superiority of our model compared with the baselines.
科研通智能强力驱动
Strongly Powered by AbleSci AI