计算机科学
不相交集
2019年冠状病毒病(COVID-19)
集合(抽象数据类型)
可扩展性
人工神经网络
人工智能
自然语言处理
数学
医学
化学
疾病
食品科学
病理
组合数学
传染病(医学专业)
程序设计语言
作者
Karlo Babić,Milan Petrović,Slobodan Beliga,Sanda Martinčić-Ipšić,Marko Pranjić,Ana Meštrović
标识
DOI:10.23919/mipro52101.2021.9596693
摘要
In this paper, we explore the influence of COVID-19 related content in tweets on their spreadability. The experiment is performed in two steps on the dataset of tweets in the Croatian language posted during the COVID-19 pandemics. In the first step, we train a feedforward neural network model to predict if a tweet is highly-spreadable or not. The trained model achieves 62.5% accuracy on the binary classification problem. In the second step, we use this model in a set of experiments for predicting the average spreadability of tweets. In these experiments, we separate the original dataset into two disjoint subsets: one composed of tweets filtered using COVID-19 related keywords and the other that contains the rest of the tweets. Additionally, we modified these two subsets by adding and removing tokens into tweets and thus making them artificially COVID-19 related or not related. Our preliminary results indicate that tweets that are semantically related to COVID-19 have on average higher spreadability than the tweets that are not semantically related to COVID-19.
科研通智能强力驱动
Strongly Powered by AbleSci AI