Prediction of COVID-19 related information spreading on Twitter

2021 
In this paper, we explore the influence of COVID-19 related content in tweets on their spreadability. The experiment is performed in two steps on the dataset of tweets in the Croatian language posted during the COVID-19 pandemics. In the first step, we train a feedforward neural network model to predict if a tweet is highly-spreadable or not. The trained model achieves 62.5% accuracy on the binary classification problem. In the second step, we use this model in a set of experiments for predicting the average spreadability of tweets. In these experiments, we separate the original dataset into two disjoint subsets: one composed of tweets filtered using COVID-19 related keywords and the other that contains the rest of the tweets. Additionally, we modified these two subsets by adding and removing tokens into tweets and thus making them artificially COVID-19 related or not related. Our preliminary results indicate that tweets that are semantically related to COVID-19 have on average higher spreadability than the tweets that are not semantically related to COVID-19.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    4
    References
    1
    Citations
    NaN
    KQI
    []