Conference paper Open Access
Longhini, Jacopo; Rossi, Claudio; Casetti, Claudio; Angaramo, Federico
In this paper, we propose a machine learning approach to automatically classify non-informative and informative contents shared on Twitter during disasters caused by natural hazards. In particular, we leverage on previously sampled and labeled datasets of messages posted on Twitter during or in the aftermath of natural disasters. Starting from results obtained in previous studies, we propose a language-agnostic model. We define a base feature set considering only Twitter-specific metadata of each tweet, using classification results from this set as a reference. We introduce an additional feature, called the Source Feature, which is computed considering the device or platform used to post a tweet, and we evaluate its contribution in improving the classifier accuracy.