JUCS - Journal of Universal Computer Science 27(10): 1128-1148, doi: 10.3897/jucs.65918

Adapting Pre-trained Language Models to Rumor Detection on Twitter

Hamda Slimi^‡, Ibrahim Bounhas^‡, Yahya Slimani^‡

‡ Laboratory of Computer Science for Industrial Systems (LISI), INSAT,Carthage University, Tunis,Tunisia., Tunis, Tunisia

Corresponding author: Hamda Slimi ( hamda.slimi@ensi-uma.tn )

Corresponding author: Ibrahim Bounhas ( bounhas.ibrahim@gmail.com )

Corresponding author: Yahya Slimani ( yahya.slimani@gmail.com )

This is an open access article distributed under the terms of the CC0 Public Domain Dedication.

Citation: Slimi H, Bounhas I, Slimani Y (2021) Adapting Pre-trained Language Models to Rumor Detection on Twitter. JUCS - Journal of Universal Computer Science 27(10): 1128-1148. https://doi.org/10.3897/jucs.65918

Abstract

Fake news has invaded social media platforms where false information is being propagated with malicious intent at a fast pace. These circumstances required the development of solutions to monitor and detect rumor in a timely manner. In this paper, we propose an approach that seeks to detect emerging and unseen rumors on Twitter by adapting a pre-trained language model to the task of rumor detection, namely RoBERTa. A comparison against content-based characteristics has shown the capability of the model to surpass handcrafted features. Experimental results show that our approach outperforms state of the art ones in all metrics and that the fine tuning of RoBERTa led to richer word embeddings that consistently and significantly enhance the precision of rumor recognition.

Keywords

Twitter, Rumor Detection, RoBERTa, Pre-trained Language Models, Fine Tuning