JUCS - Journal of Universal Computer Science 27(10): 1128-1148, doi: 10.3897/jucs.65918
Adapting Pre-trained Language Models to Rumor Detection on Twitter
expand article infoHamda Slimi, Ibrahim Bounhas, Yahya Slimani
‡ Laboratory of Computer Science for Industrial Systems (LISI), INSAT,Carthage University, Tunis,Tunisia., Tunis, Tunisia
Open Access
Abstract

Fake news has invaded social media platforms where false information is being propagated with malicious intent at a fast pace. These circumstances required the development of solutions to monitor and detect rumor in a timely manner. In this paper, we propose an approach that seeks to detect emerging and unseen rumors on Twitter by adapting a pre-trained language model to the task of rumor detection, namely RoBERTa. A comparison against content-based characteristics has shown the capability of the model to surpass handcrafted features. Experimental results show that our approach outperforms state of the art ones in all metrics and that the fine tuning of RoBERTa led to richer word embeddings that consistently and significantly enhance the precision of rumor recognition.

Keywords
Twitter, Rumor Detection, RoBERTa, Pre-trained Language Models, Fine Tuning