JUCS - Journal of Universal Computer Science 28(4): 378-395, doi: 10.3897/jucs.70941
Myers-Briggs personality classification from social media text using pre-trained language models
expand article infoVitor dos Santos, Ivandre Paraboni
‡ University of Sao Paulo, Sao Paulo, Brazil
Open Access

In Natural Language Processing, the use of pre-trained language models has been shown to obtain state-of-the-art results in many downstream tasks such as sentiment analysis, author identification and others. In this work, we address the use of these methods for personality classification from text. Focusing on the Myers-Briggs (MBTI) personality model, we describe a series of experiments in which the well-known Bidirectional Encoder Representations from Transformers (BERT) model is fine-tuned to perform MBTI classification. Our main findings suggest that the current approach significantly outperforms well-known text classification models based on bag-of-words and static word embeddings alike across multiple evaluation scenarios, and generally outperforms previous work in the field.

Natural language processing, text classification, Myers-Briggs, MBTI, personality, author profiling