Social media platforms have been expanding their user bases. For example, LinkedIn counts 917 million monthly visitors, while Twitter has 3.62 billion monthly visitors. YouTube has 22.77 billion monthly visitors, and Instagram has 2.86 billion monthly visitors. Reports confirm data size increase of the social media networks above by 20–30% every day. With the spread of COVID-19, the same platforms have been broadly used by the worldwide collectiveness to socialize and stay amongst people. Analyzing text from Social Networking sites helps recognize individuals' personality traits automatically. A person's personality refers to their unique characteristics that shape their habits, behaviour, attitude, and cognitive tendencies. In this work, several machine learning techniques are surveyed to estimate personality traits from input text using the Myers-Briggs Type Indicator (MBTI) model. Experiments are run over a freely accessible dataset from Kaggle. In addition, techniques such as tokenization, word stemming, stop word elimination, and feature selection, utilizing TF-IDF, are used to analyze personality traits further.

Personality Traits Prediction from Text via Machine Learning, 2022.

Personality Traits Prediction from Text via Machine Learning

Bruno, Alessandro
;
2022-01-01

Abstract

Social media platforms have been expanding their user bases. For example, LinkedIn counts 917 million monthly visitors, while Twitter has 3.62 billion monthly visitors. YouTube has 22.77 billion monthly visitors, and Instagram has 2.86 billion monthly visitors. Reports confirm data size increase of the social media networks above by 20–30% every day. With the spread of COVID-19, the same platforms have been broadly used by the worldwide collectiveness to socialize and stay amongst people. Analyzing text from Social Networking sites helps recognize individuals' personality traits automatically. A person's personality refers to their unique characteristics that shape their habits, behaviour, attitude, and cognitive tendencies. In this work, several machine learning techniques are surveyed to estimate personality traits from input text using the Myers-Briggs Type Indicator (MBTI) model. Experiments are run over a freely accessible dataset from Kaggle. In addition, techniques such as tokenization, word stemming, stop word elimination, and feature selection, utilizing TF-IDF, are used to analyze personality traits further.
Inglese
2022
https://ieeexplore.ieee.org/abstract/document/9848937
2022 IEEE World Conference on Applied Intelligence and Computing (AIC)
internazionale
contributo
588
594
978-1-6654-7988-2
United States
IEEE
esperti anonimi
Online
Settore INF/01 - Informatica
2
File in questo prodotto:
File Dimensione Formato  
camera_ready_AIC2022.pdf

Accessibile solo dagli utenti con account Apeiron

Tipologia: Documento in Pre-print
Dimensione 370.45 kB
Formato Adobe PDF
370.45 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10808/50964
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact