Celebrity profiling on twitter using sociolinguistic features notebook for PAN at CLEF 2019

Luis Gabriel Moreno-Sandoval, Edwin Puertas, Flor Miriam Plaza-Del-Arco, Alexandra Pomares-Quimbaya, Jorge Andres Alvarado-Valencia, L. Alfonso Ureña-López

Research output: Contribution to journalConference articlepeer-review

2 Scopus citations

Abstract

Social networks have been a revolutionary scenario for celebrities because they allow them to reach a wider audience with much higher frequency than using traditional means. These platforms enable them to improve or sometimes deteriorate, their careers through the construction of closer relationships with their fans and the acquisition of new ones. Indeed, networks have promoted the emergence of a new type of celebrities that exists only in the digital world. Being able to characterize the celebrities that are more active on social networks, such as Twitter, gives an enormous opportunity to identify what is their real level of fame, what is their relevance for an age group, or a specific gender or occupation. These facts may enrich decision making, especially in advertising and marketing. To achieve this aim, this paper presents a novel strategy for the characterization of celebrities profile on Twitter based on the generation of socio-linguistic features from their posts that serve as input to a set of classifiers. Specifically, we produced four classifiers that describe the level of fame, the gender, the birth date, and the possible occupation of a celebrity. We obtained the training and test data sets as part of our participation at PAN 2019 at CLEF. Results of each classifier are reported including the analysis of which features are more relevant, which classification techniques were more useful and which were the final precision and recall results.

Original languageEnglish
JournalCEUR Workshop Proceedings
Volume2380
StatePublished - 2019
Event20th Working Notes of CLEF Conference and Labs of the Evaluation Forum, CLEF 2019 - Lugano, Switzerland
Duration: 09 Sep 201912 Sep 2019

Keywords

  • Author profiling
  • Celebrity profiling
  • Computational linguistic
  • Natural language processing
  • Socio-linguistic feature
  • Twitter
  • User profiling

Fingerprint

Dive into the research topics of 'Celebrity profiling on twitter using sociolinguistic features notebook for PAN at CLEF 2019'. Together they form a unique fingerprint.

Cite this