Age classification from Spanish tweets the variable age analyzed by using linear classifiers

Luis G. Moreno-Sandoval, Joan Felipe Mendoza-Molina, Edwin Alexander Puertas, Arturo Duque-Marín, Alexandra Pomares-Quimbaya, Jorge A. Alvarado-Valencia

Producción: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

3 Citas (Scopus)

Resumen

Text classification or text categorization in social networks such as Twitter has taken great importance with the growth of applications of this process in diverse domains of society. Literature about text classifiers is significantly wide especially in languages such as English; however, this is not the case for age classification whose studies have been mainly focused on image recognition and analysis. This paper presents the results of testing linear classifiers performance in the task of identifying Twitter users age from their profile descriptions and tweets. For this purpose, a Spanish Lexicon of 45 words around the concept "cumpleaños" was created and the Gold Standard of 1541 users with age correctly identified was obtained. The experiments are presented with the description of the algorithms used to finally obtain the best seven models that permit to identify the user's age with accuracy results between 66% and 69 %. Considering the information-retrieval layer, the new results showed that accuracy was increased from 69,09% to 72,96%.

Idioma originalInglés
Título de la publicación alojadaICEIS 2018 - Proceedings of the 20th International Conference on Enterprise Information Systems
EditoresSlimane Hammoudi, Michal Smialek, Olivier Camp, Joaquim Filipe, Joaquim Filipe
EditorialSciTePress
Páginas275-281
Número de páginas7
ISBN (versión digital)9789897582981
DOI
EstadoPublicada - 2018
Evento20th International Conference on Enterprise Information Systems, ICEIS 2018 - Funchal, Madeira, Portugal
Duración: 21 mar. 201824 mar. 2018

Serie de la publicación

NombreICEIS 2018 - Proceedings of the 20th International Conference on Enterprise Information Systems
Volumen1

Conferencia

Conferencia20th International Conference on Enterprise Information Systems, ICEIS 2018
País/TerritorioPortugal
CiudadFunchal, Madeira
Período21/03/1824/03/18

Huella

Profundice en los temas de investigación de 'Age classification from Spanish tweets the variable age analyzed by using linear classifiers'. En conjunto forman una huella única.

Citar esto