TY - GEN
T1 - Analysis of social networks publications for stock market movement prediction
T2 - 2022 Congreso Internacional de Innovacion y Tendencias en Ingenieria, CONIITI 2022
AU - Bustos, Oscar
AU - Pomares, Alexandra
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Predicting the stock market has been a problem that has caught the attention of the scientific community since any new information is quickly incorporated into the share price. Then, it has been sought to find new sources of information that can be useful for machine learning models and allow to predict with greater precision the movement of the action. One of these novel data sources has been the estimation of the mood of the population, approximating it by means of polarity analysis on Twitter. The scientific community has focused on the study of this relationship in the stock markets of the United States and China, but no work has been reported in Spanish-speaking markets. This paper presents a methodology for capturing, cleaning, and creating indexes derived from the polarity in Twitter messages, adapted to the Spanish language. For the calculation of indicators, more than 8 million tweets published in Colombia were analyzed during the period 8-2020 to 8-2021, their polarity was calculated using the set of words from the AFINN lexicon translated into Spanish. There were calculated 5 social indicators derived from the Twitter message's polarity. The Logistic Regression, Support Vector Machines, and Artificial Neural Networks models were trained, where the latter had the best performance, with an accuracy of 58%. Then this article opens the discussion of the applicability of this type of technique in the Spanish-speaking markets.
AB - Predicting the stock market has been a problem that has caught the attention of the scientific community since any new information is quickly incorporated into the share price. Then, it has been sought to find new sources of information that can be useful for machine learning models and allow to predict with greater precision the movement of the action. One of these novel data sources has been the estimation of the mood of the population, approximating it by means of polarity analysis on Twitter. The scientific community has focused on the study of this relationship in the stock markets of the United States and China, but no work has been reported in Spanish-speaking markets. This paper presents a methodology for capturing, cleaning, and creating indexes derived from the polarity in Twitter messages, adapted to the Spanish language. For the calculation of indicators, more than 8 million tweets published in Colombia were analyzed during the period 8-2020 to 8-2021, their polarity was calculated using the set of words from the AFINN lexicon translated into Spanish. There were calculated 5 social indicators derived from the Twitter message's polarity. The Logistic Regression, Support Vector Machines, and Artificial Neural Networks models were trained, where the latter had the best performance, with an accuracy of 58%. Then this article opens the discussion of the applicability of this type of technique in the Spanish-speaking markets.
KW - Machine Learning
KW - Natural Language Processing
KW - Stock Market Forecasting
UR - http://www.scopus.com/inward/record.url?scp=85143694709&partnerID=8YFLogxK
U2 - 10.1109/CONIITI57704.2022.9953669
DO - 10.1109/CONIITI57704.2022.9953669
M3 - Conference contribution
AN - SCOPUS:85143694709
T3 - 2022 Congreso Internacional de Innovacion y Tendencias en Ingenieria, CONIITI 2022 - Conference Proceedings
BT - 2022 Congreso Internacional de Innovacion y Tendencias en Ingenieria, CONIITI 2022 - Conference Proceedings
A2 - Morales, Victor Manuel Fontalvo
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 5 October 2022 through 7 October 2022
ER -