TY - JOUR
T1 - Sentiment Analysis of Tweets using Unsupervised Learning Techniques and the K-Means Algorithm
AU - Iparraguirre-Villanueva, Orlando
AU - Guevara-Ponce, Victor
AU - Sierra-Liñan, Fernando
AU - Beltozar-Clemente, Saul
AU - Cabanillas-Carbonell, Michael
N1 - Publisher Copyright:
© 2022. International Journal of Advanced Computer Science and Applications.All Rights Reserved.
PY - 2022
Y1 - 2022
N2 - Today, web content such as images, text, speeches, and videos are user-generated, and social networks have become increasingly popular as a means for people to share their ideas and opinions. One of the most popular social media for expressing their feelings towards events that occur is Twitter. The main objective of this study is to classify and analyze the content of the affiliates of the Pension and Funds Administration (AFP) published on Twitter. This study incorporates machine learning techniques for data mining, cleaning, tokenization, exploratory analysis, classification, and sentiment analysis. To apply the study and examine the data, Twitter was used with the hashtag #afp, followed by descriptive and exploratory analysis, including metrics of the tweets. Finally, a content analysis was carried out, including word frequency calculation, lemmatization, and classification of words by sentiment, emotions, and word cloud. The study uses tweets published in the month of May 2022. Sentiment distribution was also performed in three polarity classes: positive, neutral, and negative, representing 22%, 4%, and 74% respectively. Supported by the unsupervised learning method and the K-Means algorithm, we were able to determine the number of clusters using the elbow method.
AB - Today, web content such as images, text, speeches, and videos are user-generated, and social networks have become increasingly popular as a means for people to share their ideas and opinions. One of the most popular social media for expressing their feelings towards events that occur is Twitter. The main objective of this study is to classify and analyze the content of the affiliates of the Pension and Funds Administration (AFP) published on Twitter. This study incorporates machine learning techniques for data mining, cleaning, tokenization, exploratory analysis, classification, and sentiment analysis. To apply the study and examine the data, Twitter was used with the hashtag #afp, followed by descriptive and exploratory analysis, including metrics of the tweets. Finally, a content analysis was carried out, including word frequency calculation, lemmatization, and classification of words by sentiment, emotions, and word cloud. The study uses tweets published in the month of May 2022. Sentiment distribution was also performed in three polarity classes: positive, neutral, and negative, representing 22%, 4%, and 74% respectively. Supported by the unsupervised learning method and the K-Means algorithm, we were able to determine the number of clusters using the elbow method.
KW - Classification
KW - Machine learning
KW - Techniques
KW - Twitter
UR - http://www.scopus.com/inward/record.url?scp=85133369433&partnerID=8YFLogxK
U2 - 10.14569/IJACSA.2022.0130669
DO - 10.14569/IJACSA.2022.0130669
M3 - Article
AN - SCOPUS:85133369433
SN - 2158-107X
VL - 13
SP - 571
EP - 578
JO - International Journal of Advanced Computer Science and Applications
JF - International Journal of Advanced Computer Science and Applications
IS - 6
ER -