Comparing Machine Learning Algorithms for Sentiment Analysis in Movie Reviews
DOI:
https://doi.org/10.13052/jgeu0975-1416.13110Keywords:
Sentiment analysis, machine learning, Naive Bayes, logistic regression, support vector machinesAbstract
Sentiment analysis is a crucial natural language processing component that helps businesses to understand public opinion, customer feedback, and market trends. This study evaluates the performance of three machine learning models, namely Naive Bayes, Logistic Regression and SVM in classifying the sentiment of movie reviews. We analyzed 50,000 IMDB reviews from Kaggle using TF-IDF with three ML models using the following key metrics: accuracy, precision, recall, and F1-score. We further analyzed the confusion matrix for each model, which provided insights into the number of correctly and incorrectly classified reviews, helping identify potential areas for improvement. The findings of this study can guide the selection and optimization of algorithms for sentiment analysis, ultimately improving marketing, customer engagement, and social media strategies.
Downloads
References
Q. A. Xu, V. Chang, and C. Jayne, “A systematic review of social media-based sentiment analysis: Emerging trends and challenges,” Decision Analytics Journal, vol. 3, p. 100073, 2022.
C. Sahoo, M. Wankhade, and B. K. Singh, “Sentiment analysis using deep learning techniques: a comprehensive review,” Int J Multimed Info Retr, vol. 12, no. 2, p. 41, Dec. 2023.
Md. S. Islam et al., “Challenges and future in deep learning for sentiment analysis: a comprehensive review and a proposed novel hybrid approach,” Artif Intell Rev, vol. 57, no. 3, p. 62, Mar. 2024.
R. B. Shamantha, S. M. Shetty, and P. Rai, “Sentiment analysis using machine learning classifiers: evaluation of performance,” in 2019 IEEE 4th international conference on computer and communication systems (ICCCS), IEEE, 2019, pp. 21–25.
M. D. Devika, C. Sunitha, and A. Ganesh, “Sentiment analysis: a comparative study on different approaches,” Procedia Computer Science, vol. 87, pp. 44–49, 2016.
M. Dorothy and S. Rajini, “The Various Approaches for Sentiment Analysis: A Survey,” vol, vol. 5, pp. 2014–2016, 2016.
M. Avinash and E. Sivasankar, “A Study of Feature Extraction Techniques for Sentiment Analysis,” in Emerging Technologies in Data Mining and Information Security, vol. 814, A. Abraham, P. Dutta, J. K. Mandal, A. Bhattacharya, and S. Dutta, Eds., in Advances in Intelligent Systems and Computing, vol. 814, Singapore: Springer Singapore, 2019, pp. 475–486.
M. Kabir, M. Md. J. Kabir, S. Xu, and B. Badhon, “An empirical research on sentiment analysis using machine learning approaches,” International Journal of Computers and Applications, vol. 43, no. 10, pp. 1011–1019, Nov. 2021.
P. P. Surya and B. Subbulakshmi, “Sentimental analysis using Naive Bayes classifier,” in 2019 International conference on vision towards emerging trends in communication and networking (ViTECoN), IEEE, 2019, pp. 1–5.
P. B. Pajila, B. G. Sheena, A. Gayathri, J. Aswini, and M. Nalini, “A comprehensive survey on naive bayes algorithm: Advantages, limitations and applications,” in 2023 4th International Conference on Smart Electronics and Communication (ICOSEC), IEEE, 2023, pp. 1228–1234. Accessed: Mar. 08, 2025.
R. Swathi, A. Sri, and P. Roshini, “Sentiment Classification of Movie Reviews with Logistic Regression,” in 2024 International Conference on Sustainable Communication Networks and Application (ICSCNA), IEEE, 2024, pp. 1534–1538.
D. N. Devi, C. K. Kumar, and S. Prasad, “A feature based approach for sentiment analysis by using support vector machine,” in 2016 IEEE 6th international conference on advanced computing (IACC), IEEE, 2016, pp. 3–8.