Sentiment Analysis on IMDB Review Dataset

Authors

  • Shubham Kumar Singh Department of Computer Science and Engineering, School of Engineering, The NorthCap University, Gurugram, Haryana, India 122017
  • Neetu Singla Department of Computer Science and Engineering, School of Engineering, The NorthCap University, Gurugram, Haryana, India 122017

DOI:

https://doi.org/10.57159/gadl.jcmm.2.6.230108

Keywords:

Sentiment Analysis, IMDB Review Dataset, Machine Learning Models, Data Preprocessing, Model Performance Evaluation

Abstract

A computational method known as sentiment analysis is employed to ascertain the emotional undertone or attitude of a text document, such as a review, tweet, or news story. Using machine learning models, deep neural network models, and natural language processing, the method entails examining the text to determine whether it expresses positive or negative sentiment. In this study, models like Naive Bayes, Logistic Regression, LSTM, LSVM, Decision tree, and BiLSTM are utilized to conduct a sentiment analysis (SA) study on the IMDB dataset. The goal of the investigation is to evaluate how well these models perform in retrospect on movie reviews, categorizing them as positive or negative. The study investigates the effects of data pre-processing methods and hyperparameter tuning on the models’ accuracy. The final results demonstrate that the BiLSTM model outperforms the other models in terms of recall, precision, and accuracy, followed by the LSTM, Logistic Regression, LSVM, Decision Tree, and Naive Bayes models. The research emphasizes the potential of deep learning models—in particular, BiLSTM in sentiment analysis tasks, as well as the significance of hyper-parameter tuning and pre-processing methods in achieving high accuracy.

Downloads

Published

31-12-2023

How to Cite

Singh, S. K., & Singla, N. (2023). Sentiment Analysis on IMDB Review Dataset. Journal of Computers, Mechanical and Management, 2(6), 18–29. https://doi.org/10.57159/gadl.jcmm.2.6.230108

Issue

Section

Original Articles

Categories

Received 2023-11-14
Accepted 2023-12-04
Published 2023-12-31