Evaluating machine learning algorithms for sentiment analysis: a comparative study to support data-driven decision making

Mohamad Daud, Nur Hafiza and Shafii, Nor Hayati and Md Nasir, Diana Sirmayunie and Fauzi, Nur Fatihah (2025) Evaluating machine learning algorithms for sentiment analysis: a comparative study to support data-driven decision making. Jurnal Intelek, 20 (2): 32. pp. 374-385. ISSN 2231-7716

Official URL: https://journal.uitm.edu.my/ojs/index.php/JI

Abstract

This research investigates the accuracy and robustness of sentiment analysis models through a comparative analysis of three distinct machine learning algorithms: Bernoulli Naive Bayes, Linear Support Vector Machines, and Logistic Regression. The primary objective is to assess the performance of these models across various domains and datasets in sentiment analysis tasks. The study employs data from the IMDb 500k movie reviews dataset, utilizing machine learning techniques for sentiment classification. Specifically, the selected algorithms—Bernoulli Naive Bayes, Linear Support Vector Machines, and Logistic Regression—are employed to train the dataset. Upon evaluating the models, the findings reveal notable differences in accuracy. Both LinearSVM and Bernoulli Naive Bayes achieved the highest accuracy, with each recording 89% when rounded to the nearest hundredth. However, LinearSVM slightly outperforms Bernoulli Naive Bayes in other performance metrics. In contrast, Logistic Regression records the lowest accuracy among the three algorithms. These results highlight the significance of algorithm choice in sentiment analysis tasks, with LinearSVM and Bernoulli Naive Bayes outperforming Logistic Regression. The research contributes valuable insights into the comparative performance of these algorithms, providing guidance for practitioners and researchers in choosing effective models for sentiment analysis across diverse datasets and domains.

Metadata

Item Type: Article
Creators:
Creators
Email / ID Num.
Mohamad Daud, Nur Hafiza
UNSPECIFIED
Shafii, Nor Hayati
UNSPECIFIED
Md Nasir, Diana Sirmayunie
UNSPECIFIED
Fauzi, Nur Fatihah
UNSPECIFIED
Subjects: Q Science > QA Mathematics > Instruments and machines > Electronic Computers. Computer Science > Algorithms
Divisions: Universiti Teknologi MARA, Perlis > Arau Campus
Journal or Publication Title: Jurnal Intelek
UiTM Journal Collections: UiTM Journals > Jurnal Intelek (JI)
ISSN: 2231-7716
Volume: 20
Number: 2
Page Range: pp. 374-385
Keywords: accuracy, Bernoulli Naïve Bayes, machine learning, sentiment analysis, Support Vector Machine
Date: August 2025
URI: https://ir.uitm.edu.my/id/eprint/126930
Edit Item
Edit Item

Download

[thumbnail of 126930.pdf] Text
126930.pdf

Download (622kB)

ID Number

126930

Indexing

Statistic

Statistic details