Sentiment analysis of domestic violence prediction using Naive Bayes algorithm / Nurulizzah Mohd Rahiman

Mohd Rahiman, Nurulizzah (2024) Sentiment analysis of domestic violence prediction using Naive Bayes algorithm / Nurulizzah Mohd Rahiman. Degree thesis, Universiti Teknologi MARA, Terengganu.

Abstract

This research delves into the widespread issue of domestic violence, emphasizing its severe impact on individuals and society globally. The surge in domestic violence during the COVID-19 pandemic, as highlighted by UN Women's survey, particularly in countries like Kenya, sets the stage for the research problem. Recognizing the lack of public awareness and understanding of attitudes towards domestic violence, the study proposes using sentiment analysis on Twitter data to monitor real-time public sentiment. The research objectives focus on studying and applying the Naive Bayes algorithm for sentiment analysis on tweets related to domestic violence, aiming to provide insights for researchers, government agencies, policymakers, and the public and develop a prediction model using Naive Bayes algorithm to evaluate its performance. The scope involves using English language tweets collected from March 2021 to November 2023, limiting the data to the topic of domestic violence. Few Naive Bayes classifiers are used to compare the accuracy of the Naive Bayes algorithm and parameter tuning also done on the classifiers. Resampling is used to handle the imbalance dataset. This research also compares using VADER and SentiWordNet lexicon to compare which has the best accuracy. The evaluation of algorithms consists of comparing the accuracy, specificity, and other evaluation metrics. Based on the results, Bernoulli classifier has the best accuracy of 94% while Multinomial has an accuracy of 93%. The best ratio of data to be used are 80:20 with VADER lexicon approach.

Metadata

Item Type: Thesis (Degree)
Creators:
Creators
Email / ID Num.
Mohd Rahiman, Nurulizzah
2022755587
Contributors:
Contribution
Name
Email / ID Num.
Thesis advisor
Mohamed Yusoff, Syarifah Adilah
UNSPECIFIED
Subjects: Q Science > QA Mathematics > Instruments and machines > Electronic Computers. Computer Science > Algorithms
Divisions: Universiti Teknologi MARA, Terengganu > Kuala Terengganu Campus
Programme: Bachelor of Computer Science (Hons)
Keywords: Domestic Violence, Sentiment Analysis, Prediction Model, Naive Bayes Algorithm, Multinomial Classifier, Bernoulli Classifier, Accuracy, Specificity, Evaluation Metrics, Ratio, Lexicon Approach, VADER, Sentiwordnet
Date: 2024
URI: https://ir.uitm.edu.my/id/eprint/96468
Edit Item
Edit Item

Download

[thumbnail of 96468.pdf] Text
96468.pdf

Download (92kB)

Digital Copy

Digital (fulltext) is available at:

Physical Copy

Physical status and holdings:
Item Status:

ID Number

96468

Indexing

Statistic

Statistic details