Abstract
This research delves into the widespread issue of domestic violence, emphasizing its severe impact on individuals and society globally. The surge in domestic violence during the COVID-19 pandemic, as highlighted by UN Women's survey, particularly in countries like Kenya, sets the stage for the research problem. Recognizing the lack of public awareness and understanding of attitudes towards domestic violence, the study proposes using sentiment analysis on Twitter data to monitor real-time public sentiment. The research objectives focus on studying and applying the Naive Bayes algorithm for sentiment analysis on tweets related to domestic violence, aiming to provide insights for researchers, government agencies, policymakers, and the public and develop a prediction model using Naive Bayes algorithm to evaluate its performance. The scope involves using English language tweets collected from March 2021 to November 2023, limiting the data to the topic of domestic violence. Few Naive Bayes classifiers are used to compare the accuracy of the Naive Bayes algorithm and parameter tuning also done on the classifiers. Resampling is used to handle the imbalance dataset. This research also compares using VADER and SentiWordNet lexicon to compare which has the best accuracy. The evaluation of algorithms consists of comparing the accuracy, specificity, and other evaluation metrics. Based on the results, Bernoulli classifier has the best accuracy of 94% while Multinomial has an accuracy of 93%. The best ratio of data to be used are 80:20 with VADER lexicon approach.
Metadata
Item Type: | Thesis (Degree) |
---|---|
Creators: | Creators Email / ID Num. Mohd Rahiman, Nurulizzah 2022755587 |
Contributors: | Contribution Name Email / ID Num. Thesis advisor Mohamed Yusoff, Syarifah Adilah UNSPECIFIED |
Subjects: | Q Science > QA Mathematics > Instruments and machines > Electronic Computers. Computer Science > Algorithms |
Divisions: | Universiti Teknologi MARA, Terengganu > Kuala Terengganu Campus |
Programme: | Bachelor of Computer Science (Hons) |
Keywords: | Domestic Violence, Sentiment Analysis, Prediction Model, Naive Bayes Algorithm, Multinomial Classifier, Bernoulli Classifier, Accuracy, Specificity, Evaluation Metrics, Ratio, Lexicon Approach, VADER, Sentiwordnet |
Date: | 2024 |
URI: | https://ir.uitm.edu.my/id/eprint/96468 |
Download
96468.pdf
Download (92kB)