Sentiment analysis regarding childcare issues using Naive Bayes Algorithm / Alis Farhana Zulkipeli

Zulkipeli, Alis Farhana (2025) Sentiment analysis regarding childcare issues using Naive Bayes Algorithm / Alis Farhana Zulkipeli. Degree thesis, Universiti Teknologi MARA, Terengganu.

Abstract

This study applies the Naive Bayes algorithm for sentiment analysis to assess public perceptions of childcare issues, particularly child abandonment and accidents. With the growing volume of childcare-related discussions on social media, efficient sentiment analysis tools are essential for extracting insights. However, the lack of comprehensive methodologies poses challenges for policymakers, childcare providers, and researchers in understanding public concerns and developing effective interventions. To address this, a dataset of 1,079 tweets from X (formerly Twitter) is analyzed. The data undergoes preprocessing steps such as stop-word and emoji removal, tokenization, and feature extraction using Term Frequency-Inverse Document Frequency (TF-IDF). VADER is used for initial sentiment labeling, and the Naive Bayes classifier categorizes sentiments into positive and negative classes. The motivation behind this project is to leverage sentiment analysis for enhancing childcare policies and public awareness. The project aims to enhance childcare policies and public awareness by leveraging sentiment analysis to bridge the gap between public sentiment and policy decisions. The Naive Bayes model achieves 87% accuracy with high precision, recall, and F1 scores using 10-fold crossvalidation, demonstrating its effectiveness in classifying social media sentiments. Future research could explore advanced techniques like Bidirectional Encoder Representations from Transformers (BERT) or Recurrent Neural Networks (RNNs) to improve classification accuracy and contextual understanding. Expanding the dataset to include multilingual content and incorporating topic modeling techniques would further enhance sentiment analysis in childcare-related discourse.

Metadata

Item Type: Thesis (Degree)
Creators:
Creators
Email / ID Num.
Zulkipeli, Alis Farhana
2023165061
Contributors:
Contribution
Name
Email / ID Num.
Thesis advisor
Mohamed@Omar, Hasiah
UNSPECIFIED
Subjects: Q Science > QA Mathematics > Mathematical statistics. Probabilities > Decision theory > Bayesian statistics
Divisions: Universiti Teknologi MARA, Terengganu > Kuala Terengganu Campus > Faculty of Computer and Mathematical Sciences
Programme: Bachelor of Computer Science (Hons)
Keywords: Naive Bayes Algorithm, Term Frequency-Inverse Document Frequency
Date: 2025
URI: https://ir.uitm.edu.my/id/eprint/114924
Edit Item
Edit Item

Download

[thumbnail of 114924.pdf] Text
114924.pdf

Download (95kB)

Digital Copy

Digital (fulltext) is available at:

Physical Copy

Physical status and holdings:
Item Status:

ID Number

114924

Indexing

Statistic

Statistic details