Abstract
On the surface, the introduction of PADU might be met with varying degrees of acceptance with Malaysians but knowing the actual sentiment without any biases is hard. Sentiment analysis of a certain topic, which in this study is PADU is a complex field that involves scraping datasets and classifying them with great accuracy where if one were to do it manually, would inevitably introduce some sort of bias to the results. The project provides a solution to the matter by developing a sentiment analysis model and appropriately visualising the data and results. The dataset used is scraped from X using Tweet Harvest which consists of 88 datapoints which were further augmented to 440 datapoints. The model is developed using bidirectional encoder representations from transformers that are trained with the dataset gathered. The model follows the software development methodology using waterfall and is released on a web platform. The result of the model that was trained with the combination of collected and augmented datasets showed 87% accuracy, 87% Precision, 87% Recall and F1-score of 87% compared with the model that was trained using only the collected dataset. In the future, further improvement to this project will be seen in the form of bigger language support for the model and the collection of data from a wide variety of social media
Metadata
| Item Type: | Article |
|---|---|
| Creators: | Creators Email / ID Num. Mohd Hosni, Ahmad Ishraf Imran 2023185765@student.uimt.edu.my Jasmis, Jamaluddin jamaluddinjasmis@uitm.edu.my |
| Subjects: | Q Science > Q Science (General) > Machine learning Q Science > Q Science (General) > Back propagation (Artificial intelligence) Q Science > QA Mathematics > Instruments and machines > Electronic Computers. Computer Science > Data mining |
| Divisions: | Universiti Teknologi MARA, Melaka > Jasin Campus > Faculty of Computer and Mathematical Sciences |
| Journal or Publication Title: | Progress in Computer and Mathematics Journal (PCMJ) |
| ISSN: | 3030-6728 |
| Volume: | 3 |
| Page Range: | pp. 109-118 |
| Keywords: | PADU, Sentiment analysis, BERT, Data augmentation |
| Date: | November 2025 |
| URI: | https://ir.uitm.edu.my/id/eprint/127573 |
