Sentiment classification for malay newspaper using clonal selection algorithm / Nur Fitri Nabila Mohamad Nasir

Mohamad Nasir, Nur Fitri Nabila (2013) Sentiment classification for malay newspaper using clonal selection algorithm / Nur Fitri Nabila Mohamad Nasir. Degree thesis, Universiti Teknologi MARA Terengganu.

Abstract

Sentiment classification is technique to analyze the subjective information in the text then mine the opinion. Mostly people are using blog or twitter to collect the sentiment data but not frequently used newspaper because not so many researchers are using newspaper to classify sentiment data as the main source. In this study, sentiment classifier using clonal algorithm selection was developed to categorize sentiment in Malay newspaper (Berita Harian). Another objective was to evaluate the proposed model effectiveness in classifying Malay newspaper’s data. In our method, the training of clonal selection algorithm (CSA) is first used to teach algorithm which is intelligent to categorize the sentiment in newspaper’s sentences into the polarity (positive, negative and neutraljfrom the data are collected and the testing was implemented after did the training to test whether a word should be taught correctly or not. Firstly, the data was dividing by ratio 80:20 from 1000 sentences. Therefore, 80% from 1000 sentences will use for training and 20% from 1000 sentences use for testing. Secondly, the data was dividing by ratio 70:30 which are 700 newspaper’s sentences as the training data and 300 newspaper’s sentences as the testing data. The experimental results show that our method can achieve better performance in clonal selection algorithm sentiment classification and the data collected cannot be used at once in this model because training data is very time-consuming if using all the data. The experiment achieves the best accuracy at 89.0%for ratio 70:30.This model was built with capability to help user in classifying newspaper sentence in easy way.

Metadata

Item Type: Thesis (Degree)
Creators:
Creators
Email / ID Num.
Mohamad Nasir, Nur Fitri Nabila
2010270306
Contributors:
Contribution
Name
Email / ID Num.
Thesis advisor
Isa, Norulhidayah
UNSPECIFIED
Thesis advisor
Jantan, Hamidah
UNSPECIFIED
Subjects: Q Science > QA Mathematics > Analysis
Q Science > QA Mathematics > Instruments and machines > Electronic Computers. Computer Science
Q Science > QA Mathematics > Instruments and machines > Electronic Computers. Computer Science > Algorithms
Divisions: Universiti Teknologi MARA, Terengganu > Kuala Terengganu Campus > Faculty of Computer and Mathematical Sciences
Programme: Bachelor of Computer Science (Hons)
Keywords: Clonal Selection Algorithm ; Newspaper Data ; Sentiment Classification
Date: January 2013
URI: https://ir.uitm.edu.my/id/eprint/35333
Edit Item
Edit Item

Download

[thumbnail of 35333.pdf] Text
35333.pdf

Download (141kB)

Digital Copy

Digital (fulltext) is available at:

Physical Copy

Physical status and holdings:
Item Status:

ID Number

35333

Indexing

Statistic

Statistic details