Using text mining algorithm to detect gender deception based on Malaysian chat room lingo / Dianne L. M. Cheong and Nur Atiqah Sia Abdullah @ Sia Sze Yieng

Cheong, Dianne L.M. and Sia Abdullah, Nur Atiqah (2006) Using text mining algorithm to detect gender deception based on Malaysian chat room lingo / Dianne L. M. Cheong and Nur Atiqah Sia Abdullah @ Sia Sze Yieng. Social and Management Research Journal (SMRJ), 3 (1). pp. 11-23. ISSN 1675-7017

Abstract

E-mail can be a fantasy playground for identity experimentations where players take on an imaginary persona and interact with each other in the virtual world. Therefore. gender deception is difficult. risky and it can be abandoned at will. Inference can be made both from writing style and from clues hidden in the posting data. A text-mining algorithm was designed to detect gender deception based on gender-preferential features at the word or clause level of Malaysian e-mail users. Based on this algorithm. a prototype in Visual Basic is developed It was tested with /6 documents; each consists of 5 e-mails
exchanges of respective individuals. The tests shown the prototype have 8/.3% of accuracy level. This is consistent with a human reader of the documents. This prototype can be a tool to assist interested parties such as the Criminology
and Forensic Department. e-mail users and virtual communities to successfully identify gender deception.

Metadata

Item Type: Article
Creators:
Creators
Email / ID Num.
Cheong, Dianne L.M.
dianne@tmsk.uitm.edu.my
Sia Abdullah, Nur Atiqah
atiqah@tmsk.uitm.edu.my
Subjects: Q Science > QA Mathematics > Instruments and machines > Electronic Computers. Computer Science > Data mining
T Technology > TK Electrical engineering. Electronics. Nuclear engineering > Telecommunication > Electronic mail systems. General works. Text messages (Cell phone systems)
Divisions:
Journal or Publication Title: Social and Management Research Journal (SMRJ)
UiTM Journal Collections: UiTM Journal > Social and Management Research Journal (SMRJ)
ISSN: 1675-7017
Volume: 3
Number: 1
Page Range: pp. 11-23
Keywords: Gender detection, gender of e-mail author, text-mining algorithm, program to detect gender, gender deception.
Date: 2006
URI: https://ir.uitm.edu.my/id/eprint/12967
Edit Item
Edit Item

Download

[thumbnail of AJ_DIANNE L.M. CHEONG SMRJ 06 1.pdf] Text
AJ_DIANNE L.M. CHEONG SMRJ 06 1.pdf

Download (1MB)

ID Number

12967

Indexing

Statistic

Statistic details