Detection of adjective compound word in Malay language using enhanced syntactic rules / Zamri Abu Bakar ...[et al.]

Abu Bakar, Zamri and Kamal Ismail, Normaly and Anuar, Nurhilyana and Idris, Aminatul Solehah (2021) Detection of adjective compound word in Malay language using enhanced syntactic rules / Zamri Abu Bakar ...[et al.]. Journal of Computing Research and Innovation (JCRINN), 6 (2): 8. pp. 63-83. ISSN 2600-8793

Abstract

Compound word is defined as combination two or more words and it will produce a new meaning. Generally, compound word is existed in many languages such as English, Mandarin, Arabic and others. Although, there are discussion of existing methods to detect compound word yet some limitations on detecting Malay compound word. Thus, this study is done to improve accuracy towards adjective compound words. Training data is used in this study was Malay story books. Digitization data of Malay story book is used in this study. Then, the pre-processing method involved tokenization, stemming, bi-gram and part-of-speech (POS) tagging has been applied to produce the candidate compound word. Applying the enhanced syntactic rules shown the precision result is 70.3% through this study. Thus, this study will contribute to the academic research in improvise the issues on searching and document summarization application.

Metadata

Item Type: Article
Creators:
Creators
Email / ID Num.
Abu Bakar, Zamri
UNSPECIFIED
Kamal Ismail, Normaly
UNSPECIFIED
Anuar, Nurhilyana
UNSPECIFIED
Idris, Aminatul Solehah
UNSPECIFIED
Subjects: P Language and Literature > P Philology. Linguistics > Language. Linguistic theory. Comparative grammar > Style. Composition. Rhetoric
Z Bibliography. Library Science. Information Resources > Books (General). Writing. Paleography
Divisions: Universiti Teknologi MARA, Selangor
Journal or Publication Title: Journal of Computing Research and Innovation (JCRINN)
UiTM Journal Collections: UiTM Journal > Journal of Computing Research and Innovation (JCRINN)
ISSN: 2600-8793
Volume: 6
Number: 2
Page Range: pp. 63-83
Keywords: Compound word, Malay Language, Syntactic rules, Language
Date: 2021
URI: https://ir.uitm.edu.my/id/eprint/60190
Edit Item
Edit Item

Download

[thumbnail of 60190.pdf] Text
60190.pdf

Download (313kB)

ID Number

60190

Indexing

Statistic

Statistic details