Abstract
Compound word is defined as combination two or more words and it will produce a new meaning. Generally, compound word is existed in many languages such as English, Mandarin, Arabic and others. Although, there are discussion of existing methods to detect compound word yet some limitations on detecting Malay compound word. Thus, this study is done to improve accuracy towards adjective compound words. Training data is used in this study was Malay story books. Digitization data of Malay story book is used in this study. Then, the pre-processing method involved tokenization, stemming, bi-gram and part-of-speech (POS) tagging has been applied to produce the candidate compound word. Applying the enhanced syntactic rules shown the precision result is 70.3% through this study. Thus, this study will contribute to the academic research in improvise the issues on searching and document summarization application.
Metadata
Item Type: | Article |
---|---|
Creators: | Creators Email / ID Num. Abu Bakar, Zamri UNSPECIFIED Kamal Ismail, Normaly UNSPECIFIED Anuar, Nurhilyana UNSPECIFIED Idris, Aminatul Solehah UNSPECIFIED |
Subjects: | P Language and Literature > P Philology. Linguistics > Language. Linguistic theory. Comparative grammar > Style. Composition. Rhetoric Z Bibliography. Library Science. Information Resources > Books (General). Writing. Paleography |
Divisions: | Universiti Teknologi MARA, Selangor |
Journal or Publication Title: | Journal of Computing Research and Innovation (JCRINN) |
UiTM Journal Collections: | UiTM Journal > Journal of Computing Research and Innovation (JCRINN) |
ISSN: | 2600-8793 |
Volume: | 6 |
Number: | 2 |
Page Range: | pp. 63-83 |
Keywords: | Compound word, Malay Language, Syntactic rules, Language |
Date: | 2021 |
URI: | https://ir.uitm.edu.my/id/eprint/60190 |