Abstract
A lexicon is the source of specific knowledge about individual words in the language which also known as the heart of language processing system. This research will focus on construction of a computational lexicon model for Malay Language that involved computational study of the form and behaviour of words. This research also includes study on morphological arrangement of Malay affixation process which includes prefixes, suffixes, circumfixes and infixes with the intention of constructing a collection of new Malay lexicons that will be automatically constructed from a single root word. This research conducted experiments on 2101 root words found in the Malay translated Quranic documents. The words then experimented with Malay affixation rules using the affixed words analyser. Numerous new words are constructed from a single root word with the word classes using the affixed words analyser by adding 52 affixes rules which consists of 20 prefixes, 3 suffixes, 25 circumfixes, and 4 infixes to the root word. Nevertheless, proper names need to be extracted from the list and this is done by recognising and removing particular name entities. Finally, each new word is then compared with current Malay dictionary to ensure whether the word generated is currently being used or it is a new generated new word. Results from this analysis open opportunity to construct new Malay word variant to enrich the Malay lexicon and may help to support more efficient method for any related Malay language computer linguistic analysis particularly any research on Malay Quranic translation documents.
Metadata
Item Type: | Thesis (Masters) |
---|---|
Creators: | Creators Email / ID Num. Hasmy, Harshida UNSPECIFIED |
Divisions: | Universiti Teknologi MARA, Shah Alam > Faculty of Computer and Mathematical Sciences |
Keywords: | Malay language lexicons; Morphological affixed rules |
Date: | 2016 |
URI: | https://ir.uitm.edu.my/id/eprint/17764 |
Download
TM_HARSHIDA HASMY CS 16_5.pdf
Download (3MB)