Abstract
Information is growing rapidly; anyone is able to get the information easily without any restriction especially using the World Wide Web. However, cause of too many information, sometimes readers cannot get the important value of that information. Therefore, it will leads to wrong information and waste of time on reading. The research proposes an algorithm that will automatically extract the Malay documents to improve access to information. Content words extraction techniques is explored and used as possible content and value for the text document. In the process of development the prototype, Bigram technique is introduce to assists on searching the related word of content word. As a result, the prototype will display all related sentences with content words.
Metadata
Item Type: | Thesis (Degree) |
---|---|
Creators: | Creators Email / ID Num. Samshudin, Nurfarahidayu 2009407068 |
Contributors: | Contribution Name Email / ID Num. Thesis advisor Md Hanum, Haslizatul Fairuz UNSPECIFIED |
Subjects: | Q Science > QA Mathematics > Analysis |
Divisions: | Universiti Teknologi MARA, Shah Alam > Faculty of Computer and Mathematical Sciences |
Programme: | Bachelor of Science |
Keywords: | World Wide Web, algorithm, prototype |
Date: | 2011 |
URI: | https://ir.uitm.edu.my/id/eprint/98082 |
Download
98082.pdf
Download (124kB)