Abstract
Hadith is known as the second resource for Muslims to refer to that have the specific statements of an act, saying of the Prophet concerning some matter or action. The emergence of information technology has offered many applications and systems to access the content of the hadith via indexing and information retrieval (IR). Nonetheless, it is inconvenient to use indexing for searching due to the nature of the hadith itself for being disordered and scattered. In the meantime, the existing IR system only cater the usage of simple and single keyword searching, which do not highlight the relevance of retrieved results. Apart from these, numerous results from the searching could lead to cognitive strain. For this project, it will use Agile methodology that has five phases, which are requirement analysis phase, design phase, development phase, testing phase and maintenance. Therefore, a progressive web-based information retrieval visualization system that utilizes Term Frequency-Inverse Document Frequency (TFIDF) algorithm and data visualization are suggested to solve these problems. First, the hadith data and the user query will go through the pre-processing method. Next, the TF-IDF algorithm is performed on the processed data to find the relevant hadith based on the user’s query. The results from the previous stage will be used to generate visualizations by using the D3.js library. Word cloud chart visualization is used to display the important word in a text. The more often it is stated within a given text and the more crucial it is. Implementation of sentence-based searching and the usage of the stemming algorithm would be considered to produce more meaningful results for future work.
Metadata
Item Type: | Thesis (Degree) |
---|---|
Creators: | Creators Email / ID Num. Norhisam, Nor Faezahtul Salme 2017412282 |
Contributors: | Contribution Name Email / ID Num. Thesis advisor Abu Samah, Khyrina Airin Fariza UNSPECIFIED |
Subjects: | Q Science > QA Mathematics > Instruments and machines > Electronic Computers. Computer Science Q Science > QA Mathematics > Instruments and machines > Electronic Computers. Computer Science > Algorithms |
Divisions: | Universiti Teknologi MARA, Melaka > Jasin Campus > Faculty of Computer and Mathematical Sciences |
Programme: | Bachelor of Computer Science (Hons) (CS230) |
Keywords: | Term Frequency-Inverse Document Frequency (TFIDF); Algorithm; Data visualization; Hadith scriptures |
Date: | 2020 |
URI: | https://ir.uitm.edu.my/id/eprint/35520 |
Download
35520.pdf
Download (172kB)