Cross language information retrieval (Malay-Arabic) for hadith document using stemming and exact matching technique / Farhana Hasan

Hasan, Farhana (2010) Cross language information retrieval (Malay-Arabic) for hadith document using stemming and exact matching technique / Farhana Hasan. Degree thesis, Universiti Teknologi MARA (UiTM).

Abstract

Classical Information Retrieval (IR) is the sifting out of the documents most relevant to a user's information requirement expressed as a "query", from a large electronic store of documents. A search engine performs IR by retrieving relevant web pages from the internet. Cross Language Information Retrieval (CLIR) allows the user to state their query in one language, and retrieve documents in another. Some CLIR systems use language resources such as bilingual dictionaries to translate the user's original query. Generally, Hadith directory provide facility to search Hadith, but the main problem is translation between Malay to Arabic Hadith document is rarely found and it use Arabic as lingual franca. Thus mean, only people who have master on Arabic or at least have basic Arabic can use that system. As effect from this situation, it will create language barrier for the non-Arabic because only a few people especially Malay people can use this facility. Therefore, Cross Language Information Retrieval (CLIR) is use to overcome this problem. The objectives of this project are to develop a Cross Language Information Retrieval CLIR (Malay-Arabic) search engine for Hadith (Sahih Bukhari & Sahih Muslim) text documents using stemming and exact match and to create a digitized dictionary (Malay-Arabic) with a limited scope. In investigate the retrieval effectiveness by using Recall and Precision formula, there are five experiments are conducted based on the queries on that language (Roslan, 2008).

Metadata

Item Type: Thesis (Degree)
Creators:
Creators
Email / ID Num.
Hasan, Farhana
2008711525
Contributors:
Contribution
Name
Email / ID Num.
Thesis advisor
Abdul Rahman, Nurazzah
UNSPECIFIED
Divisions: Universiti Teknologi MARA, Shah Alam > Faculty of Computer and Mathematical Sciences
Programme: Bachelor of Computer Science (hons.)
Keywords: Cross Language Information Retrieval (CLIR), Hadith, Retrieval Effectiveness
Date: 2010
URI: https://ir.uitm.edu.my/id/eprint/87107
Edit Item
Edit Item

Download

[thumbnail of 87107.pdf] Text
87107.pdf

Download (142kB)

Digital Copy

Digital (fulltext) is available at:

Physical Copy

Physical status and holdings:
Item Status:
On Shelf

ID Number

87107

Indexing

Statistic

Statistic details