Document clustering: comparison of ward's clustering and Kohonen network performance / Noor Suriana Abu Bakar and Nurul Nisa Mohd Nasir

Abu Bakar, Noor Suriana and Mohd Nasir, Nurul Nisa (2010) Document clustering: comparison of ward's clustering and Kohonen network performance / Noor Suriana Abu Bakar and Nurul Nisa Mohd Nasir. [Research Reports] (Unpublished)

Abstract

Document clustering has been investigated for use in a number of different areas of information retrieval. The aimed of research in the field is to improve efficiency and effectiveness of retrieval. Since the clusters perform best quality, Hierarchical clustering is most commonly used in document clustering. Recently, there exist researches that apply NN in IR. However, research in NN based document clustering still less frequent. Therefore, this study will apply hierarchical based document clustering and NN based document clustering in terms of suggestion supervisor and examiner for thesis. The results from these two techniques will then compare with manual system to find out whether hierarchical based or NN based performed better. The collection of theses will be used and employed the pre-processing including stop word removal and stemming further measure the document similarity before apply the clustering techniques. The result will give some insight whether NN is better for suggestion supervisor and examiner.

Metadata

Item Type: Research Reports
Creators:
Creators
Email / ID Num.
Abu Bakar, Noor Suriana
UNSPECIFIED
Mohd Nasir, Nurul Nisa
UNSPECIFIED
Subjects: Z Bibliography. Library Science. Information Resources > Library Science. Information Science > Information storage and retrieval systems > Information filtering systems
Z Bibliography. Library Science. Information Resources > Information in specific formats or media > Electronic information resources
Divisions: Universiti Teknologi MARA, Melaka > Alor Gajah Campus
Keywords: Document clustering; Information retrieval; Network performance
Date: 2010
URI: https://ir.uitm.edu.my/id/eprint/42623
Edit Item
Edit Item

Download

[thumbnail of 42623.pdf] Text
42623.pdf

Download (2MB)

Digital Copy

Digital (fulltext) is available at:

Physical Copy

Physical status and holdings:
Item Status:

ID Number

42623

Indexing

Statistic

Statistic details