Integrated the different data from the variety database for querying motif sequence / Mohd Taufik Mishan, Zanariah Idrus and Jasmin Ilyani Ahmad

Mishan, Mohd Taufik and Idrus, Zanariah and Ahmad, Jasmin Ilyani (2014) Integrated the different data from the variety database for querying motif sequence / Mohd Taufik Mishan, Zanariah Idrus and Jasmin Ilyani Ahmad. Research Reports. Research Management Institute, Sungai Petani, Kedah. (Unpublished)

Abstract

In performing protein secondary structure prediction procedures, biologists need to use variety types of sequence data from multiple biological repositories which are available publicly in the Internet. A lot of researches have been done in minimizing the numbers of repositories needed for the prediction procedures. However, due to the size complexity and numbers of repositories used has created a major challenge in integrating all different data into one repository or database. This challenge is known as syntactic heterogeneity problem. The aim of this research is to overcome the problem by transforming all the different data form variety of databases such as Prosite, Blast, Print and PDB into flat file format and other format into relational form using XML and asp dot net. From studies that have been conducted, XML approach is considered as a better choice for biological data integration. And this research has reveals that query made from relational database incorporating XML schema gives better query performance after integrating the variety data into one repository or relational database using metadata framework. As a result, this research showed some tool can search different data and different sizes of protein secondary structure data stored in the relational database and the result can be retrieved faster and reliable.

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year