Understanding the occurrence of metastatic breast cancer through clinical, phenotype and genotype data, and the employment of machine learning / Nadia Jalaludin

Jalaludin, Nadia (2023) Understanding the occurrence of metastatic breast cancer through clinical, phenotype and genotype data, and the employment of machine learning / Nadia Jalaludin. PhD thesis, Universiti Teknologi MARA (UiTM).

Abstract

Approximately 2.3 million women were diagnosed with breast cancer (BC) in 2020 and nearly 30% of women diagnosed with early-stage breast cancer will later develop metastatic disease. Despite the development and discovery of drugs and pharmacotherapy for breast cancer, the 5-year survival rate for people with metastatic breast cancer (MBC) remains low. Therefore, the objective of this study is to: a) mine and integrate clinical, phenotype and genotype data that contributes to the occurrence of MBC, b) build a prediction model that can predict possibility of occurrence to metastatic state of breast cancer based on factors previously determined in (a), and c) to validate findings from (a) and (b) through systematic review of randomised controlled trials of MBC. For objective (a), genotype and clinical data was mined from databases such as cBioportal and Genomic Data Common (GDC) portal, and was analysed using principal component analysis (PCA; after feature selection) and multiple correspondence analysis (MCA) in R. The data was then subjected to subsequent pathway mapping, Gene Ontology (GO) mapping and protein-protein interaction (PPI) to investigate its connection to the metastatic phenotype. The odds ratio of mutated gene, disease similarities and hierarchical clustering were also done before all the result was consolidated. For objective (b), prediction model was generated based on the outcome of (a) by using the Random Forest (RF) algorithm and validated by 5-fold cross validation. Additionally, the sensitivity and specificity of each model was also calculated. Meanwhile, for objective (c), six keywords: “metastatic breast cancer, chemotherapy, hormonal therapy, targeted therapy, gene and progression free survival” were used in PubMed, Scopus, Web of Science, Cochrane and Science Direct to further validate the previous findings. Based on all of these evaluations, the findings suggest that mRNA and genetic profiling can differentiate between breast cancer and metastatic breast cancer patients and more attention should be paid to YAP1 and SP7 genes.

Metadata

Item Type: Thesis (PhD)
Creators:
Creators
Email / ID Num.
Jalaludin, Nadia
2014274662
Contributors:
Contribution
Name
Email / ID Num.
Thesis advisor
Mohd Fauzi, Fazlin
UNSPECIFIED
Subjects: R Medicine > RC Internal Medicine > Cancer
Divisions: Universiti Teknologi MARA, Shah Alam > Faculty of Pharmacy
Programme: Doctor of Philosophy – PH990
Keywords: Breast cancer, diagnosis, occurrence of metastatic
Date: 2023
URI: https://ir.uitm.edu.my/id/eprint/84271
Edit Item
Edit Item

Download

[thumbnail of 84271.pdf] Text
84271.pdf

Download (3MB)

Digital Copy

Digital (fulltext) is available at:

Physical Copy

Physical status and holdings:
Item Status:

ID Number

84271

Indexing

Statistic

Statistic details