Impact of feature standardization on heart disease prediction: a comparative analysis of logistic regression and support vector machine models

Muhammad Noor Mathivanan, Norsyela and Foo, Eric Zhi Xian and Foo, Debbie Yong Xi and Chua, Hiang Kiat (2025) Impact of feature standardization on heart disease prediction: a comparative analysis of logistic regression and support vector machine models. Malaysian Journal of Computing (MJoC), 10 (2): 1. pp. 2159-2175. ISSN 2600-8238

Identification Number (DOI): 10.24191/mjoc.v10i1.6835

Abstract

Cardiovascular diseases are among the leading causes of global mortality. Heart disease, in particular, remains a major contributor to this burden, highlighting the need for effective predictive models to enable early detection. This study investigates the impact of feature standardization using StandardScaler on the performance of two prominent machine learning models involving Logistic Regression (LR) and Support Vector Machine (SVM) for predicting heart disease. The research utilizes a dataset comprising demographic and clinical attributes of patients, focusing on the role of feature standardization in enhancing model performance. The study compares models trained on raw data and standardized data, applying performance metrics such as accuracy, precision, recall, and F1-score. Results indicate that feature standardization significantly improves the performance of both models. LR showed a clear enhancement in macro F1-score on the testing set, rising from 0.82 without standardization to 0.87 with standardization. SVM was slightly superior in its raw form but still improved after standardization, with the macro F1-score increasing from 0.85 to 0.86. These findings highlight the importance of data pre-processing and demonstrate how feature scaling can optimize machine learning models for heart disease prediction. This research contributes to the growing field of predictive healthcare, offering valuable insights for clinicians seeking reliable early detection tools for cardiovascular conditions.

Metadata

Item Type: Article
Creators:
Creators
Email / ID Num.
Muhammad Noor Mathivanan, Norsyela
norsyela.m@uow.edu.my
Foo, Eric Zhi Xian
0133702@student.uow.edu.my
Foo, Debbie Yong Xi
0136275@student.uow.edu.my
Chua, Hiang Kiat
hk.chua @uow.edu.my
Subjects: H Social Sciences > HA Statistics > Regression. Correlation
R Medicine > RC Internal Medicine > Specialties of internal medicine > Diseases of the circulatory (Cardiovascular) system
Divisions: Universiti Teknologi MARA, Shah Alam > College of Computing, Informatics and Mathematics
Journal or Publication Title: Malaysian Journal of Computing (MJoC)
UiTM Journal Collections: UiTM Journals > Malaysian Journal of Computing (MJoC)
ISSN: 2600-8238
Volume: 10
Number: 2
Page Range: pp. 2159-2175
Keywords: Cardiovascular diseases, Feature standardization, Heart disease, Logistic regression, Machine learning model, Support vector machine
Date: October 2025
URI: https://ir.uitm.edu.my/id/eprint/125988
Edit Item
Edit Item

Download

[thumbnail of 125988.pdf] Text
125988.pdf

Download (1MB)

ID Number

125988

Indexing

Altmetric
PlumX
Dimensions

Statistic

Statistic details