Che Muhammad, Ummi Asyiqin and Mohd Razali, Muhammad Hasbullah
(2023)
Classification of diabetic patients with imbalanced class distribution by using a Cost-Sensitive forest algorithm / Ummi Asyiqin Che Muhammad and Muhammad Hasbullah Mohd Razali.
In:
Research Exhibition in Mathematics and Computer Sciences (REMACS 5.0).
College of Computing, Informatics and Media, UiTM Perlis, pp. 147-148.
ISBN 978-629-97934-0-3
Abstract
In the medical data set, the majority class consist of healthy patients, whereas the minority class consist of a few sick patients. Although many machine learning algorithms have been developed by researchers, the class imbalanced distribution still makes it challenging for classifiers to properly learn and differentiate between the minority and majority classes. This study focused on fitting an imbalanced diabetic data set to a CSForest algorithm. The accuracy of the CSForest was then compared to the RForest. It was found that the accuracy of RForest was 76.70% while the accuracy of the CSForest was 78.72%, indicating that CSForest performs better than the RForest in classifying diabetic patients.
Metadata
Item Type: | Book Section |
---|---|
Creators: | Creators Email / ID Num. Che Muhammad, Ummi Asyiqin UNSPECIFIED Mohd Razali, Muhammad Hasbullah UNSPECIFIED |
Subjects: | Q Science > QA Mathematics > Instruments and machines > Electronic Computers. Computer Science > Algorithms |
Divisions: | Universiti Teknologi MARA, Perlis > Arau Campus > Faculty of Computer and Mathematical Sciences |
Page Range: | pp. 147-148 |
Keywords: | Imbalanced class, cost-sensitive forest, random forest, diabetic patients |
Date: | 2023 |
URI: | https://ir.uitm.edu.my/id/eprint/100154 |