Comparative analysis of PCA and ANOVA for assessing the subset feature selection of the geomagnetic Disturbance Storm Time / Ain Dzarah Nafisah Mazlan … [et al.]

Mazlan, Ain Dzarah Nafisah and Hairuddin, Muhammad Asraf and Md Tahir, Nooritawati and Khirul Ashar, Nur Dalila and Jusoh, Mohamad Huzaimy (2020) Comparative analysis of PCA and ANOVA for assessing the subset feature selection of the geomagnetic Disturbance Storm Time / Ain Dzarah Nafisah Mazlan … [et al.]. Journal of Electrical and Electronic Systems Research (JEESR), 17. pp. 8-16. ISSN 1985-5389

Abstract

A Disturbance Storm Time (Dst) index represents the geomagnetic storm strength due to interaction of the Sun towards Earth in the space weather. Formation of the Dst contributed by the total of nine (9) input features namely interplanetary magnetic field (IMF), solar wind density (SWD), solar wind speed (SWS), solar wind input energy (SWIP) and also Earth’s magnetic field components comprise of the horizontal intensity component (H), declination component (D), north component (N), east component (E), and vertical intensity component (Z). Large datasets which comprise of 157896 number of data have existed for all features thus require pre-processing and subset feature selection for reducing data dimensionality in order to reduce the data processing time and enhance the performance of the learning algorithm. In this paper two methods of analyzing the features were compared: Principal Component Analysis (PCA) and one-way Analysis of Variance (ANOVA). The main aims for this works are to reduce a large set of input parameters from the Dst index and to compare the subset feature using the proposed methods for acquiring the reduced features. Prior to analyse the features, an independent-samples t-test is used to evaluate if there is a large difference between the mean of two groups that can be correlated with certain characteristics. The results for the features analyzed demonstrated that one-way ANOVA performed better in eliminating seven (7) components out of nine (9) components of features as compared to PCA. This finding was validated with a dendrogram to support that one-way ANOVA outperformed the PCA in reducing the subset features.

Metadata

Item Type: Article
Creators:
Creators
Email / ID Num.
Mazlan, Ain Dzarah Nafisah
UNSPECIFIED
Hairuddin, Muhammad Asraf
masraf@uitm.edu.my
Md Tahir, Nooritawati
noori425@uitm.edu.my
Khirul Ashar, Nur Dalila
nurdalila306@uitm.edu.my
Jusoh, Mohamad Huzaimy
UNSPECIFIED
Subjects: H Social Sciences > HA Statistics > Theory and method of social science statistics
H Social Sciences > HA Statistics > Theory and method of social science statistics > Surveys. Sampling. Statistical survey methodology
Divisions: Universiti Teknologi MARA, Shah Alam > Faculty of Electrical Engineering
Journal or Publication Title: Journal of Electrical and Electronic Systems Research (JEESR)
UiTM Journal Collections: UiTM Journal > Journal of Electrical and Electronic Systems Research (JEESR)
ISSN: 1985-5389
Volume: 17
Page Range: pp. 8-16
Keywords: Dst index, Geomagnetic storm, Principal Component Analysis (PCA), One-way ANOVA
Date: 2020
URI: https://ir.uitm.edu.my/id/eprint/42379
Edit Item
Edit Item

Download

[thumbnail of 42379.pdf] Text
42379.pdf

Download (610kB)

ID Number

42379

Indexing

Statistic

Statistic details