BukaGini: A Stability-Aware Gini Index Feature Selection Algorithm for Robust Model Performance
dc.authorid | Salah, Bashir/0000-0003-2709-760X | |
dc.authorid | Bouke, Mohamed Aly/0000-0003-3264-601X | |
dc.authorid | Frnda, Jaroslav/0000-0001-6065-3087 | |
dc.authorid | Cengiz, Korhan/0000-0001-6594-8861 | |
dc.authorwosid | Salah, Bashir/ABC-5845-2020 | |
dc.authorwosid | Bouke, Mohamed Aly/HZJ-0305-2023 | |
dc.authorwosid | Frnda, Jaroslav/M-1454-2019 | |
dc.contributor.author | Bouke, Mohamed Aly | |
dc.contributor.author | Abdullah, Azizol | |
dc.contributor.author | Frnda, Jaroslav | |
dc.contributor.author | Cengiz, Korhan | |
dc.contributor.author | Salah, Bashir | |
dc.date.accessioned | 2024-05-19T14:39:41Z | |
dc.date.available | 2024-05-19T14:39:41Z | |
dc.date.issued | 2023 | |
dc.department | İstinye Üniversitesi | en_US |
dc.description.abstract | Feature interaction is a vital aspect of Machine Learning (ML) algorithms, and gaining a deep understanding of these interactions can significantly enhance model performance. This paper introduces the BukaGini algorithm, an innovative and robust approach for feature interaction analysis that capitalizes on the Gini impurity index. By exploiting the unique properties of the BukaGini index, our proposed algorithm effectively captures both linear and nonlinear feature interactions, providing a richer and more comprehensive representation of the underlying data. We thoroughly evaluate the BukaGini algorithm against traditional Gini index-based methods on various real-world datasets. These datasets include the High School Students' Performance (HSSP) dataset, which examines factors affecting student performance; Cancer Data, which focuses on identifying cancer types based on gene expression; Spambase, which targets spam email classification; and the UNSW-NB15 dataset, which addresses network intrusion detection. Our experimental results demonstrate that the BukaGini algorithm consistently outperforms traditional Gini index-based methods in terms of accuracy. Across the tested datasets, the BukaGini algorithm achieves improvements ranging from 0.32% to 2.50%, underscoring its effectiveness in handling diverse data types and problem domains. This performance gain highlights the potential of the BukaGini algorithm as a valuable tool for feature interaction analysis in various ML applications. | en_US |
dc.description.sponsorship | King Saud University, Saudi Arabia [RSP2023R145]; Ministry of Education, Youth and Sports of the Czech Republic [2/KKMHI/2022]; Institutional research of the Faculty of Operation and Economics of Transport and Communications-University of Zilina; [SP2023/007] | en_US |
dc.description.sponsorship | This study received funding from King Saud University, Saudi Arabia through researchers supporting project number (RSP2023R145). And was supported by the Ministry of Education, Youth and Sports of the Czech Republic under the grant SP2023/007 conducted by VSB -Technical University of Ostrava, Czechia, and partially supported by Institutional research of the Faculty of Operation and Economics of Transport and Communications-University of Zilina, no. 2/KKMHI/2022.& nbsp; | en_US |
dc.identifier.doi | 10.1109/ACCESS.2023.3284975 | |
dc.identifier.endpage | 59396 | en_US |
dc.identifier.issn | 2169-3536 | |
dc.identifier.scopus | 2-s2.0-85162645716 | en_US |
dc.identifier.scopusquality | Q1 | en_US |
dc.identifier.startpage | 59386 | en_US |
dc.identifier.uri | https://doi.org10.1109/ACCESS.2023.3284975 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12713/4829 | |
dc.identifier.volume | 11 | en_US |
dc.identifier.wos | WOS:001016812900001 | en_US |
dc.identifier.wosquality | N/A | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.language.iso | en | en_US |
dc.publisher | Ieee-Inst Electrical Electronics Engineers Inc | en_US |
dc.relation.ispartof | Ieee Access | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.snmz | 20240519_ka | en_US |
dc.subject | Bukagini Algorithm | en_US |
dc.subject | Gini Index | en_US |
dc.subject | Ensemble Learning | en_US |
dc.subject | Feature Interaction Analysis | en_US |
dc.subject | Data Mining | en_US |
dc.title | BukaGini: A Stability-Aware Gini Index Feature Selection Algorithm for Robust Model Performance | en_US |
dc.type | Article | en_US |