Optimization of tree-based machine learning algorithms for improving the predictive accuracy of hepatitis C disease
dc.authorscopusid | Femilda Josephin Joseph Shobana Bai / 57810685700 | |
dc.authorwosid | Femilda Josephin Joseph Shobana Bai / AGG-4255-2022 | |
dc.contributor.author | Bai, Femilda Josephin Joseph Shobana | |
dc.contributor.author | Jasmine, R. Anita | |
dc.date.accessioned | 2025-04-18T10:08:20Z | |
dc.date.available | 2025-04-18T10:08:20Z | |
dc.date.issued | 2024 | |
dc.department | İstinye Üniversitesi, Mühendislik ve Doğa Bilimleri Fakültesi, Bilgisayar Mühendisliği Bölümü | |
dc.description.abstract | Hepatitis C is a globally prevalent viral infection that has the potential to cause significant liver-related complications if not appropriately managed. The timely and precise identification of the medical condition is imperative for the efficient administration of patient care and therapy. One of the precise and potential diagnosis methods in the identification of hepatitis C is the utilization of machine learning (ML) algorithms. The present investigation focuses on the optimization of four ML algorithms which are tree-based algorithms, namely, random forest (RF), gradient boosting machines (GBMs), light gradient boosting machines (LGBMs), and extreme gradient boosting (XGBoost) with the aim of enhancing the predictive accuracy of hepatitis C disease. The investigation utilized a reliable dataset from the University of California, Irvine (UCI) Machine Learning Repository. The research methodology encompasses various stages, including data preprocessing, feature selection, hyperparameter tuning, and model evaluation. Optimization techniques, including the synthetic minority oversampling technique (SMOTE) for data balancing and grid search optimization for hyperparameter tuning, were utilized to improve the models’ performance. The optimized models were assessed through the utilization of stratified k-fold cross-validation and performance metrics, which comprise accuracy, precision, recall, F1-score, and area under the receiver operating characteristic (ROC) curve. The findings of our study indicate that the optimized tree-based algorithms exhibit superior performance compared to their nonoptimized counterparts. Specifically, LGBM demonstrated the highest level of predictive accuracy at 98.91%, followed by XGBoost at 98.70%, GBM at 97.83%, and RF at 97.29%. The LGBM learning approach has the potential to be broadly applied and extended to diverse medical datasets and use cases, thus advancing ML in the healthcare domain. The study highlights the importance of optimizing tree-based algorithms to improve the accuracy of early prediction of the prevalence of hepatitis C disease and promote patient health. This underscores the capacity of ML to improve healthcare outcomes. © 2024 Elsevier Inc. All rights reserved. | |
dc.identifier.citation | Bai, F. J. J. S., & Jasmine, R. A. (2024). Optimization of tree-based machine learning algorithms for improving the predictive accuracy of hepatitis C disease. In Decision-Making Models (pp. 523-545). Academic Press. | |
dc.identifier.doi | 10.1016/B978-0-443-16147-6.00015-3 | |
dc.identifier.endpage | 545 | |
dc.identifier.isbn | 978-044316147-6, 978-044316148-3 | |
dc.identifier.scopus | 2-s2.0-85202870219 | |
dc.identifier.scopusquality | N/A | |
dc.identifier.startpage | 523 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12713/6956 | |
dc.indekslendigikaynak | Scopus | |
dc.institutionauthor | Bai, Femilda Josephin Joseph Shobana | |
dc.institutionauthorid | Femilda Josephin Joseph Shobana Bai / 0000-0003-0249-9506 | |
dc.language.iso | en | |
dc.publisher | Elsevier | |
dc.relation.ispartof | Decision-Making Models: A Perspective of Fuzzy Logic and Machine Learning | |
dc.relation.publicationcategory | Kitap Bölümü - Uluslararası | |
dc.rights | info:eu-repo/semantics/closedAccess | |
dc.subject | Extreme Gradient Boosting Machines | |
dc.subject | Gradient Boosting Machines | |
dc.subject | Hepatitis C Disease Prediction | |
dc.subject | Hyperparameter Optimization | |
dc.subject | Light Gradient Boosting Machines | |
dc.subject | Machine Learning | |
dc.subject | Random Forest | |
dc.subject | SMOTE | |
dc.title | Optimization of tree-based machine learning algorithms for improving the predictive accuracy of hepatitis C disease | |
dc.type | Book Chapter |
Dosyalar
Lisans paketi
1 - 1 / 1
Küçük Resim Yok
- İsim:
- license.txt
- Boyut:
- 1.17 KB
- Biçim:
- Item-specific license agreed upon to submission
- Açıklama: