Advanced hybrid and preprocessing models for diagnosis challenges in data classification
dc.contributor.author | Fayez, Mustafa Adil | |
dc.contributor.author | Kurnaz, Sefer | |
dc.date.accessioned | 2024-12-05T06:53:08Z | |
dc.date.available | 2024-12-05T06:53:08Z | |
dc.date.issued | 2024 | en_US |
dc.department | Enstitüler, Lisansüstü Eğitim Enstitüsü, Elektrik ve Bilgisayar Mühendisliği Ana Bilim Dalı | en_US |
dc.description.abstract | Machine Learning (ML), often viewed as a cutting-edge technology best suited for qualified specialists, presents limited access for other physicians and scientists in the medical profession. In this work, we provide a new, sophisticated, and highly successful technology for medical applications, especially cardiac diagnostics. We propose a novel advanced hybrid optimization model with two essential parts. Initially, we apply a high-performance hybrid resampling technique for feature engineering and pre-processing. This approach, which combines Synthetic Minority Oversampling Technique Edited Nearest Neighbors (SMOTEENN) with Neighborhood Cleaning Rules (NCL), addresses class imbalance in the data. We developed a complex hybrid optimization model that incorporates hyper-parameter optimization, advanced Application Programming Interface (API) functions, and a super-learner ensemble model to enhance diagnosis accuracy in cases where datasets lack balance. Furthermore, we developed high-performance prediction models using sophisticated Support Vector Machines (SVMs). We show that, with re-sampled Cardiovascular Disease (CVD) data, the advanced hybrid optimization model attained an astounding accuracy of 98%. By comparison, an advanced SVM model obtained 96% accuracy, while an advanced deep learning model produced 95.5% accuracy. Our new sophisticated hybrid optimization machine learning models may significantly improve physicians’ interpretation of ML results. This strategy could make it easier to apply AI methods on a large scale in the clinic, which would eventually raise patient outcomes and diagnostic accuracy. | en_US |
dc.identifier.citation | Fayez, M. A., Kurnaz, S. (2024). Advanced hybrid and preprocessing models for diagnosis challenges in data classification. Journal of Advances in Information Technology, 12(11), 1264-1272. 10.12720/jait.15.11.1264-1272 | en_US |
dc.identifier.endpage | 1272 | en_US |
dc.identifier.issue | 11 | en_US |
dc.identifier.scopus | 2-s2.0-85210263799 | |
dc.identifier.scopusquality | Q2 | |
dc.identifier.startpage | 1264 | en_US |
dc.identifier.uri | https://hdl.handle.net/20.500.12939/5075 | |
dc.identifier.volume | 15 | en_US |
dc.identifier.wos | WOS:001374446300002 | |
dc.identifier.wosquality | N/A | |
dc.indekslendigikaynak | Web of Science | |
dc.indekslendigikaynak | Scopus | |
dc.institutionauthor | Fayez, Mustafa Adil | |
dc.institutionauthor | Kurnaz, Sefer | |
dc.language.iso | en | |
dc.publisher | Engineering and Technology Publishing | en_US |
dc.relation.ispartof | Journal of Advances in Information Technology | |
dc.relation.isversionof | 10.12720/jait.15.11.1264-1272 | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - İdari Personel ve Öğrenci | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Application Programming Interface (API) function | en_US |
dc.subject | Cardiovascular Disease (CVD) | en_US |
dc.subject | Hybrid advanced models | en_US |
dc.subject | Neighborhood Cleaning Rules (NCL) | en_US |
dc.subject | Optimization | en_US |
dc.subject | Synthetic Minority Oversampling Technique Edited Nearest Neighbors (SMOTEENN) | en_US |
dc.title | Advanced hybrid and preprocessing models for diagnosis challenges in data classification | |
dc.type | Article |