An Optimized Ensemble Learning Framework using Boosted Random Subspace SVMs and Stacked Generalization for Cardiovascular Disease Diagnosis

International Journal of Electronics and Communication Engineering
© 2025 by SSRG - IJECE Journal
Volume 12 Issue 6
Year of Publication : 2025
Authors : J. Raghunath, S. Kiran
pdf
How to Cite?

J. Raghunath, S. Kiran, "An Optimized Ensemble Learning Framework using Boosted Random Subspace SVMs and Stacked Generalization for Cardiovascular Disease Diagnosis," SSRG International Journal of Electronics and Communication Engineering, vol. 12,  no. 6, pp. 326-339, 2025. Crossref, https://doi.org/10.14445/23488549/IJECE-V12I6P126

Abstract:

Cardiovascular disease causes most deaths worldwide, so health organizations seek to produce dependable automated medical diagnosis tools. Our proposed method combines a Support Vector Machine with boosted random subspace and stacked generalization to make more accurate CVD diagnosis predictions. The framework begins by normalizing data as part of preprocessing to normalize feature scales from clinical data. Several SVM-based learners gain training via the diverse feature subsets that the Random Subspace Method (RSM) generates. The learners achieve optimized kernel parameters by performing a grid search optimization. The voting scheme in bootstrap aggregation methods improves diversity while controlling overfitting to generate predictions. The model generalization requires stacked generalization that integrates base learner outputs into a second-level logistic regression prediction system. The assessment method involves checking accuracy rates together with precision values and recall rates and includes F1-score and Area Under the Receiver Operating Characteristics Curve (AUC) measurements. Experimental benchmark results validate that the ensemble model reaches an accuracy rate of 96.39%, surpassing standard single classifiers together with standard ensemble techniques in predicting heart disease, thus proving its clinical value for cardiovascular assistance. The proposed diagnostic framework demonstrates strength and expandability for medical diagnosis procedures that require outstanding interpretive capabilities along with specific prediction accuracy.

Keywords:

Accuracy, Boosted SVM, Cardio Vascular Disease, F1-score, Precision, Random subspace, Recall.

References:

[1] Cardiovascular Diseases, Key Facts, World Health Statistics, 2021. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds)
[2] Robin P Choudhury, and Naveed Akbar, “Beyond Diabetes: A Relationship between Cardiovascular Outcomes and Glycaemic Index,” Cardiovascular Research, vol. 117, no. 8, pp. E97-E98, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[3] World Heart Report 2023 Confronting the World's Number One Killer, World Heart Federation, 2023. [Online]. Available: https://world-heart-federation.org/resource/world-heart-report-2023/
[4] Zakaria K.D. Alkayyali, Syahril Anuar Bin Idris, and Samy S. Abu-Naser, “A Systematic Literature Review of Deep and Machine Learning Algorithms in Cardiovascular Diseases Diagnosis,” Journal of Theoretical and Applied Information Technology, vol. 101, no. 4, pp. 1353-1365, 2023.
[Google Scholar] [Publisher Link]
[5] Rahma Atallah, and Amjed Al-Mousa, “Heart Disease Detection Using Machine Learning Majority Voting Ensemble Method,” 2019 2nd International Conference on new Trends in Computing Sciences, Amman, Jordan, pp. 1-6, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Vivek Pandey, Umesh Kumar Lilhore, and Ranjan Walia, “A Systematic Review on Cardiovascular Disease Detection and Classification,” Biomedical Signal Processing and Control, vol. 102, 2025.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Xuewei Du et al., “Progress, Opportunities, and Challenges of Troponin Analysis in the Early Diagnosis of Cardiovascular Diseases,” Analytical Chemistry, vol. 94, no. 1, pp. 442-463, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Yongqiang Weng et al., “Application of Support Vector Machines in Medical Data,” 2016 4th International Conference on Cloud Computing and Intelligence Systems, Beijing, China, pp. 200-204, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Rosita Guido et al., “An Overview on the Advancements of Support Vector Machine Models in Healthcare Applications: A Review,” Information, vol. 15, no. 4, pp. 1-36, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Dakhaz Mustafa Abdullah et al., “Machine Learning Applications Based on SVM Classification a Review,” Qubahan Academic Journal, vol. 1, no. 2, pp. 81-90, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Muralidharan Jayaraman, and Shanmugavadivu Pichai, “Improving the Prediction Accuracy of Onset of Cardiovascular Diseases, using Ensemble Learning,” Proceedings of the 1st International Conference on Artificial Intelligence, Communication, IoT, Data Engineering and Security, Lavasa, Pune, India, pp. 1-6, 2023.
[CrossRef] [Publisher Link]
[12] Shahid Mohammad Ganie et al., “An Improved Ensemble Learning Approach for Heart Disease Prediction Using Boosting Algorithms,” Computer Systems Science and Engineering, vol. 46, no. 3, pp. 3993-4006, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Achyut Tiwari, Aryan Chugh, and Aman Sharma, “Ensemble Framework for Cardiovascular Disease Prediction,” Computers in Biology and Medicine, vol. 146, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Narendra Kumar Sharma et al., “Enhancing Heart Disease Diagnosis: Leveraging Classification and Ensemble Machine Learning Techniques in Healthcare Decision-Making,” Journal of Integrated Science and Technology, vol. 13, no. 1, pp. 1-8, 2025.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Wenhao Chi et al., “Alleviating Hyperparameter-Tuning Burden in SVM Classifiers for Pulmonary Nodules Diagnosis with Multi-Task Bayesian Optimization,” Arxiv, pp. 1-12, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Francesco Girlanda et al., “Enhancing Cardiovascular Disease Prediction through Multi-Modal Self-Supervised Learning,” Arxiv, pp. 1-18, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Subasish Mohapatra et al., “A Stacking Classifiers Model for Detecting Heart Irregularities and Predicting Cardiovascular Disease,” Healthcare Analytics, vol. 3, pp. 1-10, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[18] Qusay Shihab Hamad, Hussein Samma, and Shahrel Azmin Suandi, “Optimization of Convolutional Neural Network Hyperparameter for Medical Image Diagnosis Using Metaheuristic Algorithms: A Short Recent Review (2019-2022),” Arxiv, pp. 1-8, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[19] Mert Ozcan, and Serhat Peker, “A Classification and Regression Tree Algorithm for Heart Disease Modeling and Prediction,” Healthcare Analytics, vol. 3, pp. 1-9, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[20] V.K. Sudha, and D. Kumar, “Hybrid CNN and LSTM Network for Heart Disease Prediction,” SN Computer Science, vol. 4, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[21] Ezekiel Adebayo Ogundepo, and Waheed Babatunde Yahya, “Performance Analysis of Supervised Classification Models on Heart Disease Prediction,” Innovative Systems and Software Engineering, vol. 19, pp. 129-144, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[22] G. Manikandan et al., “Classification Models Combined with Boruta Feature Selection for Heart Disease Prediction,” Informatics in Medicine Unlocked, vol. 44, pp. 1-12, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[23] Md Maruf Hossain et al., “Cardiovascular Disease Identification Using a Hybrid CNN-LSTM Model with Explainable AI,” Informatics in Medicine Unlocked, vol. 42, pp. 1-15, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[24] Ye Tian, and Yang Feng, “RaSE: Random Subspace Ensemble Classification,” Journal of Machine Learning Research, vol. 22, no. 45, pp. 1-93, 2021.
[Google Scholar] [Publisher Link]
[25] Peiwen Tan, “Ensemble-Based Hybrid Optimization of Bayesian Neural Networks and Traditional Machine Learning Algorithms,” Arxiv, pp. 1-23, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[26] Ahmed M. Elshewey et al., “Bayesian Optimization with Support Vector Machine Model for Parkinson Disease Classification,” Sensors, vol. 23, no. 4, pp. 1-21, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[27] Alaa M. Elsayad, Ahmed M. Nassef, and Mujahed Al-Dhaifallah, “Bayesian Optimization of Multiclass SVM for Efficient Diagnosis of Erythemato-Squamous Diseases,” Biomedical Signal Processing and Control, vol. 71, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[28] Alexandros Vamvakas et al., “Breast Cancer Classification on Multiparametric MRI – Increased Performance of Boosting Ensemble Methods,” Technology in Cancer Research & Treatment, vol. 21, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[29] Sruthi Nair et al., “Combining Varied Learners for Binary Classification Using Stacked Generalization,” Arxiv, pp. 1-9, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[30] T.R. Mahesh et al., “An Efficient Ensemble Method Using K-Fold Cross Validation for the Early Detection of Benign and Malignant Breast Cancer,” International Journal of Integrated Engineering, vol. 14, no. 7, pp. 204-216, 2022.
[Google Scholar] [Publisher Link]
[31] Heart Disease, IEEE Dataport. [Online] Available: https://ieee-dataport.org/keywords/heart-disease
[32] Gireen Naidu, Tranos Zuva, and Elias Mmbongeni Sibanda, “A Review of Evaluation Metrics in Machine Learning Algorithms,” Artificial Intelligence Application in Networks and Systems, pp. 15-25, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[33] Oona Rainio, Jarmo Teuho, and Riku Klén, “Evaluation Metrics and Statistical Tests for Machine Learning,” Scientific Reports, vol. 14, no. 1, pp. 1-14, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[34] S. Sathyanarayanan, and B. Roopashri Tantri, “Confusion Matrix-Based Performance Evaluation Metrics,” African Journal of Biomedical Research, vol. 27, no. 4S, pp. 4023-4031, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[35] Rüstem Yılmaz, and Fatma Hilal Yağın, “Early Detection of Coronary Heart Disease Based on Machine Learning Methods,” Medical Records, vol. 4, no. 1, pp. 1-6, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[36] Bhanu Prakash Doppala et al., “A Reliable Machine Intelligence Model for Accurate Identification of Cardiovascular Diseases Using Ensemble Techniques,” Journal of Healthcare Engineering, vol. 2022, no. 1, pp. 1-13, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[37] Najmu Nissa, Sanjay Jamwal, and Mehdi Neshat, “A Technical Comparative Heart Disease Prediction Framework Using Boosting Ensemble Techniques,” Computation, vol. 12, no. 1, pp. 1-22, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[38] Nadikatla Chandrasekhar, and Samineni Peddakrishna, “Enhancing Heart Disease Prediction Accuracy through Machine Learning Techniques and Optimization,” Processes, vol. 11, no. 4, pp. 1-31, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[39] Subhash Mondal et al., “An Efficient Computational Risk Prediction Model of Heart Diseases Based on Dual-Stage Stacked Machine Learning Approaches,” IEEE Access, vol. 12, pp. 7255-7270, 2024.
[CrossRef] [Google Scholar] [Publisher Link]