An Explainable and Accurate Machine Learning Approach for Early Heart Disease Prediction Using Feature Selection and Ensemble Techniques

Authors

  • Khaliq Ahmed Department of computer science Iqra university Karachi, Pakistan.
  • Khalid bin Muhammad Faculty of engineering sciences and technology, department of computer science, Ziauddin university Karachi, Pakistan.
  • Malik zohaib Hussain CCSIS, Institute of Business Management, Karachi, Pakistan.
  • Abdul Khaliq CCSIS, Institute of Business Management, Karachi, Pakistan.

DOI:

https://doi.org/10.62019/r0anxk68

Keywords:

Machine learning, Heart diseases, ML algorithms, SMOTE, SHAP

Abstract

This research introduces a holistic machine learning-based approach to early heart disease prediction, utilizing state-of-the-art ensemble methods and explainable artificial intelligence (XAI). The envisioned model pipeline includes feature selection processes to improve prediction performance and interpretability. Ensemble techniques outperformed more classical models such as Logistic Regression and SVM, with different classifiers such as Random Forest, Gradient Boosting, and XGBoost being thoroughly compared. The best accuracy of 98.54% was attained by Random Forest, Gradient Boosting, and XGBoost, showing the effectiveness of ensemble methods in working with healthcare datasets. The precision and recall measures also drifted close to 1.0, indicating very few false negatives and false positives—essential for medical diagnoses. The AUC measures also supported the strength of the classifiers, with Random Forest showing a perfect 1.0. Visual outcomes confirm the adherence and performance of the suggested methodology in the most significant key performance indicators. This work prioritizes not just predictive performance but also explainability, so the model's choice can be understood by doctors. By combining accuracy with interpretability, this framework offers an accurate decision support system for cardiologists that allows for the early diagnosis and customized treatment plan. The visual analytics offered in the proposed work section also further support the practical relevance and clinical promise of the introduced method.

 

Author Biographies

  • Khaliq Ahmed, Department of computer science Iqra university Karachi, Pakistan.

    Assistant professor

  • Khalid bin Muhammad , Faculty of engineering sciences and technology, department of computer science, Ziauddin university Karachi, Pakistan.

    Associate professor

  • Malik zohaib Hussain, CCSIS, Institute of Business Management, Karachi, Pakistan.

    Senior lecturer

  • Abdul Khaliq, CCSIS, Institute of Business Management, Karachi, Pakistan.

    Senior lecturer

Downloads

Published

2025-05-30

How to Cite

An Explainable and Accurate Machine Learning Approach for Early Heart Disease Prediction Using Feature Selection and Ensemble Techniques. (2025). The Asian Bulletin of Big Data Management , 5(2), 101-114. https://doi.org/10.62019/r0anxk68

Similar Articles

1-10 of 96

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)