Professional Documents
Culture Documents
Heart Disease Prediction
Heart Disease Prediction
Abstract
Heart disease, a leading cause of global mortality, Heart disease weakens the heart’s ability to pump
necessitates accurate prediction for timely blood, leading to health issues like shortness of
intervention. This study proposes a hybrid model breath and fatigue [5]. It is prevalent globally,
amalgamating LR, DT and ANN algorithms to especially in developing countries, where limited
enhance heart disease prediction. Using a Kaggle diagnostic tools pose challenges for early diagnosis
dataset comprising 1025 patient records with 14 and treatment [5]. Accurate early prediction enables
features, including age, sex, chest pain, and effective intervention, improving outcomes.
cholesterol levels, the hybrid model achieved an Machine learning techniques like logistic regression
impressive 88% precision. This outperforms [6], random forests [7] and deep belief networks [9]
individual models, with DT achieving 99% accuracy, have been explored for heart infection prediction
LR with 80%, and ANN with 86%. Evaluation using datasets from UCI repository and hospitals [5]
metrics demonstrate competitive performance, [10]. Classification accuracy ranges widely, from
affirming the hybrid model as a robust tool for 77% [6] to 100% [3], reflecting differences in
cardiovascular ailment prediction. The study algorithms and data used. More complex ensemble
underscores the efficacy of combining diverse approaches tend to outperform individual models. For
algorithms, leveraging their strengths for more example, Alizadehsani [11] combined multiple
effective predictive modeling in cardiovascular health classifiers using an ensemble method, achieving 90%
assessment. accuracy. Ruzzo-Tompa’s [9] optimally configured
deep belief network attained 94.61% accuracy.
Keywords : Decision Tree, Logistic Regression, Feature engineering and selection techniques also
Artificial Neural Network, hybrid model boost performance [10].
However, issues like overfitting [13] and suboptimal
1.Introduction model tuning [17] persist. Most studies test
individual algorithms rather than hybrid approaches
The heart circulates blood, supplying nutrients and that could mitigate limitations. As Jabbar [13]
oxygen to bodily organs [1]. Heart disease impairs demonstrated, minimizing training error sometimes
this function, affecting the brain, lungs, liver, and increases testing errors. Ahmed [17] achieved only
kidneys [1]. It causes cognitive and respiratory issues 80.32% validation accuracy despite testing multiple
[2]. As per the WHO, heart disease accounts for 31% models.
of annual deaths globally [3]. Delayed diagnosis Opportunities exist to advance accuracy further
leads to poor outcomes and mortality [3]. Machine through larger, more diverse datasets [5], ensembles
learning models show promise for early prediction to and hybrid methods [11], improved feature selection
improve prognosis [4]. However, traditional models [10], and better hyper parameter optimization and
struggle with complexity while deep learning models cross-validation [10]. Combining complementary
require extensive data and risk overfitting [4]. A techniques in a tuned, thoroughly validated hybrid
hybrid model combining both approaches could approach could yield both high accuracy and
enable accurate, cost-effective early prediction. reliability in diagnosis.
2. Related Work
3. Methodology
The learning aims to predict heart disease possibility START
using computerized prediction, benefiting medical
professionals and patients. To accomplish this
objective, a systematic approach was adopted. This Data
process is illustrated in Figure 1 below:
Preprocessing
3.1 Data Collection
This study uses a Kaggle dataset of 1025 patient
records with 14 distinct features related to heart
Model selection
disease risk factors.