Title Name School Supervisor SRN Date Progress Report Number Duration of The Project

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 8

TITLE : HEART DISEASE PREDICTION USING ML

NAME : T HARSHA VARDHAN REDDY


SCHOOL : SCHOOL OF APPLIED SCIENCES
SUPERVISOR : POONAM YADAV
SRN : R21SC043
DATE : 20/01/2024
PROGRESS REPORT NUMBER :
DURATION OF THE PROJECT :
Progress
Review
Meeting
CONTENT

DATASET AND FEATURES


METHODS/MODELS
CONCLUSIONS /FUTURE WORK
WEBAPP/USER INTERFACE

02
DATASET AND FEATURES
• Our dataset is based on UCI heart Disease Data Set [6] and we have 303 instances. According to UCI, “This database
contains 76 attributes, but all published experiments refer to using a subset of 14 of them”.We guess too many features will
bring too much noise, so people has done feature extraction and reduce 76 features to 14 features. To better understand the
meaning of the features, we have the responsibility to explain some of the attributes of original dataset from UCI as
follows:
• age: age in years
• sex: sex (1 = male; 0 = female)
• cp: chest pain type
-- Value 0: typical angina
-- Value 1: atypical angina
-- Value 2: non-anginal pain
-- Value 3: asymptomatic
• trestbps: resting blood pressure (in mm Hg on admission to the hospital)
• Chol: serum cholesterol in mg/dl
• fbs: (fasting blood sugar > 120 mg/dl) (1 = true; 0 = false)
• target: Heart disease (0 = no, 1 = yes) 03
METHODS/MODELS
• During this project, we have tried 5 algorithms for experiments, and
they are Logistic Regression , SVC, DecisionTreeClassifier,
KNeighborsClassifier and Random Forest .
1. Logistic Regression
2. SVC (Support Vector Classifier)
3. DecisionTreeClassifier
4. KNeighborsClassifier
5. Random Forest

04
CONCLUSIONS /FUTURE WORK

We use some libraries provided by Python to implement this project.


After the experiments, the algorithm of Random Forest gives us the best
test accuracy, which is 91.8%. The reason why it outperforms others is
that it is not limited to the property of the dataset. Logistic Regress
requires the features to be linearly separable. Though we get a good
result of 91.8% accuracy, that is not enough because it cannot
guarantee that no wrong diagnosis happens. To improve accuracy, we
hope to require more dataset because 260+ instances of dataset are not
sufficient to do an excellent job.

05
WEBAPP/USER INTERFACE

We have created a webapp or user interface to predict the Heart


Disease or Cardiovascular Diseases by predicting using the attributes
or characteristics that are provided by user that we mentioned to input
through a user interface we created. In that page it will predict the heart
disease and display the results whether person or user is having heart
disease or not.
We used Streamlit app to create web interface for users with a machine
learning model connected to it .

05
Thank You

You might also like