Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 11

FIRST PROGRESS REPORT

Heart disease
prediction

ASHMITA CHADHA
RIPUL DOGRA
PARTH SHARMA
LIBRARIES USED

1. NUMPY - numpy offers comprehensive mathematical functions,


random number generators, linear algebra routines, Fourier transforms, and more
2. PANDAS - pandas is a software library written for the Python
programming language for data manipulation and analysis
3. MATPLOTLIB - Matplotlib is a comprehensive library for
creating static, animated, and interactive visualizations in Python.
4. SEABORN- Seaborn is a library that uses Matplotlib underneath to
plot graphs. It will be used to visualize random distributions.
IMPORTING AND UNDERSTANDING OUR DATASET

Verifying it as a
Understanding our John Smith
'dataframe' object in
ATTRIBUTES better Despite being red,
pandas. Mars is cold

ANALYSING THE CHECKING Maria Wilson


TARGET CORRELATION Neptune is far away
VARIABLE BETWEEN COLUMNS from Earth
Understanding our ATTRIBUTES better
01 LOGISTIC REGRESSION

NAIVE BAYES
02
Model fitting
03 Support vector machine

04 K-nearest neighbour
ACCURACY: 85.25%
01
LOGISTIC
REGRESSION
Logistic regression estimates the probability of an event
occurring, such as voted or didn't vote, based on a given
dataset of independent variables. Since the outcome is a
probability, the dependent variable is bounded between 0 and
1.
ACCURACY: 85.25%
02
NAIVE BAYES
Naïve Bayes is a probabilistic machine learning algorithm based
on the Bayes Theorem, used in a wide variety of classification
tasks. Naive Bayes algorithms are mostly used in sentiment
analysis, spam filtering, recommendation systems etc. They are
fast and easy to implement but their biggest disadvantage is that
the requirement of predictors to be independent.
ACCURACY: 81.97%
03
SUPPORT VECTOR
MACHINE
The goal of the SVM algorithm is to create the best line or decision
boundary that can segregate n-dimensional space into classes so that we
can easily put the new data point in the correct category in the future. This
best decision boundary is called a hyperplane.
ACCURACY: 61.21%
04
K-NEAREST
NEIGHBOUR
The k-nearest neighbors algorithm, also known as
KNN or k-NN, is a non-parametric, supervised
learning classifier, which uses proximity to make
classifications or predictions about the grouping
of an individual data point.
THANK
YOU

You might also like