Improving Heart Failure Prediction

Uploaded by

Fagbolade Ayomide

0% found this document useful (0 votes)

1 views13 pages

Original Title

Improving heart failure prediction

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

1 views13 pages

Improving Heart Failure Prediction

Uploaded by

Fagbolade Ayomide

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 13

Search inside document

Heart failure

prediction using
PCA as a dimension
reduction technique.
What my dataset looks like!

As you can see , I can’t throw this dataset into my machine learning model because it contains a lot
of categorical variables.
Data preparation

I tried to explore my dataset to see if there is any missing values and fortunately , there
is none.

● So , next , I explored each column with categorical variables

● This gave me an insight on how I can handle them.
● So columns with just two unique variable ,I used label encoding while I used one
hot encoding for the rest.

NB: When using One-Hot Encoding , you must remember to drop ﬁrst in the argument
as this helps to remove the issue of multicollinearity
Scaling of my dataset

There are two reasons why I scaled my dataset :

● PCA only works with scaled data

● The variables in my dataset are terribly lopsided , for example , my dummies are zero
and ones , and there are other variables that are way bigger, throwing them into the
model without scaling might confuse the model.
Challenges
Before I use PCA , I want to find out how much accuracy I
can get with different machine learning models , but then
that will be too much iteration for me to repeat especially
finding the best parameters for these machine learning
models. So in this case , I will be using GridSearchCV. In the
next slide, I will be showing you how I used GridSearchCV to
find the model with the best accuracy.
How I used
GridSearchCV

Firstly, I created a
dictionary for all the
models I wanted to
use and their
parameters .
How I used GridSearchCV
Best scores without PCA
I need to say this that PCA doesn’t doesn’t mean that the
accuracy of our model will increase, usually it decreases , but
computation is much lighter and this is some of the trade off
we consider in the industry.
PCA reduces the number of variables in a dataset while
maintaining as much information as possible. It transforms the
original variables into a new set of variables, which are called
principal components. These components are ordered so that
the ﬁrst few retain most of the variation present in all of the
original variables
How I implemented PCA
Model accuracy signiﬁcantly improved after PCA to about
89.67% . PCA reduced my features to 13 from the initial 16
components
Thank you

Lab 2
Document4 pages
Lab 2
geoaamer
100% (1)
Pca 1692550768
Document13 pages
Pca 1692550768
kan luc N'guessan
No ratings yet
Machine Learning Assignment
Document2 pages
Machine Learning Assignment
Utkarsh gupta
No ratings yet
PCA - Analysis in R - DataCamp
Document20 pages
PCA - Analysis in R - DataCamp
Cleaver Bright
No ratings yet
TB 969425740
Document16 pages
TB 969425740
guohong hu
No ratings yet
40 Interview Questions On Machine Learning - AnalyticsVidhya
Document21 pages
40 Interview Questions On Machine Learning - AnalyticsVidhya
Kaleab Tekle
100% (1)
The Intuition Behind PCA: Machine Learning Assignment
Document11 pages
The Intuition Behind PCA: Machine Learning Assignment
Palash Ghosh
No ratings yet
Pytorch GPU
Document20 pages
Pytorch GPU
Sandro Skansi GMAIL
No ratings yet
6 Different Ways To Compensate For Missing Values in A Dataset
Document6 pages
6 Different Ways To Compensate For Missing Values in A Dataset
icha
No ratings yet
Week 10 - PROG 8510 Week 10
Document16 pages
Week 10 - PROG 8510 Week 10
Vineel Kumar
No ratings yet
Chapter Five Principal Comonent Analysis (PCA)
Document33 pages
Chapter Five Principal Comonent Analysis (PCA)
Ruun Mohamed
No ratings yet
Ensemble Machine Learning Method
Document15 pages
Ensemble Machine Learning Method
Fagbolade Ayomide
No ratings yet
Capp Calculating Standard Values With Capp
Document7 pages
Capp Calculating Standard Values With Capp
wipro@651975
No ratings yet
Key Requirements
Document2 pages
Key Requirements
陈二二
No ratings yet
R PCA (Principal Component Analysis) - DataCamp
Document54 pages
R PCA (Principal Component Analysis) - DataCamp
UMESH D R
No ratings yet
Need of PCA
Document6 pages
Need of PCA
Simi Jain
100% (1)
Predict Diabetes Using Machine Learning Algorithm
Document5 pages
Predict Diabetes Using Machine Learning Algorithm
papan banik
No ratings yet
Basic Interview Q's On ML PDF
Document243 pages
Basic Interview Q's On ML PDF
sourajit roy chowdhury
100% (2)
DataStru Ques
Document51 pages
DataStru Ques
Vinodh Jemini
No ratings yet
FDS Unit 2
Document8 pages
FDS Unit 2
Amit Adhikari
No ratings yet
Day 4 of 100 Data Science Interview Questions Series!!
Document3 pages
Day 4 of 100 Data Science Interview Questions Series!!
Silga
No ratings yet
Spreadsheets and The Data Life Cycle
Document11 pages
Spreadsheets and The Data Life Cycle
Kiel Rodelas
No ratings yet
40 Interview Questions On Machine Learning From Analytics Vidhya
Document14 pages
40 Interview Questions On Machine Learning From Analytics Vidhya
shakir ali
No ratings yet
Interview Questions On Machine Learning
Document22 pages
Interview Questions On Machine Learning
Praveen
100% (4)
Assignment
Document24 pages
Assignment
Santhi Palanisamy
No ratings yet
Data Science and Machine Learning Essentials: Lab 4A - Working With Regression Models
Document24 pages
Data Science and Machine Learning Essentials: Lab 4A - Working With Regression Models
aussatris
No ratings yet
The 5 Feature Selection Algorithms Every Data Scientist Should Know
Document29 pages
The 5 Feature Selection Algorithms Every Data Scientist Should Know
Rama Chandra Gunturi
No ratings yet
Pentaho OLAP Design Guidelines
Document9 pages
Pentaho OLAP Design Guidelines
a3scribd
No ratings yet
Interview Questions
Document2 pages
Interview Questions
rashmi
No ratings yet
Multidimensional Data Modeling in Pentaho
Document6 pages
Multidimensional Data Modeling in Pentaho
thamasi kandi
No ratings yet
SAP PP CAPP Calculating Standard Value
Document7 pages
SAP PP CAPP Calculating Standard Value
Kanapon Gunprom
No ratings yet
Optimizers For NeuralNetworks
Document13 pages
Optimizers For NeuralNetworks
Abhishek Saini
No ratings yet
ML Performance Improvement Cheatsheet
Document11 pages
ML Performance Improvement Cheatsheet
rahulsukhija
100% (1)
1694601214-Unit 3.4 Principal Component Analysis CU 2.0
Document36 pages
1694601214-Unit 3.4 Principal Component Analysis CU 2.0
prime9316586191
No ratings yet
1 Introduction To Data Structures
Document3 pages
1 Introduction To Data Structures
at9187
No ratings yet
Pergunta 1: 1 / 1 Ponto
Document22 pages
Pergunta 1: 1 / 1 Ponto
Bruno Cury
No ratings yet
Cs193p Homework Assignments
Document4 pages
Cs193p Homework Assignments
cfmfr70e
100% (1)
55 Final Paper PDF
Document8 pages
55 Final Paper PDF
Datta Kumar
No ratings yet
Principal Component Analysis
Document10 pages
Principal Component Analysis
Deeksha Manoj
No ratings yet
Machine Learning Interview Questions PDF
Document14 pages
Machine Learning Interview Questions PDF
smita prajapati
No ratings yet
Istilah Machine Learning
Document14 pages
Istilah Machine Learning
hu ans
No ratings yet
Application Tuning
Document11 pages
Application Tuning
vino5bora
No ratings yet
Power Bi Interview Question-All
Document4 pages
Power Bi Interview Question-All
Ashok kari
No ratings yet
Pattern Recognition Techniques
Document13 pages
Pattern Recognition Techniques
Sonu Gangwani
No ratings yet
3.2 Pca
Document27 pages
3.2 Pca
Javada Javada
No ratings yet
Importamnt Questions of ML
Document1 page
Importamnt Questions of ML
Mayank Verma
No ratings yet
Data Mining Project 11
Document18 pages
Data Mining Project 11
Abraham Zeleke
No ratings yet
Dimensionality Reduction
Document47 pages
Dimensionality Reduction
bka212407
No ratings yet
Quick tip_ Choosing how to parallelize your jobs _ hjkgrp.mit.edu
Document2 pages
Quick tip_ Choosing how to parallelize your jobs _ hjkgrp.mit.edu
Abdul Muhaymin
No ratings yet
Principal Component Analysis - Intro - Towards Data Science
Document4 pages
Principal Component Analysis - Intro - Towards Data Science
Alan Picard
No ratings yet
6 Different Ways To Compensate For Missing Values in A Dataset
Document12 pages
6 Different Ways To Compensate For Missing Values in A Dataset
9fd1343d1d
No ratings yet
Evidence of Learning 5
Document3 pages
Evidence of Learning 5
api-532338241
No ratings yet
Data Analysis (27 Questions) : 1. (Given A Dataset) Analyze This Dataset and Tell Me What You Can Learn From It
Document28 pages
Data Analysis (27 Questions) : 1. (Given A Dataset) Analyze This Dataset and Tell Me What You Can Learn From It
kumar kumar
No ratings yet
2012 Nikolaos Nikolaou MSC
Document102 pages
2012 Nikolaos Nikolaou MSC
uyjco0
No ratings yet
Predicting Credit Card Approvals
Document14 pages
Predicting Credit Card Approvals
as
100% (1)
Logistic Regresion
Document11 pages
Logistic Regresion
Flor Cabillar
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
IBM SPSS Modeler Cookbook
From Everand
IBM SPSS Modeler Cookbook
Meta S. Brown
No ratings yet
Analysis and Design of Algorithms: A Beginner’s Hope
From Everand
Analysis and Design of Algorithms: A Beginner’s Hope
Shefali Singhal
No ratings yet
R: Recipes for Analysis, Visualization and Machine Learning
From Everand
R: Recipes for Analysis, Visualization and Machine Learning
Yu-Wei
No ratings yet
The Adverse Childhood Experiences Questionnaire Two Decades of Research On Chi
Document25 pages
The Adverse Childhood Experiences Questionnaire Two Decades of Research On Chi
Fagbolade Ayomide
No ratings yet
Nestjs Course Manual Haider Malik
Document269 pages
Nestjs Course Manual Haider Malik
Fagbolade Ayomide
No ratings yet
Part 2. Basic Callbacks - Dash For Python Documentation - Plotly
Document12 pages
Part 2. Basic Callbacks - Dash For Python Documentation - Plotly
Fagbolade Ayomide
No ratings yet
Part 3. Interactive Graphing and Crossfiltering - Dash For Python Documentation - Plotly
Document4 pages
Part 3. Interactive Graphing and Crossfiltering - Dash For Python Documentation - Plotly
Fagbolade Ayomide
No ratings yet
Part 4. Sharing Data Between Callbacks - Dash For Python Documentation - Plotly
Document10 pages
Part 4. Sharing Data Between Callbacks - Dash For Python Documentation - Plotly
Fagbolade Ayomide
No ratings yet
Part 1. Layout - Dash For Python Documentation - Plotly
Document11 pages
Part 1. Layout - Dash For Python Documentation - Plotly
Fagbolade Ayomide
No ratings yet
Installation - Dash For Python Documentation - Plotly
Document1 page
Installation - Dash For Python Documentation - Plotly
Fagbolade Ayomide
No ratings yet
Dash in 20 Minutes: Hello World
Document15 pages
Dash in 20 Minutes: Hello World
Fagbolade Ayomide
No ratings yet
A Minimal Dash App: Python
Document3 pages
A Minimal Dash App: Python
Fagbolade Ayomide
No ratings yet