Professional Documents
Culture Documents
Predictive Analytics in Healthcare: An Engineering Project in Community Service
Predictive Analytics in Healthcare: An Engineering Project in Community Service
Submitted by
NAMAN AGARWAL
(19BCE10395)
VAGHASIA AUM
(19BCE10437)
YASH SAXENA
(19BCE10115)
BOMMAKANTY SATYA SAI ANIRUDH
(19BCE10045)
YASH GUPTA
(19BME10026)
PUNEET GOYAL
(19BEC10007)
JATIN DIDWANIA
(19BCE10237)
DEVANSH RANA
(19BAI10052)
BACHELOR OF TECHNOLOGY
This project report (Phase II) is submitted for the Project Viva-Voce examination
held on 22th April 2022.
Supervisor
1. Introduction 4
1.1 Motivation 4
1.2 Objective 5
2. Existing Work 6
4. Individual Contribution 19
5. Conclusion 22
6. References 22
1. Introduction
In today’s world health and well-being is a major concern for all. Millions of people living in this
world deal with some kind of heart disease. They experience difficulties in breathing due to this
disease which often turns fatal. Moreover, people nowadays mostly sit and work on their computers
every day and not exercising which in turn making the person obese and prone to many heart
problems. Not just the common people but the patients with existing heart problems are increasing
and all of them need constant monitoring. It is not possible for a doctor or nurse to always monitor
their patients so a there is need for computer to start tracking and predicting through AI basic heart
problems. The device which we have made will warn people before hand if their heart rates have
increased significantly without them doing any heavy activities. By knowing this information prior,
they can go the hospital and get necessary medication before things get serious.
1.1 Motivation
Have we ever considered how much stress we are giving on our heart without even knowing? But
even by knowing this have we ever taken any action to prevent us from serious ailments? Our main
aim of this project is to warn people beforehand about the condition about their heart incase their
BPM rises without a warning. This is will also help the patient know if they are suffering from any
heart conditions due to the irregular BPMs. The necessary information from the object detector can
be used for avoiding fatal scenarios and can be used to prevent before it gets serious to the point
where not even a medical treatment can save the person. This will even help the hospitals to know
well about what is happening with the patient as they will have the data to understand and act
accordingly to aid the person.
Using the technology, sensors and materials to create a working model to help the people know the
condition of their heart so that they could perform any necessary medication if needed.
1.2 Objective
Health is always the number one priority to everyone in the world. Without proper health one
cannot perform their day-to-day activities with ease. The objective of this project is to develop a
working model using sensors and AI technology to predict the diseases which the person has by
monitoring their heart rate. Hardware technology that could be designed for the at time use
without requiring apps to activate. Developing such wristband for the people who are prone to
heart diseases will help them by alerting if their hear rate is increasing or there is any
abnormality in the heart beat.
2. Existing Works/ Literature review
Many tools and techniques are used to support the predictive analytics in the analysis of
healthcare data such and logistic and linear regression to train our AI model. Correlational
Matrix and Data Visualization is also used to ML. In order to predict the level of gravity and
risk of certain disease in patients some algorithms are used such as Logistic Regression, Genetic
and decision tree algorithms based on medical data such as the patients’ symptoms and
laboratory measures. Moreover, to predict the early stages of certain diseases in patients some
algorithms are used such as decision tree and logistic regressions are used. For the frontend part
of our website where we post all the reports, HTML, CSS, Bootstrap and JavaScript is used.
For the backend part where we store all the data of our website and data collected from it, we
used Django, PHP and SQL Database. As for the IoT part we used AD8232 Heart Beat Node
which will help us to record the pulse and through these pulses we are able to predict the disease.
The predictive analytics have been widely used by researcher to solve the readmission problems.
Thus, to enhance the quality of care provided to patients with less cost there have been a strategy
to emerge the hospitals readmissions as a quality indicator to prevent it. The predictive models
have been developed to predict the risk of readmission by utilizing the mentioned datasets. The
algorithms used in the prediction are multiple classification algorithms especially logistic
regression and the random forest algorithm due to its ability to perform with all the types of
predictor variables and the analytical platform used. Moreover, the predictive analytics have
been used in health care to be able to prevent high risk from many chronic diseases which will
help in saving patient lives and providing better quality of care.
2.3 Existing work and their limitations
A lot of smartphones manufacturers have made their own versions of this device in form of a
wrist band or by embedding it in the smartphone itself. The main problem of these devices is
lack of effective data. The used data by them is not clean, precise or correctly formatted. Often
times they will fail to alert you about whether you are good or not.
These devices also cost a lot of money and generally oriented for business purpose instead of
helping people.
3 Topic of Work
The project involved analysis of the heart disease patient dataset with proper data processing. Then,
different models were trained and predictions are made with different algorithms KNN, Decision
Tree, Random Forest, SVM, Logistic Regression etc. This is the Jupyter notebook code and dataset
We’ve used for my Kaggle kernel 'Binary Classification with Sklearn and Keras’.
A variety of Machine Learning algorithms, implemented in Python, to predict the presence of heart
disease in a patient. This is a classification problem, with input features as a variety of parameters,
and the target variable as a binary variable, predicting whether heart disease is present or not.
Machine Learning algorithms used:
1. Logistic Regression (Scikit-learn)
2. Naive Bayes (Scikit-learn)
3. Support Vector Machine (Linear) (Scikit-learn)
4. K-Nearest Neighbors (Scikit-learn)
5. Decision Tree (Scikit-learn)
6. Random Forest (Scikit-learn)
7. XGBoost (Scikit-learn)
Artificial Neural Network with 1 Hidden layer (Keras)
Firstly, we trained our model with the preexisting data found on the internet to check its functioning.
We were able to achieve accurate information from this session and the results were good. After that
we had that device installed on ourselves and it was able to tell accurately our current heart beat rate.
We used the following models to train the data
1. age
2. sex
3. chest pain type (4 values)
4. resting blood pressure
5. serum cholesterol in mg/dl
6. fasting blood sugar > 120 mg/dl
7. resting electrocardiographic results (values 0,1,2)
8. maximum heart rate achieved
9. exercise induced angina
10. oldpeak = ST depression induced by exercise relative to rest
11. the slope of the peak exercise ST segment
12. number of major vessels (0-3) colored by flourosopythal: 3 = normal; 6 = fixed defect; 7 =
reversable defect
The prediction as it names implied is one of the data mining techniques that discovers relationship
between independent variables and relationship between dependent and independent variables.
For instance, prediction analysis technique can be used in blood donors to predict the behavior for
the future if we consider donor is an independent variable, blood could be a dependent variable.
Then based on the historical data, we can draw a fitted regression curve that is used for donor’s
behavior prediction.
Regression technique can be adapted for predication.
Disease Prediction: - Patient will specify the symptoms caused due to his illness. The system will
ask certain symptoms regarding his illness and then predict the disease based on the symptoms
specified by patient.
Result: - The result will include user’s BMI, disease, what you should do, exercise and a diet chart,
blogroll. All the tuples have different diseases and related to that we have different sets of exercises
and diet chart.
Model's accuracy is 79.6 +- 1.4%. The following are the results of analysis done on the available
heart disease dataset. Each graph shows the result based on different attributes. Green box
indicates No Disease. Red box indicates Disease.
GRAPHS -
Fig 4 Age group Fig 5 Exercise Induced Angina
6. REFERENES
https://how2electronics.com/iot-ecg-monitoring-ad8232-ecg-sensor-esp8266/
https://www.ijeat.org/wp-content/uploads/papers/v8i4/D6137048419.pdf
https://link.springer.com/content/pdf/10.1007/s42979-020-00365-y.pdf
https://www.quora.com/Why-is-data-analytics-important-in-healthcare
https://en.wikipedia.org/wiki/Health_care_analytics
https://github.com/topics/healthcare-analysis
https://github.com/coreycoole/heart_disease_analysis
https://github.com/NamiF/The-cost-of-healcare-related-to-longevity
UCI Dataset link : http://archive.ics.uci.edu/ml/datasets/Heart+Disease
https://github.com/Byte7/Heart-Disease-Analysis-and-Prediction.git
https://www2.deloitte.com/content/dam/Deloitte/ec/Documents/life-sciences-health-care/
DI_Predictive-analytics-in-health-care%20(2).pdf