Professional Documents
Culture Documents
Covid-19 Analysis MLUP
Covid-19 Analysis MLUP
A PROJECT REPORT
Submitted by
Babul Sahoo - 220720100110
Sarbajit Patra -220720100137
Durga madhab Acharya - 220720100353
Soumya Ranjan Jena -220720100147
DEPARTMENT OF MCA
1
DECLARATION
We hereby declare that the project work being presented in this
projectentitled “COVID-19 DEATH CASE ANALYSIS”has been
done by us under the guidance of Mr. TRILOCHAN SAHOO the
Department of C.S.E, Centurion University of Technology and
Management, Odisha, India. We further declare that this work has not been
submitted elsewhere for the award of any other degree.
Date: Name:
Place: CUTM, BHUBANESWAR, Odisha Babul Sahoo - 220720100110
Sarbajit Patra -220720100137
Durga madhab Acharya - 220720100353
Soumya Ranjan Jena -220720100147
MCA
3rd sem
Dept. of Computer
Science CUTM
2
BONAFIDE CERTIFICATE
SIGNAT
URE
(RAKESH KUMAR RAY)
DEPARMENTAL SEAL
3
ACKNOWLEDGEMENT
Date :
Place: CUTM Bhubaneswar
Dept. of Computer Science
CUTM
4
Contents
1. ABSTRACT..............................................................................06
2. INTRODUCTION...................................................................07
3. INFORMATION ABOUT DATASET....................................08
4. LIBRARIES USED...........................................................09-11
5. CLASSIFICATION AND VISUALIZATION PART........12-26
6. CONCLUSION...................................................................27-28
REFERNCE
5
1. ABSTRACT
The COVID-19
pandemic has had a
profound impact on
global health, with
a significant focus
on understanding
and mitigating the
effects of the virus
on mortality rates.
This study delves
into the realm of
COVID-19 Death
Case Analysis,
employing data-
driven
methodologies to
assess the patterns,
implications, and
trends associated
with fatalities
caused by the virus.
The analysis aims
to provide a
comprehensive
understanding of
the demographic
and regional
variations in
COVID-19-related
mortality, enabling
informed decision-
making for public
health
interventions. 6
Utilizing a data-
centric approach,
2. Introduction
7
3.INFORMATION ABOUT DATASET:
8
4.Library
used
Pandas
NumPy
Matplotlib
Word cloud
Tokenize
Skleran
Pandas is a popular Python library for data analysis. It is not directly related to
Machine Learning. As we know that the dataset must be prepared before training. In
this case, Pandas comes handy as it was developed specifically for data extraction
and preparation. It provides high-level data structures and wide variety tools for data
analysis. It provides many inbuilt methods for grouping, combining and filtering
data.
NumPy is a very popular python library for large multi-dimensional array and matrix
processing, with the help of a large collection of high-level mathematical functions. It is
very useful for fundamental scientific computations in Machine Learning. It is
particularly useful for linear algebra, Fourier transform, and random number capabilities.
High-end libraries like TensorFlow uses NumPy internally for manipulation of Tensors.
9
Matplotlib is a very popular Python library for data visualization. Like Pandas, it
is not directly related to Machine Learning. It particularly comes in handy when
a programmer wants to visualize the patterns in the data. It is a 2D plotting
library used for creating 2D graphs and plots. A module named pyplot makes it
easy for programmers for plotting as it provides features to control line styles,
font properties, formatting axes, etc. It provides various kinds of graphs and plots
for data visualization, viz., histogram, error charts, bar chats, etc,
Word Cloud
is a data visualization technique used for representing text data in which
the size of each word indicates its frequency or importance. Significant
textual data points can be highlighted using a word cloud. Word clouds
are widely used for analyzing data from social network websites.
For generating word cloud in Python, modules needed are –
matplotlib, pandas and wordcloud. To install these packages, run the
following commands :
pip install matplotlib
pip install pandas
pip install wordcloud
The dataset used for generating word cloud is collected from UCI Machine Learning
Repository. It consists of YouTube comments on videos of popular artists.
Tokenize
In Python tokenization basically refers to splitting up a larger body of text
into smaller lines, words or even creating words for a non-English language.
The various tokenization functions in-built into the nltk module itself and
10
can be used in programs as shown below.
211.95
11
5.CLASSIFI
CATION
AND
SCREEN SHOT DATA SET:- CONFORMED DATA SET
VISUALIZ
ATION
PART
12
SCREEN SHOT DATA SET: - RECOVERED DATA SET
13
14
15
16
17
18
19
20
OUTPUT
21
22
OUTPUT
23
OUTPUT
24
OUTPUT
25
OUTPUT
OUTPUT
26
OUTPUT
OUTPUT
27
OUTPUT
OUTPUT
CONCLUSI
ON 28
The world is under
the grasp of COVID-
19 virus. Early
prediction of the
transmission can
help to take
necessary actions.
This article
proposed to utilize
the machine
learning and deep
learning
models for epidemi
c.Future prediction
of potential
infections will
enable authorities
to tackle the
consequences
effectively.
Furthermore, it is
necessary to keep
up with the
number of infected
people by
performing regular
check-ups, and it is
often vital to
quarantine infected
people and adopt
medical measures
Prediction models
such as the PA, 29
ARIMA, and LSTM
algorithms were
REFERENCE
30
https://www.slideshare.net/irjetjournal/role-of-
machine-learning-techniques-in-covid19-prediction-
and-detection
[1] Wang L, Wong A
(2020) COVID-Net:
a tailored deep
convolutional
neural network
design for
detection of
COVID-19 cases
from chest
radiography
images.
[2] Beck BR, Shin B,
Choi Y, Park S, Kang
K. Predicting
commercially
available antiviral
drugs that may act
on the novel
coronavirus (SARS-
CoV-2) through a
drug- target
interaction deep
learning model
[3] Cohen et al
(2020) COVID-19
image data
collection
[4] Ting, Daniel Shu
Wei, Lawrence 31
Carin, Victor Dzau,
and Tien Y. Wong.