Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 31

COVID-19 DEATH CASE ANALYSIS

A PROJECT REPORT
Submitted by
Babul Sahoo - 220720100110
Sarbajit Patra -220720100137
Durga madhab Acharya - 220720100353
Soumya Ranjan Jena -220720100147

in partial fulfilment for the award of the degree


of
MASTER IN COMPUTER APPLICATION
3rd semester

Under the Guidance of


Mr. TRILOCHAN SAHOO

DEPARTMENT OF MCA

CENTURION UNIVERSITY OF TECHNOLOGY AND


MANAGEMENT ,BHUBANESWAR, ODISHA, JAN, 2023

1
DECLARATION
We hereby declare that the project work being presented in this
projectentitled “COVID-19 DEATH CASE ANALYSIS”has been
done by us under the guidance of Mr. TRILOCHAN SAHOO the
Department of C.S.E, Centurion University of Technology and
Management, Odisha, India. We further declare that this work has not been
submitted elsewhere for the award of any other degree.

Date: Name:
Place: CUTM, BHUBANESWAR, Odisha Babul Sahoo - 220720100110
Sarbajit Patra -220720100137
Durga madhab Acharya - 220720100353
Soumya Ranjan Jena -220720100147

MCA
3rd sem
Dept. of Computer
Science CUTM

2
BONAFIDE CERTIFICATE

This is to certify that the dissertation entitled “Fake News Classification”


being submitted by “Babul Sahoo, Sarbajit Patra, Durga madhab
Acharya, Soumya Ranjan Jena” for the award of the degree of MCA are
carried out by our guidance Mr. TRILOCHAN SAHOO in the Department
C.S.E, Centurion University of Technology and Management, Odisha. The
results presented in this SRS have not been submitted elsewhere for the
award of any other degree to the best of my knowledge. In our opinion, this
work is fulfilling the requirements for the award of the degree mentioned.

Signature of the supervisor


Mr. TRILOCHAN SAHOO
Department C.S.E, Centurion University of
Technology and Management, Odisha

Certified that the above mentioned project has been duly


carried out as per the norm of the college statutes of the
university

SIGNAT
URE
(RAKESH KUMAR RAY)

PROFESSOR & HOD

DEPT. OF CSE, CUTM

DEPARMENTAL SEAL

3
ACKNOWLEDGEMENT

We would like to Acknowledgement our thanks to all academic committee


for active support in all the time during our dissertation work. We would
like to express our special thanks to gratitude to our professor
Mr.TRILOCHAN SAHOO for their able guidance and support is
complecting our dissertation.

Babul Sahoo - 220720100110


Sarbajit Patra -220720100137
Durga madhab Acharya - 220720100353
Soumya Ranjan Jena -220720100147

MCA 3RD SEM

Date :
Place: CUTM Bhubaneswar
Dept. of Computer Science
CUTM

4
Contents

1. ABSTRACT..............................................................................06
2. INTRODUCTION...................................................................07
3. INFORMATION ABOUT DATASET....................................08
4. LIBRARIES USED...........................................................09-11
5. CLASSIFICATION AND VISUALIZATION PART........12-26
6. CONCLUSION...................................................................27-28
REFERNCE

5
1. ABSTRACT

The COVID-19
pandemic has had a
profound impact on
global health, with
a significant focus
on understanding
and mitigating the
effects of the virus
on mortality rates.
This study delves
into the realm of
COVID-19 Death
Case Analysis,
employing data-
driven
methodologies to
assess the patterns,
implications, and
trends associated
with fatalities
caused by the virus.
The analysis aims
to provide a
comprehensive
understanding of
the demographic
and regional
variations in
COVID-19-related
mortality, enabling
informed decision-
making for public
health
interventions. 6
Utilizing a data-
centric approach,
2. Introduction

 The issue of "COVID-19 Death Case Analysis" has been a prominent


concern in recent times, reflecting the gravity of understanding and
interpreting the impact of the pandemic on mortality rates.

 At its core, we define "COVID-19 Death Case Analysis" as the systematic


examination of fatalities attributed to the virus. This involves scrutinizing
data to discern patterns, demographic variations, and regional trends
related to COVID-19-induced deaths. The analysis aims to provide a
comprehensive understanding of the severity of the pandemic, guiding
effective resource allocation, and informing targeted interventions in
healthcare management.

 However, it is imperative to acknowledge the complexity and nuances


inherent in COVID-19 death case analysis. The term itself has taken on
various connotations and interpretations, extending beyond the narrow
definition of mortality statistics. The analysis considers not only the direct
health implications but also the broader societal, economic, and political
ramifications of the pandemic. Similar to the politicization of "fake news,"
COVID-19 death case analysis is susceptible to being leveraged for
political agendas or to cast doubt on opposing viewpoints. The advent of
technological advances, particularly in the realm of data dissemination
through platforms such as social media, further complicates the analysis.

7
3.INFORMATION ABOUT DATASET:

• Data used:-Covid-19 Classification


• Size :-110 MB
• SOURCEOFOLLECTION
KAGGLE https://www.kaggle.com/clmentbisaillon/fake-and- real-
news-dataset
• TOTAL SIZES 3 FILES
• CONFIRMED FILE 57 OF 4 CLOUMNS 62.79 KB
• DEATH FILE 57 OF 4 CLOUMNS 53.58 KB
• RECOVERY FILE 57 COLUMNS 62.58 KB

8
4.Library
used
 Pandas
 NumPy
 Matplotlib
 Word cloud
 Tokenize
 Skleran

Pandas is a popular Python library for data analysis. It is not directly related to
Machine Learning. As we know that the dataset must be prepared before training. In
this case, Pandas comes handy as it was developed specifically for data extraction
and preparation. It provides high-level data structures and wide variety tools for data
analysis. It provides many inbuilt methods for grouping, combining and filtering
data.

NumPy is a very popular python library for large multi-dimensional array and matrix
processing, with the help of a large collection of high-level mathematical functions. It is
very useful for fundamental scientific computations in Machine Learning. It is
particularly useful for linear algebra, Fourier transform, and random number capabilities.
High-end libraries like TensorFlow uses NumPy internally for manipulation of Tensors.

9
Matplotlib is a very popular Python library for data visualization. Like Pandas, it
is not directly related to Machine Learning. It particularly comes in handy when
a programmer wants to visualize the patterns in the data. It is a 2D plotting
library used for creating 2D graphs and plots. A module named pyplot makes it
easy for programmers for plotting as it provides features to control line styles,
font properties, formatting axes, etc. It provides various kinds of graphs and plots
for data visualization, viz., histogram, error charts, bar chats, etc,

Word Cloud
is a data visualization technique used for representing text data in which
the size of each word indicates its frequency or importance. Significant
textual data points can be highlighted using a word cloud. Word clouds
are widely used for analyzing data from social network websites.
For generating word cloud in Python, modules needed are –
matplotlib, pandas and wordcloud. To install these packages, run the
following commands :
pip install matplotlib
pip install pandas
pip install wordcloud

The dataset used for generating word cloud is collected from UCI Machine Learning
Repository. It consists of YouTube comments on videos of popular artists.

Tokenize
In Python tokenization basically refers to splitting up a larger body of text
into smaller lines, words or even creating words for a non-English language.
The various tokenization functions in-built into the nltk module itself and

10
can be used in programs as shown below.

Scikit-learn is one of the most popular ML libraries for classical ML algorithms. It


is built on top of two basic Python libraries, viz., NumPy and SciPy. Scikit-learn
supports most of the supervised and unsupervised learning algorithms. Scikit-learn
can also be used for data-mining and data- analysis, which makes it a great tool
who is starting out with ML.

211.95

11
5.CLASSIFI
CATION
AND
SCREEN SHOT DATA SET:- CONFORMED DATA SET
VISUALIZ
ATION
PART

SCREEN SHOT DATA SET: - DEATHS DATA SET

12
SCREEN SHOT DATA SET: - RECOVERED DATA SET

13
14
15
16
17
18
19
20
OUTPUT

21
22
OUTPUT

23
OUTPUT

24
OUTPUT

25
OUTPUT

OUTPUT

26
OUTPUT

OUTPUT

27
OUTPUT

OUTPUT

CONCLUSI
ON 28
The world is under
the grasp of COVID-
19 virus. Early
prediction of the
transmission can
help to take
necessary actions.
This article
proposed to utilize
the machine
learning and deep
learning
models for epidemi
c.Future prediction
of potential
infections will
enable authorities
to tackle the
consequences
effectively.
Furthermore, it is
necessary to keep
up with the
number of infected
people by
performing regular
check-ups, and it is
often vital to
quarantine infected
people and adopt
medical measures
Prediction models
such as the PA, 29
ARIMA, and LSTM
algorithms were
REFERENCE

30
https://www.slideshare.net/irjetjournal/role-of-
machine-learning-techniques-in-covid19-prediction-
and-detection
[1] Wang L, Wong A
(2020) COVID-Net:
a tailored deep
convolutional
neural network
design for
detection of
COVID-19 cases
from chest
radiography
images.
[2] Beck BR, Shin B,
Choi Y, Park S, Kang
K. Predicting
commercially
available antiviral
drugs that may act
on the novel
coronavirus (SARS-
CoV-2) through a
drug- target
interaction deep
learning model
[3] Cohen et al
(2020) COVID-19
image data
collection
[4] Ting, Daniel Shu
Wei, Lawrence 31
Carin, Victor Dzau,
and Tien Y. Wong.

You might also like