Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 14

A REPORT OF MINIPROJECT

SUBMITTED TO THE SAVITRIBAI PHULE PUNE UNIVERSITY,


PUNE IN PARTIAL FULFILLMENT OF THE REQUIREMENTS
FOR THE THIRD YEAR OF THE DEGREE

BACHELOR OF ENGINEERING
In

COMPUTER ENGINEERING

By

PIYUSH BANDAL ROLL NO: 305A024

TEJAS BANKAR ROLL NO: 305A025

SHARAYU BHAD ROLL NO: 305A028


SOHAM BHASKARWAR ROLL NO: 305A031

Under the guidance of

Prof. S. S. Peerzade

DEPARTMENT OF COMPUTER ENGINEERING

SINHGAD COLLEGE OF ENGINEERING, PUNE-41

Accredited by NAAC

2023-24
Sinhgad Technical Education Society,
Sinhgad College of Engineering, Pune-41 Department
of Computer Engineering

Date:

CERTIFICATE
This is to certify that the project

Submitted by

PIYUSH BANDAL ROLL NO: 305A024

TEJAS BANKAR ROLL NO: 305A025

SHARAYU BHAD ROLL NO: 305A028


SOHAM BHASKARWAR ROLL NO: 305A031

have successfully completed the work associated with Data Science and Big Data Analytics
Laboratory titled as “To Analyze the Covid Vaccine Data” and has submitted the work book
associated under my supervision, in the partial fulfillment of Third Year Bachelor of
Engineering of Savitribai Phule Pune University

Prof. S. S. Peerzade Dr. M. P. Wankhade


Department of Head
Computer Engineering Department of Computer
Engineering

Dr. S. D. Lokhande
Principal
Sinhgad College of Engineering

I
ACKNOWLEDGEMENT

Words are often less to reveal once deep regards, with an understanding that
words like this can never be the outcome of a single person. I take this
opportunity to express our profound sense of gratitude and respect to all those
who directly or indirectly help us through the duration of this work.

It gives me great pleasure to present a dissertation on “To Analyze the Covid


Vaccine Data". Therefore, it become my duty to express my gratitude towards

themes. I am extremely obliged to our seminar guide Prof. S. S. Peerzade in


the department of Computer Engineering for helping and providing us with
proper guidance. Timely suggestions by this professor made it possible for us to
complete this seminar successfully.

I will fail in my duty if I don't acknowledge a great sense of gratitude to the


head of the computer engineering department Dr. M. P. Wankhade the entire
staff members of our department for their cooperation.

II
TABLE OF CONTENT

Page no.
Title Page
Certificate I
Acknowledgement II

1. Introduction 1

2. System Overview 4

3. Software and Hardware Requirements 5

4. Analytics 6

5. Conclusion 9

References
To Analyze the Covid Vaccine

Chapter-1
Introduction

1.1 Introduction

This pandemic has caused global social and economic disruption, including
the largest global recession since the Great Depression. we are going to
analyses the Covid-19 data using Python and some graphing libraries,
project the total number of confirmed cases of COVID19, and also display
the total number of deaths for a country (this article uses India as an
example) on a given date. Humans sometimes need help interpreting and
processing the meaning of data, so this Project also demonstrates how to
create a graph for various parameters. Data visualization is a graphical
representation of any data or information. Visual elements such as charts,
graphs, and maps are the few data visualization tools that provide the
viewers with an easy and accessible way of understanding the represented
information. We didn't just stop at displaying the data but went ahead and
made predictions for values of total confirmed cases, deaths and recovered
for both India and the world. We obtained our predictions by using a
Facebook Prophet model. Facebook Prophet is an excellent time series
model developed by Facebook. It is fast and simple it also automatically
takes into account the crucial time points in the data provided. Using this
model, we have obtained appreciable accuracy in India predictions and
excellent accuracy in world predictions. The predictions are displayed under
Predictions.

1 TE(Computer Engineering)
To Analyze the Covid Vaccine

1.2 Motivation

Analyzing COVID-19 vaccine data using data science techniques serves


several important motivations: Efficacy Assessment, Safety Monitoring,
Vaccine Distribution and Allocation, Modeling and Forecasting, Vaccine
Adverse Event Signal Detection. Overall, analyzing COVID-19 vaccine data
in data science serves the larger goal of gaining insights into vaccine
efficacy, safety, distribution, and impact. These insights are essential for
informing public health policies, optimizing vaccination campaigns, and
ultimately mitigating the impact of the COVID-19 pandemic.

1.3 Problem Definition

The problem definition for analyzing COVID vaccine data in data science
can be formulated as follows: Given a dataset containing information about
COVID-19 vaccinations, including attributes such as vaccine type,
administration dates, demographic information, and reported 7 adverse
events, the goal is to apply data science techniques and methodologies to
gain insights and understanding from the data. This involves exploring the
dataset, performing data cleaning and preprocessing, conducting descriptive
and inferential analysis, and potentially building predictive models.

1.4 Objectives

1. To learn the concepts of data analytics and data visualization


2. To develop in depth understanding and implementation of the key
technologies in data science and big data analytics.

2 TE(Computer Engineering)
To Analyze the Covid Vaccine

1.5 Scope

1. Vaccine Efficacy: Data science techniques can help determine the


efficacy of different COVID-19 vaccines by analyzing large-scale clinical
trial data. This involves assessing the effectiveness of the vaccines in
preventing infection, reducing symptoms, and preventing severe illness.
2. Safety Monitoring: Analyzing vaccine data can help identify and
monitor any potential adverse events or side effects associated with COVID-
19 vaccines. Data scientists can use statistical methods and machine learning
algorithms to identify patterns and trends in adverse events data and assess
their significance.
3. Vaccine Distribution and Impact: Data science can aid in optimizing
the distribution of COVID-19 vaccines. By analyzing demographic and
geographic data, data scientists can identify high-risk populations and areas
that require priority vaccination. They can also assess the impact of
vaccination campaigns on infection rates, hospitalizations, and mortality.

3 TE(Computer Engineering)
To Analyze the Covid Vaccine

Chapter-2
System Overview

2.1 Introduction of Libraries

1. Numpy: It is a very important library on which almost every data


science or machine learning Python packages such as SciPy (Scientific Python),
Mat−plotlib (plotting library), Scikit-learn, etc. depends on to a reasonable
extent. NumPy is very useful for performing mathematical and logical
operations on Arrays. e.g.: import numpy as np.

2. Pandas: Pandas is a game-changer for data science and analytics,


particularly if you came to Python because you were searching for something
more powerful than Excel and VBA. Pandas uses fast, flexible, and expressive
data structures designed to make working with relational or labeled data both
easy and intuitive. e.g.: import pandas as pd.

3. Matplotlib: Matplotlib is a Python library for data visualizations. It


helps you to mostly plot 2- D dimensional graphs. It is built upon Numpy and
Scipy framework. There are various types of graph plotting can be done using
Matplotlib. The following graph plots can be done.

4. Seaborn: Seaborn helps to visualize the statistical relationships, To


understand how variables in a dataset are related to one another and how that
relationship is dependent on other variables, we perform statistical analysis.
This Statistical analysis helps to visualize the trends and identify various
patterns in the dataset. E.g.: import seaborn as sns.

4 TE(Computer Engineering)
To Analyze the Covid Vaccine

Chapter-3
Software And Hardware Requirements

4.1 Software Requirements

1. Anaconda Navigator (Jupyter Notepad).


2. Google Colab.

4.2 Hardware Requirements


1. Operating System: - Windows 10
2. RAM: - 8GB
3. Hard Disk: - 1TB
4. Processor: - Intel core i3

5 TE(Computer Engineering)
To Analyze the Covid Vaccine

Chapter-4
Analytics

Following Analytics performed for given dataset:

1. Describe the dataset:

2. Top 10 states with most active cases in India:

6 TE(Computer Engineering)
To Analyze the Covid Vaccine

3. Top 10 States with most deaths:

4. Male and Female Vaccination:

7 TE(Computer Engineering)
To Analyze the Covid Vaccine

5. Top 5 states vaccinated in India:

6. Full Dataset using Barplot and Rugplot:

8 TE(Computer Engineering)
To Analyze the Covid Vaccine

Chapter-5
Conclusion

In this way we can analyses the dataset using Python and can estimate
other the factors as well. I hope that now it would be easy for you to
create, analyses, and monitor the consequences and effects of COVID-19.
We have implemented covid vaccine data report for every state.

9 TE(Computer Engineering)
To Analyze the Covid Vaccine

References

[1] https://www.kaggle.com/sudalairajkumar/covid19india?select=covid_v
accine_state wise.csv
It is used for download the Covid vaccine dataset.

[2] Books: - Data science and big data analytics Publication:


Technical. To learn how to analytics the data and visualization data in
this book, syntax and how to retrieve data

1 TE(Computer Engineering)

You might also like