Professional Documents
Culture Documents
An Application of Spatial Data Mining in The Study of Corona Virus (COVID-19) Pandemic Through Statistical Approach
An Application of Spatial Data Mining in The Study of Corona Virus (COVID-19) Pandemic Through Statistical Approach
Abstract
In the technological world Spatial Database Management System (SDBMS) has been a vital role
to study neighbourhood relation. The core concept of spatial data mining we need to investigate the
neighbors of many objects in the single run of typical data mining algorithm. This means that in spatial data
mining algorithm we have to efficiently process the neighborhood relation. An integration of spatial data
mining algorithms and the potential of spatial database management system (SDBMS) will help efficiently
providing general concept of neighborhood relation and its implementation. This paper focusses that the
neighbouhood relation of the Corona virus (COVID-19) pandamic in India as on 30th March 2020. For the
significant study of the corona virus SPSS and SQL query has been used.
Keywords: Spatial Data Base, COVID-19, SPSS, Pandemic
1. Introduction
The explosive growth of spatial database has far outpaced the human potential to
interpret this statistics. This creates an urgent need for new technology and equipment that
support the human in transforming the facts into useful facts and knowledge. Spatial
Database Management System (SDBMS) is the database structures for the control of
spatial information [2]. Spatial Data Mining (SDM) is the technique to locate the implicit
regularities, rules or patterns hidden in massive spatial database [1][4][5][6].
Spatial database framework is a database framework which offers spatial realities
types (SDT) in its data form and shape question language. We utilize set of components
as a general outline of spatial things [3]. The spatial database control gadget need to be
able to retrieve from a massive collection of objects in some space the ones lying within a
specific area with out scanning the whole set. The spatial database control system should
be capable of retrieve from a massive series of gadgets in a few area those existing in a
specific locale without examining the entire set. For that spatial ordering is obligatory.
The DBMS enables numerous spatial file to shape e.g. R-tree [8]. They are used in
rushing up the processing of spatial queries or nearest queries [7]. The SDBMS musty
also have the function to connect object from special classes. The business RDBMS (e.g.
Oracle) can be used to combine the fundamental operations for spatial records mining
[10]. The topological relations [9] among two items A and B are gotten from the nine
meeing points of the insides, the limits and the supplements of A and B with each other.
In addition, these devices are commonly intended to see client purchasing behaviors in
advertise crate records [11]. The SDBMS smelly likewise have the trademark to interface
object from uncommon classes' druing a couple of spatial relationship.
The 2019 corona virus disease (COVID-19) epidemic in Chinaisa global health
care threat [12] andisby a ways the largest outbreak of a standard pneumonia considering
the fact that the Severe Acute Respiration Syndrome (SARS) outbreak in 2003. Within
weeks of the initial outbreak the entire number of instances and deaths passed the ones of
SARS [13]. A Statistical Study on the Impact of Dengue Fever in Thanjavur District
Using SPSS was made by Dr.R.Arumugam et. al 2019 [17].
The outbreak was first discovered in late December 2019 while clusters of
pneumonia cases of unknown etiology were found to be associated with
epidemiologically related publicity to a seafood market and untraced exposures inside the
city of Wuhan of Hubei Province [14]. Since, the wide variety of cases has persevered to
escalate exponentially within and past Wuhan, spreading to all 34 regions of China by 30th
January 2020. On the identical day, the World Health Organization (WHO) declared the
COVID-19 outbreak a public health emergency of worldwide concern [15]. In this paper a
set of database queries of the corona virus (COVID-19) pandamic has been tested and
introduced for mining the spatial database the use of SPSS tool.
2. Methods and Meterials
2.1 Spatial Data and Spatial Database System
In different fields there's a need to manipulate spatial data i.e. data associated with
space. One distinguished instance of spatial information is the satellite informations for
the corona virus (COVID-19). To extract information from a satellite it needs to be
processed w.r.t spatial body of reference, in all likelihood our Earth’s surface. But the
satellite information is n't the simplest the spatial records and our Earth surface are not the
only body of reference. Since the advent of relational database device there were tries to
manipulate such facts in database.
The necessities and techniques for dealing with objects in area that have identity
and well defined active cases, recovaries, and deaths. Here we are discussing spatial
database systems in the constrained sense. The queries or command that we execute on
spatial records is known as spatial query.
For example, the queries are given for the following questions,
1. Which states are afftected by means of Corona?
2. How many peoples are affected?
3. How many peoples are recoverd? And
4. How many deaths are occurred?
Like that lot of queries are decribed in this paper. And list out all information the
use of quary language based on the spatial query.
Figure 1: Timeline of the pandemic spread across India (As on 30th March 2020)
Secondly, if our database keeps the detail of a country name, affected areas,
recovories, deaths and total active cases are list out here based on the spatial information.
Then the question like, list the top five corona affcted states, in which state greater than
five is a non- spatial query and database management system with spatial data and spatial
query is essentially required.
3. Analysis
3.1 Modeling the Spatial Database
Corona afftected place (ie active cases at different states in India), recoveries
from the corona virus (COVID-19) and deaths can be designed using sql query language
and queries of the outputs are displayed in the following table1, table 2 and table 3.
Query Model:
3.2 Neighborhood relation of the Corona States
The neighborhood relation says that the mutual influence among more than two states
(i.e., objects) depends on factors such as the topology, the maintaing social distancing and
practice respiratory hygiene. For example a population pond can cause different degree
and different levels of Corona pandemic in the neighborhood location. The topological,
social distancing and respiratory hygiene relation are the binary relation.
Query3
State Active cases
Chandigarh 9
Figure 3: Freq level in active case Figure 4: Freq level in recoveries Figure 5: Freq level in death stage
study the statistical software SPSS is used. Also we are presented the ANOVA table for
the study of the significant at 5% level.
REFERENCES
[1] Agrawal, R., T.Imielinski and A.Swami, Database mining: A overall performance
perspective. IEEE Transactions on Knowledge and Data Engineering, five
(6):914–925, 1993.
[3] Egenhofer, M.J. Reasoning about binary topological relations. In Proc. second
Int. Symp. On huge spatial Databases, Zurich, Switzerland, PP.143-160, 1991.
[4] Ester, M., H.P.Kriegel., and J. Sander., Spatial statistics mining: A database
method. In Proc. fifth Int. Symp. On Large Spatial Databases, Berlin, Germany,
pp. 47–66, 1997.
[5] ]Ester, M., A. Frommelt, H.P Kriegel, and J. Sander, Algorithms for
characterization and fashion detection in spatial databases. In Proc. 4th Int.
Conf. On Knowledge Discovery and Data Mining, New York City, NY, pp. 44–50,
1998.
[7] Gueting, R.H. An advent to spatial database systems. VLDB Journal Special Issue
on Spatial Database Systems, three (four). 1994.
[8] Guttman, A. R-trees: A dynamic index shape for spatial searching. In Proc. ACM
SIGMOD Int. Conf. On Management of Data, pp. 47–54. 1984.
[10] Koperski, K., J.Adhikary, and J. Han, Knowledge discovery in spatial databases:
Progress and challenges. In Proc. SIGMOD Workshop on Research Issues in
Data Mining and Knowledge Discovery. Technical Report 96-08, University of
British Columbia, Vancouver, Canada. 1996.
[11] Ladnee R, F E Petry, M A Cobb, Fuzzy Set Approaches to Spatial Data Mining of
Association Rules[J].Transin GIS, 2003,7(1).123-138.