Professional Documents
Culture Documents
Report On Data Visualization: Done By: Vigneshwaran.K
Report On Data Visualization: Done By: Vigneshwaran.K
JULY 2020
Done By:
VIGNESHWARAN.K
SECTION – K
ROLL NUMBER – 2027824
DATA
1. QUALITATIVE DATA (Categorical data)
a) NOMINAL DATA
It is defined as data that is used for naming or labelling variables, without any
quantitative value. There is no intrinsic ordering to nominal data.
STUDENT'S
STUDENT
PLACE OF BIRTH
16 Kuwait
1 Kuwait
17 Saudi Arabia
2 Kuwait
18 Lebanon
3 Jordan
19 Lebanon
4 Kuwait
20 Jordan
5 Kuwait
21 Lebanon
6 Kuwait
22 Saudi Arabia
7 Egypt
23 USA
8 USA
24 USA
9 USA
25 USA
10 Egypt
26 USA
11 USA
27 USA
12 Tunis
28 USA
13 Tunis
29 Tunis
14 Tunis
30 Kuwait
15 Kuwait
SOURCE: https://www.kaggle.com/aljarah/xAPI-Edu-Data
COUNTRY FREQUENCY
Kuwait 8
Jordan 2
Egypt 2
USA 9
Tunis 4
Saudi Arabia 2
Lebanon 3 ANALYSIS:
From the 30 data provided it is
understood that students from USA
are the highest and then Kuwait students second highest. The students from
Jordan, Egypt, Saudi Arabia are the least.
b) ORDINAL DATA
It is a data where the variables have natural, ordered categories and the
distance between the categories is not known.
PARENT'S
STUDEN
SCHOOL
T
SATISFACTION
16 Bad
1 Good
17 Bad
2 Bad
18 Good
3 Good
19 Bad
4 Bad
20 Bad
5 Good
21 Bad
6 Good
22 Good
7 Good
23 Good
8 Good
24 Bad
9 Bad
25 Bad
10 Bad
26 Bad
11 Bad
27 Bad
12 Bad
28 Good
13 Bad
29 Good
14 Good
30 Bad
15 Good
SOURCE: https://www.kaggle.com/aljarah/xAPI-Edu-Data
PARENT'S SCHOOL
FREQUENCY
SATISFACTION
Good 13
Bad 17
30
PARENT'S SCHOOL SATISFACTION
Good
43% Bad
57%
ANALYSIS:
From the data provided it is understood that there are more parents who is not
satisfied with the performance of the school than who is satisfied.
2. QUANTITATIVE DATA
a) INTERVAL DATA
It is defined as a data type which is measured along a scale, in which each
point is placed at equal distance from one another. Interval data always
appears in the form of numbers.
RESOURCE: https://data.worldbank.org/indicator/IT.CEL.SETS
Number of classes = 7
Class width = (Largest data value-smallest data value)/ Number of classes
Actual class width = 94.1857847
Class width taken = 100
NEW
S. NO FREQUENCY
SUBSCRIBERS
1 0-100 12
2 100-200 3
3 200-300 4
4 300-400 3
5 400-500 2
6 500-600 2
7 600-700 4
30
MOBILE CELLULAR SUBSCRIPTION IN WORLD
14
12
10
8
Frequency
Frequency
6
0
0-100 100-200 200-300 300-400 400-500 500-600 600-700
NEW SUBSCRIBERS
ANALYSIS:
The histogram obtained is highly skewed towards right. From the histogram
it is understandable that between 1986 to 2015 exactly for 12 years, new
subscribers between 0 to 100 million ventured into mobile cellular
subscription.
b) RATIO DATA
It has the same properties as interval data, with equal ratio between each data
and absolute zero being treated as a point if origin. In other words, there can
be no negative numerical value in ratio data.
TOTAL GREEN
HOUSE GAS
YEAR EMISSION (in
million kt of co2
equivalent)
1996 39.26
1997 43.38
1998 44.05
1999 40.89
2000 40.56
2001 40.37
2002 43.07
2003 44.42
2004 45.66
2005 47.22
2006 48.64
2007 49.98
2008 48.66
2009 48.15
2010 50.91
RESOURCE: https://data.worldbank.org/indicator/EN.ATM.GHGT.KT.CE?
end=2000&start=1985
50.00
40.00
30.00
20.00
10.00
0.00
1994 1996 1998 2000 2002 2004 2006 2008 2010 2012
ANALYSIS
From the scatter diagram it is understandable that due to the industrial boom
and raise of usage of transport vehicle in the world, increased the greenhouse
gas emission between the provided time frame. And decrease in greenhouse
gas can be understandable as the human effort to reduce this emission.