Professional Documents
Culture Documents
Statistics - Sessional Assignment
Statistics - Sessional Assignment
Statistics - Sessional Assignment
(Sessional Assignment)
Student Name:
“Fahad Naved Shaikh”
ID Number:
“1945-2019”
Semester:
“3rd Semester”
Faculty:
“Faculty of Management Sciences”
Submission Date:
“27-01-2021”
Content:
DEFINITIONS:
Statistics
Types of statistics
Sample
Population
Raw Material
Presentation of Data
New definition;
Statistics is a scientific way of assembling, classifying and presenting the data as well as
analyzing the data in order to make generalizations and decisions.
TYPES OF STATISTICS:
There are two types of statistics which are as following,
1. Descriptive statistics:
It is consisted of methods for organizing and summarizing data or information.
2. Inferential statistics:
It is consisted of methods for drawing and measuring the reliability of conclusion about the
population based on information obtained from a sample of that population.
Sample:
It is that part of population from which information is collected.
Population:
It is the collection of all individuals, items and data under consideration in a statistical study.
Raw Data:
It is a collection of facts which is not organized in any way, such data has mixed information.
Presentation of Data:
i. Ascending and descending order
ii. Frequency distribution
iii. Graphical representation
Page 2 of 34
QUESTIONS AND SOLUTIONS:
Data of Age:
17 21 22 17 21 23 18 21 24 19
22 24 19 22 25 19 22 25 20 22
25 20 22 26 21 22 26 21 22 26
Solution:
Ascending order of Age;
17 17 18 19 19 19 20 20 21 21
21 21 21 22 22 22 22 22 22 22
22 23 24 24 25 25 25 26 26 26
26 26 26 25 25 25 24 24 23 22
22 22 22 22 22 22 22 21 21 21
21 21 20 20 19 19 19 18 17 17
Data of Height:
4.2 5.0 5.5 5.9 6.2 5.5 6.0 6.2 4.6 5.2
4.6 5.2 5.6 6.2 5.6 4.8 5.4 5.7 6.2 5.4
5.8 6.2 5.0 5.4 5.8 6.2 5.5 5.9 6.2 6.2
Page 3 of 34
Solution:
Ascending order of Height;
4.2 4.6 4.6 4.8 5.0 5.0 5.2 5.2 5.4 5.4
5.4 5.5 5.5 5.5 5.6 5.6 5.7 5.8 5.8 5.9
5.9 6.0 6.2 6.2 6.2 6.2 6.2 6.2 6.2 6.2
6.2 6.2 6.2 6.2 6.2 6.2 6.2 6.2 6.0 5.9
5.9 5.8 5.8 5.7 5.6 5.6 5.5 5.5 5.5 5.4
5.4 5.4 5.2 5.2 5.0 5.0 4.8 4.6 4.6 4.2
FREQUENCY DISTRIBUTION:
Organization of set of data in a table showing the breakup of data into classes or groups together
with the number of observations in each class or group is called a frequency distribution.
17 21 22 17 21 23 18 21 24 19
22 24 19 22 25 19 22 25 20 22
25 20 22 26 21 22 26 21 22 26
Solution:
Finding Range;
Page 4 of 34
26−17
¿
5
9
¿
5
¿ 1.8 ↑ = 2
17 – 18 ||| 3
19 – 20 |||| 5
21 – 22 ||||||||||| 13
23 – 24 ||| 3
25 – 26 ||||| 6
∑ f =30
4.2 5.0 5.5 5.9 6.2 5.5 6.0 6.2 4.6 5.2
4.6 5.2 5.6 6.2 5.6 4.8 5.4 5.7 6.2 5.4
5.8 6.2 5.0 5.4 5.8 6.2 5.5 5.9 6.2 6.2
Solution:
Finding Range;
6.2−4.2
¿
3
Page 5 of 34
2
¿
3
¿ 0.66 ↑ = 1.0
17 21 22 17 21 23 18 21 24 19
22 24 19 22 25 19 22 25 20 22
25 20 22 26 21 22 26 21 22 26
Solution:
19 – 20 |||| 5 8 27
21 – 22 ||||||||||| 13 21 22
23 – 24 ||| 3 24 9
25 – 26 ||||| 6 30 6
Page 6 of 34
∑ f =30
4.2 5.0 5.5 5.9 6.2 5.5 6.0 6.2 4.6 5.2
4.6 5.2 5.6 6.2 5.6 4.8 5.4 5.7 6.2 5.4
5.8 6.2 5.0 5.4 5.8 6.2 5.5 5.9 6.2 6.2
Solution:
∑ f =30
GRAPHICAL PRESENTATION:
There are two types of graphical presentation,
Histogram
Frequency Polygon
HISTOGRAM;
A histogram is a virtual representation of a group frequency distribution. It is consisted of set of
adjacent rectangles by taking bases along x-axis with areas proportional to the frequencies of the
class intervals.
Histogram of Age:
Question:
Page 7 of 34
Data of Age:
CLASS 17 – 18 19 – 20 21 – 22 23 – 24 25 – 26
INTERVALS
FREQUENCY 3 5 13 3 6
Solution:
19 – 20 5 18.5 – 20.5
21 – 22 13 20.5 – 22.5
23 – 24 3 22.5 – 24.5
25 – 26 6 24.5 – 26.5
Histogram of Age;
14
13
12
10
Frequency
8
6
6
5
4
3 3
0
16.5 18.5 20.5 22.5 24.5 26.5
Class Boundaries
Histogram of Height:
Question:
Data of Height:
Page 8 of 34
CLASS 4.2 – 5.1 5.2 – 6.1 6.2 – 7.1
INTERVALS
FREQUENCY 6 16 8
Solution:
Histogram of Height;
18
5
16
14
12
Frequency
10
13
8
3
6
0
4.15 5.15 6.15 7.15
Class Boundaries
FREQUENCY POLYGON:
A frequency polygon is an alternative to histogram that presents overall shape of the distribution.
Page 9 of 34
Frequency polygon of Age:
Question;
Data of Age:
CLASS 17 – 18 19 – 20 21 – 22 23 – 24 25 – 26
INTERVALS
FREQUENCY 3 5 13 3 6
Solution:
Polygon of Age;
14
12
10
Frequency
8
6
4
2
0
17.5 19.5 21.5 23.5 25.5
Mid Point
Page 10 of 34
INTERVALS
FREQUENCY 6 16 8
Solution:
Polygon of Height;
18
16
14
12
Frequency
10
0
4.65 5.65 6.65
Mid Point
Measure of location
Measure of dispersion
Page 11 of 34
MEASURE OF LOCATION:
The measures of location are,
Arithmetic Mean (Grouped, Ungrouped)
Median (Grouped, Ungrouped)
Mode (Grouped, Ungrouped)
Geometric Mean (Grouped, Ungrouped)
Harmonic Mean (Grouped, Ungrouped)
QUESTIONS AND SOLUTIONS:
GROUPED:
{Median, Mode, Arithmetic Mean (A.M), Geometric Mean (G.M), Harmonic
Mean (H.M)}
FOR AGE:
CLASS 17 – 18 19 – 20 21 – 22 23 – 24 25 – 26
INTERVALS
FREQUENCY 3 5 13 3 6
Solution,
{ (∑ ) }
f
−C . F
~ 2
x=L+ h
F. M
Page 12 of 34
( 2)
{ }
30
−8
~
x=20.5+ 2
13
~
x=20.5+(0.53)(2)
~
x=20.5+(1.07)
~
x=21.57
^x =L+
{ ( F . M −F 1 )
( F . M −F 1 ) + ( F . M −F 2 )
h
}
^x =20.5+
{ ( 13−5 )
(13−5 ) + ( 13−3 )
(2) }
^x =20.5+
{ (8 )
}
( 8 ) + ( 10 )
(2)
^x =20.5+(0.44)(2)
^x =20.5+(0.88)
^x =21.38
x=
∑ fx
∑f
653
x=
30
x=21.76
G . M =Antilog
( ∑ flogx
∑f )
G . M =Antilog ( 40.0525
30 )
G . M =Antilog ( 1.3350 )
Page 13 of 34
G . M =21.63
H .M =
{ ∑f
∑ (f / x) }
H .M = ( 1.396
30
)
H . M =21.49
H . M =21.49
Thus,
A . M >G . M > H . M
FOR HEIGHT:
Solution,
5.2 – 6.1 16 (F.M) 5.65 5.15 (L) – 6.15 90.4 22 0.7520 12.032 2.8319
Page 14 of 34
6.2 – 7.1 8 (F2) 6.65 6.15 – 7.15 53.2 30 08228 6.5824 1.2030
{ (∑ ) }
f
−C . F
~ 2
x=L+ h
F. M
( 2)
{ }
30
−6
~
x=5.15+ (1)
16
~
x=5.15+ ( 0.5625 )( 1 )
~
x=5.15+ ( 0.5625 )
~
x=5.713
^x =L+
{ ( F . M −F 1 )
( F . M −F 1 ) + ( F . M −F 2 )
h
}
^x =5.15+
{ ( 16−6 )
(16−6 )+ (16−8 )
(1)
}
^x =5.15+
{ ( 10 )
(10 )+ ( 8 ) }
( 1)
^x =5.15+ ( 0.555 )( 1 )
^x =5.15+ ( 0.555 )
^x =5.705
x=
∑ fx
∑f
Page 15 of 34
171.5
x=
30
x=5.716
G . M =Antilog
( ∑∑ ) flogx
f
G . M =Antilog ( 22.6188
30 )
G . M =Antilog ( 0.75396 )
G . M =5.6749
H .M =
{ ∑f
∑ (f / x) }
H .M = ( 5.3252
30
)
H . M =5.6336
UNGROUPED:
{Median, Mode, Arithmetic Mean (A.M), Geometric Mean (G.M), Harmonic Mean (H.M)}
FOR AGE;
17 21 22 17 21 23 18 21 24 19
22 24 19 22 25 19 22 25 20 22
25 20 22 26 21 22 26 21 22 26
Solution:
Page 16 of 34
8 20 1.3010 0.05
9 21 1.3222 0.0476
10 21 1.3222 0.0476
11 21 1.3222 0.0476
12 21 1.3222 0.0476
13 21 1.3222 0.0476
14 22 1.3424 0.0454
15 22 1.3424 0.0454
16 22 1.3424 0.0454
17 22 1.3424 0.0454
18 22 1.3424 0.0454
19 22 1.3424 0.0454
20 22 1.3424 0.0454
21 22 1.3424 0.0454
22 23 1.3617 0.0435
23 24 1.3802 0.0417
24 24 1.3802 0.0417
25 25 1.3979 0.04
26 25 1.3979 0.04
27 25 1.3979 0.04
28 26 1.4150 0.0385
29 26 1.4150 0.0385
30 26 1.4150 0.0385
∑ x =¿ ¿ ∑ logx=¿ ¿40.0
654 652 ()
∑ 1x =¿ ¿1.39
46
Arithmetic Mean (A.M) of Age; (Ungrouped)
x=
∑x
n
654
x=
30
x=21.8
() ( ) value
th th
n n+2
value+
~ 2 2
x=
2
( ) ( ) value
th th
30 32
value+
~ 2 2
x=
2
Page 17 of 34
th th
~ 15 value +16 value
x=
2
~ 22+22
x=
2
~
x=22
G . M =Antilog ( ∑ logx
n )
G . M =Antilog ( 40.0652
30 )
G . M =Antilog ( 1.3355 )
G . M =21.65
H .M =
{ n
∑ (1 / x ) }
H .M = ( 1.3946
30
)
H . M =21.51
FOR HEIGHT;
4.2 5.0 5.5 5.9 6.2 5.5 6.0 6.2 4.6 5.2
4.6 5.2 5.6 6.2 5.6 4.8 5.4 5.7 6.2 5.4
5.8 6.2 5.0 5.4 5.8 6.2 5.5 5.9 6.2 6.2
Solution:
Page 18 of 34
Observations (n)
1 4.2 0.6232 0.2381
2 4.6 0.6628 0.2174
3 4.6 0.6628 0.2174
4 4.8 0.6812 0.2083
5 5 0.6990 0.2
6 5 0.6990 0.2
7 5.2 0.7160 0.1923
8 5.2 0.7160 0.1923
9 5.4 0.7324 0.1858
10 5.4 0.7324 0.1858
11 5.4 0.7324 0.1858
12 5.5 0.7404 0.1818
13 5.5 0.7404 0.1818
14 5.5 0.7404 0.1818
15 5.6 0.7482 0.1786
16 5.6 0.7482 0.1786
17 5.7 0.7559 0.1754
18 5.8 0.7634 0.1724
19 5.8 0.7634 0.1724
20 5.9 0.7708 0.1695
21 5.9 0.7708 0.1695
22 6 0.7781 0.1667
23 6.2 0.7924 0.1613
24 6.2 0.7924 0.1613
25 6.2 0.7924 0.1613
26 6.2 0.7924 0.1613
27 6.2 0.7924 0.1613
28 6.2 0.7924 0.1613
29 6.2 0.7924 0.1613
30 6.2 0.7924 0.1613
∑ x =¿ ¿1 ∑ logx=¿ ¿22.3
67.2 164 ()
∑ 1x =¿ 5.4421¿
Arithmetic Mean (A.M) of Height; (Ungrouped)
x=
∑x
n
167.2
x=
30
x=5.573
Page 19 of 34
() ( ) value
th th
n n+2
value+
~ 2 2
x=
2
( ) ( ) value
th th
30 32
value+
~ 2 2
x=
2
~ 5.6+5.6
x=
2
~
x=5.6
G . M =Antilog (∑ ) logx
n
G . M =Antilog ( 22.3164
30 )
G . M =Antilog ( 0.74388 )
G . M =5.5448
H .M =
{∑ } n
(1 / x )
H .M = ( 5.4421
30
)
H . M =5.5126
MEASURE OF DISPERSION:
Following are the measures of dispersion,
Page 20 of 34
Variance (Grouped, Ungrouped)
Standard deviation (Grouped, Ungrouped)
Quartile Deviation (Grouped, Ungrouped)
QUESTIONS AND SOLUTIONS:
GROUPED:
{Variance and Standard Deviation (S.D)}
FOR AGE:
CLASS 17 – 18 19 – 20 21 – 22 23 – 24 25 – 26
INTERVALS
FREQUENCY 3 5 13 3 6
Solution,
x=
∑ fx
∑f
653
x=
30
x=21.76
σ =
2
{ ∑ f ( x −x )2
∑f }
σ =
2
( 173.868
30 )
Page 21 of 34
2
σ =5.7956
σ=
√{ ∑ f ( x−x )2
∑f }
σ =√ 5.7956
σ =2.4074
FOR HEIGHT:
FREQUENCY 6 16 8
Solution,
Page 22 of 34
x=
∑ fx
∑f
171.5
x=
30
x=5.716
σ = 2
{ ∑ f ( x −x )2
∑f }
σ =
2
( 13.8665
30 )
σ 2=0.4622
σ=
√{ ∑ f ( x−x )2
∑f }
σ =√ 0.4622
σ =0.6798
UNGROUPED:
{Variance and Standard deviation (S.D)}
FOR AGE;
17 21 22 17 21 23 18 21 24 19
22 24 19 22 25 19 22 25 20 22
25 20 22 26 21 22 26 21 22 26
Page 23 of 34
Solution:
Number of x ( x−x )2
Observations (n)
1 17 23.04
2 17 23.04
3 18 14.44
4 19 7.84
5 19 7.84
6 19 7.84
7 20 3.24
8 20 3.24
9 21 0.64
10 21 0.64
11 21 0.64
12 21 0.64
13 21 0.64
14 22 0.04
15 22 0.04
16 22 0.04
17 22 0.04
18 22 0.04
19 22 0.04
20 22 0.04
21 22 0.04
22 23 1.44
23 24 4.84
24 24 4.84
25 25 10.24
26 25 10.24
27 25 10.24
28 26 17.64
29 26 17.64
30 26 17.64
∑ x =¿ ¿6 ∑ ( x− x )2=¿188.8 ¿
54
x=
∑x
n
Page 24 of 34
654
x=
30
x=21.8
2
σ = { ∑ ( x−x )2
n }
2
σ = ( 188.8
30 )
2
σ =6.293
σ=
√{ ∑ ( x−x )2
n }
σ =√ 6.293
σ =2.508
FOR HEIGHT;
4.2 5.0 5.5 5.9 6.2 5.5 6.0 6.2 4.6 5.2
4.6 5.2 5.6 6.2 5.6 4.8 5.4 5.7 6.2 5.4
5.8 6.2 5.0 5.4 5.8 6.2 5.5 5.9 6.2 6.2
Solution:
Number of x ( x−x )2
Observations (n)
1 4.2 1.88512
2 4.6 0.94672
3 4.6 0.94672
4 4.8 0.59752
5 5 0.32832
6 5 0.32832
7 5.2 0.13912
8 5.2 0.13912
Page 25 of 34
9 5.4 0.02992
10 5.4 0.02992
11 5.4 0.02992
12 5.5 0.00532
13 5.5 0.00532
14 5.5 0.00532
15 5.6 0.00072
16 5.6 0.00072
17 5.7 0.01612
18 5.8 0.05152
19 5.8 0.05152
20 5.9 0.10692
21 5.9 0.10692
22 6 0.18232
23 6.2 0.39312
24 6.2 0.39312
25 6.2 0.39312
26 6.2 0.39312
27 6.2 0.39312
28 6.2 0.39312
29 6.2 0.39312
30 6.2 0.39312
∑ x =¿ ¿1 ∑ ( x− x )2=¿ ¿9.0784
67.2
x=
∑x
n
167.2
x=
30
x=5.573
2
σ = { ∑ ( x−x )2
n }
2
σ = ( 9.0784
30 )
2
σ =0.302
Page 26 of 34
Standard Deviation (S.D) of Height; (Ungrouped)
σ=
√{ ∑ ( x−x )2
n }
σ =√ 0.302
σ =0.550
22 24 19 22 25 19 22 25 20 22
25 20 22 26 21 22 26 21 22 26
Solution,
17 17 18 19 19 19 20 20 21 21
21 21 21 22 22 22 22 22 22 22
22 23 24 24 25 25 25 26 26 26
For Q1;
So,
7.75th value is between the 7th and 8th value,
Q 1= ( 7 th+82 th )
Page 27 of 34
Q 1= ( 20+ 20
2 )
Q 1=20
For Q3;
Q 3=23.25 th Value
So,
23.25th value is between the 23th and 24th value,
Q 3= ( 23th+24
2
th
)
Q 3= ( 24+24
2 )
Q 3=24
Now Q.D,
Q . D= ( Q 3−Q
2
1
)
Q . D= ( 24−20
2 )
Q . D=2
Grouped:
CLASS 17 – 18 19 – 20 21 – 22 23 – 24 25 – 26
INTERVALS
FREQUENCY 3 5 13 3 6
Page 28 of 34
Solution,
∑ f =30
So,
∑ f = 30 =7.5
4 4
For Q1;
h
Q 1=L+ ×
f (∑ 4
f
−C . f )
2
Q 1=18.5+ × (7.5−3 )
5
Q 1=20.3
For Q3;
h
Q 3=L+ × 3
f ( (∑ ) ) 4
f
−C . f
2
Q 3=22.5+ × ( 22.5−21 )
3
Q 3=23.5
Now Q.D,
Q 3−Q1
Q . D=
2
Page 29 of 34
23.5−20.3
Q . D=
2
Q . D=1.6
Coefficient of correlation:
If two sets of variable vary in such a way that the change of one set is related by changes in the
other then these sets are said to correlation.
N X Y XY X2 Y2
Page 30 of 34
24 24 6.2 148.8 576 38.44
25 25 6.2 155 625 38.44
26 25 6.2 155 625 38.44
27 25 6.2 155 625 38.44
28 26 6.2 161.2 676 38.44
29 26 6.2 161.2 676 38.44
30 26 6.2 161.2 676 38.44
∑ x =¿ ¿654 ∑ y=¿ ¿1 ∑ xy =¿ 3684.6∑
¿ ( x ) =14446 ∑ ( y 2) =940.94
2
67.2
Finding b,
n ∑ xy−∑ x ∑ y
b= 2
n ∑ x2 + ( ∑ x )
Finding a,
a=Y +b X
Finding Y and X ,
Y=
∑y
n
167.2
Y=
30
Y =5.573
X=
∑x
n
654
X=
30
X =21.8
Page 31 of 34
Now,
a=(5.573)+( 0.0013)( 21.8)
a=5.60134
(X on Y)
X =c +dY → B
Finding d,
n ∑ xy −∑ x ∑ y
d= 2
n ∑ y +( ∑ y )
2
( 30 ) ( 3684.6 )−(654)(167.2)
d=
( 30 ) ( 940.94 ) + ( 167.2 )2
d=0.0211
Finding c,
c= X +d Y
Finding Y and X ,
X=
∑x
n
654
X=
30
X =21.8
Y=
∑y
n
167.2
Y=
30
Y =5.573
Now,
c=(21.8)+(0.0211)(5.573)
Page 32 of 34
c=21.9175
( 30 ) ( 3684.6 )−(654)(167.2)
r=
√ ( 30 ) ( 14446 ) +( 654 ) √ ( 30 )( 940.94 ) +( 167.2 )
2 2
1189.2
r=
(927.95)(237.03)
r =0.0054 (Negligible)
Coefficient of Determination,
¿ r 2 ×(100)
2
¿( 0.0054) ×(100)
¿ 0.0029 %
Problem Error:
P . E=0.6745 ( )
1−r 2
√n
P . E=0.6745 ( √
1−(0.0054)2
30 )
P . E=0.123
Hence,
6 × P . E=6 ( 0.123 )=0.738
So “r” is not greater than P.E but more than 0.3. Thus, there is good evidence of
correlation.
Page 33 of 34