Professional Documents
Culture Documents
BS Classwork
BS Classwork
BS Classwork
Statistics:
It is a science of collecting, collating, analysing and interpreti
Ex 1
Simple data: Calculate Mean and standard deviation
Sl Age (x) Deviation (xi - x-bar)
1 20 -5.50
2 23 -2.50
3 24 -1.50
4 21 -4.50
5 25 -0.50
6 22 -3.50
7 24 -1.50
8 45 19.50
SUM 204 0.00
Mean 25.50
SD 7.53
Mean (SAM) (x-bar) = SUM(xi) / N
Advantage of calculating MEAN (SAM): It helps us to have a
Basiaclly it gives us a central representative value
It ignores the individuality and the diversity
Sum of Deviations about MEAN is always ZERO
Simple Variance (Measure of Dispersion) = [SUM(xi- x-bar)^
Dispersion mesaures can broadly be cassified into two categ
Mean 25.50
Variance 56.75
Standard Deviation
(σ) 7.53
Coefficient of
Variation = SD/Mean 29.54%
Task 2
Grade 1
Employees 2262
Mean salary LPA 3.95
Std Deviation 0.61
Coeff of Variation 15.4%
Example 2
To compute Mean and SD from WEIGHTED DATA
Weighted data is a kind of data set, where individual data p
Sl Equity Shares Investement (Rs. Lakhs)
w
1 A 25
2 B 20
3 C 40
4 D 30
5 E 15
130
Task 2
TFRSocio-Economic
Survey:
Sl category No of respondents
w
1 Metro 5400
2 Tier I 7800
3 Teir II + III 12100
4 Sub-urban 15300
5 Rural 20400
61000
Q1 Average no of children in the entire sample, considering all
Q2 Corresponding SD
Mean 2.477
Variance 0.126
SD 0.356
coeff of variation 14.4%
Example 3
Categorise raw data into Frequncy Distn Table
508 537
530 553
579 503
577 563
529 543
527 564
540 549
549 526
535 587
522 522
554 504
565 572
504 558
599 539
579 508
518 555
553 553
514 532
553 519
503 520
Class Boundaries
LCB UCB
500 510
510 520
520 530
530 540
540 550
550 560
560 570
570 580
580 590
590 600
Advantages:
a) It helps us to visualise the density of data points in various p
b) It helps us to construct a Probability Distribution, which in t
rd deviation
Sq of Deviation
(xi - x-bar)^2 Pop quiz
30.25 Qs 1
6.25 Mean
2.25 SD
20.25
0.25 Qs 2
12.25 Mean
2.25 SD
380.25
454.00 SSD
56.75 MSD or Variance
s ZERO
) = [SUM(xi- x-bar)^2]/N
sified into two categories: DEVIATION FAMILY and RANGE FAMILY
tify the volatility inherently present in the data. The neasures of Dispersion
Years
Years^2
Years
ople in this group are within a range of 25.5 years ± 7.53 years
nd we can call it an OUTLIER
Grade 2
1754 Combined Mean 4.7929
5.88 Combined SD 0.7611
0.92
15.6%
ED DATA
re individual data points carry corresponding weight / priority
Return (%)
x w*x w*(xi - x-bar)^2
12.60% 3.15 0.06770003698
15.80% 3.16 0.008030798817
20.0% 8 0.01929236686
18.40% 5.52 0.001066198225
22.10% 3.315 0.0276854068
23.145 0.1237748077
17.80%
)^2] / SUM(w) 0.0009521139053
3.09%
Continuous Distns
g, Regression and ANOVA
ANGE FAMILY
53 years
up or go down?
/ priority
504 Max 600
567 Min 501
505
534
579
580
552
514
540
565
546
559
572
531
562
552
530
562
538
527
Class
representative
x
505
515
525
535
545
555
565
575
585
595
f*(x - MEAN)^2
22680.875
10980.75
4510.6875
2101.25
0.625
1330.875
5070.8125
11505.8125
11060.4375
14850.375
84092.5
Column1
Mean 25.5
Standard Error 2.847
Median 23.5
Mode 24
Standard Deviation 8.053
Sample Variance 64.86
Kurtosis 7.009
Skewness 2.586
Range 25
Minimum 20
Maximum 45
Sum 204
Count 8
0