Professional Documents
Culture Documents
Basic Concepts On Statistics
Basic Concepts On Statistics
Basic Concepts On Statistics
By
Umesh Raj Aryal
Lecturer
Department of Community Medicine
Kathmandu Medical College
Affiliated to Kathmandu University
What is Statistics?
Descriptive Statistics
Inferential Statistics
Descriptive Statistics
Numerical Data 41, 24, 32, 26, 27, 27, 30, 24, 38, 21
Frequency Distributions
Ordered Array
Cumulative Distributions
21, 24, 24, 26, 27, 27, 30, 32, 38, 41
Tables
Polygons
Histograms Ogive
Frequency
7
O g iv e
6 7
5 6 120
4 5
100
4
3
3 80
2
2 60
1 1
40
0 0
10 20 30 40 50 60 5 15 25 35 45 55 More 20
Tabulating Numerical Data: Frequency
Distributions
Sort Raw Data in Ascending Order
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44,
46, 53, 58
Find Range: 58 - 12 = 46
Select Number of Classes: 5 (usually between 5 and 15)
Compute Class Interval (Width): 10 (46/5 then round up)
Determine Class Boundaries (Limits):10, 20, 30, 40, 50, 60
Compute Class Midpoints: 15, 25, 35, 45, 55
Count Observations & Assign to Classes
Frequency Distributions, Relative Frequency
Distributions and Percentage Distributions
Data in Ordered Array:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46,
53, 58
Class Frequency Relative Frequency Percentage
0-10 0 0 0
10-20 3 0.15 15
20-30 6 0.3 30
30-40 5 0.25 25
40-50 4 0.2 20
50-60 2 0.1 10
Total 20 1 100
Tabulating Numerical Data:
Cumulative Frequency
Categorical Data
Tabulating Data
Graphing Data The Summary Table
The Contingency Table
158669
100000 84897
0
Male Female
Sex
Summary Table
(for occupation of the population )
Lung Cancer
Yes 92 8 100
No 10 90 100
Standard Deviation
Summary Measures
Summary Measures
Median Mode
Quartile Percentile
Mean
Max. repeated
value
x i (n 1) i (n 1) i ( n 1)
x Md
2 Qi Pi
n 4 100
The Shape
Mode 24.00
VARIABLE
N Valid 9
Missing
0
Percentiles 25 24.00
40 26.00
50 27.00
70 32.00
75 35.00
Measures of Variation
Variation
Interquartile Range
X i X
2
Xlargest - Xsmallest S
S i 1 CV 100%
n 1 X
Summary Statistics (Variation)
(Based on noncentral values)
VARIABLE
N Valid 9
Missing 0
Range 20
Minimum
21
Maximum
41
Quartiles 25 24.00
50 27.00
75 35.00
IQR Q Q
3 1
35 24
11
7, 4, 9, 7, 3, 12
Summary Statistics (Variation)
(Based on central values)
VARIABLE
N Valid 6
Missing 0
Mean 7.00
Median 7.00
Mode 7
Std. Deviation
3.29
Variance
10.8
Comparing Coefficient
of Variation
College A: College B:
Average Height 155 cm Average Height = 160 cm
Standard deviation = 10 cm Standard deviation = 5 cm
s 10 s 5
CV 100%
100% 6.45% CV 100% 100% 3.12%
x 155 x 160
Xi
2
X
2
i
2 i 1 Population variance
N
S2 = Sample Variance
Thank you