Professional Documents
Culture Documents
Introduction To Statistics
Introduction To Statistics
Introduction To Statistics
STATISTICS
Adapted from
DR.S. Ahamed
LECTURE OUTLINE:
Definition of Statistics
Types of data
Frequency distribution of data
Graphical representation of data
“Statistics is the science which deals
with collection, classification and
tabulation of numerical facts as the
basis for explanation, description
and comparison of phenomenon”.
------ Lovitt
Statistics explores the collection,
organization, analysis and interpretation of
data.
WHAT DOES STATISTICS
COVER ?
Planning
Design
Execution (Data collection)
Data Processing
Data analysis
Presentation
Interpretation
Publication
INVESTIGATION
Data Colllection
Descriptive Statistics
Data Presentation Inferential Statistics
Univariate analysis
Measures of Location
Tabulation Estimation-Point estimate
Measures of Dispersion
Diagrams Interval estimate Multivariate analysis
Measures of Skewness &
Graphs Hypothesis Testing
Kurtosis
TYPES OF DATA
QUALITATIVE DATA
DISCRETE QUANTITATIVE
CONTINUOUS QUANTITATIVE
QUALITATIVE
Nominal
Example: Sex ( M, F)
Exam result (P, F)
Blood Group (A,B, O or AB)
Color of Eyes (blue, green,
brown, black)
ORDINAL
Example:
Response to treatment
(poor, fair, good)
Severity of disease
(mild, moderate, severe)
Income status (low, middle,
high)
QUANTITATIVE (DISCRETE)
QUANTITATIVE (CONTINUOUS)
Number of Children
Hb
CONTINUOUS DATA
DISCRETE DATA
Interval scale :
Data is placed in meaningful intervals and order. The unit of
measurement are arbitrary.
Descriptive Statistics
Data Presentation Inferential Statistics
Univariate analysis
Measures of Location
Tabulation Estimation-Point estimate
Measures of Dispersion
Diagrams Interval estimate Multivariate analysis
Measures of Skewness &
Graphs Hypothesis Testing
Kurtosis
Frequency Distributions
<9.0 0 2 2
9.0 – 9.9 1 3 4
10.0 – 10.9 3 5 8
11.0 – 11.9 6 8 14
12.0 – 12.9 10 6 16
13.0 – 13.9 5 4 9
14.0 – 14.9 3 2 5
15.0 – 15.9 2 0 2
Total 30 30 60
Elements of a Table
Ideal table should have Number
Title
Column headings
Foot-notes
Number – Table number for identification in a report
10
umb
5
0 in the X axis
Smo Alc Chol DM HTN No F-H
Exer The bars should be of equal
Risk factor width and no touching the
other bars
The distribution of risk factor among cases with
Cardio vascular Diseases
HIV cases enrolment in
USA by gender
Bar chart
12
Enrollment (hundred)
10
8
6
Men
4 Women
2
0
1986 1987 1988 1989 1990 1991 1992
Year
HIV cases Enrollment
in USA by gender
Stacked bar chart
18
16
Enrollment (Thousands)
14
12
10
8 Women
6 Men
4
2
0
1986 1987 1988 1989 1990 1991 1992
Year
Pie Chart
•Circular diagram – total -100%
10%
•Divided into segments each
representing a category
20% Mild
Moderate
•Decide adjacent category
68 63 42 27 30 36 28 32
79 27 22 28 24 25 44 65
43 25 74 51 36 42 28 31
28 25 45 12 57 51 12 32
49 38 42 27 31 50 38 21
16 24 64 47 23 22 43 27
49 28 23 19 11 52 46 31
30 43 49 12
Histogram
20
Frequency
10
20
Frequency
10
68 63 42 27 30 36 28 32
79 27 22 28 24 25 44 65
43 25 74 51 36 42 28 31
28 25 45 12 57 51 12 32
49 38 42 27 31 50 38 21
16 24 64 47 23 22 43 27
49 28 23 19 11 52 46 31
30 43 49 12
Stem and leaf plot
Stem-and-leaf of Age N = 60
Leaf Unit = 1.0
6 1 122269
19 2 1223344555777788888
(11) 3 00111226688
13 4 2223334567999
5 5 01127
4 6 3458
2 7 49
Box plot
80
70
60
50
Age
40
30
20
10
Descriptive statistics report:
Boxplot
- minimum score
- maximum score
- lower quartile
- upper quartile
- median
- mean
the histogram
(quantitative data)