Professional Documents
Culture Documents
CH 1-2 Basic Statistics-2
CH 1-2 Basic Statistics-2
CH 1-2 Basic Statistics-2
Introduction
Definition of statistics
Definition:
1. Plural sense (lay man definition):
Statistics is defined as the collection of numerical facts or figures
( or the raw data themselves).
Examples: Statistics of births, deaths, students, imports & exports,
etc.
2. Singular sense (formal definition):
Statistics is the subject that deals with the methods of collecting,
organizing, presenting, analyzing and interpreting of numerical
Compiled by: Bacha E., Applied Mathematics, ASTU
data.
Statistical methods can be used to find answers to the
questions like:
• What kind and how much data need to be collected?
• How should we organize and summarize the data?
• How can we analyze the data and draw conclusions from
it?
• How can we assess the strength of the conclusions and
evaluate their uncertainty?
Classifications of Statistics
Depending on how data can be used statistics is some times divided
in to two main areas or branches.
Note that quantitative variables are either discrete (which can assume only certain
values, and there are usually "gaps" between the values, such as the number of
bedrooms in your house) or continuous (which can assume any value within a specific
range, such as the air pressure in a tire.)
Limitations of Statistics
It does not deal with individual values
etc
Compiled by: Bacha E., Applied Mathematics, ASTU
Level of Measurements
Proper knowledge about the nature and type of data to be dealt with is essential in
order to specify and apply the proper statistical method for their analysis and
inferences.
Four levels of measurement scales are commonly distinguished:
1. Nominal scale
-no ranking or ordering
-all arithmetic & relational operations are not applicable
- no numerical or quantitative value
Example : -Sex (Male or Female),
-Marital Status (married, single, widow, divorce)
2. Ordinal Scale
Can be arranged in some order, but the differences between the data values are
meaningless.
All arithmetic operations are not applicable
All relational operations are applicable
Example:- latter grading (A, B, C, D, F)
- Rating
Compiled by: Bachascales (excellent,
E., Applied very
Mathematics, good,
ASTU good, fair, poor)
- military status (general, colonel, lieutenant, etc).
3. Interval Scale
All relational operations are applicable
All arithmetic operations except division are applicable
There is no true zero, or starting point.
That is, zero on the scale is arbitrary (artificial origin)
Example: - 𝑇𝑒𝑚𝑝𝑒𝑟𝑎𝑡𝑢𝑟𝑒(℃)
- Intelligence Quotient (IQ)
4. Ratio Scale
All arithmetic & relational operations are applicable
Zero on the scales implies absolute absence of the
characteristics under considered
Example: - Weight, age, number of students, number of children
per family etc
Compiled by: Bacha E., Applied Mathematics, ASTU
CHAPTER TWO
Methods of Data Collection and
Presentation
I. Methods Data Collection Collection
Based on their sources data can be classified into two.
i. Primary data
ii. Secondary data.
Primary data are those collected by the investigator for the purpose a
specific study, whereas
Secondary data are obtained from available data already collected by
some other agency for the same or different purpose.
Telephone interview
V) Observation
Frequency (fi) 25 20 8 7 60
No. of 2 3 4 5 6 7 8 Total
Children
Frequency 5 7 8 4 1 2 3 30
(fi)
Compiled by: Bacha E., Applied Mathematics, ASTU
3. Continuous Frequency Distribution:- Continuous FD’s
arise from continuous variables.
- When the range of the data is large, the data must be grouped
in to classes that are more than one unit in width.
Basic Terms in a continuous frequency distribution
Class Frequency (or frequency):- refers to the number of
items belonging to a class.
Class limits (C.L.):- It divided into two.
i) Lower class limit (LCL)
ii) Upper class limit (UCL)
Example: Consider the mark of 40 students out of 60 given
below.
𝐿𝐶𝐿1 = 6 , 𝐿𝐶𝐿2 = 12, … , 𝐿𝐶𝐿6 = 36
𝑈𝐶𝐿
Compiled1by:= 11,
Bacha 𝑈𝐶𝐿
E., Applied 2 = 17,
Mathematics, ASTU … , 𝑈𝐶𝐿6 = 41
Table 1: The mark of 40 students out of 60
Class No of Class Class Relative % R.F L.C.F M.C.F
limits Students Boundary Mark frequency
(Mark) (fi) (C.B) (C.M) (R.F)
𝑈 = 𝐿𝐶𝐿𝑖+1 − 𝑈𝐶𝐿𝑖
e.g. Calculate the class width of the above example. Table one.pptx
Solution:
fi
%R.F 100%
n
Example: Table one.pptx
Compiled by: Bacha E., Applied Mathematics, ASTU
Cumulative Frequency Distribution
1. Less than cumulative frequency (L.C.F):- is obtained by adding
the frequency of all the preceding classes including the frequency
of that class.
- In other word L.C.F is the total number of observations less than the
UCB of that class.
2. More than cumulative frequency (M.C.F) :- is also obtained
by adding the frequency of all the succeeding classes including the
frequency of that class
- M.C.F is the total number of observations greater than the LCB of
that class.
Example: Using the above example find the L.C.F and M.C.F Table
one.pptx
Compiled by: Bacha E., Applied Mathematics, ASTU
Constructing a continuous frequency distribution
Practical steps in constructing continuous frequency distribution
1. Determine the number of classes (k)
Using Sturges‟ rule-of-thumb:
k = 1 + 3.322 log n
where k is the number of classes,
log is common logarithm
n is the total number observations in our sample
2. Determine the Class Width (w)
𝑅𝑎𝑛𝑔𝑒
𝑤= 𝑘
and rounded to the nearest integer.
Range = largest value – smallest value
R = L – S and rounded to the nearest integer
Compiled by: Bacha E., Applied Mathematics, ASTU
3. Determine the Class Limits
The lower class limit of the first class should be less than
or equal to the smallest value of the observations collected from
the field
Add the class width on the lower class limit to obtain the lower
class limit of the next higher class.
Subtract the unit of measure from 2nd LCL to obtain the 1st UCL.
Then Add the class width on the UCL to obtain the upper class
limit of the next higher class.
Example:- Construct a continuous frequency distribution for the
following raw data on marks (out of 100) obtained by 50
students in Statistics course.
57, 53, 65, 55, 50, 45, 64, 52, 15, 46, 42, 63, 33, 64, 53, 25, 54, 35,
48, 55, 70, 47, 39, 58, 52, 36, 65, 75, 26, 20, 55, 60, 83, 61, 45,
63, 49, 42, 35, 18, 51, 45, 42, 65, 39, 59, 45, 41, 30, 40.
Compiled by: Bacha E., Applied Mathematics, ASTU
Cont…
Solution: n = 50, L = 83, S = 15. Then
k = 1 + 3.322 log n= 1+3.322 (log50) = 6.64 ≈ 7
R = L – S =83 – 15 = 68
w = R/k = 68/7 = 9.71 ≈ 10
Table 2: The marks of 50 students (out of 100) obtained in
Statistics course.
Class 15 -24 25-34 35-44 45-54 55-64 65-74 75-84 Total
limits
fi 3 4 10 15 12 4 2 50
C.B
C.M
R.Compiled
F by: Bacha E., Applied Mathematics, ASTU
L.C.F
Diagrammatic and Graphical Method of Data Presentation
They are thick lines (narrow rectangles) having the same breadth (size).
600
400 450 600
400 Frequency 400 Male
200
200 200
0 0
Phys Maths Chem Bio Phys Maths Chem Bio
Deprtment Department
Compiled by: Bacha E., Applied Mathematics, ASTU
Pie Chart
Pie chart:- is a circle that is divided in to sections or wedges
according to the percentage of frequencies in each category of the
distribution.
Example: Draw a pie chart to represent the following population data
in a town.
Men Women Girls Boys
fi
% 100%
n
Compiled by: Bacha E., Applied Mathematics, ASTU
Pie chart …..
Class Frequence (fi) percentage
Boys Men
15% 25%
Girls Women
40% 20%
data.
1. Histogram
2. Frequency polygon