Professional Documents
Culture Documents
Statistics For Finance
Statistics For Finance
0
TABLE OF CONTENTS
CONTENTS
Page
1
BLOCK 6: INTRODUCTION TO PROBABILITY DISTRIBUTIONS
2
BLOCK 1: INTRODUCTION
UNIT 1: INTRODUCTION TO STATISTICS
INTRODUCTION
When did the practice of Statistics start? It is not as such possible to tell the exact point
at which Statistics was started. However, it is not done consciously, the practice of
Statistics in one of the things human beings perform in their day-to-day activities. All the
estimations, forecasts, comparisons, averaging, and so on are part of Statistics. Statistics
is something that is part of our lives. We encounter it in all walks of life, but most of the
time we don’t recognize it.
It was around the 16th century that Statistics considered as a formal discipline. After
passing through continuous developmental stages, Statistics currently reached at a point
more than ever in its history.
Nevertheless, Statistical results are not ends by themselves. They are used to develop and
strengthen ideas or theories that we have in other fields like economics, social and natural
science, business, medicine, etc. Rober W. Bugess, as cotted by C.B Gubta, put the aim
of Statistics as:
“ The fundamental gospel of Statistics is to push back the domain of ignorance, prejudice,
rule of thumb, arbitrary or premature decisions, traditions and dogmatism, and to increase
the domain in which decisions are made and principles are formulated on the basis of
analyzed facts.”
3
BLOCK 2: CLASSIFICATION AND PRESENTATION
OF STATISTICAL DATA
INTRODUCTION
In the preceding block, you have learnt about the collection of data. When conducting a
statistical study, you must gather data for the particular variable under study.
In this block, you will learn about classification and presentation of data. The first part
deals with ‘classification of data’ and the following unit deals with ‘presentation of data’.
The purpose of this block is to explain how to organize data by constructing ‘frequency
distribution’ and how to present the data by constructing graphs and charts. The graphs
and charts illustrated in this block include histograms, frequency polygons, ogives, pie
charts, bar charts, time series graphs, and pictographs (pictograms).
4
BLOCK 3: MEASURES OF CENTRAL
T E NDE NC Y
INTRODUCTION
Visual presentation of data would disclose some characteristic features of a mass of data.
And further summarization of data is so essential to show the relationship between
variables and to correlate one variable with another. To describe the characteristic
features of the entire mass of data with single quotient, the more obvious measure that
helps to make quicker and better decision is the measure of Central Tendency, also called
the Averages. An average gives a bird's eye view of huge mass of data, which are not
easily intelligible, since it refers to a numerical value that is a central point about which
other values in a series get dispersed.
5
BLOCK 4: MEASURES OF DISPERSION
(VARIATION)
UNIT 9: RANGE AND QUARTILE DEVIATION
UNIT 10: MEAN DEVIATION
UNIT 11: STANDARD DEVIATION
INTRODUCTION
6
BLOCK 5: ELEMENTARY PROBABILITY THEORY
INTRODUCTION
Dear students, this block consists of two units; the first of which is counting methods
which gives you clear idea of how to determine the possible number of elements in an
event as well as in a sample space of an experiment, while the second unit give you the
techniques of determining the probability of an event. You also need to realize that in
ordinary language, probability chance, likely hood, and odds are interchangeable.
7
BLOCK 6: INTRODUCTION TO
PROBABILITY DISTRIBUTIONS
INTORODUCTION
In the preceding block, you have learnt about probability, one of the most important tools
in statistics and which is important in decision making as it provides a mechanism for
measuring, expressing and analyzing the uncertainties associated with future events.
You also saw that the probability of an event could be computed either by summing the
probabilities of the experimental outcomes (sample points) comprising the event or by
using the relationships established by the addition, conditional probability and
multiplication laws of probability.
In this block, you will learn what a ‘probability distribution’ is. The first part deals with
discrete probability distribution and the in the following unit, an introduction to
continuous probability distribution will be dealt with, focusing on the normal probability
distribution.
8
BLOCK 7: SAMPLING AND SAMPLING DISTRIBUTION OF THE
MEAN
9
BLOCK 9: CORRELATION AND REGRESSION
UNIT 21 CORRELATION
UNIT 22 REGRESSION
INTRODUCTION
In this block, we shall look at methods, which investigate whether two quantitative
variables are related. In our practical life, we come across different sets of data that deal
with more than one variable which are interrelated and interdependent. For example, the
instructor wonders whether the Mathematics mark and the Statistics mark are related: did
a good performance in one subject go with a good performance in the other?
She/he decides that s/he can most easily discover this by plotting the marks for all the
students on a sheet of graph paper. In addition, she/he tries to see the relationship
mathematically. These mathematical methods are Karl Pearson’s coefficient of
correlation and Spearman’s Rank coefficient of correlation.
INTRODUCTION
In the preceding block, you have learnt about the collection of data. When conducting a
statistical study, you must gather data for the particular variable under study.
10
In this block, you will learn about classification and presentation of data. The first part
deals with ‘classification of data’ and the following unit deals with ‘presentation of data’.
The purpose of this block is to explain how to organize data by constructing ‘frequency
distribution’ and how to present the data by constructing graphs and charts. The graphs
and charts illustrated in this block include histograms, frequency polygons, ogives, pie
charts, bar charts, time series graphs, and pictographs (pictograms).
The aim of this unit is to study about the collection of data for a statistical study and
discuss the various types of classification of data, and then to organize these data into a
frequency distribution.
3.1. INTRODUCTION
After collecting relevant information (data) for the purpose of statistical investigation, the
next important task is classification and presentation of this data. It is difficult to group
the meaning of any considerable volume of numerical data unless their mass is some
hours reduced to relatively few convenient classes or categories and presented with the
This section discusses classification of data. Presentation of data using graphs and charts
12
3.2. DEFINITION OF CLASSIFICATION OF DATA
Purposes of Classification:-
To eliminate unnecessary detail.
To bring out clearly points of similarity & dissimilarity
To enable one to form mental pictures of objects on measurements
To enable one to make comparisons and draw inferences
Example
Region Common Language Spoken
1 Tigrigna
2 Afar
3 Amharic
4 Oromifa
Example
Year (in EC) Population (in million)
1974 30
1986 52
1991 60
13
Example 3. Employees in a Factory x
Educated Un educated
Example 4.
Mr. x Height (X) in cm
A 160
B 182
C 175
D 178
Note: There are two kinds of variables, which can have values: Discrete Variable and
Continuous Variable.
A. Discrete Variables – are variables that are associated with enumeration or counting
Example
Number of students in a class
Number of children in a family, etc
When the raw data have been collected, they should be put in to an ordered array in an
ascending or descending order so that it can be looked at more objectively. Then this
data must be organized in to a “FD” which simply lists the values or classes with their
14
corresponding frequencies in a tabular form. Here, frequency refers to the number of
observations a certain value occurred in a data.
The tabular representation of values of a variable together with the corresponding
frequency is called a Frequency Distribution (FD).
Definition:
A frequency distribution is the organization of raw data in table form, using classes and
frequencies.
Solution:
No. of Children No. of Family Frequency
(Values) (Tallies)
0 // 2
1 //// 4
2 //// 5
3 /// 3
4 // 2
Total 16
CYP 1
Consider the following scores in a statistics test obtained by 20 students in a given class.
10, 4, 4, 7, 5, 7, 7, 8, 5, 7, 8, 5, 10, 8, 7, 5, 7, 8, 7, 4
Prepare an ungrouped FD
15
B. Grouped Frequency Distribution (GFD)
If the mass of the data is very large, it is necessary to condense the data in to an
appropriate number of classes or groups of values of a variable and indicate the number
of observed values which fall in to each class. Therefore, a GFD is a frequency
distribution where values of a variable are linked in to groups & corresponded with the
number of observations in each group.
Example*
Values (xi) 1 - 25 26 - 50 51 - 75 76 -
100
Frequency (fi) 3 10 18 6
i. Class:- group of values of a variable between two specified numbers called lower
class limit
(LCL) & upper class limit (UCL)
*
In Example , the GFD contains four classes: 1 – 25, 26 – 50, 51 – 75, and 76 – 100
LCL1 = 1, UCL1 = 25 LCL3 = 51, UCL3 = 75
LCL2 = 26, UCL2 = 50 LCL4 = 76, UCL4 = 100
ii. Class Frequency (or Simply Frequency): refers to the number of observations
corresponding to a class.
In Example * the class frequency of the 1st, 2nd, 3rd, & 4th classes are respectively 3, 10,
18 and 6.
iii. Class Boundaries: are boundaries obtained by subtracting half of the unit of
measurement (u) from the lower limits or by adding ½ (u) on the upper limits of a class.
i.e UCBi = UCLi + ½ (u)
LCBi = LCLi - ½ (u)
Where UCBi = Upper Class Boundaries and
LCBi = Lower Class Boundaries
Remark: The unit of measurement (u) is the gap between any two successive classes. i.e
16
*
In Example , consider the 2nd class, 26 – 50 , since u = 26 – 25 = 1,
LCL2 = 26 UCL2 = 50
LCB2 = 26 - ½(1) = 25.5 UCB2 = 50 + ½(1) =50.5
iv. Class Width (size of a class or class interval): it is the difference between the upper
and lower class limits or the difference between the upper and lower class boundaries of
any class.
Remarks:
1. If both the LCL & UCL are included in a class, it is called an inclusive class. For
inclusive classes,
Class width (cw) = UCBi - LCBi
2. If LCL is included and the UCL is not included in a class, it is called an exclusive
class. For exclusive classes
cw = UCLi – LCLi
vi. Range (R) : is the difference between the largest (L) and the smallest (S)
values in a data
R=L–S
17
b. How many observations (items) are linked into the last class?
c. Find i. the LCL and UCL of the fourth class
ii. the UCB and LCB of the third class
iii. the class interval ( class width) of the fifth class
iv. the class mark (mid point) of the second class
Remark Unequal class intervals create problem in graphing and computing some
statistical measures
20 48 65 25 48 49
35 25 72 42 22 58
53 42 23 57 65 37
18
18 65 37 16 39 42
49 68 69 63 29 67
a. construct a GFD with a suitable number of classes
b. complete the distribution obtained in (a) with class boundaries & class marks
Range 56
iii. Class width = = 9.33 = cw
n 6
For the sake of convenience, take cw to be 10 (note that it is also
possible to choose the cw to be 9).
iv. Take lower limit of the 1st class (LCL1) to be 16 & u = 1
i.e. LCL1 = 16 and UCL1 = LCL1 + cw – u =
16+10-1 = 25
LCL2 = LCL1 + cw = 16 + 10 = 26 UCL2 = UCL1 + cw = 25 +
10 = 35
LCL3 = LCL2 + cw = 26 + 10 = 36 UCL3 = UCL2 + cw = 35 +
10 = 45
a) b)
Class (xi) Frequency (fi)
16 – 25 7
26 – 35 2
36 – 45 6
46 – 55 5
56 – 65 6
66 – 75 4
Class (xi) Frequency (fi) CBi cmi
16 – 25 7 15.5 – 25.5 2.05
26 – 35 2 25.5 – 35.5 30.5
36 – 45 6 35.5 – 45.5 40.5
46 – 55 5 45.5 – 55.5 50.5
56 – 65 6 55.5 – 65.5 60.5
19
66 – 75 4 65.5 – 75.5 70.5
CYP 3
Construct a grouped frequency distribution for the following ages of 50 persons with 6
classes.
37 40 69 35 36 70 72 62 36 72
65 64 47 59 55 42 45 50 46 65
54 63 51 50 61 60 58 58 56 58
55 45 49 51 50 56 44 60 70 44
52 43 55 46 42 62 57 48 60 55
Remark: The frequency distribution does not tell us directly the number of units above
or below specified values of the classes this can be determined from a “cumulative
Frequency Distribution’
Class (xi) Frequency (fi) Less than Cumulative More than Cumulative
Frequency (<cfi) Frequency (>cfi)
3-6 4 4 30
7 – 10 7 11 26
11 – 14 10 21 19
15 – 18 6 27 9
19 – 22 3 30 3
This means that from ‘less than’ cumulative frequency distribution there are 4
observations less than 6.5, 11 observations below 10.5, etc and from ‘more than’
cumulative frequency distribution 30 observations are above 2.5, 25 above 6.5 etc.
20
3.8. RELATIVE FREQUENCY
DISTRIBUTION (RFD)
It enables the researcher to know the proportion or percentage of cases in each class.
Relative frequencies can be obtained by dividing the frequency of each class by the total
fi
Rf i
n
Where Rfi – is the relative frequency of the ith class
fi – is the frequency of the ith class
n – is the total number of observations
Note: Pfi = Rfi 100%
Where Pfi is percentage frequency of each class.
This unit discussed the definitions of classification of data and a frequency distribution.
In order to describe situations, draw conclusions or make inferences about random
events, one must organize the data in some meaningful way. The most convenient
method of organizing data is to construct a frequency distribution.
21
Therefore, a frequency distribution was seen as a distribution showing the
correspondence of values or classes with their respective frequencies.
CYP 1
Value(xi) Frequency(fi)
4 3
5 4
7 7
8 4
10 2
CYP 2
a) 12
b) 3
c) i) L.C.L4 = 20 and U.C.L4 = 24
ii) Since u = 10 – 9 = 1 (or any gap between two consecutive classes)
L.C.B3 = L.C.L3 – ½(u) = 15 - ½.1 = 14.5
U.C.B3 = U.C.L3 + ½(u) = 19+ ½.1 = 19.5
iii) class interval = class width = cw = UCB5 – LCB5 = 29.5 – 25.5 = 6
iv) class mark(cm2) = UCB2 + LCB2
2
= 19.5 + 14.5
2
= 24/2
= 12
CYP 3
22
lower limits of the next class. Keep adding until there are 6 classes as
shown
35
42
49
56
63
70
v) Subtract one unit from the lower limit of the second class to get the upper
limit of the first class; then add the class width(cw) to each upper limit
to get all the upper limits. i.e. UCL1 = LCL2 - 1 = 42 – 1 = 41. So the
first class is 35-41.
vi) Tally the data (count the number of observations linked in to the respective
classes) and write the numerical values for tallies in the frequency column.
Therefore, the frequency distribution would be:
a) A frequency distribution is the organization of raw data, in table form, that lists
values or classes with their corresponding frequencies.
b) The mid point of a class is found by adding the upper and lower limits and
dividing by
c) If the gap between any two successive classes is one and the limits of a class are
10-19,
then the width of the class is 9.
d) If the limits of a class in a frequency distribution are 26-30, then the boundaries
are
25.5-30.5.
e) When data is first collected, it is called raw data.
23
2. Classify each variable as discrete or continuous.
32 21 28 31 35 46 48 49 49 48
36 37 22 31 28 34 20 45 44 48
38 33 33 23 28 29 33 26 36 30
43 42 32 36 24 27 27 32 45 45
39 39 38 32 33 25 30 28 37 36
42 43 38 40 35 34 20 30 36 32
40 38 38 40 46 36 35 21 31 35
41 42 39 40 46 44 32 37 22 27
41 39 40 38 44 45 48 36 32 23
40 41 40 44 49 49 49 49 37 33
Construct a Grouped Frequency Distribution (GFD) with five classes for the above data.
3.12. GLOSSARY
Frequency: The number of values in a specific class of the distribution or the number of
times a
value occurs in the distribution.
Cumulative Frequencies: refer to the total frequency of all values up to and including
the upper
boundary of the class interval that is under consideration.
Class: In set refers to a group of data considered as one item in a frequency distribution.
Range: Means the difference between the largest and the smallest values in a set of data.
24
Class Limits: Means limits of different classes in a frequency distribution.
Class Boundaries: Boundaries that are obtained by adding and subtracting half of unit of
measurement.
3.13. REFERENCES
CONTENTS:
4.0. Aims and Objectives
4.1. Introduction
4.2. Histogram
4.3. Frequency Polygon
4.4. Cumulative Frequency Curve (Ogive)
4.5. Line Graph
4.6. Vertical Line Graph
4.7. Bar Chart (Bar Diagram)
4.8. Types of Bar Charts
4.9. Pie Chart
4.10. Pictograph (Pictogram)
4.11. Summary
4.12. Answer to Check Your Progress Questions (CYP)
4.13. Model Examination Questions
4.14. Glossary
4.15. References
The aim of this unit is to study how to construct and present data using different types of
graphs, charts, and diagrams that can facilitate comparisons and in general to have an
over all good picture of data.
25
At the end of this unit, you will be able to:
4.1. INTRODUCTION
This unit deals with the study of organizing a set of raw data in to a Frequency
polygon, & a cumulative frequency curve (ogive). The other types of numerical
information will be summarized & presented in the form of bar chart, pie chart or a
pictogram.
Definition:
4.2. HISTOGRAM
After you complete a frequency distribution, your next step will be to construct a
“picture” of these data values using a histogram. A histogram is a graph consisting of a
series of adjacent rectangles whose bases are equal to the class width of the
corresponding classes and whose heights are proportional to the corresponding class
frequencies. Here, class boundaries are marked along the horizontal axis (x – axis) and
the class frequencies along the vertical axis ( y – axis) according to a suitable scale. It
describes the shape of the data. You can use it to answer quickly such questions a,s are
the data symmetric? And where do most of the data values lie?
26
11 – 14 10
15 – 18 6
19 - 22 3
Total 30
Solution:
10
Class frequency (fi)
27
along the y – axis. Empty classes are include at each end so that the curve will anchor
with the x – axis.
Solution:
A frequency polygon for the
distribution in example 9
15
frequency (fi)
10
0
0.5 7.5 12.5 17.5 22.5 27.5 32.5 37.5
Class marks (cmi)
CYP 2 construct a frequency polygon for the frequency distribution given under CYP 1
Solution:
28
A less than ogive showing the frequency
distribution above
35
Less than cumulative
30
frequency (<Cfi)
25
20
15
10
5
0
6.5 10.5 14.5 18.5 22.5
Upper class boundary (UCBi)
B) ‘More than’ ogive: here, lower class boundaries are plotted against the ‘more
than’ cumulative frequencies of their respective class and they are joined by
adjacent lines.
Example 4. Draw a ‘More than’ ogive for the frequency distribution in Example 11
Solution:
29
A more than ogive for the above frequency
distribution
40
More than cumulative
30
frequency (>Cfi
20
10
0
2.5 6.5 10.5 14.5 18.5
lower class boundaries (LCBi)
It represents the relation ship between time (on the x-axis) and values of variable (on the
y-axis). The values are recorded with respect to the time of occurrence.
Solution:
A line graph showing the above time series
35
30 30
25 25
20 20
Values
15 15
10 10 10
5
0
1986 1987 1988 1989 1990 1991
Year
30
4.6. VERTICAL LINE GRAPH:
Family A B C D E
Number of children 3 2 7 6 4
Solution:
Y
7 …………………
6 …………………………
5
4 ………………………………
3 ……
2 ……………
1
X
A B C D E
vertical line graph showing number of children in family A , B , C , D and E
Histogram, Frequency polygon, ogives are used for data having an interval or ratio level
of measurement. The other kinds of presenting statistical data suitable for a particular
kind of situations are bar charts, pie chart and pictograph.
Bar chart is a series of equally spaced bars of uniform width where the height (length) of
a bar represents the amount (magnitude) of frequency corresponding with a category.
Bars may be drawn horizontally or vertically. Vertical bar graphs are preferred as they
allow comparison with other bars.
It represents a single set of data (variable) classified in different categories. Singular bars
are drawn with the respective frequencies.
Example18: Revenue (in millions of Birr) of company x from 1980 to 1982 is given
below
31
Year Revenue
1980 50
1981 150
1982 200
Solution:
250
200
150
Revenue
100
50
0
1980 1981 1982
year
here two or more bars are grouped with the corresponding frequency to represent two or
more interrelated data in each category. The bars of related variables are kept adjacent to
each other for every set of values. These charts can be used if the overall total is not
required and each bar is shaded or colored separately and a key is given to distinguish
them.
Example19: The following table shows the production of wheat and maize in hundreds
of quintals.
32
1981 20 60
1982 60 100
Solution:
100 100
80 80
60 60 60
Number of
quintals 40 40 maize
20 20 wheat
0
1980 1981 1982
Year
It is used to present data by subdividing a single bar with respect to the proportional
frequency. Each portion of the bar is then shaded or colored and a key is give to
distinguish them.
Example20: The number of quintals of wheat and maize (in millions of quintals)
produced by country x in the indicated years.
33
The number of quintals of wheat and maize
produced by country X
600
Number of
quintals
400 200 100 Maize
200 150 Wheat
300 350
150
0
1980 1981 1982
Year
It is a subdivided bar chart where percentages are used in each classification rather than the
actual frequencies.
Example 21: construct percentage bar chart for the data in Example 19.
Solution:
Year % of Wheat Production % of Maize
Production
1980 150/300 100 = 50 150/300 100 = 50
1981 300/500 100 = 60 200/500 100 = 40
1982 350/450 100 = 78 100/450 100 = 22
100%
22
80% 50 40
Percentage
produced
60% wheat
40% 78 maize
50 60
20%
0%
1980 1981 1982
Year
34
4.9. PIE CHART
A pie chart is a circle divided in to various sectors with areas proportional to the value of
the component they represent. It shows the components in terms of percentages not in
absolute magnitude. The degree of the angle formed at the center has to be proportional
300
350 Food
House rent
Clothing
Misc.
100
250
Example 23: In comparing the population of a country from 1990 to 1992, we simply
draw pictures of people where each picture may represent 1000,000 people.
35
1992 - Key: = 1000,000
1991 -
1990 -
4.11. SUMMERY
This unit discussed how to present the organized data. Once a frequency distribution is
constructed, the representation of the data by using graphs is a simple task. The most
commonly used graphs in research statistics are the histograms, frequency polygon, an
ogive, and other graphs and diagrams, like the bar charts, pie charts, pictograms can also
be used. And some of these graphs are seen frequently in newspapers, magazines, and
various statistical reports.
CYP 1
y
freq.12
10
x
5 10 15 20 25 30 35
Class boundaries (CBi)
CYP 2
. y
12
10
Cummulative Frequency
36
x
2.5 7.5 12.5 17.5 22.5 27.5 32.5 37.5
Class Marks (cmi)
4.13. MODEL EXAMINATION QUESTION
37
4.14. GLOSSARY
Frequency Polygon: Refers to the graph obtained when the mid points of the tops of the
rectangles in a histogram having equal class intervals are
connected
by line segments.
Frequency Curve: Refers to a smooth frequency polygon for data that can take a
continuous set of values.
Bar Chart: Refers in a graph made up of bars whose lengths are proportional to
quantities in a set of data
Pie Chart: Refers to a diagram wherein proportions are shown as sectors of a circle.
4.15 REFERENCES
38
BLOCK 4: MEASURES OF CENTRAL
TENDENCY
INTRODUCTION
Visual presentation of data would disclose some characteristic features of a mass of data.
And further summarization of data is so essential to show the relationship between
variables and to correlate one variable with another. To describe the characteristic
features of the entire mass of data with single quotient, the more obvious measure that
helps to make quicker and better decision is the measure of Central Tendency, also called
the Averages. An average gives a bird's eye view of huge mass of data, which are not
easily intelligible, since it refers to a numerical value that is a central point about which
other values in a series get dispersed.
39
UNIT 5: DEFINITION AND PURPOSE OF AVERAGES
CONTENTS:
5.1 Definition
5.2 Purpose of Average
5.3 Requisites of a good average
5.4 Glossary
5.5 References
5.1 DEFINITION
Statistics provides its tools to reduce each group of values in to a single summary figure
representing each group. These representative values are called averages (the measures
of central tendency). In other words, they are measures, which condense a huge un
widely set of numerical data in to a single value. Its value always lies between the
40
To sum up, the averages are very much useful in:
i) Describing the distribution in concise manner
ii) Comparative study of different distributions
iii) Computing various other statistical measures such as dispersion, skew
ness and other basic characteristics of mass of data.
5.4 GLOSSARY
Fluctuation - Move up and down or be irregular (of price, level, etc.)
Extreme values - Refers to the largest or smallest variant values which are borne by the
number of a set. The expression signifies values neighboring the end
values.
Inference - Drawing conclusion from facts or by reasoning.
Parameter - Refers to characteristic or determining feature.
5.5 REFERENCES
Business Statistics, C.R. REEDY. M Com Ph. D., 1994
Business Statistics [A textbook for B.Com. Students of Indian Universities].
R.H. DHARESHWAR, M.Sc. M.Phil. 1999
41
UNIT 6: MATHEMATICAL MEASURES OF CENTRAL TENDENCY
CONTENTS:
6.0 Aims and Objectives
6.1 Introduction
6.2 Summation Notation and Its Properties
6.3 Arithmetic Mean (AM)
6.4 Geometric Mean (GM)
6.5 Harmonic Mean (HM)
6.6 Advantages and Application Areas of the Three Means
6.7 Summary
6.8 Model Examination Questions
6.9 Answers to Check Your Progress Questions
6.10 Glossary
6.11 References
6.1 INTRODUCTION
The definition should not leave anything to the description of the person who calculated
averages. Averages should be computed with sufficient ease and rapidity or averages
should not involve more of mathematical complexities. The most popular and widely
used measure for representing the entire data by one value is arithmetic mean.
Summation operator, , implies that the values that follow it are to be summed or added
together.
42
n upper lim it
xi
i m lower lim it
the i th var iable of x
5
Example x = x1 + x2 + x3 + x4 + x5
i 1
i
Properties:
1. The summation of sums of differences
x yi
n n n n n n
i 1
i x
i 1
i y
i 1
i , x
i 1
i yi x
i 1
i y
i 1
i
Example: Suppose x1 = 1 , x2 = 3 , x3 = 4 , y1 = 2 , y2 = 5 , y3 = 3
x yi
3 3 3
Then
i 1
i xi
i 1
y
i 1
i
xi
i 1
yi xi
i 1
y
i 1
i …… left for the student
2. Multiplication by a constant
n n
kxi k xi
i 1 i 1
43
4
Then k nk
i 1
4
6 46
i 1
6 + 6 + 6 + 6 = 24
k n m 1k
im
for m < n
k n m 1k
im
8 6 4 18
i4
8 + 8 + 8 = 3(8)
24 = 24
4. Sum of summations
k n n
xi
i 1
xi
i k 1
x
i 1
i for any k < n
xi
i 1
xi
i k 1
x
i 1
i
3 6 6
x
i 1
i x
i 4
i x
i 1
i
6 6
Let xi 10, x 148 , x1 = 3 , x2 = 2
2
CYP 1 i
i 3 i 3
6 6 6 6
xi xi xi ( xi 2) (2 xi 3) 2
2
Find i. ii. iii. iv.
i 1 i 1 i 1 i 1
44
2
v.
i 1
(ixi 4)
6.3.0 Definition
The arithmetic mean is the sum of the values in a group divided by the number of items
in that group. Let x1, x2, …, xn be n values of a variable x, then their arithmetic mean is
n
x x2 xn x i
x
defined by: x 1 i 1
n n n
Where x – sum of all observations
n – total number of observations
x i
d
Direct method: x i 1
Short cut method: x A
n n
Where n – number of items A = Assumed mean d = sum of deviations i.e. ( xi -
A)
Example: Find the arithmetic mean for the following data by
i. direct method ii. short cut method
23.4 15.6 22.1 20.0 26.7 31.4 18.9 22.3
Solution:
8
8 x i
180.4
i. xi = 180.4 , n = 8
i 1
x i 1
n
8
22.55
ii. Let A = 22 then di : 1.4, -6.4, 0.1, -2, 4.7, 9.4, -3.1, 0.3
8
8 d i
4 .4
d
i 1
i = 4.4 , n = 8 x A i 1
n
22
8
= 22 + 0.55 = 22.55
45
n
fx i i
fx fd
Direct method: x i 1
Short cut method: x A
n n n
Where f - frequency d - deviation of items from assumed mean (xi – A)
A - assumed mean n - number of observations
Example: Given data of 50 students of marks of a test in a class. Calculate the arithmetic
mean by i. direct method ii. short cut method.
No. of Students 20 30 40 50 60 70
Marks 8 10 16 8 5 3
Solution:
Marks xi fi fx Di = ( x – 40) where fd
A = 40
20 8 160 -20 -160
30 10 300 -10 -100
40 16 640 0 0
50 8 400 10 80
60 5 300 20 100
70 3 210 30 90
50 2010 10
i. x
fx 2010
40.20
n 50
ii. x A
fd 40 10 40.20
n 50
For continuous series:
Direct method x
fcmi fd 1
Step deviation method x A c
n n
Where , f – frequency n – number of observation
Cmi – class mark A – assumed mean
d – derivation of class marks from assumed mean (cmi – A)
d' – d/c c – class width
Example: In a survey, the number of persons at different ages is found as follows:
Age in Year 5 - 15 15 - 25 25 - 35 35 - 45 45 - 55 55 - 65
No. of Persons 8 10 14 20 16 12
Solution:
46
25 - 35 14 30 420 0 0 0 0
35 - 45 20 40 800 10 200 1 20
45 - 55 16 50 800 20 320 2 32
55 - 65 12 60 720 30 360 3 36
80 3020 620 62
i. x
fcm 3020
37.75
n 80
ii. x A
fd 30 620 30 7.75 37.75
n 80
fd
1
iii.
Classes 10 - 15 15 - 20 20 - 25 25 - 30 30 - 35
Frequencies 5 6 7 7 5
2) The algebraic sum of the deviations of the given values from the arithmetic mean
is equal to zero. Mathematically,
xi x 0 … for ungrouped data
f x
x 0 … For grouped data
i i
47
f x x f x A …For grouped data. Where, A is different
2 2
i i i i
from
mean.
4) Suppose the mean of the values x1 , x2, … , xn be x0 . Then
i. if a constant k is added to each xi, then the new mean xn will be x0 +
k.
Proof: Arithmetic mean of x1 + k, x2 + k, …, xn + k is
n
x i k
x1 k x2 k xn k
A .M i 1
n n
A .M
x1 x2 x n k k k
n
x1 x 2 x n nk
A .M
n n
A .M x 0 k
ii. if each value is multiplied by a constant k, then the new mean will be k x0
Proof: A.M for kx1 , kx2, … kxn, is
n
kx i
kx1 kx2 kxn
A.M i 1
n n
k x1 x2 xn
A.M
n
A.M kx0
4
Example: Given data 12, 10, 8, 6, 16, 7, 11. If each item is multiplied by and 8 is
5
added, what will be the new mean?
4
7
xn x0 8
x i
70
5
x0 i 1
10 New mean
7 7
xn
4
10 8 16
5
48
CYP 3 Given data 3, 8, 9, 4, 7, 5, 10, 11, 6 if each item is multiplied by 2 and 6 is
added, then
i. The new mean will be _______________
ii. xi x __________________
N x N 2 x2 N n xn N x i i
For n number of groups, xc 1 1 i 1
N1 N 2 N n n
N
i 1
i
Example: The mean height of 25 male and 20 female is 161.0cm and 155.6cm. What
will be the combined mean height?
xm = 161.0cm, xF = 155.6cm, NM = 25, NF = 20
xm N m x F N F
xc
Nm NF
161.0 25 155.6 20 7137
xc 158.60cm
25 20 45
CYP 4 In a factory, 120 workers get an average wage of birr 30 a day, 160 workers get
Birr 50 a day, 80 workers get Birr 60 a day and 40 workers get birr 80 a day.
Find
i. the average of averages.
ii. the general average.
This relative importance is technically known as weight. In case where the relative
importance of the different items is not the same we compute weighted arithmetic mean.
If w1, w2, …, wn are weights attached to the values x1, x2, … , xn respectively, then the
weighted AM is defined as
49
xw
x1 w1 x2 w2 xn wn
wx
w1 w2 wn w
Example: An auto ride costs Birr 5 for the first km, Birr 4 for the next 3kms and Birr 9
for each of the subsequent kms. Find the average cost per km for 10 kms.
xw
xw
71.00
7.10Birr
w 10
Examples:
1. The average mark of 100 students was found to be 40 but latter it is discovered that a
score of 33 was misread as 83. Find the correct average corresponding to the correct
sum.
x 40
xi x N 40 100 4000 wrong sum
N 100
Wrong Entry = 83
Correct Entry = 33
50
4000 83 33 3950
Correct Mean 39.5
100 100
2. The average of a class having 35 pupils is 14 years. When the age of the class
teacher is added to the sum of the ages of the pupils, the average rises by 0.5 year.
What must be the age of the teacher?
x 14
xi 14 35 490 … Sum of ages of the pupils
N 35
x 14.5
xi 14.5 36 522 … Sum of ages of the pupils and the teacher
N 36
Age of the teacher is 522 - 490 = 32 years.
3. Goals scored by a football team in successive matches are 5, 2, 4, 3, 6, 0, 4 and 6.
What is the number of goals the team must score in the next match in order that the
average comes to 4 goals per match?
Total goal scored in 8 matches = 5 + 2 + 4 + 3 + 6 + 0 + 4 + 6 = 30
Total goal scored in 9 matches = x .N = 4 9 = 36
Hence the goals required in the 9th match to bring the average 4 = 36 – 30 = 6
CYP6 The mean of 200 items is 50. Later on it is discovered that two items were
wrongly taken as 92 and 8 instead of 192 and 88. Find out the correct mean.
CYP7 The average rainfall for a week, excluding Sundays, was 10cm. Due to heavy
rainfall on Sunday, the average rainfall for the week rose to 15cm. How much
rain fall was there on Sunday?
Symbolically, let x1, x2, … , xn be the n values of a variable x, then their G.M is defined
as
G.M n x1 . x2 xn
If the number of observation is more than three or more, the computation of the nth root is
very tedious. To simplify computation, the logarithms are used. In terms of log.
51
Log G .M Log n x1 . x 2 . x 3 xn
x1 n
1
Log . x2 xn
1
. Log x1 . x2 xn
n
1
Log x1 Log x2 Log xn
n
n
1
n
.
i 1
Log xi
1 n
Anti log Log GM Anti log n . Log xi
i 1
1 n
GM Anti log n
i 1
Log xi
1 n
For grouped data: G.M Anti log
n
i 1
f i . Log xi
Log x
i 1
i 9.4021 n = 5
1
G.M Anti log 9.4021 Anti log 1.5670 36.9
5
52
ii. x: 10 16 22 28 34
f: 5 4 3 6 2
Log x: 1 1.2041 1.3424 1.4472 1.5315
fi log xi: 5 4.8164 4.0272 8.6532 3.0630
20
f
i 1
i Log xi 25.5598
1
G.M Anti log 25.5598 Anti log 1.2780 18.6
20
iii. Classes: 30 – 40 40 – 50 50 – 60 60 – 70
fi : 5 8 4 3
CMi : 35 45 55 65
Log CMi : 1.5441 1.6532 1.7401 1.8129
fi Log CMi : 7.7200 13.2256 6.9612 5.4387
1
G.M Anti log 33.3455 Anti log 1.6673 45.81
20
CYP8 Calculate GM for the following data.
i. x: 8 40 175 1209 2000
ii. x: 2 3 4 5 6
f: 5 7 8 3 2
iii. Classes: 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50
– 60 fi : 2 5 6 18 13
6
53
6.5.1 Computations of Harmonic Mean for Ungrouped and Grouped
Data
n n
For ungrouped data : H .M n
For grouped data: H .M n
1 fi
x
i 1
x
i 1
i i
Solution:
2 2 120
i. 20 30 H .M 24
1 1 5 5
20 30 60
ii.
x 2 3 4 5 6
f 5 7 8 3 2
f/x 2.5 2.33 2 0.6 0.33
25
fi 25
x i 1
7.76 H .M
7.76
3.22
i
iii.
Classes 20 - 24 25 - 29 30 - 34 35 - 39 40 - 44 45 - 49 50 - 54
fi 11 18 32 37 21 47 13
Cmi 22 27 32 37 42 47 52
fi/Cmi 0.5 0.67 1 1 0.5 1 0.25
179
fi 179
CM
i 1
4.92 H .M
4.92
36.38
i
54
ii.
Marks 40 50 60 70
No. of Students 20 30 50 10
iii.
Classes 10 - 20 20 - 30 30 - 40 40 - 50 50 - 60 60 - 70
fi 4 6 10 12 5 3
6.6.1 Advantages:
All are i. rigidly defined.
ii. based on all the observations.
iii. suitable for further mathematical tea.
AM iv. easy to calculate and understand.
v. is least affected by fluctuations of sampling compared to other averages.
GM iv. it gives highest weightage to smaller values and smaller weightage to large
values.
v. it is a proper average to measure the relative change (like percentage increase
in Population, sales over a period of time, etc.
HM iv. is not affected very much by fluctuation of sampling.
v. is particularly useful in averaging speed, special types of rates and ratios
where time factor is involved.
vi. since the reciprocals of the variables are involved, it gives greater weightage
to smaller values.
6.6.2 Application Problems
1. Prove that
i. AM = GM = HM if all the values are equal in a series.
ii. HM < GM < AM if the values are different in a series.
Solution:
i. Suppose there are two items x and y in the series
If x = y = 7, then
x y 2
AM , GM xy , HM
2 1 1
x y
55
7 7 2 2 7
AM 7 , GM 7 7 7 , HM 7
2 1 1 2
7 7
Therefore, AM = GM = HM
ii. Suppose there are two items x and y in the series
x y
Then AM , GM xy
2
If x y, then x – y > 0
x y 0
x y
2
0
x y 2 xy 0
x y 2 xy
x y
xy
2
AM GM
xy
Consider xy This is proved above.
2
xy xy xy
by multiplying both sides by we get
2 x y x y
2 xy xy 1 2 2
xy 2 . 2 .
x y x y x y x y 1 1
xy xy xy y x
2
xy GM HM
1 1
x y
Therefore, HM < GM < AM
Note- We can have the following relationship between the three means.
x y 2 xy
AM . HM xy
2 x y
To equalize AM . HM to GM, we put AM . HM under square root
x y 2 xy
GM xy . AM . HM
2 x y
GM AM . HM If there are only two positive observations in the series.
56
2. The price of a commodity increased by 5% from 1979 to 1980, by 9% from 1980 to
1981 and by 73% from 1981 to 1982. The average increase from 1979 to 1982 is
quoted as 25.6% and not 29%. Verify.
Solution:
Year Price at the end of the year taking preceeding as 100%. (X) Log X
1980 100 + 5 = 105 2.0212
1981 100 + 9 = 109 2.0374
1982 100 + 73 = 173 2.2380
6.2966
AM = 5 + 9 + 73 = 87 = 29
3 3 3
GM = Antilog[1/3(6.2966)] = Antilog(2.0989) = 125.6
Therefore, Rise in price is 125.6 - 100 = 25.6%
Verification:
Year Rise Price would be Growth 25.6% Growth 29%
1979 100 100 100
1980 5% 105 125.6 129
1981 9% 114.45 157.75 166.41
1982 73% 198 198 214.67
Thus GM is the best average to give us the true rise in price.
3. World Population has increased from 5 billion to 6 billion within 12 years. Calculate
the average increment per year.
Solution:
The average annual increase is computed by applying the formula
n n
Pn = Po(1 + r) or r = Pn/Po - 1.
Where Pn - the amount at the end of the period
Po - the amount at the beginning of the period
n - time (years)
r - rate of change
Pn = 6, Po = 5, n = 12 r = ?
12
r= 6/5 - 1 = 1.01 - 1 = 0.01
The average increment per anum = 1%
Therefore, GM is used in determination of average percentage of change in amount.
57
4. A machine depreciates by 40% in the first year, by 25% in the second year and by
10% per anum for the next three years. Each percentage being calculated on the
diminishing value, what is the average percentage of depreciation for the entire
period?
Solution:
Depreciation (%) After depreciation (%) = X Log X
40% 60% 1.7782
25% 75% 1.8751
10% 90% 1.9542
10% 90% 1.9542
10% 90% 1.9542
9.5159
GM = Antilog [1/5(9.5159)]
= Antilog (1.9032)
81
Rate of depreciation per anum is 100 - 81 = 19%
5. The weighted GM of 5 numbers 10, 15, 25, 12 and 20 is 17.15. If the weights of the
first four numbers are 2, 3, 5 and 2 respectively, find out the weight of the fifth
number.
Solution:
X W Log X (LogX).W
10 2 1.0000 2.0000
15 3 1.1761 3.5283
25 5 1.3979 6.9895
12 2 1.0792 2.1584
20 x 1.3010 1.3010(x)
14.6762 + 1.3010.x
Log17.15 = 14.6762 + 1.3010.x
12 + x
1.2343 = 14.6762 + 1.3010.x
12 + x
-0.0667x = -0.1354
x = 2.03
The missing weight is 2.
6. A cyclist pedals from his house to his college at a speed of 8 kmph and back from the
college to home at 12 kmph. Find the average speed.
Solution:
Let the distance between the house and the college be x kms. Then the distance from
house to college is covered in x/8 hrs and from college to house in x/12 hrs.
58
And the total distance = 2 x (house to college and back) is covered in (x/8 + x/12)hrs.
Average Speed = Total distance traveled
Time taken
= 2x = 2x = 48x = 9.60kmph
x/8 + x/12 5x/24 5x
7. Mr. Raga traveled a distance of 900 kms by train at an average speed of 60 kmph, 200
km by boat at speed of 20 kmph, 1000 km by plane at 800 kmph speed and finally 4
km by taxi at 25 kmph speed. What is the average speed for the entire distance?
Solution:
X W X/W
60 900 15.00
20 200 10.00
800 1000 1.25
25 4 0.16
2104 26.41
Weighted HM = W
W/X
= 2104 = 79.67 kmph.
26.41
CYP 10 If the arithmetic mean and the geometric mean of two items is 12.5 and 10
respectively, then
i. find the HM of the two items.
ii. find the value of the two items.
CYP 11 A motorist travels at a uniform speed of 20 kmph, 60 kmph and 30 kmph from
A to B, B to C and C to D respectively. Find the average speed.
CYP 12 In a factory, a unit of work is completed by A in 5 minutes, by B in 7 minutes,
by C in 4 minutes, by D in 8 minutes and by E in 6 minutes.
i. What is their average rate of work?
ii. What is the average number of units of work completed per minute?
iii. At this rate, how many units of work will they complete in six hours a
day?
CYP 13 Find the average rate of increase in Population which in the first decade had
increased by 20%, in the next by 30% and in the third by 40%.
6.7. SUMMARY
Arithmetic mean is mostly used in practice of all areas because its characteristics value
being represented to all items in the variable.
59
Geometric mean is widely used in averaging ratios and percentages and in computing
average rates of increase or decrease.
Harmonic mean is useful in comparing the values of a variable with constant quantity of
another variable, i.e. time, rate, distance covered, quantities purchased or sold per unit
etc.
3. Find the class intervals if the AM of the following distribution is 30.1 and
assumed mean is 31.5.
4. The mean weight of 150 students of a class is 60kgs. The mean weight of boys is
70kgs and that of girls is 55kgs. Find the number of boys and girls in the class.
5. The price of a commodity increased by 20% in 1989, decreased by 12% in 1990
and increased by 15% in 1991. Calculate the average annual change in price.
6. If the price of a commodity triples in a period of 6 years, what is the average
percentage increase per year?
7. A train runs the first 40 kms at a speed of 60 kmph, the next 60 kms at a speed of
80 kmph and the last 80 kms at a speed of 100 kmph. What is the average speed
of the train for the whole journey?
8. If the GM of two positive observations is 2/3 of their AM and the sum of the two
observations is 18, then
i. their HM is ____________
ii. the two observations are _________ and _________
60
6.9. ANSWERS TO CHECK YOUR PROGRESS QUESTIONS
CYP 3 i. 20 ii. 0
CYP 4 i. 55 ii. 49
CYP 6 50.9
CYP 7 45cm.
CYP 11 30 kmph.
CYP 12 i. 5.65 minutes. ii. 0.177 units of work / minute. iii. 63.72 units of
work.
CYP 13 i. 29.7%
6.10. GLOSSARY
Assumed mean - Refers to an estimated or approximate value for the arithmetic mean
or average which is used to simplify its calculation. The nearer it is to
the mean, the smaller are the numbers involved.
Class Interval - The range of interval between the highest and lowest values allowed
in a particular class.
Depreciate - Make or become less in value (being diminished in value).
Deviation - Refers to the difference between a value of a variable and the mean of
its distribution.
61
Rate - Refers to standard of reckoning, obtained by bringing two numbers or
amounts into relationship like a period of time and a number of people,
Currencies, Tax, etc.
Time Series - Refers to a set of values of a variable recorded over a period of time.
6.11. REFERENCES
62
UNIT 7: POSITIONAL MEASURES OF CENTRAL TENDENCY
CONTENTS:
7.0 Aims and Objectives
7.1 Introduction
7.2 Mode
7.3 Median
7.3 Quartiles, Deciles and Percentiles and Grouped Data
7.4 Summary
7.5 Model Examination Questions
7.6 Answers to Check Your Progress Questions
7.7 Glossary
7.8 References
7.1 INTRODUCTION
The mode and median are called positional measures of central tendency. The term
position refers to the place of a value in the series. The values being divided by a number
of equal parts are called partition values. Besides median, which divides a series in to
equal parts, the quartiles, deciles and percentiles are important measures.
7.2 MODE
Importance:
1. Mode can be used as a central location for qualitative as well as quantitative data,
like the median. Example, if a beauty measurement turns in to three impressions
63
or scores, which we rate ‘very beautiful’, ‘beautiful’ and ‘not beautiful’, then the
modal value is beautiful.
2. Like the mean, the mode is not affected by extreme values.
3. Mode can be used when one or more of the classes are open-ended.
For grouped data: Discrete Series: Mode x̂ = the value of the variable corresponding
to the maximum frequency.
Continuous Series: The class corresponding to the maximum frequency is called the
modal class. The value of mode is obtained by the following
interpolation formula.
f1 f 0
Mode xˆ l c
1 f f 0 f1 f
2
or
1
Mode xˆ l c
1 2
Where l – LCB of the modal class f2 – frequency succeeding f1
f1 – maximum frequency C – magnitude of the class
f0 – frequency preceding f1 ∆1 = f1 – f0
f2 – frequency succeeding f1 ∆2 = f1 – f2
iii.
Classes 0-9 10 - 19 20 - 29 30 - 39 40 - 49 50 - 59 60 - 69 70 - 79
fi 328 350 720 664 598 524 378 244
Solution:
i. Mode = value which occurs most often
Mode = 25
ii. Mode = Value of the variable with maximum frequency
Mode = 40
64
iii. Modal Class = 19.5 - 29.5
l = 19.5 f0 = 350 f1 = 720 f2 = 664 c = 10
f1 f 0
Mode xˆ l c
f1 f 0 f1 f 2
720 350 3700
19.5 10 19.5 28.1854
720 350 720 664 426
7.3. MEDIAN
The median is that value of the variable, which divides the group in to two equal parts,
one part comprising all the values greater and the other all the values less than median.
Or median can be defined as the middle value of a set of data values when they are
arranged in ascending or descending order.
Importance:
In dealing with qualitative data, median is more suitable average
Median is recommended if the distribution has unequal classes, since it is simple
to compute than the mean.
Median is especially useful incase of open-ended classes since it is only positional
and not calculated average.
The magnitudes of extreme deviations do not influence the median.
65
Commutation of Median for Ungrouped and Grouped Data
For ungrouped data:
First, rearrange the values in the order of magnitude.
Then apply the following formula.
N 1
th
Median ~
x vallue of the item (where n is odd)
2
xn 1
2
1
th th
N N
Median ~
x Value of item Value of 1 item Where n is even
2 2 2
1
xn xn 1
2 2 2
66
b) 8, 5, 2, 6, 15, 10, 25
ii.
x 4 6 8 10 12 14 16
f 2 4 5 3 2 4 1
iii.
Solution:
i. a. Rearranging:
12 16 18 23 25 25 27 27 28 33 33
42
n = 12 … even
1
~
x xn xn 1
1
x6 x7 1 25 27 26
2 2 2 2 2
b. Rearranging: 2 5 6 8 10 15 25
n = 7 … odd
~ x
x n 1 x4 8
2
ii.
x 4 6 8 10 12 14 16
f 2 4 5 3 2 4 1
<cfi 2 6 11 14 16 20 21
th
n = 21 Median = The value of N+1 item
2
th
= 21 + 1 item
2
= The value of the 11th item
= 8
iii.
x 50 - 60 60 - 70 70 - 80 80 - 90 90 - 100 100 - 110
fi 20 21 50 40 53 16
<cfi 20 41 91 131 184 200
67
th
n
Median class = Value of item 100th item 80 - 90
2
l = 80, c = 10, f = 40, c.f = 91
c n
Median ~
x l 100 91 80 9 82.25
10
c. f 80
f 2 40 4
ii.
x 28 30 32 34 36 38 40 42
f 14 15 16 24 16 10 6 4
iii.
x 30 - 34 35 - 39 40 - 44 45 - 49 50 - 54 55 - 59
fi 5 10 15 20 6 4
Draw both the more than and less than ogives on the same graph. From the point of
intersection of these two curves, draw a perpendicular line to the x – axis. The foot of the
perpendicular line is the value of the median.
Classes 0 - 20 20 - 40 40 - 60 60 - 80 80 - 100
fi 15 25 30 14 16
Solution:
Classes 0 - 20 20 - 40 40 - 60 60 - 80 80 - 100
fi 15 25 30 14 16
<cfi 15 40 70 84 100
>cfi 100 85 60 30 16
68
The < & > Ogives
120
100
80
<cfi
CFi 60
>cfi
40
20
0
0 - 20 20 - 40 40 - 60 60 - 80 80 - 100
CBi
The perpendicular line drawn from the intersection point meets the x-axis approximately
at 46. Therefore, the Median of the distribution is 46.
69
Importance:
The quartiles are more widely used in Economics and Business while the deciles and
percentiles are important in Psychology and Educational Statistics concerning grades,
rates, ranks, etc. The working principle for computing the partition value is basically the
same as that of computing the median.
c iN
Qi l c. f
f 4
c iN
Di l c. f
f 10
c iN
Pi l c. f
f 100
Example: For the data given below, compute the value of Quartiles, D3, D7, P15 and P88
and interpret.
Solution:
th
N
Q1 – size of item = 25th item 10 – 20 quartile class
4
70
l = 10, c = 10, f = 15, c.f = 10
c n
Q1 l c. f 10
10
25 10 20
f 4 15
Mark of 25% of students is less than 20.
th
2N
Q2 – size of item = 50th item 20 – 40 quartile class
4
l = 20, c = 20, f = 25, c.f = 25
c n
Q2 l c. f 20
20
50 25 40
f 2 25
Mark of half of students is below 40.
th
3N
Q3 – size of item = 75th item 40 – 60 quartile class
4
l = 40, c = 20, f = 30, c.f = 50
c 3n
Q3 l c. f 40
20
75 25 73.33
f 4 30
3
Mark of th of students is below 73.33.
4
th
3N
D3 – size of item = 30th item 20 – 40 decile class
10
L = 20, c = 20, f = 25, c.f = 25
c 3n
D3 l c. f 20
20
30 25 24
f 10 25
Mark of 30% of students is below 24.
th
7N
D7 – size of item = 70th item 40 – 60 decile class
10
L = 40, c = 20, f = 30, c.f = 50
c 7n
D7 l c. f 40
20
70 50 53.33
f 10 30
Mark of 70% of students is below 53.33.
th
15N
P15 – size of item = 15th item 10 – 20 percentile class
100
L = 10, c = 10, f = 15, c.f = 10
c 15n
P15 l c. f 10
10
15 10 13.3
f 100 15
Mark of 15% of students is below 13.3.
71
th
88N
P88 – size of item = 88th item 60 – 80 percentile class
10
L = 60, c = 20, f = 14, c.f = 80
c 88n
P88 l c. f 60
20
88 80 71.43
f 100 14
Mark of 88% of students is below 71.43.
CYP 16 Compute the value of Quartiles, D4, P69 and interpret for the data given below.
i. 46 35 28 52 54 43 35 49 46 50 41
ii.
Daily Wages 40 45 50 55 60 65 70
No. of Workers 9 22 26 18 13 8 5
iii.
Rent in 150-250 250-350 350-450 450-550 550-650 650-750 750-850 850-950
Birr
No. of 8 10 15 25 40 20 15 7
Houses
7.5 SUMMARY
The arithmetic mean and median satisfy the conditions of definition and stability. Media
has a distinct merit over mean insofar as easy calculations. Mode can be located just by
inspection. In case, every value occurs the same number of times mode is useless
measure. It is observed that the median, quartiles, deciles and percentiles have good
relation.
Weight in Kgs 30 - 40 40 - 50 50 - 60 60 - 70 70 - 80 80 - 90
No. of People 18 37 45 27 15 8
3. For the data given below, find the missing frequencies if median is 37 and mode
is 43 million birr.
72
Fund raised in 0 - 10 10 - 20 20 - 30 30 - 40 40 - 50 50 - 60
millions of birr
No. of NGO’s 3 F2 16 20 F2 16
Marks 31 - 39 41 - 49 51 - 59 61 - 69 71 - 79 81 - 89 91 - 99
No. of Students 12 10 12 9 6 7 4
5. For the following data Q1 is found to be 41. Find the missing frequency.
Classes 30 - 34 35 - 39 40 - 44 45 - 49 50 - 54 55 - 59
fi 8 10 f3 20 12 25
CYP16 i. Q1 = 35 Q2 – 46 Q3 = 50 D4 = 43 P69 = 50
ii.Q1 = 45 Q2 = 50 Q3 = 60 D4 = 50 P69 = 55
iii.Q1 = 458 Q2 = 580 Q3 = 685 D4 = 542 P69 = 646.5
7.8 GLOSSARY
73
7.9 REFERENCES
74
UNIT 8: RELATIONSHIP BETWEEN MEAN, MEDIAN AND MODE
CONTENTS:
8.0 Aims and Objectives
8.1 Introduction
8.2 Symmetric and Moderately Skewed Distribution
8.3 Summary
8.4 Model Exam Questions
8.5 Answers to Check Your Progress Questions
8.6 Glossary
8.7 References
8.1 INTRODUCTION
For a moderately symmetric distribution, median lies between mean and mode. An
approximate relationship among these averages is:
Mean – Mode = 3 (Mean – Median) or
Mean – Median = 1/3 (Mean – Mode).
From this empirical relationship, we can see that median is closest to mean than mode. If
the maximum frequency has repeated or if the grouping gives two modal classes, then the
distribution is called Bi-modal distribution. In such situation, mode is obtained by:
Mean – Mode = 3 (Mean – Median) or
Mode = 3 Median – 2 Mean
Example: Find the value of mode for the following distribution.
Wages 0 - 10 10 - 20 20 - 30 30 - 40 40 - 50 50 - 60 60 - 70 70 - 80
No. of 10 40 20 0 10 40 16 14
Persons
x
fx i i5890
39.2667
N 150
c n
~
x l c . f 40
10
75 70 45
f 2 10
Then
x 2 x 345 239.2667 135 78.5334 56.4666
xˆ 3 ~
75
CYP 19 Calculate mode using the empirical relationship of mean and median for the
following distribution.
A distribution is said to be symmetrical when the values of the variables, equidistant from
the mean, have equal frequencies.
Consider the following frequency distribution
Classes 20 - 30 30 - 40 40 - 50 50 - 60 60 - 70 70 - 80 80 - 90
fi 12 18 25 36 25 18 12
In this distribution, the mirror images of the frequencies with respect to the central
frequency are present on both sides. Such distribution can be said Symmetric Frequency
Distribution. If we calculate the mean, median and mode for this distribution, we can
find that x ~
x xˆ 55 .
~
x ~
x ~
x
Mean = Median = Mode Mean > Median > Mode Mean < Median <
Mode
Q2 – Q1 = Q3 – Q2 Q2 – Q1 < Q3 – Q2 Q2 – Q1 > Q3 – Q2
76
Symmetric Positively Skewed Negatively Skewed
100
fx i i
6742
x i 1
67.42
N 100
c n
~
x l c . f 65.5
3
50 25 65.5 2.1 67.6
f 2 35
f1 f 0
xˆ l c
f1 f 0 f1 f 2
35 18 51
65.5 3 65.5 68.3
35 18 35 34 18
Since x ~ x xˆ , the distribution is negatively skewed.
8.3 SUMMARY
Skew ness discloses the difference between the manners in which the observations are
distributed in a particular distribution compared with a normal distribution.
77
8.4 MODEL EXAMINATION QUESTIONS
2. For a certain symmetric distribution the first and the last deciles are 200 and 360
respectively. What is the modal value of the distribution?
3. Test the skew ness of the following distribution.
8.6 GLOSSARY
Bi - modal - Refers to a distribution of data points in which two values occur more
frequently than the rest of the values in the data set.
Empirical - Derived from or relating to experiment and observation, rather than
theory.
Skew ness - A form of asymmetry in a frequency distribution.
Symmetric FD -A frequency distribution in which the distribution of frequencies is
identical on both sides of the mode. The Mean, Median and Mode
coincide.
8.7 REFERENCES
78
UNIT 13. PROBABILITY
CONTENTS
13.0 Aims and Objectives
13.1 Introduction
13.2 Definition of Probability
13.3 Properties of Probability
13.4 Multiplication Rule of Probability
13.5 Conditional Probability
13.6 Addition Rule of Probability
13.7 Summary
13.8 Answer to check Your Progress questions
13.9 Model Exams
13.10 Glossary
13.11 References
79
13.1 INTRODUCTION
In the previous unit you have seen methods of counting in finding the number of
elements in an event as well as in a sample space of an experiment, which was the ground
to determine the probability of an event. There are two approaches of the Definitions of
probability. But we will be interested and wok mainly with one of the Definition namely
the classical Definition of probability. You will see different techniques of determining
the probability of an event, but in any case you need to apply the counting methods
discussed in the previous unit.
Notation: the probability of an event E written as P (E) read as the probability of event E
that is always expressed by a number between 0 and 1 inclusively i.e. for any event E, we
must have
0 P (E) 1.
Note that the probability P of an event E is the numerical information for the occurrence
of event E.
Suppose in an experiment there are n equally likely out comes and if an event E can
m n E
happen in m of these then P ( E )
n n u
Example1: In rolling a regular die what is the probability of getting an even number on
the upper face.
Solution: When a regular die is rolled, the number that faces up can be any one of the
six equally likely out comes. 1, 2, 3, 4, 5, or 6 and three of these are even.
Hence n (u) = 6 , n (E) = 3, where E = {2, 4, 6} and u = {1, 2, 3, 4, 5, 6, }
3 1
P (E) =
6 2
80
Example 2: In rolling a pair of regular dice, what is the probability of scoring a sum
a) 8 b) 9 c) 10 d) 11 e) 12
Solution: n (u) = 36
a) E1 = {(2, 6), (3, 5), (4, 4), (6, 3), (6, 2)} then P (E1) = 5/36
b) E2 = {(3, 6), (4, 5), (5, 4), (6, 0)} then P (E2) = 4/36
c) E3 = {(4, 6), (5, 5), (6, 4)} then P (E3) = 3/36
d) E4 = {(6, 6), (6, 5), (5, 4), (6, 0)} then P (E4) = 2/36
e) E5 = {(6, 6)} then P (E5) = 1/36
Example3: five cards bearing numerals 1, 3, 5, 7, and 9 are placed in a box and two are
with drawn at random. What is the probability that the sum of the numbers shown on the
cards drawn is
a) 4 b) 8 c) 16 d) an even number e) an odd number
Solution: U = {(1, 3), (1, 5), (1, 7), (1, 9), (3, 5) (3, 7), (3, 9), (5, 7), (5, 9), (7, 9)}
n(u) = 10
1 2 1
a) P (E1 ) b) P (E 2 ) c) P (E 3 )
10 10 10
d) P (E 4 ) 1 e) P (E 5 ) 0
Where E1 = {(1, 3)} E2 = {(1, 7) , (3, 5)} E3 = {(7, 9)} E4 = u and E5 =
{}
81
Relative Frequency Definition of Probability: - In performing an experiment large
number of times in which an event E actually occurs then
Number of Times E Occured
P (E)
Number of Times experiment was Reapeated
Example4: In an experiment of tossing a fair coin, if 1000 tosses of the coin result 523
523
head, then the observed relative frequency of head is 0.523 . If another 1000
1000
489
toss results 489 heads then the observed relative frequency of heads is 0.489 .
1000
Then the observed relative frequency of heads in the total of 2000 tosses is
523 489 1012
0.506 .
2000 2000
According to the statistical Definition, continuing in this manner, the observed relative
frequency of heads gets closer and closer to the number called the probability of a head in
a single toss of the coin and that is 0.5.
Example5: How many five-digit numerals can be written using the digits 1, 3, 5, 7 and 9
if no digit is repeated in each numeral? If each numeral is equally likely to be chosen,
what is the probability that the number chosen: -
a) is odd b) is even c) has unit digit is 9
d) is greater than 50,000 e) less than 40,000
Solution: The number of five digit numerals that could be written is P (5, 5) = 5! = 120
a) 1, since all the digit used to write the numerals are odd the unit digit is certainly
odd and thus all the five digit numerals are odd.
b) 0, since the unit digit can never be even, the first digit numeral can never be even,
thus it is an impossible event.
1
c) , since the number of the five digit numerals whose unit digit is 9 is 4! = 24 and
5
24 1
the probability of this event is .
120 5
82
3
d) , since for the number to be greater than 50,000 the 10,000th digit has to be
5
selected only from 6, 7 or 9 and there after any of the numbers 1, 3, 5, 7 or 9
which was not already selected can be selected once. Hence there are 3 4 3
2 1 = 72 different numbers greater than 50,000. So the probability of his
72 3
event is .
120 5
2
e) , since there are 2 4 3 2 1 = 48 different five digit numerals less than
5
48 2
40,000. So the probability of this event is .
120 5
Example6: From a jar containing 4 white, 3 red and 2 black balls all identical except
color, three balls are drawn at random. How many different out comes are there? What
is the probability that an out come consists of
d. 2 red and 1 white balls e. 1 white and 1 red balls f. 2 red and 1 black
g. 1 red and 2 black h. 1 one fro each color i. 2 white and 1 black balls
Solution: Totally there are 9 balls. Hence the number of possible outcomes of drawing
3 balls randomly is c (9, 3) = 84. Thus
c (4 , 3) 4 1
a. P (3W) =
c (9 , 3) 84 21
c (3 , 3) 1
b. P (3R) =
c (9 , 3) 84
c (4 , 2) c (3 , 1) 6 3 18 3
c. P (2W , 1R) =
c (9 , 3) 84 84 14
c (4 , 1) c (3 , 2) 4 3 1
d. P (1W , 2R) =
c (9 , 3) 84 7
83
c (4 , 1) c (2 , 2) 4 1
e. P (1W , 2B) =
c (9 , 3) 84 21
c (3 , 2) c (2 , 1) 6 1
f. P (2R , 1B) =
c (9 , 3) 84 14
c (3 , 1) c (2 , 2) 3 1
g. P (1R , 2B) =
c (9 , 3) 84 28
c (4 , 1) c (3 , 1) c (2 , 1) 4 3 2 2
h. P (1W , 1R , 1B) =
c (9 , 3) 84 7
c (4 , 2) c (2 , 1) 6 2 12 1
i. P (2W , 1B) =
c (9 , 3) 84 84 7
CYP 2 If 3 light bulbs are chosen at random from 10 bulbs of which 3 are defective then
what is the probability that a. none of them is defective b. all are defective
c. exactly one is defective d. exactly two are
defective
e. at least two are non- defective.
CYP 3 If a committee of 3 persons is to be randomly chosen from a group of 4 men
and 2
women. What is the probability that exactly one of the members of the committee is a
woman?
CYP 4 Suppose a two-letter word is a one vowel and one consonant pair is written from
the letter of the word “GONDAR”. Whether or not it gives meaning what is the
probability that a randomly chosen word is either “DO” or “GO”.
CYP 5 A three digit whole number is written using the digit 1, 2, 3, …,9. If a digit is
used at most once in a whole number, then what is the probability that a randomly chosen
number is divisible by 2?
84
13.3 PROPERTIES OF PROBABILITY OF EVENT
Definition:
In an experiment if it is certain for an event to occur it is called sure event and if it is
certain for an event not to occur it is called an impossible event.
Note: In an experiment any event E is either sure event, impossible event or some where
in between. Therefore the probability of any event E can be expressed as 0 P (E) 1
where the probability of sure event is 1 and the probability of an impossible event is 0.
i.e. P(s) = 1,
P() = 0 and 0 < P(E) < 1 for any event E such that E s and E and the sum of the
probabilities of all the sample points is 1. Where s is the sample space of the experiment.
Example1: In rolling a fair die on a flat surface, an event of getting a “7” on the upper
face is an impossible event and its probability is 0. While an event of getting a number
between 0 and 7 on the upper face is a sure event, its probability is 1. But the probability
of an event E which is a proper subset of the sample space is between 0 and 1, provided
that E .
Definition:
In an experiment two or more events are said to be mutually exclusive event iff they
cannot occur simultaneously.
Note: In an experiment mutually exclusive events are pair wise disjoint whose union is a
subset of the sample space of the experiment.
Example: In rolling a fair die, the event of getting the set of prime number E1 and the set
of composite number E2 on the upper face are two mutually exclusive events since E1 =
{2, 3, 5} and E2 = {4, 6} can not occur simultaneously.
Definition:
In an experiment two events are said to be complementary iff they are disjoint whose
union gives the sample spaces.
85
Rule of complementary events: If E and E are two complementary events of an
experiment then P(E) + P(E) = 1
Example3: In rolling a regular die, what is the probability that the face appears up
shows not composite number?
Example4: In tossing a fair 5-cent coin three times, what is the probability of achieving
at least one head in the three tosses?
Solution: U = (HHH, HHT, HTH, HTT, THH, THT, TTH, TTT}. Let E be an event
consisting of no head i.e. E = {TTT} then E is an event consisting of at least one head.
1 1 7
Since P(E) = and P(E) 1 - P (E) 1 -
8 8 8
Example5: Suppose a family plan to have four children. What is the probability that not
all the children have the same sex if it is equally likely for a son or daughter to be born?
Solution: n (u) = 16, let E be an event that the children are all sons or all daughters i.e.
2 2 14 7
E = {SSSS , DDDD} then P(E) = and P(E) 1 - P (E) 1 -
16 16 16 8
Definition:
Two events are said to be independent if the occurrence of one does not affect the
probability of the occurrence of the other. Several events are similarly independent if the
occurrence of any one does not affect the probabilities of the occurrence of the other. If
two events are not independent then they are said to be dependent. Similarly several
events are not independent then they are said to be dependent.
86
Example6: In rolling a pair of fair dice. Let E1 be an event consisting of prime number
that appears on the upper face of the first die and E2 be an event consisting of composite
number that appears on the upper face of the second die, then since the occurrence of E1
does not affect the probability of the occurrence of E2, E1 and E2 are said to be
independent events.
Example 7: Suppose a box contains 10 balls all identical except in color where 6 of
them are white and 4 of them black. If one ball is drawn randomly and is obtained to be
white, with out replacement if a second ball is drawn randomly then the probability that a
5 4
second ball to be white is , to be black is . But the probability that a first ball to be
9 9
6 4
white was , to be black was Hence the two events are dependent events, since the
10 10
occurrence of one affects the probability of the occurrence of the other.
Note: If the balls were drawn with replacement, the two events would be independent
since the probabilities of a second event to occur would not be affected by the occurrence
of the first.
Example8: If 3 light bulbs are chosen at random from a dozen of bulbs of which 4 are
defective, what is the probability that
a) none is defective b) all defective
c) 1 defective and 2 non defective d) 2 defective and 1 non defective
Solution: there are c (12, 3) ways of choosing 3 bulbs from 12 i.e. 220
c (8 , 3) 56 14 c (4 , 3) 4 1
a) b)
220 220 55 220 220 55
c (4 ,1) c (8 , 2) 4 28 28 c (4 , 2) c (8 ,1) 12 12
c) d)
220 220 55 220 220 55
87
Example9: Suppose from a box containing 7 white and 3 black balls, we draw 2 balls
turn by turn with out replacement. What is the probability of drawing 1 white and 1
black ball?
7 3 21
Solution: The probability of drawing 1st white and then 2nd black is .
10 9 90
3 7 21
The probability of drawing 1st black and 2nd white is . Hence the total
10 9 90
probability of drawing 1 white and 1 black is
21 21 42 7 c (7 ,1) c (3 ,1) 7 3 7
| or P(1w ,1b)
90 90 90 15 c (10 , 2) 45 15
CYP 6 Suppose 30 men and 20 women are attending a conference and if 3 participants
are randomly selected to report on the discussion, find the probability that at least one is
a woman.
CYP 7 Suppose a test consists of 10 true – false questions. An unprepared student gives
the answer by guess randomly. What is the probability that he gives
a) all correct answer b) no correct c) 5 correct answer d) at least one correct answer
CYP 8 Among the 12 nominees for the board of directors of a farm cooperative, there are
8 men and 4 women. In how many ways can the members select any two of the
nominees as directors? What is the probability that the selection consists of
a) both men b) both women c) one man and one woman
88
Example 1: Suppose a die is thrown twice, what is the probability of the 1st throw being
less than 3 and the 2nd throw being less than 4.
Solution: Let E1 be an event of the 1st throw being less than 3, and E2 be an event of the
2 3 1
2nd throw being less than 4. Then P (E1 E2) = P (E1) P (E2) = .
6 6 6
Example 2: Suppose one box contains 5 black and 3 white balls and a second box
contains 4 black and 6 white balls if one ball is drawn from each box, what is the
probability that
a) both are black b) both are white c) 1 white and 1 black
Solution: a) let E1 be an event of being black from the 1st box and E2 be an event of
being black from the 2nd box. Then E1 and E2 are independent.
5 4 1
P (E1 E2) = P (E1) . P (E2) = .
8 10 4
b) E1 is then an event of being white from the 1st box and E2 is an event of being white
from the 2nd box. Then E1 and E2 are also in dependent events
3 6 9
P (E1 E2) = P (E1) . P (E2) = .
8 10 40
c) We get an event of 1 white and 1 black if either we get an event of being white from
the 1st box and black from the 2nd box or an event of being black from the 1st box and
white from the 2nd box. Thus P (E1 E2 ) (E1 E2) = P (E1 E2) + P (E1 E2) =
P(E1) . P (E2) + P (E1) . P (E2)
5 6 3 4 21
= . . or
8 10 8 10 40
10 9 21
P (1w , 1b) = 1 – [P (E1 E2) + P (E1 E2) = 1 or
40 40 40
89
Example 3: What is the probability of getting two consecutive kings if two cards are
drawn at random from a deck of 52 playing cards if
a) the 1st card is replaced before the 2nd card is drawn
b) the 1st card is not replaced before the 2nd card is drawn
Solution: a) There are 4 kings among the 52 cards. Thus the probability of the 1st king
4 4 1
and 2nd king to be drawn is . (the two events are independent)
52 52 169
b) If the 1st card drawn is king and not replaced then there are only 3 kings remained
4 3 1
among the rest 51 cards the probability of the 1st king and 2nd king is .
52 51 221
Example4: If A and B are events such that P(A) = 0.7 ad P(B) = 0.4 and P(A B) = 0.2
are A and B independent event?
Solution: Since P(A) P(B) = 0.7 0.4 = 0.28 and P(A B) = 0.2 we have P(A B)
P(A) . P(B) there fore A and B are not independent events.
CYP 9 If P(A) = 0.8 , P(B) = 0.25 and P(A B) = 0.2 then are A and B independent
events?
CYP 10 Find the probability that a “6” turning up once in the two tosses of a fair die.
CYP 11 Two cards are drawn from a well-shuffled deck of 52 playing cards. Find the
probability that they are both pictured a) if cards are drawn with replacement
b) if cards are drawn with out replacement
CYP 12 Find the probability of three consecutive 2’s turning up in rolling a fair die three
times.
When two events are dependent, the concept of conditional probability is used to show
the occurrence of the related events.
90
Definition:
If A and B are two dependent events then the probability of event B occurring given that
event A has occurred denoted by P (B\A) read as probability of event B given that event
A has occurred is called the conditional probability of B given that A has occurred given
by
P B A
P B \ A
P (A)
Note: If A and B are independent events then P (B\A) must equal P (B) since the
occurrence of A should not affect P (B). Hence P (A B) = P (A) . P (B) if A and B are
independent events and
P (A B) = P (A) . P (B\A)
= P (B) . P (A\B) if A and B are dependent events
Example1: Suppose there are 30 applicants for a job in a certain organization, which are
cross- classified by their sex and color.
Black W hi t e
Male 12 8
Female 4 6
Assume that each applicant is equally likely to be chosen for a job. What is the
probability that the applicant chosen is
a) black b) white c)male d) female e) male and
black
f) female and black g) male and white h) female and white
Solution: Let B stands for the set of black applicant W stands for white applicant M
stands for male applicant and F stands for female applicant
12 4 8 8 6 7
a) P (B) = b) P (W) =
30 15 30 15
12 8 2 4 6 1
c) P (M) = d) P (F) =
30 3 30 3
91
12 8
e) P (M B) = g) P (M W) =
30 30
4 6
f) P (F B) = h) P (F W) =
30 30
P (M B) 12 8 12 3
Solution: a) P (M/B) =
P (B) 30 15 16 4
P (M W) 8 15 4
b) P (M/W) = .
P (W) 30 7 7
P (F B) 4 15 1
c) P (F/B) = .
P (B) 30 8 4
P (F W) 6 15 3
d) P (F/W) = .
P (W) 30 7 7
CYP 13
Suppose there are 80 employees in a company classified by their academic background
and experience as shown below and if an employee is randomly selected to be a chair
person of the employees association, then Find the probability that the selected person to
have
a) experience below 10 years given that he (she) is graduate
b) experience below 10 years given that he (she) is not graduate
c) experience 10 years or above given that he (she) is graduate
d) experience 10 years or above given that he (she) is not graduate
Below 10 years 8 26
92
13.6 ADDITION RULE OF PROBABILITY
Solution: Let E1 be an event achieving a sum 7 then E1 = {(1, 6), (1,5), (3,4), (4,3),
6
(5,2), (6,1)} hence P (E1) = , Let E2 be an event of achieving a sum 8 then E2 = {(2,6),
36
5
(3,5), (4,4), (5,3), (6,2)}, hence P (E2) = . Let E3 be an event achieving a sum 9 then
36
4
E3 = {(3,6), (4,5), (5,4), (6,3)} hence P (E3) = since E1, E2 and E3 are mutually
36
6 5 4 15
exclusive events P (E1 E2 E3) = P (E1) + P (E2) + P (E3) =
36 36 36 36
Example2: 9 cards bearing numerals 1,2,3 …or 9 is placed in box and one card is
withdrawn randomly. What is the probability that the card drawn is numbered either an
odd number or a multiple of 3?
Solution: Let E1 be an event of odd numbered to be drawn i.e.E1 = {1,3,5,7,9} and E2 be
an event of multiple of 3 to be drawn i.e. E2 = {3,6,9}
5 3 2 6 2
P (E1 E2) = P (E1) + P (E2) - P (E1 E2) =
9 9 9 9 3
93
Example3: find the probability of drawing a black card or a king from a deck of 52
cards randomly.
Solution: Let E1 be the event of drawing a black card, then n (E1) = 26 and E2 be the
event of drawing a king then n (E2) = 4 where 2 of them are black.
26 4 2 7
P (E1 E2) = P (E1) + P (E2) - P (E1 E2) =
52 52 52 13
CYP 14 Find the probability of scoring a sum of 9 or 10 two tosses of a pair of fair dice.
CYP 15 Use addition rule and rule of complementary events to find a formula for the
probability of not getting either event A or event B.
13.7 SUMMARY
94
ii. P (A and B) = P (A) . P (B/A)
= P (B) . P (A/B) If A and B are dependent
c. The probability of one or the other of two mutually exclusive events A and
B happening is given by P (A B) = P (A) + P (B)
The probability of one or the other of any two events A and B happening is
given by
P (A B) = P (A) + P (B) – P (A B)
d. The probability of one or the other of k mutually exclusive events A1, A2
… or Ak happening is given by P (A1 A2 … Ak) = P (A1) + P (A2)
+ … + P (Ak).
e. The probability of any event E is given by the sum of the probabilities of
the individual out comes comprising event E.
f. The probability of k in dependent events A1, A2 … Ak happening is given
by
P (A1 A2 … Ak) = P (A1) . P (A2) . … . P (Ak).
CYP 1 The sample space consists of C (10 , 5) = 252 members. If two are women the
rest three must be men. The event consists of C (6 , 3) C (4 , 2) = 20 6 = 120
members.
The probability that exactly two of the members of the committee are women
120 10
is
252 21
95
CYP 2 The sample space consists of C (10 , 3) = 120
C 7 , 3 35 C 3 , 2 C 7 ,1 21
a. d.
120 120 120 120
C 3 , 3 1 C 7 ,1 C 3 , 2 C 7 , 2 22
b. e.
120 120 120 120
C 3 ,1 C 7 , 2 63
c.
120 120
C 2 ,1 C 4 , 2 2 6 12
CYP 3
C 6 , 3 20 20
CYP 4 The sample space consists of 16 t letter word 8 is vowel – consonant pair and 8 is
2 1
a consonant – vowel pair. Then P (DO of GO) = .
16 8
CYP 5 The sample space consists of 9 8 7 = 504 members the event consists of 7
224 4
8 4 = 224 the probability is
504 9
CYP 6 The probability that at least one is a woman
C 30 , 3 3060 1454
= 1 – P (no woman) = 1 1
C 50 , 3 17600 1760
96
1st 1 2 3 4 5 6
12 12 12 3 12 11 33 11
CYP 11 a. b.
52 52 52 13 52 51 663 221
1
CYP 12 They are independent events. The probability that a “2” turns up in a toss is
6
1 1 1 1
then the probability that three consecutive tosses is 3
6 6 6 6
CYP 13 Let A be the set of employees with experience 10 years or above
B be the set of employees with experience below 10 years
G be the set of employees who are graduates
N be the set of employees who are not graduates
P (B G) 8 / 80 8 4
a) P (B/G) =
P (G) 30 / 80 30 5
P (B N) 26 / 80 26 13
b) P (B/N) =
P (N) 50 / 80 50 25
P (A G) 22 / 80 22 11
c) P (A/G) =
P (G) 30 / 80 30 15
P (A N) 24 / 80 24 12
d) P (A/N) =
P (N) 50 / 80 50 25
97
CYP 14 n (S) = 36 , n (E) = 5 , E = {(3 , 6) , (4 , 5) , (5 , 4) , (6 , 3) , (5 , 5)}
5
P(E) =
36
1st 1 2 3 4 5 6
1) If a natural number is written using the digit 1, 2, 3, 4, 5 if each digit is used at most
once in a natural number what is the probability that a randomly chosen number is
a) < 2000 b) between 200 and 3000 c. > 3000
2) In a decimal system of Notation, how many three-digit numeral can be written? What
is the probability that a randomly chosen number is
a) an odd number b) an even number
c) greater than 300 d) less than 700
98
4) From a class of 12 boys and 18 girls, three students are randomly selected to represent
their class. What is the probability that
a) all are boys b) all are girls
c) at least one girl d) at least one boy in the section
5) A die is thrown and a card is drawn from a well shuffled deck of 52 playing cards at
the same time, what is the probability that an out come consists of
a) even number from the die and diamond from the card
b) a 2 from the die and a pictured card from the playing cards
c) a prime number from the die and a king from the card
d) a composite number from the die and a red card from the playing card
7) If 7 men and 5 women have applied for a job and 4 applicants are randomly selected
from this group, find the probability that
a) all 4 are women b) 2 are men c) at least one is a woman d) at least one
is a man.
8) A box contains 12 fuses of which 3 are defective. Two fuses are randomly selected,
turn by turn with out replacement. Find the probability that
a) both are defective b) both are non defective c) one defective and
one non defective.
9) From a lot consisting of 100 items of which 10 are defective three items are chosen
randomly with out replacement what is the probability that
a) all are defective b) all are non defective c) one defective d) two defective
99
10) One box contains 5 black and 4 white balls; a second box contains 4 black and 3
white balls. If one ball is drawn from each box what is the probability that
a) both are white b) both are black c) the two balls have different color
11) Three balls are drawn successively from a box containing 5 green , 4 yellow and 3
red balls. What is the probability that it is drawn in the order green, yellow and red if
each ball is
a) replaced b) not replaced before the next draw.
12) There are 50 applicants for a job in a company. Some are college graduate and some
are not, some have at least 5 years of experience and some have below 5 years of
experience with the exact break down as given below. If the order in which the
applicants are interviewed by the manager is at random. If G is the event that the first
applicant interviewed is a college graduate and E is the event that the first applicant
interviewed has at least 5 years experience. Determine each of the following
probabilities.
College Graduate Not College Graduate
13) Find the probability of getting a “red card” or a “card with 6” if one card is drawn
randomly form a well shuffled deck of a 52 playing cards.
14) A day of the week is randomly selected, what is the probability that it is neither
Tuesday nor Thursday?
15) A box contains 9 cards each numbered exactly one of 1,2,3…9. If 3 cards are drawn
turn by turn with out-replacement, then what is the probability that the drawn cards
are numbered odd- even- odd or even - odd - even?
16) If P (A) = 0.3 and P (B) = 0.6 then what is known about P (A or B) if A and B are
a) mutually exclusive events b) not mutually exclusive events
100
13.10 GLOSSARY
13.11 REFERENCES
Elementary Statistics by Mario F. Triola
Elementary Business Statistics by Freund & Williams
Statistics (Schaum’s Out line Series) by Murray R. Spiegle Ph.D
101
UNIT 14: DISCRETE PROBABILITY DISTRIBUTION
CONTENTS
14.0 Aims and Objectives
14.1 Introduction
14.2 Random Variable
14.3 Definition of Probability Distribution
14.4 Types of Probability Distributions
14.5 Expected Value and Variance of a Probability Distribution
14.6 The Binomial Probability Distribution
14.7 The Poisson Probability Distribution
14.8 Summary
14.9 Answers to Check Your Progress Questions
14.10 Check Your Progress Questions (CYP)
14.11 Glossary
14.12 References
The aim of this unit is to study the concept of random variable and then discuss the most
commonly used discrete probability distributions, the Binomial and Poisson probability
distributions.
102
14.1 INTRODUCTION
In block 3, you have learnt how to construct a frequency polygon for a given frequency
distribution. It seemed that there was no way of telling in advance how the polygon
would look like and how the mean and the standard deviation would be. As a result, it
may be necessary to further study the behavior of the frequency polygon so as to study
the general behavior of the distribution in general and make some conclusions, which are
useful for decision-making.
This section focuses on the definitions of random variable and probability distribution.
Then, you will deal with the two most common discrete probability distributions.
In block 6, we defined the concept of ‘experiment’ and its associated outcome. A random
variable provides a means of assigning numerical values to experimental outcomes. The
definition of a random variable is as follows:
Definition:
Notation: Random variables are usually denoted by capital letters like X, Y, Z, etc.
103
Example 1: Consider the experiment of tossing of fair coin once.
The sample space is S={H, T} where H denotes the outcome ‘Head’ and T
denotes the outcome ‘Tail’. So, there are two possible outcomes H or T.
Now, let the random variable X represents the outcome `Head’, then X can take
the value 0 or 1.
Let the random variable Y denotes the outcome ‘A number greater than 2
occurs’. Then the random variable can assume the values 3, 4, 5 or 6.
Examples 3: Consider the experiment of rolling two fair dice once simultaneously.
If the random variable T indicates the outcome `the sum of the numbers on the
two dice is greater than 10,’ then T can take the pairs (5, 6), (6, 5) or (6, 6) since
in each of these cases the sum of the numbers is grater than 10.
CYP1
Let two fair coins be tossed once simultaneously. If the random variable X denotes ‘A tail
appears ’
What are the possible values of the random variable X?
Depending upon the numerical values it can assume, a random variable can be classified
into two major divisions.
104
A) Discrete Random Variable: is a random variable that may assume either a finite
number of values or an infinite sequence (e.g. 1, 2, 3…) of values. In general, a
discrete random variable takes whole number values, which can be counted or
enumerated.
Example: The number of students who are enrolled for a diploma program in
Unity University College, the number of defective batteries observed in assessing
its quality, the number of customers who visit a shop during one day of operation
are all examples of discrete random variables.
CYP2
Decide whether each of the following random variables is discrete or continuous. Put
your answer on the space provided.
2. The number of indigenous birds, which are visited each day in the Awash
National Park.
________________________________________________
105
3. The amount of time elapsed to cover a distance between two stations in a city.
________________________________________________
The probability distribution for a random variable describes how the probabilities are
distributed over the values of the random variable. For a discrete random variable X, the
probability function is denoted by P(X). The probability function provides the probability
for each value of the random variable.
A probability distribution may in general be defined as follows:
Definition:
Example 1: Construct a probability distribution for the number of heads in tossing two
fair coins simultaneously once.
Outcome, X 0 1 2
Probability, P(X) ¼ ½ ¼
106
The probability distribution shows that the probability that the random variable can
assume the value 0 is ¼, the value 1 is ½ and the value 2 is ¼. Note that the sum of these
probabilities is 1.
Example 2: The number of mistakes a typist made in ten days of assessment is shown in
the following table.
No of mistakes 2 3 4 5
No of days 1 4 3 2
Solution:
a) In Constructing the probability distribution, our random variable assumes a value
for the number of mistakes the typist committed. Let the variable X denotes this
random variable. Then, we assign a probability for each of the number of days
with respect to the total number of days.
The probability distribution is shown below:
No of mistakes, X 2 3 4 5
Probability, P(X) 1/ 10 4/ 10 3/ 10 2/ 10
107
committed is labeled on the x-axis and the corresponding probability, P(X) on the
y-axis.
Y axis
0 .4
P(X)
Probability 0 .3
0 .2
0 .1
1 2 3 4 x axis
number of mistakes
In the construction of the probability distribution for a discrete random variable, the
following two conditions must be satisfied.
Properties (Required Conditions) for a Discrete Probability Distribution
The sum of the probabilities of all the events in the sample space must equal 1.
i.e. P(x) =1
The probability of each event in the sample space must be between or
equal to 0 and 1.
i.e. 0 P(x) 1
108
For instance, in the above example, these two conditions are satisfied since
P(X) = P(2) + P(3) + P(4) + P(5) = 0.1+ 0.4 + 0.3 + 0.2 = 1 and
each of these probabilities is greater than or equal to 0 and less than or equal to 1.
For some discrete random variables, the probability distribution can be given as a formula
that yields (x) for every possible value of x.
Solution:
The outcome x assumes the values 0, 2 and 3
Out come, x 0 2 3
Probability, (x) 0/ 5 2/ 5 3/ 5
CYP3
1. Construct a probability distribution for the number of tails in tossing three fair
coins once.
2. Assign a probability function, which can generalize all the outcomes in tossing a
fair coin once.
The expected value, or mean, of a random variable is a measure of the central location for
the random variable. It is denoted by E(x) or . The mathematical expression for the
expected value of a discrete random variable x is as follows:
109
Expected value of a discrete random variable:
E(x)= = x1 P(x1) + x2 . P(x2) +………..+ xn P(Xn) Or,
n
E (x) = x
i 1
i . P(xi)
where x1, x2,-------,xn are the outcomes and P(x1), P(x2)…P(xn) are the
corresponding probabilities.
The above formula shows that in order to compute the expected value of a discrete
random variable, we must multiply each value of the random variable by the
corresponding probability
P(x) and then add the resulting products.
14.5.2 Variance
While the expected value provides the mean value for the random variable, we often
need a measure of dispersion, or variability, for the random variable just as we need
variance in block 5 to summarize the dispersion in a data set. The mathematical
expression for the variance of a discrete random variable is as follows:
i 1 i 1
2
and the standard deviation is ó ó
Example 1: If three fair coins are tossed, find the expected number of heads that will
occur and obtain the variance.
110
Solution:
Begin by constructing the probability distribution for the number of heads in tossing the
three coins.
The probability distribution is constructed below:
No of heads, x 0 1 2 3
Probability, P(x) 1/ 8 3/ 8 3/ 8 1/ 8
Then,
4
E(x)= i 1
xi.P(xi) = xi P(x1) + x2 . P(x2) + x3 . P(x3) + x4 . P(x4)
= 0· 1/ 8 + 1· 3/ 8 + 2 · 3/ 8 + 3· 1/ 8
= 0 + 3/ 8 + 6/ 8 + 3/ 8 = 12/ 8 = 6/ 4 = 3/ 2 = 1. 5
The theoretical mean = 1.5 implies that if the experiment is done as many times as
possible, then on the average a head occurs 1.5 of the time.
4
2 =
i 1
[(xi-)2· P(xi)]
= (x1 - )2 · P(x1) + (x2 - )2 · P(x2) + (x3 - )2 · P(x3) + (x4 - )2 · P(x4)
= (0 - 1.5)2 · 1/8 + (1-1.5)2 · 3/8 + (2 - 1.5)2 · 3/8 + (3 - 1.5)2 · 1/8
2 = 0.5
Example 2: One thousand tickets are sold at $1 each for a color television valued at
$350. What is the expected value if a person purchases one ticket?
Solution:
111
Hence,
E(x) = $349 · 1/1000 + (-$1) · 999/1000 = -$0.65
Or,
E(x) = overall gain - $1 = $350 · 1/1000 - $1 = $0.65
i.e. The average loss is $0.65 for each of the 1000 ticket holders.
CYP4
Five balls numbered 0, 2, 4, 6 and 8 are placed in a bag. After the balls are mixed, one is
selected, its number is noted, and then it is replaced. If this experiment is repeated many
times,
The Binomial Probability Distribution is a discrete probability distribution that has many
applications. It is associated with a multi-step experiment that we call the Binomial
experiment, which is a probability experiment satisfying the following four requirements.
Definition:
A probability distribution showing the outcomes of a Binomial experiment along with the
corresponding probabilities is termed as a Binomial Probability Distribution.
112
In a Binomial experiment, the probability of exactly x successes in n trials is given by:
Px
n!
. p x .qn x
n x ! x!
Note: q = 1 - p and 0 x n
Example 1: Consider the experiment of tossing a coin three times. Show that it is a
binomial experiment and find the probability of getting exactly two heads.
Solution:
Now, to find the probability of getting two heads, let p denotes the probability of getting
a head on a single toss.
113
Then p = 1/2, q = 1-1/2 = 1/2
n = 3, x=2
Px
n!
. p x . q n x
n x ! x!
2
3! 1 1 32 3! 1 1
P(2) . . . .
3 2!2! 2 2 1!2! 4 2
3
= = 0 .3 7 5
8
Example 2: A new drug is effective 60% of the time. What is the probability that in a
random sample of 4 patients, it will be effective on two of them?
Solution:
This is a Binomial experiment as the points of the experiment are satisfied. Define
‘effective’ as ‘success’ and ‘non effective’ as ‘failure’. Then,
p = 0 .6 , q = 1 - 0 .6 = 0 .4 , n = 4, x=2
Required p (2) = ?
4 2 !2!
Hence, the drug will be effective on two of a random sample of 4 patients with a
probability of 0.3456 (or 34.56%).
Under the column `n’ choose the number 4, proceed horizontally and correspond it with
x=3, then read the number that matches p=0.5, which is of course.
114
CYP5
A survey found that 30% of teenage consumers receive their spending money from part-
time jobs. If five teenagers are selected at random, find the probability that at least three
of them will have part-time jobs.
M e a n , a n d V a r ia n c e o f a P r o b a b ilit y D is t r ib u t io n
Definition
The mean, variance and standard deviation of a variable that has the Binomial
distribution is found as:
Mean =n·p
Variance 2 = n·p·q
Standard deviation = npq
Example1: A coin is tossed four times. Find the mean, variance and SD of the number of
heads that will be obtained.
Solution:
= 2 = 1=1
Example 2: A die is rolled 240 times. Find the mean, variance and standard deviation for
the number of 3’s that will be rolled.
Solution:
n = 240,P=1/6
= n . p = 240(1/6) = 40
2 = n . p . q = (24)(1/6)(5/6) 33.33
= 33.33 5.77
115
CYP6
Calculate the mean and variance of the number of `Head’ that will appear if a fair coin is
tossed 1000 times.
A discrete probability distribution that is useful when n is large and p is small and when
the independent variables occur over a period of time is called the Poisson probability
di s t r i but i on.
i) The probability of an occurrence is the same for any two intervals of equal length.
ii) The occurrence or non-occurrence in any interval is independent of the
occurrence or non-occurrence in any other interval.
Example1: Past police records indicate a mean of five accidents per month while
investigating the safety of a dangerous intersection. The number of accidents is
distributed according to the probability in any month of
a) Exactly 3 accidents.
b) Fewer than 2 accidents.
116
x=3
P3
5 . 2.7183
3 5
125 0.00674
3! 6
= 0 .1 4 0 4
P0 P1
5 . 2.7183
0 5
5 2.7183
1 5
0! 1!
0 .0 6 7 4 + 0 .3 3 7 0
0 .4 0 4 4
e x 2.7183 . 0.4
0.4 3
Px , 0.00715
x! 3!
Thus, there is less than a 1% probability that a give page contains less than 3 errors.
117
CYP7
A sales firm receives, on the average, 3 calls per- hour on its toll-free number. For any
given hour, find the probability that the firm receives
a) At most 3 calls
b) At least 3 calls
c) 5 or more calls.
14.8 SUMMARY
This unit discussed the definitions of random variable as a variable assigned to a random
probability experiment and where the probability distribution of such experiment attains
the summarized table comprising the random variable together with the probability of
occurrence of the events. A random variable can be discrete or continuous, depending on
the values it assumes.
CYP1
The sample space is S = {HH, HT, TH, TT}. There are three possibilities; no tail occurs,
one tail occurs or two tails occur.
Hence, the random variable can take the value 0, 1 or 2.
CYP2
1. Continuous random variable
2. Discrete random variable
3. Continuous random variable
118
CYP3
Let the random variable Y denotes the number of tails that appear. The possible cases are
shown below
No of tails, Y 0 1 2 3
Probability P (Y) 1/ 8 3/ 8 3/ 8 1/ 8
CYP4
No on ball, X 0 2 4 6 8
Probability, P(X) 1/ 5 1/ 5 1/ 5 1/ 5 1/ 5
5
a) E x xi . P xi 0. 2. 4. 6. 8.
1 1 1 1 1
i 1 5 5 5 5 5
x
5
1
. P xi 2 0 2 . 2 2 . 4 2 . 6 2 . 82 . 4 2
1 1 1 1
2
2
b) i
i 1 5 5 5 5 5
=8
and 8 2.83
CYP5
Given p = 0.3, q = 1 – 0 .3 = 0 .7 n = 5 and x = 3, 4, 5
119
P(at least 3) = p(3) + p(4) + p(5)
= 0 .1 3 3 3 3 2 3 + 0 .0 2 8 3 5 + 0 .0 0 2 4 3
= 0 .1 6 3 0 8
So the probability that at least three of them will have part-time jobs is 0.16308.
CYP6
The experiment is Binomial with p = ½, q = 1 - ½ = ½ and n = 1000
= n . p = 1000 ½ = 500
2 = n . p . q = 100 ½ . ½ = 250
CYP7
120
14.10 MODEL EXAMINATION QUESTIONS
2. The only information available to you regarding the probability distribution of a set
of outcomes is the following list of frequencies:
X 0 1 2 3 4 5
FREQUENCY 18 48 180 252 72 30
3. A Psychologist has determined that the number of hours required to obtain the trust
of a new patient is 1, 2 or 3. Let x be a random variable indicating, the time in
hours required to gain the patients trust. The following probability function has
been proposed.
x
(x) = for x = 1, 2 or 3
6
a. Is this a valid probability function? Explain.
b. What is the probability that it takes exactly 2 hours to gain the patients trust?
c. What is the probability that it takes at least 2 hours to gain the patient’s trust?
121
4. For a Binomial distribution with n=6 and p=0.3, find the following probabilities
a) P(r = 5) b) P(r > 4) c) P(r < 2) d) P(r 3)
5. Find the mean and standard deviation of the Binomial distribution with
a) n = 12, p = 0.25
b ) n = 2 5 , p = 0 .4
c) n = 2,250, p = 0.95
6. When a new machine is functioning properly, only 3% of the items produced are
defected. Assume that we will randomly select two parts produced on the machine and
that we are interested in the number of defective parts found.
a) Describe the conditions under which this situation would be a Binomial
experiment.
b) Draw a free diagram showing this as two trial experiments.
c) How many experimental outcomes result in exactly one defect being found?
d) Compute the probabilities associated with finding no defects, exactly 1 defect,
2 defects.
7. At a particular university, it has been found that 20% of the students withdraw
without completing the introductory statistics course. Assume that 20 students have
registered for the course this time.
a) What is the probability that 2 or fewer will withdraw?
b) What is the probability that exactly 4 withdraw?
c) What is the probability that more than 3 will withdraw?
d) What the expected number of withdraws?
8. Of the next-day express mailings handled by a postal service, 85% are actually
received by the addressee 1 day after the mailing. What is the expected value and
variance for the number of 1-day deliveries in a group of 250 express mailings?
122
10 Airline passengers arrive randomly and independently at the passenger-screening
facilitate major international airport. The mean arrival rate is 10 passengers per minute.
a) What is the probability of no arrivals in a 1-minute period?
b) What is the probability that 3 or fewer passengers arrive in a 1-minute
period?
c) What is the probability of at least one arrival in a 1-minute period?
14.11 GLOSSARY
Discrete Random Variable: A random variable that may assume only a finite or infinite
sequence of values.
Continuous Random Variable: A random variable that may assume all values in an
interval or collection of intervals.
Probability Distribution: A description of how the probabilities are distributed over
the values the random variable can take on.
Expected Value: A measure of the mean, or central location, value of a random
variable.
14.12 REFERENCES
123
UNIT 15: CONTINUOUS PROBABILITY DISTRIBUTION
CONTENTS
15.0 Aims and Objectives
15.1 Introduction
15.2 The Normal Probability Distribution
15.3 Area Under the Normal Curve
15.4 Applications of the Normal Distribution
15.5 Summary
15.6 Answers to Check Your Progress Questions (CYP)
15.7 Model Examination Questions
15.8 Glossary
15.9 References
The aim of this unit is to enable you get the idea of the normal probability distribution
and apply it to solve some problems involving it.
15.1 INTRODUCTION
So far, we have been concerned with discrete probability distributions. In this unit, we
shall turn to cases in which the variable can take on any value within a given range and in
which the probability distribution is continuous.
124
There are two basic reasons why the normal distribution occupies such a prominent place
in statistics. First, it has some properties that make it applicable to a great many situations
in which it is necessary to make inferences by taking samples. Second, the normal
distribution comes close to fitting the actual observed frequency distributions of many
phenomena, including human characteristics (weights, heights and IQS)
Many contentious variables such as height and weight have distributions that are bell-
shaped and are called approximately normally distributed variables, deriving the most
important probability distribution used to describe a continuous random variable called
the normal probability distribution.
The normal probability distribution is a continuous, symmetric, bell-shaped
distribution of a variable.
The shape and position of the normal distribution curve depends on two parameters, the
mean and the standard deviation. Each normally distributed variable will have its own
normal distribution curve.
125
Properties of the normal probability distribution
34.13% 34.13%
13.59% 13.59%
2.28% 2.28%
about 68%
about 95%
about 99.7%
f x
1
e 2 2
3 .1 4 1 5 9
2
Where = mean e 2 .7 1 8 3
= Standard deviation
126
The standard normal probability distribution
A random variable that has a normal distribution with a mean of 0 and a standard
deviation of 1 is said to have a standard normal probability distribution.
Recall that the standard score (z-score) of a value is the number of standard
deviations that value is from the mean.
All normally distributed variables can be transformed into the standard normal
distributed variable by using the formula for the standard score:
z= value – mean
standard deviation
X
Or, z=
CYP1
1. Write the two parameters that determine the shape and position of the normal
curve.
2. What is the total area under the normal curve?
3. Determine the area of the normal curve within the range - and +
4. Find the z-score of the value 20 if the entire distribution has a mean of 10 and
the standard deviation is 3.
127
15.3 AREA UNDER THE NORMAL CURVE
As with other continuous random variables, probability calculations with any normal
probability distribution are made by computing areas under the graph of the probability
density function. Thus, to find the probability that a normal random variable lies within
any specific interval, we must, compute the area under the normal curve over that
interval.
For the standard normal probability distribution, areas under the normal curve have been
computed and are available in tables that can be used in computing probabilities. The
normal probability distribution table is available at the end of this block.
For the solution of problems using the normal distribution, the following steps are used.
1. Draw a picture
2. Transform the given value to z-value
3. Shade the area desired
4. Read the area from the standard normal distribution table.
Example 1: Find the area under the normal curve between z=0 and z=2.34
Solution:
The standard normal curve
Representation is shown: From
the table the intersection 0 2 .3 4
of z = 2.3 with 0.04 gives 0.4904 or
49.04% which is the required area.
Example 2 : Find the area under the normal distribution curve between z = -1.93 and z =
2 .3 5
128
area between 0 and 2.35;
Area = 0.4732 + 0.4906 = 0.9638 or 96.38%. Note that it is equivalent to say that the
probability of the z-value lying between z = -1.93 and z = 2.35 is 96.38%. This can also
be written as:
P(-1.93 < z > 2.35) = 0.9638
Example 3: Find the probability that the z-value of a normally distributed variable lies
to the left of 1.65
Solution
The probability that the z-value
lies to the left of 1.65 is equivalent to
finding the area under the standard
normal curve, which is to the left of 1.65
Hence, total area = area to the left of 0 0 1 .6 5
plus area between 0 and 1.65 = P(z < 1.65)
= 0 .5 0 0 0 + 0 .4 5 0 5 = 0 .9 5 0 5 o r 9 5 .0 5 %
Which is required probability.
If a random variable x, has a normal distribution with a mean 5.6 and standard deviation
1.4, find
a) P(5 < x < 6) b)P(x < 7) c)P(x > 6.4)
129
15.4 APPLICATIONS OF THE NORMAL DISTRIBUTION
The area under the normal curve is used to solve practical application problems such as
finding probabilities or percentages of values. In order to solve such problems you need
only transform the values of the variable into the z values and read the standard normal
distribution table.
Example 1: The scores for an IQ test are normally distributed with a mean of 100 and a
standard deviation of 15. Find the percentage of IQ scores that will fall below 112.
Solution
Step 1: Draw a figure and represent the area
Step 2: Find the z-value
Corresponding to an IQ
Score 112.
Z = x - = 1 1 2 – 1 0 0 = 0 .8 100 112
115 0 0 .8
Step3: From the table,
P(z < 0.8) = P(z < 0) + P(0 < z < 0.8) = 0.5000 + 0.2881 = 0.7881
Hence, 78.81% of the IQ scores fall below 112.
Example2: The monthly salaries of 2000 workers are normally distributed with a mean
of birr 550 and of workers whose monthly salaries are
a) Between birr 600 and 700
b) Less than birr 700.
700 550
Z 1.875 550 600 700
80
Hence, 96.99% x200=1939.8 0 0.625 1.875
Approximately 1940 of the workers earn a monthly salary less than birr 700.
130
Example 3 A college desires to accept only the top 10% of all graduating seniors on
the basis of the results o a national placement test. The test has a mean of 500 and a
standard deviation of 100. Find the cut-off score for the exam.
Solution:
The area is shown.
We solve the problem back ward.
We need to determine the point on 500 x
the axis that cuts the upper 10% of the area. 0 z
Let it be denoted by x
From the table, the z – value that corresponds to the area 0.4000 is approximately 1.28.
x 500
Then, 1.28 x 628
100
Hence the score 628 should be used as a cut –off score. Any student scoring below 628
should not be admitted.
CYP3
A standardized test has a mean of 50 and a standard deviation of 10. The scores are
normally distributed. If the test is administered to 800 students, approximately how many
will score between 48 and 62?
T – DISTRIBUTION
Since the variation between the sample mean and the population mean is given by Z X
,
where X , the population standard deviation in large samples can be
n
approximated by sample standard deviation in large samples can be approximated by
S
sample standard deviation, so that, X . This relationship is not valid for small
n
samples because of wide fluctuations in the values of sample standard deviation (s).
131
Based upon this variation, Gossett came up with different sets of critical scores, called t-
scores. (Gossett wrote articles under the name of student; hence the distribution of t-
scores is known as student t-distribution). These t-scores are to be used in place of Z-
scores. The larger the sample size, the closer will be the value of t-score to the value of Z-
score.
t-score distribution is useful not only when sample size is small but also when the
population standard deviation is not known. A small sample must come from a normal or
near normal distribution, in order for a t-test to be used. The t-scores should not be used if
the small samples came from a population which is distributed in a non-normal pattern.
133
categories of two or more independent samples (where given contingency table), the df
would be (k – 1) (r – 1), where r-is the number of rows and k-number of columns. For
example, if a sample of 100 students were categorized as freshman, sophomores, juniors
and seniors, then there are four categories and k is 4.
The 2 test is used to test whether there is a significant difference between the observed
number of responses in each category and the expected number of responses for such
category under the assumptions of null hypothesis. In order words, the objective is to find
how well the distribution of observed frequencies (fo) fit the distribution of expected
frequencies (fe). Hence this test is also called goodness-of-fit test.
Example: -
Find the critical value of 2 from the table of 2-distribution if level of significance is
0.05 and degree of freedom is 2.
Answer 2 = 5.991
15.5 SUMMARY
This unit discussed probability distributions to the case of continuous random variable.
With continuous probability distribution, we associate a probability density function that
provides that probability that the random variable x assumes various values. We have
also discussed that the area under the standard normal curve represents the probability
distributions used to solve practical problems, which can be reduced to a normal
distribution.
CYP1
1. The mean and the standard deviation
134
2. 1 or 100%
3. Recall that 68% of th4e values lie within 13.59 between + 1 and + 2, and
2.28% between + 2and + 3
Hence, 68 + 13.59 + 2.28 = 83.87% of the values lie between - 3, which is
equivalently the area between them.
4. Given x = 20, = 10, = 3
value mean x 20 10 10
Z 3.33
SD 3 3
CYP2
= p(5 < x < 6) = p(-0.43 < Z < 0.43) + p(0 < Z < 0.29)
= 0 .1 6 6 4 + 0 .1 1 4 1
= 0 .2 8 0 5 o r 2 8 .0 5 %
b) b) The Z score for x = 7 is
7 5 .6
Z 1
1 .4
p( x < 7) = p( Z < 1 )
= p(Z < 0) + p(0 < Z < 1)
= 0 .5 0 0 0 + 0 .3 4 1 3 5 .6 7
=0.8413 + 84.13% 0 1
c) The Z-score for x = 4.4 is
8 .4 5 .6
Z 2
1 .4
p(x > 8.4) = p(Z > 2)
= p(Z > 0) - p(0 < Z < 2)
135
= 0 .5 0 0 – 0 .4 7 7 2 5 .6 8 .4
= 0 .0 2 2 8 o r 2 .2 8 % 0 2
CYP3
The Z score for 48 and 62 are
48 50
Z 0 .2 and
10
62 50
Z 1 .2
10
p(48 < x < 62) = p(-0.2 < Z < 1.2) 48 50 62
= p(-0.2 < Z < 0) + ( p(0 < Z < 1.2) -0.2 0 1 .2
=0.0793 + 0.3849
=0.4642 or 46.42 %
Hence, 46.42% x 800 371 students scored between 48 and 62
3. The demand for a new product is assumed to be normally distributed with = 200
and = 40. Letting x be the number of units demanded, find the following:
a) P (180 < x < 220)
136
b) P (x > 250)
c) P (x < 225 < x < 250)
4. The test scores from a college admissions test are normally distributed, with a
mean of 450 and a standard deviation of 100.
a) What percentage of the people taking the test score between 400 and 500?
b) Suppose that someone receives a score of 630.What percentage of the
people taking the test score better?
c) If a particular university will not admit anyone scoring below 480, what
percentage of the persons taking the test would be acceptable le to the
University?
5. Lamps used in residential area street lighting are constructed to have a mean
lifetime of 400 days with a standard deviation of 30 days. Furthermore, their
lifetimes are normally distributed, what percentage of such lamps last
a) Longer than 1 year (365days)?
b) Between 375 and 425 days?
c) Longer than 480 days?
15.8 GLOSSARY
137
15.9 REFERENCES
CONTENTS
16.0 Aims and Objectives
16.1 Introduction
16.2 Definition of Sample and Census Survey
16.3 Advantages and Disadvantages of Sample Survey
16.4 Summary
16.5 Glossary
16.6 Reference Books
16.1 INTRODUCTION
138
to the population, which is one of the characteristic features of research, needs scientific
approach of searching for facts. Therefore, sampling must be scientific.
A subset of the population selected for the study is known as sample. The group from
which the samples are selected is called Universe or Population.
Sample survey: is a procedure, which makes one able to draw inferences about the
population by observing or measuring few items.
Census survey: is a method of inquiry, which makes one able to draw inferences by
observing each item constituting the population.
Sampling refers to the method of selecting a sample from the universe. A proper
procedure is to be adopted for evaluating the sample plan in order to select representative
units of the universe. Sampling occupies a key role in the study and has acquired the
status of a technical job.
The number of units in the sample is called Sample size. Not on a new line
* Sample size should never be too small nor too large but optimum. Optimum fulfills the
needs of efficiency, representative ness, reliability and validity.
The size of sample for a study is determined on the basis of the following factors
i- the size of the population
ii- the availability of resources
iii- the degree of accuracy
iv- the homogeneity or heterogeneity of the population
v- the nature of the study
vi- the method of sampling technique adopted
139
vii- the nature of respondents
If the sample is drawn on scientific approach, the adopted sample design is good and the
sample size is adequate. Sample method has some merits over the census method. That
are:
1- Sampling saves time and money.
2- It is much convenient as it involves less personal staff.
3- It is useful when population is infinitely large.
4- It can be more accurately supervised and data can be carefully selected.
5- It is useful in case of inspecting the quality of units, which we have to resort to
sampling, such as testing the quality of bulbs, tubes, strength of stencils, testing
explosives, etc.
Sampling method has its limitations and problems, which are:
1- It would give unreliable data if not designed and executed carefully. Samples are
like medicines. They can be harmful if taken carelessly or without knowledge of
their effect.
2- The service of skilled, trained, qualified personnel for supervision; and
sophisticated equipment and statistical techniques are required. In the absence of
these, it may not be reliable.
3- Sample survey is not useful when information is needed about each and every unit
of the population.
16.4 SUMMARY
In a field of statistical analysis, it is not possible to take the entire population for
consideration due to time, cost and other constraints. Therefore, random samples are
taken from the population, which are analyzed properly and lead to generalizations that
are valid for the entire population. A small sample properly selected may be a true
representative of the universe while a large sample poorly chosen may be unreliable. So
the selection of a sample should be done in a manner that every item in the universe must
140
have an equal chance of inclusion in the sample. Thus a good sample possesses two
characteristics, which are:
16.5 GLOSSARY
Business statistics, Dr. J.S Chandan Prof. Jagiit Singh KK Khanna. 19995,
Reprint1996.
Business statistics, Theory and Practice. C.R/ REDDY. M. Com Ph D. 1994.
141
UNIT 17: TYPES OF SAMPLING TECHNIQUES
CONTENTS
17.0 Aims and Objectives
17.1 Introduction
17.2 Types of Sampling Techniques
17.3 Summary
17.4 Answers to Self-Assessment Questions
17.5 Model Exam Questions
17.6 Glossary
17.7 Reference Books
17.1 INTRODUCTION
Statistical methods are especially appropriate for handling data (information), which are
subject to variations, and for which we can observe only a fraction of the totality of
observations, which may exist. Under this situation, techniques must be devised by which
we can make inferences about the nature of the totality of the universe from the particular
observation we have.
Sampling technique refers to the method of selecting a sample from the universe
(population). It occupies a key role in a study and has acquired the status of being a
technical job. The right type of sampling technique is of paramount importance in the
142
execution of a sample survey in accordance with the objectives and the scope of the
inquiry. The sampling methods may broadly be classified as:
Random sampling method is a method of selection of a sample such that each item within
the population has equal chance of being selected.
In this method, there is no place for investigator’s bias in sample selection since it
depends on probability. It provides more accurate estimates in the sense of greater
precision.
Suppose population size is 100 and sample size is 10. i.e. N = 100 and n = 10. Hundred
chits would be prepared bearing the serial number of units in the universe. These chits
would be put together and shuffled thoroughly, and then ten would be drawn one by one.
The sampling units corresponding to the number on the selected chits will form a random
sample. This method gives a sample, which is quite independent of the natures of
universe. This method is commonly in practice even at present.
The other most practical and inexpensive method is the method of “Random Number
Tables” (RNT). If we have to select a sample of size n from a universe of size N less than
9, then the numbers can be paired as 0 to 9.If we have to select a sample of size n from a
universe of size N less than 99, then the RNT will be from 00 to 99.If N is less than 999,
then from 000 to 999 and so on.
143
Then, select any K from the RNT and if K N, the kth unit will be selected as a sample
and if K > N, divide K by N and take the remainder or the Rth unit as a sample. This
process continues till n number of samples are selected.
Example: From 40 big enterprises in Addis Ababa, we want to study the case of only 5 of
them. Let 12, 59, 67, 81 and 97 be the numbers selected from the RNT. Then which of
the items of the population are selected for the sample.
Under this method, the whole population is divided into a number of homogeneous
groups or strata. From each of these strata, random sample of size n is selected. Thus,
stratified RS means selecting a number of random samples, one from each stratum of the
universe. It is used when each group has small variation within itself but wide variation
between the groups.
144
- The size of sample items which must be selected from the ith stratum is denoted by ni
and is given by
nN i
ni Where n – Sample size
N
N – Population size
Ni – Size of the ith stratum
Ni N1 N2 N3 N4 N5
Field of study Accounting Business Law Marketing Architecture
No. of students 3000 2000 1500 2500 1000
S ol ut i on: n = 120
N1 = 3000, N2 = 2000, N3 = 1500, N4 = 2500, N5 = 1000
N = N1 + N2 + N3 + N4 + N5 = 10,000
nN 1 120 x 3000 nN 2 12
Then, n1 36 n2 x 2000 24
N 10,000 N 1000
n 120 n
Or n1 , N1 x 3000 36 i.e. 0.012
N 10,000 N
12 12 12
n3 x 1500 18 , n 4 x 2500 30 , n5 x 1000 12
1000 1000 1000
In this method, a random starting point is selected from the list representing the universe
and the remaining units are automatically selected in a definite sequence at an equal
spacing from one another. This method is recommended if the sample units are arranged
in systematic order such as chronological, geographical, alphabetical, etc. and also if the
sample units in the universe are uniquely identified. Systematic sampling is also called
sampling by regular intervals or sampling by fixed intervals.
145
- To get a systematic sample of size n from a population of size N, draw a random
N
number i from 1 to K, where K = , and then select i, i + K, i + 2K, i + 3K, …
n
th
N
In general, the i element of the sample is ni i
th
w item. where 0 w n – 1
n
Or we can have an alternative method,
Ai = A1 + (i – 1) K. Where A1 – the random starting point or the first sample item.
Ai – the ith item in the sample
Example 1: - From the files of 24 cases of the federal high court, the cases of only 4 of
these is to be seen. The fifth file was selected randomly. Indicate the remaining three
elements of the sample.
Solution: - N = 24 , n = 4 , A1 = 5
N 24
K= 6
n 4
Then A2 = A1 + (2 – 1) K
= A1 + K = 5 + 6 = 11. The 11th file is the second element
A3 = A1 + (3 – 1) K
= A1 + 2K = 5 + 2 (6) = 17. The 17th file is the third element.
A4 = A1 + (4 – 1) K
= A1 + 3K = 5 + 3 (6) = 23. The 23rd file is the fourth element.
Example 2 : - If the 4th and 12th elements of a systematic sample are 70 and 126 (in the
population) respectively, then which item of the population is the first element of this
systematic sample.
146
Then A4 = A1 + 3K
70 = A1 + 3(7)
A1 = 70 – 21 = 49
The 49th item of the population is the random starting point for the systematic samples.
In this method, the chance of including any elementary unit of the population in the
sample cannot be determined. It is simple to adopt and no complicated procedure is
needed to draw a sample.
There are many non-random sampling techniques. Some of which are Judgment,
Convenient and Quota sampling.
Judgment Sampling: - The exercise of good perception and appropriate strategy are
taken into account. Samples are selected deliberately by the investigator. It is a personal
view. So it becomes satisfactory with regards to one’s research needs. For example, if a
sample of 10 students is to be singled out from a class of 50 for analyzing the habits of
students, the investigator would select ten students, who in his opinion are representative
of the class.
Convenient Sampling: - Elements of the sample are selected by taking those elements of
the population, which are readily available or convenient for the investigator.
Quota sampling: - In this technique, quota is set up according to given criteria, but the
sample with in prescribed quota is selected by personal judgment of the investigator. It is
suitable in market and public opinion surveys where stratification is very difficult.
However, it suffers from representivness as the interviewer may select samples
convenient for him with regards to location and sample unit.
It is the combination of judgment and stratified sampling methods. so it enjoys the merits
of bot h .
147
Example: - If we ask about Canada dry for a prescribed quota of 20 households, 15
students and 10 children, then this method is quota sampling.
SAQ 1 A researcher used a random number table ranging from 000 to 999 and
selected 85, 199, 350, 740 and 960 randomly. If the total number of
observations is 120, which items should be included in the sample.
I
SAQ 3 If the 3rd and 5th items of a systematic sample are 21 and 37 (in
population) and if there are 8 items in the sample, then
a – Give the remaining items in the sample.
b – Find the total number of items in the population.
148
17.3 SUMMARY
Statisticians prefer sample survey to census survey for it is possible to obtain required
accuracy, errors can be controlled effectively, follow up in case of non-response is easy,
efficient when statistical resulted are needed urgently and economical as it covers only
representative units of the universe. This is highly significant in carrying out surveys in
developing countries with budding economy who cannot afford census survey due to lack
of finance.
SAQ 2
N1 = 200, N2 = 360, N3 = 400, N4 = 480
For N1 – Mathematics
N n 1
N2 – Statistics 40
n N 40
N3 – Biology
N4 – Chemistry ni = ?
nN 1 n 1
n1 , N1 x 200 5
N N 40
149
nN 2 n 1
n2 , N2 x 360 9
N N 40
nN 3 n 1
n3 , N3 x 400 10
N N 40
nN 4 n 1
n4 , N4 x 480 12
N N 40
And the total number of students taken for the study is 36
A3 = 21 n=8 A3 = A1 + (3 – 1) K = A 1 + 2K
SAQ 3
A5 = 37 A5 = A1 + (5 – 1) K = A1 + 4K
A5 – A3 = (A1 + 4K) – (A1 + 2K)
37 – 21 = A1 – A1 + 4K – 2K
16 = 2K
K=8
a ) A3 = 21 = A1 + 2 (8) A1 = 21 – 16 = 5
A2 = A1 + K = 5 + 8 = 13 A6 = A1 + 5K = 5 = 5 (8) = 45
A4 = A1 + 3K = 5 + 3 (8) = 29 A7 = A1 + 6K = 5 + 6 (8) = 53
A8 = A1 + 7K = 5 + 7 (8) = 61
N
b) = K N = nK
n
= 8 (8)
= 64
There are 64 items in the population
1) In a systematic random sampling, the 10th and 15th sample elements correspond to the
indices (serial numbers) 68 and 103 respectively. Find the index for the 5th systematic
sample.
2) Discuss the difference between random and non-random sampling techniques.
150
3) Classify each of the following samples as random, systematic stratified or cluster
a. Every fifth teenager entering an amusement park is asked to select his or her
favorite ride.
b. All police officers of a small town are interviewed to determine whether they feel
the crime rate has changed over the past year.
4) Unity University College has registered 12,000 students for the last four years. The
college administration would like to know the number of students who have
participated in co-curricular activities. For the purpose of the study, the administrator
collected the names of 400 students from the files by taking proportional number of
students from each of the years (batches) for interview.
Based on the above information, find
a. The variable of interest
b. The source of data (primary or secondary)
c. The population
d. The sample
e. The sampling technique used
6) A personnel manager selected 20 workers for interview from the master list of 320
workers. He randomly selected the 4th worker.
a – What type of sampling method did he prefer?
b – What is the population size?
c – What is the sample size?
d – Find the interval or constant of coding.
e – from the master list which workers are going to be interviewed 5th, 12th and
19th?
7) In a certain systematic sample, the sum of the 5th and 6th items is 60 and the 3rd item is
one third of the 8th item.
a ) Find the interval.
151
b) Find the first item in the sample.
c) If total number of items is 42, what will be the total number of items selected
for the sample?
8) A research was conducted on four weredas in Addis Ababa. Number of people for
each Wereda is given below. If sample size to population size is given in the ratio 1:10
for Wereda 2, then find
a) Total number of people taken for the sample.
Wereda 1 2 3 4
No.of people 10,000 11,000 9,000 1 2 ,0 0 0
17.6 GLOSSARY
Cluster: refers to number of things of the same kind or homogeneous, found closely
together.
Estimate: forming judgment about or approximate calculation of size, cost, etc.
Probability: refers to a mechanism, which measures and analyzes the chance of
occurrence of an uncertain event.
Proportionate: corresponding in degree, amount or Ratio.
Random: means each and every unit in the population will have an equal chance of
being selected.
Sampling Units: Sampling unit is the unit in terms of which the enumerator collects the
data.
Sampling: The process of taking sample and making inference to the population.
Sampling Frame: The listing of all units in the population under study.
Sampling Error: The difference between the results obtained from a sample study and
the results that would have been obtained from an equal complete coverage.
Non-Sampling Error: Errors that can arise even in census or complete enumeration.
They mainly arise at the stage of acquiring, recording and processing of data.
152
Parameters: Values obtained from a population and used to describe or summarize
population characteristics.
Statistics: Values obtained from samples and used to describe sample characteristic
(behavior).
Business Statistics, Dr. J.S Chandan Prof. Jadjit Singh KK Khanna, 1995, Reprint
1996.
Business Statistics (A text book for B. Com. Students of Indian Universities), R.H.
DHARESHWAR, M.Sc. M. Phil. 1999.
Business Statistics (Practical) T.K. Nagpal, P.S.Narayana, 1988.
From your high school mathematics, you know that the coordinate (a,b) exists for every
point in the x-y coordinate plane. Since every point on each axis has a real number
associated with it. Hence each point located in the plane can be associated with a unique
ordered pair of real numbers.
In this block we develop some of the basic tools used in coordinate geometry & apply
these tools to write different form of equations of a line and solve system of linear
equations graphically.
153
UNIT 22 DISTANCE FORMULA AND MID-POINT OF A LINE SEGEMNT
CONTENTS
22.0 Aims and Objectives
22.1 Introduction
22.2 Length of a horizontal & a vertical line segment
22.3 Distance Formula
22.4 Summary
22.5 Answer to self-assessment questions (SAQ)
22.6 Model Examination questions
22.7 References
22.1 INTRODUCTION
In this unit you will learn how to find distance between point in a coordinate plane by
considering different cases and the mid-point of a line segment using the coordinate of
the mid points. Recall that the distance between two points is the length of the segment
that connects them.
Definition:
154
Definition:
y P3 (1,3)
P1(-3,1)
P2 (4,1)
x
0
P4 (1,-2)
Solution:
___
___
a) P1P2 is a horizontal line segment because the y-coordinate of P1 and P2 are
equal,and P3P4 is a vertical line because both P3 & P4 have the same x-coordinate.
a) P (3,4), Q (3,1)
b) P (4,3), Q (1,3)
155
22.3 DISTANCE FORMULA
So far we have seen distance formulas for a horizontal and a vertical line segments. The
basic tool in coordinate geometry is the distance between any two points, which is easily
derived using Pythagorean theorem.
Let P1 (x1, y1) & P2 (x2, y2) be two point in a Rectangular Coordinate System then refer
the figure below. We can see that: y axis
P2 (x2, y2)
P1P22 =x2-x12 + y2-y12
y2-y1
x axis
P1P2 = x2 x1 2 y2 y1 2
P1(x1, y1) Q(x2, y1)
_____
Theorem: The distance between two points P1 and P2 denoted d(P1, P2) is given by
d (P1P2) = x2 x1 2 y2 y1 2 or d = x2 x1 2 y2 y1 2
Where P1 (x1, y1), P2 (x2, y2) and ‘d’ is the distance between P 1 and P2
Note: The formula given above can be used to find the distance between any two points
in a coordinate plane.
Solution
LetP1 (x1, y1) = P1 (-3,6) and P2 (X2, Y2) = P2 (0,2)
Then d P1P2 = x2 x1 2 y2 y1 2
= 0 (3)2 2 62
= 25
= 5
SAQ2 Find the distance between the points a) (5,-2) and (-6,-4)
b) (0,0) and (5,6)
156
MIDPOINT OF A LINE SEGMENT
Y
. P2 (X2,Y2)
. M (X1+Y2 , Y1+Y2)
2 2
X
.
P (X1,Y1)
____
Example1: Find the coordinates of the mid point of AB for A (7,1) , B (-
3,5)
Solution: A (7 , 1) , B (-3 , 5) Letting (x1, y1) = (7, 1) and (x2, y2) = (-3, 5)
= (7+-3 , 1+5)
2 2
= (2,3)
___
There fore the Mid-point of AB has the coordinates (2,3)
___
Example2: M is the mid point of CD. Find the coordinate of D for C (-
5,4) & M (-2,1).
Let C have coordinate (X1,Y1) and D have coordinate (X2,Y2)
Solution: M (x1+x2 , y1+y2) = (-2,1) C(-5,4)
2 2
= -5 + x2 = -2 and 4+ y2 = 1
157
2 2 (X2,Y2)
= -5 + x2 = -4 and 4+y2 = 2
= x2 = 1 and y2 = -2
Example:3
ΔABC has vertices A (-4,-3) , B (4,-1) and C (-2,3). Find the length of the median CM
where M is the mid-point of AB
Solution:
To get the length of the median CM , it requires
both
the mid-point theorem & the distance formula.
First
C(-2,3) find the coordinates of M. Use mid point
formula.
Let M(x, y) be the mid-point of AB where
x = -4+4 = 0 and y= -3+-1 = -2
2 2
B (4,-1) Therefore M(x,y) = (0,-2)
·M
Then find d CM
A(-4,-3) Use distance formula
d CM = x 2 x1 2 y 2 y1 2
= (0 2) 2 (2 3) 2
= 4 25
d CM = 29
There fore, the length of the median CM is 29 units.
___
SAQ3 Find the coordinate of the mid-point of AB
a) A (5,0) , B(-4,1) b) A (-3,1) , B (8,-5)
___ ___
SAQ4 Show that AC and BD have the same mid point for
A (-3,-5) , B (2,-3) , C (3,5) , and D (-2,3)
158
22.4 SUMMARY
In this unit, we have discussed the distance and mid-point formulas. The distance
between any two points can be computed by using the distance formula. The mid-point is
a unique point on the line segment that is equidistant from the two end points. The
coordinate geometry.
SAQ1: a) PQ = 3 b) PQ = 3
SAQ2 : a) 125 5 5 b) 61
SAQ3: a) (½,½)
b) (5/2,-2)
__ __
SAQ4: The mid point of AC is (0, 0) The mid-point of BD = is (0,0)
: - They have the same mid-point
___
3. Find the coordinates of the mid-point of CD , if :
a) C (3,8) , D (-5,2)
b) C (3,7) , D (-3,-7)
c) C (-2,6) , D (5,-5)
___
4. M is the mid point of AB find the coordinates of B, if:
a) M (-3,-1) , A (7,5)
b) M (5,0) , A (-8,2)
c) M (-6,1) , A (10,-3)
159
__
5. In ΔABC, M is is the midpoint of AB. Show that AM=MB=MC for A (7,1) , B (1,-7)
and
C (1,1).
22.7 REFERENCES
160
UNIT 23: EQUATIONS OF A STRAIGHT LINE
CONTENTS
23.0: Aims & Objectives
23.1: Introduction
23.2: Two-points form of equation of a line
23.3: Point-slope form of equation of a line
23.4: Slope-intercept form of equation of a line
23.5: Intercepts form of equation of a line
23.6: General form of equation of a line
23.7: Summary
23.8: Answer to SAQ
23.9: Model examination
23.10 References
The aim of this unit is to let you be well accustomed to forming the different forms of
equations of a line and generalize it to a more compact form.
-draw a line, given the coordinate of one point and the slope of the line, & then write its
equation
-write an equation of a line in standard form, given the coordinate of two points on the
line.
23.1 INTRODUCTION
In this unit we investigate some standard equation whose graph are straight line and the
concept of slope of a line in key point here that help us to relate points of straight line
Slope of a line
If we take two points P1 (x1,y1) and P2 (x2,y2) on a line, then the ratio of
the change in y to the change in x as we move from point P1 to P2 is
called the slope of the line i.e. slope of a line is the measure of the
“Steepness” of a line.
161
Definition
If a line passes through two distinct points P1 (x1,y1) and P2 (x2,y2), then
its slope, some usually denoted by, m, is given by the formula,
M = y2-y1 , x1x2 Y
x2-x1 P2 (x2,y2)
= Vertical Change
Horizontal Change y2-y1
P1(x1,y1) (x2,y1)
x2-x1
If a line passes through P1 (x1,y1) and P2 (x2,y2), then the equation of the
line is given by the formula
y-y1 = y2-y1 = m where (x, y) is any point on the line
other
(x-x1) (x2-x1) than p1 and p2
hence the equation y-y1 = y2-y1 is two points form of equation of a straight line.
x2-x1 x2-x1
Example: Find the slope and equation of a line that passes through the points A(3,4) and
B (-5,6).
Solution: A (3, 4) , B (-5, 6) Let (x1, y1) = (3, 4) and (x2, y2) = (-5, 6)
y-4 = 6-4
x-3 -5-3
y-4 = 2
x-3 -8
-8 (y-4) = 2x – 6
162
-8y+32 = 2x – 8
-8y = 2x – 40
y = -¼x + 5
SAQ1 Find the equation of the line that passes through the points
a) (0,1) and (6,-2)
b) (-2,-3) and (2-6)
Note that P (x,y) is any point on the line other than P1 variable point and P1 (x1,y1) is
fixed
· P (x,y)
·P1 (x1,y1)
Example
2
Write the equation of a line that passes through (2,3) having slope /3.
SAQ2
Write the equation of a line passes through (1,-1) having a slope of -½.
If a line has slope m and y intercept b, then an equation of the line is given by y=mx+b
Example1: If equation of a line is –2x-6y = 12. Find the slope and the y-intercept of the
line.
163
Solution: First write the given equation in slope intercept form.
-6y = 2x +12
y = -1/3x – 2
1
Therefore, m= - /3 and the Y-intercept is –2
Note: The Y-intercept of a line is the y-coordinate of the point where the line intersects
the
Y-axis.
Example2: Write the equation of the line in slope-intercept form that passes through the
points (3,0) and (5,-4).
Therefore, the equation of the line with slope m and Y-intercept b is given by
Y = mx + b , Since m = -2 and b = 3. It gives
Y = -2x + 3
SAQ3
Write the equation of the line with slope m and Y-intercept b.
a) m = -3, b = 4 b) m = 7, b = -2 c) m = 0 , b = -1/2
If a line ℓ has X-intercept (a,0) and Y-intercept (0,b), where both a and b are not zero
then the equation of line ℓ is given by:
x y
1 a, b 0
a b
Example: The y-intercept & the x-intercept of the line ℓ is 2 and 3 respectively, then
write the equation of the line in intercept form.
Solution:
x-intercept = 2 x+ y=1
y-intercept = 3 a b
x y
Therefore, 1 is the equation of the line.
2 3
SAQ4
164
Indicate the slope, x-intercept, y-intercept & write the equation in the
intercepts form.
a) y = -3/5x + 4 b) 4x – 3y = 24
Solution: -2 (y + 3) = x-5
-2y – 6 = x-5
-2y – x = 1
-2y – x – 1 = 0 or 2y + x + 1 = 0
SAQ4
Write 3(y-1) = 2x + 4 in standard form (general form)
23.7 SUMMARY
Equation of a line is a first-degree equation that shows relation between any two points
on the line. The graph of any equation that can be written in the form Ax + By + C = 0.
Where A, B, C R with A and B not both zero, in a line. An equation of the line through
SAQ1 a) 2y = 2 – x b) 4y = -3 (x + 6)
SAQ2 2y = - (x + 1)
165
x y
: - The eq: - 20
=1
3 4
4
b) m = , b = -8, x-intercept 6
3
x y
: - The eq: - 1
6 8
SAQ5 3y – 2x – 7 = 0 or
2x –3y + 7 = 0
1. For each line whose equation is given, find the slope and coordinate of any one
point on the line.
a) y – 6 = 2 (x-5) b) y – 7/3 = (x + ¾) c) y = -x
23.10 REFERENCES
166
UNIT 24 PERPENDICULAR AND PARALLEL LINES
CONTENTS
24.0 Aims and Objectives
24.1 Introduction
24.2 Parallel and Perpendicular lines
24.3 Summary
24.4 Answer to SAQ
24.5 Model Examination
24.6 References
The aim of this unit is to let you differentiate parallel and perpendicular lines.
24.1 INTRODUCTION
From geometry course, we know that two vertical lines are parallel to each other and that
a horizontal line and vertical lines are perpendicular to each other. In this unit you will
see some technique that can help you to see when two non-vertical lines are parallel and
Theorem: Given two non-vertical lines ℓ1 and ℓ2 with slopes m1 and m2, respectively,
then
ℓ1 // ℓ2 if and only if m1 = m2
ℓ1 ℓ2 if and only if m1 , m2 = -1 where // mean parallel to
mean perpendicular to
Example: Given a line ℓ: 2x –y = 2 and the point P (1,2), find an equation of the line ℓ1
through P that is a) Parallel to ℓ b) perpendicular to ℓ
167
y = 2x –2
m=2
a) The slope of the line ℓ1 parallel to ℓ is the same with that of ℓ. Hence, slope of ℓ 1
= 2 = slop of ℓ
For ℓ1, m = 2 and P (1,2) is on ℓ1 , then let ( x1, y1) = (1, 2). Hence
ℓ1 : y-y1 = m (x-x1)
ℓ1: y – 2 = 2(x –1)
ℓ1: y = 2x –2 + 2
ℓ1: y= 2x
b) ℓℓ1
(Slope of ℓ) . (Slope of ℓ1) = -1
Slope of ℓ1 = -1 = -1/2 , let m1 be slope of ℓ1
slope of ℓ
There fore m1 = -1/2 , p (1,2) on ℓ1 . Take (x1, y1) = (1, 2), Hence eq. of ℓ1 is given by:
y-y1 = m (x-x1)
y - 2 = -1/2 (x –1)
y –2 = -X/2 + ½
y = -X/2 + 5/2
2y = -x + 5 y = - ½ x + 5/2
SAQ1 Given a line L with equation 4x + 2y = 3 and the point P (2, -3), find an
equation of a line through P that is
a) Parallel to L b) Perpendicular to L
Write the final answers in the slope-intercept form, i.e. y = mx + b
24.3 SUMMARY
One of the fundamental relationships that exists between straight lines is the relationship
of being parallel to or perpendicular to each other. In this unit we have described the
relation by the help of their slope.
SQA1: a) y = -2x + 1
b) y = X/2 - 4
24.5 MODEL EXAMINATION
1.Give the slope of a line parallel to the line with the given equation. Then give the
slope of a line perpendicular to the line with the given equation.
a) y = 3x –1 b) y = -2x + 4 c) y = x + 3
168
__ __
2. Use slope to show PR | SQ , if
P (2,-1) , Q (5,3) , R (1,6) , S (-2,2)
3. Write an equation in slope-intercept form of the line passing through the given
point and parallel to the line whose equation is given by
a) (5,-2) ; y = -6x + 1 b) (0,6) ; 2x + 4y = 10
4. Write an equation in slope-intercept form of the line passing through the given
point and perpendicular to the line whose equation is given
a) (-3,1) ; y = 2/3 x – 4 b) (-4,-5) ; 3x + 2y = -7
24.6 REFERENCES
169
UNIT 25: SYSTEM OF LINEAR EQUATIONS
CONTENTS
The aim of this unit is to let you see the different possible cases of solving two linear
equation in two variables (un know)
25.1 INTRODUCTION
In this chapter we review how system of linear equation are solved algebraically and
equation.
170
or
2. No solution
or
3. Infinitely many solution
There are no other possibilities
Solution by Graphing
We first graph both equations in the same rectangular coordinate system. Then the
coordinates of any points that the graph have in common must be solution to the system,
since they must satisfy both equations.
6
(3,4)
3
Therefore, T.S = {(3,4)}
2
10
={x=3 y
= 4}
2x + 3y = 18
x+y=7
Solution by substitution
We solve the system in the next example using substitution method.
Solution:
Step 1: Solve either equation for one variable in terms of the other.
3x – y = 7
-y = -3x + 7
y = 3x –7
171
Step 2: Substitute the expression obtained in step 1 into the other equation in the
system and solve the resulting linear equation in one variable.
2x – 3y = 7
2x – 3(3x –7) = 7
2x – 9x + 21 = 7
-7x = -14
:- x=2
Step 3: Substitute the value of the variable determined in Step 2 into any one of the two
equation. Since 3x – y = 7 y = 3x – 7. on y = 3x – 7 if we substitute x = 2, we
get
y = 3(2) –1
y = -1
SAQ2
Solve by substitution,
3x – 4y = 18
2x + y = 1
In this method, we multiply the equation by appropriate numbers so that when we add the
two equations, one of the two variables may be eliminated and get a linear equation in
one variable, solve for that variable and substitute the result in any one of the two
equation, to solve for the second variable.
3x – 2y = 8
2x + 5y = -1
15x – 10y = 40
4x + 10y = -2
19x = 38
x=2
172
2(2) + 5y = -1
5y = -1-y
y = -1
T.S = {(2,-1)}
SAQ3
Solve using elimination by addition:
6x + 3y = 3
5x + 4y = 7
25.3 SUMMARY
So far we have discussed different method of solving system of linear equation in two
variables. The techniques we discussed can be applied for the system of n linear equation
with n variables but it may be time consuming. So we need other technique, which is
X–Y=3
-3 -3 1 3
/2
2 ·
-3
X + 2Y = -3
2. Solve by substitution
a) 2x – y = 3 b) 2x + y = 6
173
x + 2y = 14 x – y = -3
25.6 REFERENCES
This block is about matrices and determinants. The concept of matrices and
determinants are useful to solve different problems, which can be reduced to systems of
linear equations.
The first unit is about matrices (singular matrix). A matrix is an array of numbers
in a rectangular way. And the second unit is about determinants. In this unit we are
going to associate with each square matrix a real number, called determinant of the
matrix.
174
UNIT 26 MATRICES
CONTENTS:
26.1 INTRODUCTION
In this unit we discuss matrices. We define and study, some algebraic operations on
matrices, including addition, subtraction, multiplication, and transpose.
Matrices are relatively new concept in mathematics. They were not devised until 1857
when the British Mathematician Arthur Calyley (1821 – 1895) began to use them in the
175
26.2 DEFINITION OF MATRIX
A capital letter is generally used to name a matrix and lower case letters with double
subscripts generally denote its entries. A matrix A can be written as:
A = (aij)mn where, the notation aij indicate the entry in row i and column j.
In general, we can write a matrix as:
CYP1 For matrix A in the above example the 2nd and 3rd columns are _______ and
_____respectively.
Remark:
176
26.3 TYPES OF MATRICES
i) Row Matrix: A matrix, which has exactly one row, is called a row matrix.
Example: (5 9 6 2) is a row matrix, but
ii.) Column Matrix: A matrix which has exactly one column is called a column matrix.
3
3
Example: 2 and are examples of column matrices.
1 2
iii) Square Matrix: A matrix whose number of columns an rows equal is called a
square matrix.
0 0 0 1 2 3 4
1 2
Example: (2) , , 0 0 0 , 5 6 7 8
3 2 0 0 0 0 2 1 3
are examples of square matrices.
1 2 3 4
A = 5 6 7 8
8 0 2 1
This matrix (matrix A) is an mn = 34 matrix. Since 4 3, a matrix A is
not a square matrix.
iv) Null or zero matrix: A matrix each of whose elements is zero is called a null
matrix or zero matrix.
0 0 0 0
Example: ( 0 ) , , ( 0 0 ) , are examples of null or zero matrices
0 0 0 0
v) Diagonal Matrices: A square matrix whose every element (entry) other the main diagonal elements is zero is called a
diagonal matrix: (The main diagonal of a square matrix runs from upper left to the lower right)
177
1 0 0
3 0
Example: , 0 5 0 are examples of diagonal matrices
0 1 0 0 2
Note:
0 0 0
Example: A = 0 0 0 is a diagonal matrix. Clearly A is a square matrix, having 3
0 0 0
rows and 3 columns and
a12 = 0 , a13 = 0 , a21 = 0 , a23 = 0 , a31 = 0 a32 = 0
a31 = 0, a32 = 0, a33 = 0
0 2
CYP2 Let A = Is A a diagonal matrix? (Why?)
3 0
[Hint: If aij = 0 for i j, then the matrix is a diagonal matrix]
vi) Scalar Matrix: A diagonal matrix, whose diagonal elements are equal, is called
scalar matrix.
3 0 0 0
1 0 0
0 0 0 3 0 0
Example: , 0 1 0 , 0
0 0 0 0 1 0 3 0
0 3
0 0
are examples of scalar matrices.
vii) Identity Matrix: A diagonal matrix whose diagonal elements are all equal to
1 (unity) is called identity matrix(or unit matrix), and it is denoted by I
178
1 0 0
1 0
Example: , 0 1 0 are examples of identity matrices.
0 1 0 0 1
vii) Triangular matrix: A square matrix whose elements aij = 0 whenever i<j is called
a lower triangular matrix.
Similarly, a square matrix whose elements aij = 0 whenever i>j is called upper
triangular matrix.
1 0 0
2 0
Example: , 4 5 0 are lower triangular matrices, and
3 0 6 8 9
1 2 3 4
0 5 4 6 1 2
0 , are upper triangular matrix
0 8 2 0 3
0 7
0 0
6
3 a b
D) a. E) ( 2 3 6 -1 ) F)
0 e f
2
0 0 0 0 1 0 0 0
0 0 0 0 0 1 0 0
G) H)
0 0 0 0 0 0 1 0
0 0 0 0 1
0 0 0
179
2) List the matrices in (1) above that can be described as follows.
i. A row matrix iv. A zero matrix
ii. A square matrix v. A diagonal matrix
iii. A column matrix vi. An identity matrix
Definition:
Two matrices are equal if and only if they have the same dimensions and the elements in
all corresponding positions are equal.
2 3 5 / 2 2 3 5 / 2
Example 1. but
3 3/ 3 1 3 1 1
4 9 4 6
9 2 9 2
Example 2. Find the value of each variable if
x 3 1 4 1
2 6 2 3 y
Solution: since the matrices are equal, elements in corresponding positions are equal.
x + 3 = 4 and 3y = 6
There fore, x = 1 and y=2
Note: A zero matrix with order (dimension) mn can be denoted by Omn ,for example
0 0 0
O23 =
0 0 0
x y x 2 3 y 10 z
A) O22 = B) O23 =
z o 0 0 0
4 x 3 y z 1 3 6 3x y 4
C) D)
19 1 5 0 19 1 5 0 1x 3 y 2
180
26.4 OPERATIONS OF MATRICES
Definition:
Solution: Each pair of matrices has the same order. Add the corresponding entries.
5 0 6 3
A) A + B =
4 1/ 2 2 3
5 6 0 (3) 1 3
=
4 2 1/ 2 3 6 7 / 2
1 3 1 2
B) A + B = 1 5 1 2
6 0 3 1
1 (1) 3 (2) 0 1
= 1 1 5 (2) 0 3
6 (3) 0 1 3 1
181
5 6 4
B) C = , D =
3 4 1
Solution:
A) We subtract corresponding entries (elements)
1 2 1 1
C – D = 2 0 - 1 3
3 1 2 3
1 1 2 (1) 0 3
= 2 1 0 3 3 3
3 2 1 3 5 4
B) The matrices do not have the same order, so we cannot subtract.
The product of two matrices A and B AB is defined only when the number of columns of
A is the same as the number of rows in B.
Definition:
Dot Products
The dot product of a 1n row matrix and, an n1 column matrix is a real number given
by:
182
b1
b
(a1 a2 …an) . 2 = a1b1 + a2b2 + …anbn
b
n
Remark: the dot between the two matrices is important. If the dot is omitted, the
multiplication is of another type, which we consider later.
5
Example: ( 3 2 1) . 1
4
= 3 5 + 2 1 +1 4
=15 + 2 + 4
= 21
Definition:
The product of two matrices A and B AB is defined only on the assumption that the
B is a
1 6
3 1 1 4 6
Example: For A= , B = 3 5 and C =
2 0 3 2 4 1 2
Find each of the following
A) AB B) BA C) BC D) AC
1 6
3 1 1
AB = . 3 5
2 0 3 2 4
183
3 1 1 3 (1) (2) 3 6 1 (5) (1) 4
=
2 1 0 3 3 (2) 2 6 0 (5) 3 4
3 3 2 18 5 4 8 9
= =
2 0 6 12 0 12 4 24
1 6
3 1 1
BA = 3 5
2 4 2 0 3
1 3 6 2 1 1 6 0 1 (1) 6 3
= 3 3 (5) 2 3 1 5 0 3 (1) 5 3
2 3 4 2 2 1 4 0 2 1 4 3
15 1 17
= 1 3 18
2 2 14
C) B is a 32 matrix and C is a 22 matrix, so BC will be a 32 matrix
1 6
4 6
BC = 3 5
2 4 1 2
1 4 6 1 1 (6) 6 2
= 3 4 5 1 3 (6) 5 2
2 4 4 1 2 (6) 4 2
10 6
= 7 28
4 20
D) The product AC is not defined because the number of columns of A which is 3 is not
equal to the number of rows of C which is 2.
Note
1. If A is a square matrix, then A can be multiplied by itself.
i.e. A. A = A2 (called power of a matrix)
2. The scalar product of a number k and a matrix A is the matrix denoted by kA, obtained
by multiplying each entry of A by the number k. The number k is called a scalar.
184
Example: find A2 and kA if
1 0
A = and k = 3
3 4
1 0 1 0 1 0
Solution: A2 = =
3 4 3 4 15 16
1 0 3 0
kA = 3 =
3 4 9 12
0 3 2 0
Example: let A = and B =
1 1 1 4
0 3 2 0 3 12
Then AB = =
1 1 1 4 1 4
2 0 0 3 0 6
But BA = =
1 4 1 1 4 1
Therefore, AB BA
Definition:
Let A be a matrix. The matrix obtained from A by interchanging of its rows and
corresponding columns, is called the transpose of A, and denoted by A t or A
185
1 2 3
Example: let A = , then the transpose of A is given by
4 5 6
1 4
t
A = 2 5 , it is obtained simply by interchanging the row’s and
3 6
corresponding column’s
Note:
1. (At)t = A
2. (A + B) t = A t + B t
3. (AB) t = B t A t
2 3 1 5
CYP7 let A = , and B =
4 5 6 7
Verify (At) t = A , (A+B) t = At + Bt , (AB) t = Bt At
26.5 SUMMARY
In this unit we have seen that two matrices are equal if and only if they have the same
dimension (or orders) and all corresponding elements are equal. Matrices having the
same dimensions can be added (or subtracted) by adding (or subtracting) corresponding
elements.
In matrix algebra, a real number is called a scalar. The scalar product of a real number k
and a matrix A is the matrix kA.
CYP4 A) x = 0 , y = 0 , z = 0 B) x = -2 , y = 0 , z = 10
C ) x = -1 , y = 6 , z = 4 D) x = 1 , y -1
186
CYP5 A) |x| B) 22
0 64 40
CYP6 A) ( 18 ) B) ( 6 ) C) 9 11 11
3 39 23
2 4 2 3
CYP7 At = , (At)t = = A
3 5 4 5
3 8 3 10
A+B = , (A + B)t =
10 12 8 12
3 10 2 4 1 6
At + Bt = = (A + B)t , Since At = and Bt =
8 12 3 5 5 7
20 34
(AB)t = = Bt At
31 55
26.7 MODEL EXAMINATION QUESTIONS
Instruction: write the short and precise answers on the space provided.
3 2
3 2 5 4 2 0
Let A = , B = 5 6 , C = , k = 5 , and = 3
4 6 1 0 1 1 3 5
Then
A) A + C = ______________________ B) A – C = _________________________
C) At + B = ______________________ D) (At + B) t = ______________________
E) k(A + C) = ____________________ F) ( K + )A = ______________________
G) AB = _________________________ H) (AB) t = ________________________
I) BtAt = _________________________
26.8 REFERENCES
187
UNIT 27 DETERMINANTS
CONTENTS:
27.1 INTRODUCTION
In the previous block (block6), you remember that we have seen how to solve systems of
linear equations that involves two or more variables. This unit is concerned about a new
term; ‘determinant.’ The concept lies in that whenever it is necessary to assign a real
number to a matrix. In line with this, we will use determinants to solve systems of linear
equations. Moreover, we will use determinants to find the inverse of a matrix, which in
turn, is used to solve systems of linear equations.
188
27.2 DEFINITION
With every square matrix, we associate a number called its determinant. The number of
elements in any row or column is called the order of the determinant.
Note: the determinant of a matrix is usually displayed in the same form as the matrix, but
with vertical bars rather than brackets enclosing the elements.
6 4
Example: Let A = Evaluate det A
2 3
6 4
Solution: det A = = 6 3 – 2 4 = 10
2 3
Definition:
A convenient method for finding the six terms needed to evaluate a 33 determinant is
shown below.
1. Copy the first two columns of the matrix in order to the right of the third
column.
189
a1 b1 c1 a1 b1
a2 b2 c2 a2 b2
a3 b3 c3 a3 b3
2. Multiply each element in the first row of the original matrix by the other two
elements from left-to-right down ward on the diagonal, these products are the first
three terms of the determinant.
a1 b1 c1 a1 b1
a2 b2 c2 a2 b2
a3 b3 c3 a3 b3
3. Multiply each element in the last row of the original matrix by the other two
elements from left-to-right upward on the diagonal, the opposites of these
products are the last three terms of the determinant
a1 b1 c1 a1 b1
a2 b2 c2 a2 b2
a3 b3 c3 a3 b3
3 5 0
Example: Evaluate det A = 2 4 1
1 6 3
Solution:
190
3 5 0 3 5 0 3 5
det A = 2 4 1 = 2 4 1 2 4
1 6 3 1 6 3 1 6
3 4 3 + 5 -1 1 + 0 2 6 - 1 4 0 – 6 –1 3 – 3 2 5
= 36 – 5 + 0 + 18 – 30 = 19 Answer: - det A = 19
Definition:
For a square matrix A = (aij), the minor Mij of an element aij is the determinant of the
matrix formed by deleting the ith row and the jth column of A.
3 2 5
Example: for the matrix A = (aij) = 1 4 9 , find each of the following
6 0 7
A) M12 B) M23 C) M 33
Solution:
A) Delete the first row and the second column and find the determinant of the 2 2
matrix formed by the remaining elements.
3 2 5
1 9
1 4 9 M12 =
6 0 7 6 7
= -1 7 – 6 9
= -7 – 54
= -61
B) Delete the second row and the third column and find the determinant of the 2 2
matrix formed by the remaining elements.
191
3 2 5
3 2
1 4 9 M23 =
6 0 7 6 0
= 30–62
= -12
3 2 5
3 2
C) 1 4 9 M 33 =
6 0 7 1 4
= 3 4 – (-1 2)
= 12 + 2
= 14
Definition:
For a square matrix A = (aij) , the cofactor Aij of an element aij is given by:
Aij = (-1)i+j Mij, where Mij is the minor of aij
3 2 5
Example: for the matrix A = (aij) = 1 4 9 , find each of the following.
6 0 7
A) A11 B) A23 C) A31
3 2 5
4 9
Solution: A) 1 4 9 M11 =
6 0 7 0 7
= 4 7 – 0 9 = 28
1+1
Then, A11 = (-1) M11
= (-1)2 (28) = 28
3 2 5
3 2
B) 1 4 9 M23 =
6 0 7 6 0
= 3 0 – 6 2 = -12
192
3 2 5
2 5
C) 1 4 9 M31 =
6 0 7 4 9
= 2 9 – 4 5 = -2
3+1
Then A31 = (-1) M31
= (-1)4 (-2) = -2
1 0 0 2
4 1 0 0
CYP2 For the matrix A = , Find:
5 6 7 8
2 3 1 0
A) M41 and M44 B) M12 C) A41and A44 D) A12
Definition:
193
= -36 - 240 + 240
= -36
194
a1 b1
where D = , determinant of the coefficient matrix
a2 b2
c1 b1 a c1
Dx = , Dy = 1 , and D 0
c2 b2 a2 c2
Note that the denominator D contains the coefficients of x and y, in the same position as
in the original equations. For x, the numerator is obtained by replacing the x-coefficient
in D (the a’s) by the c’s. For y, the numerator is obtained by replacing the y-coefficient in
D (the b’s) by the c’s.
Example: solve using Cramer’s rule
2x – y = 5
x – 2y = 1
Solution: we have
2 1
D= = -4 –(-1) = -3 0
1 2
Dx 5 1 10 (1) 9
x = = 3
D 1 2 3 3
-3
2 5
Dy 1 1 25 3
y = = 1
D 3 3 3
Hence, the solution is (3 , 1) and the solution set is {(3 , 1)}
195
a1 d1 c1 a1 b1 d1
D y = a2 d2 c2 , Dz = a 2 b2 d 2 , and D 0
a3 d3 c3 a3 b3 d3
Solution: A) we have
1 3 7 13 3 7
D = 1 1 1 = -10 , Dx = 1 1 1 = 20
1 2 3 4 2 3
1 13 7 1 3 13
Dy = 1 1 1 = -6 , Dz = 1 1 1 = -24
1 4 3 1 2 4
Dx 20 Dy 6 3 D 24 12
Then x = = -2 , y = , z z =
D 10 D 10 5 D 10 5
1 1 1 9 1 1
B ) D = 2 5 7 = -4 0 Dx = 52 5 7 = -4
2 1 1 0 1 1
1 9 1 1 1 9
Dy = 2 52 7 = -12 Dz = 2 5 52 = -20
2 0 1 2 1 0
Dx 4 Dy 12 D 20
Then, x = =1 y = =3 z z = =5
D 4 D 4 D 4
Hence, the solution is ( 1 , 3 , 5 ) , and the solution set is {( 1 , 3 , 5 )}
196
CYP5 Solve the following system of equations using Cramer’s rule.
x – 3y – 2z = 9
3x + 2y + 6z = 20
4x – y + 3z = 25
Definition:
Cofactor matrix is defined to be the matrix obtained by replacing every number aij of the
given matrix A by its cofactor in the determinant of A.
1 2 3
Example 1: let A = 4 5 1 then the cofactor matrix of A is given by:
2 4 0
5 1 4 1 4 5
4 0 2 0 2 4
2 3 4 2 6
1 3 1 2
= 12 6 0
4 0 2 0 2 4 13 11 3
2 3 1 3 1 2
5 1 4 1 4 5
Definition:
Let A be a matrix and let C be its cofactor matrix, then the transpose Ct of C is called
the adjoint of A (to be written, in short Adj A).
4 2 6 4 12 13
C = 12 6 0 and C = 2 6 11
t
13 11 3 6 3
0
4 12 13
Therefore, Adj A = C = 2 6 11
t
6 3
0
197
Inverse of a matrix A, denoted by A–1, is given by the formula
1
A–1 = Adj A, where A 0 and A is determinant of matrix A.
A
1 2 3
Example3: Find the inverse of the matrix A = 4 5 1
2 4 0
Solution: we notice A is a square matrix
5 1 4 1 4 5
det A = 1 -2 +3
4 0 2 0 2 4
= -4 + 4 + 3 6 = 18 0 , hence A is invertible.
198
3 2 2 2 2 3
2 2 3 2 3 2
2 5
2 2
1 1 1 1 2
C = = 2 1 4
2 2 3 2 3 2 1
2 0 1
1 1 1 1 2
3 2 2 2 2 3
2 2 1
Adj A = C = 2 1 0
t
5 4 1
2 2 1
1 1
–1
Hence, A = Adj A = 2 1 0
A 1
5 4 1
2 2 1
= 2 1 0
5 4 1
Solution: We write the coefficients on the left in a matrix. We then write the product of
that matrix and the column matrix containing the variables, and set the result equal to the
column matrix containing the constant on the right.
4 2 1 x 3
9 0 1 y 5
4 5 2 z 1
199
4 2 1 x 3
If we let A = 9 0 1 X = y and B = 5
4 5 2 z 1
We can write this matrix equation as AX = B. Solve systems of linear equations using a
matrix equation like AX = B.
AX = B , where
1 5 x 2
A = , X = , and B =
2 1 y 4
If A 0 so that A exists, the system has unique solution X = A-1B.
-1
1 1 x 2
=
1 1 y 4
1
A-1 = adj A
A
1 1
1 1 1
= = 2 2 (see inverse matrix)
2 1 1 1 1
2 2
Therefore, AX = B X = A-1B
1 1
x 2 2 2
=
y 1 1 4
2 2
200
1 1
x 2 4
= 2 2
y 2 4
1 1
2 2
x 3
=
y 1
x = 3, and y = 1
Hence, the solution is (3 , 1)
1 3 7
AX = B , where A = 1 1 1
1 2 3
x 13
X = y , and B = 1
z 4
1
We have, A-1 = adj A
A
1 1 1 1 1 1
2 3 1 3 1 2
3 7 1 7 1 3
C = cofactor matrix of matrix A
2 3 1 3 1 2
3 7 1 7 1 3
1 1 1 1 1 1
5 2 3
C = 5 4 1
10 6 4
201
5 5 10
Adj A = C = 2 4 6
t
3 1 4
1
A-1 = adj A , A= -10
A
1 1
1
5 5 10 2 2
1 1
2 3
A-1 = 2 4 6 =
10 5 5 5
3 1 4 3 1 2
10 10 5
-1
X=A B
1 1
1
x 2 2 13
y = 1
1 2 3
5 5 5
z 3
1 2 4
10 10 5
1 1
13 1 1 4
2 2
= 1
13
2
1 4
3
5 5 5
3 1 2
13 1 4
10 10 5
2
3
=
5
12
5
3 12
x = -2, y= , and z =
5 5
3 12
Therefore the solution of the given system is ( -2 , , ) and the solution set is
5 5
3 12
{( -2 , , )}
5 5
202
CYP7 Use inverse matrix to solve:
A) x + 2y + 3z = 11
2x + 4y + 5z = 21
3x + 5y + 6z = 27
B) x + 2y + z = 8
2x + 3y + 2z = 14
3x + 2y + 2z = 13
C) 3x – y + z = 2
-15x + 6y – 5z = 5
5x – 2y + 2z = 3
27.6 SUMMARY
For each nn matrix A there is a real number called the determinant of A.
If A is a matrix with det = A 0, then the inverse of A is given by:
1
A-1 = adj A, where Adj A = Ct and C is called the cofactor matrix of A.
A
The solution of a system of n linear equation in n variables is given by
Dx Dy Dz
x , y , z , … where D is the determinant of the matrix of
D D D
coefficients of the variable ( D 0) and Dx , Dy , Dz , … are derived from D
by replacing the coefficient s of x , y , z … respectively, by the constants.
This method is called Cramer’s Rule.
CYP5 ( x , y , z) = (2 ,-5 , 4)
203
17 1 31
7 3 3 35 7 70
1
B)
3
CYP6 A) 1 1 0 0
5 10
1 0 1 23
3 22
35 7 35
CYP7 A) (x , y , z) = ( 2 , 3 , 1) B) (x , y , z) = ( 1 , 2 , 3)
C) (x , y , z) = ( 1 , 15 , 14)
3 0 4
2. Given matrix A = 2 1 3
4 1 0
Then A) M12 = ________________ C) A23 = ____________________
B) M32 = ________________ D) A33 = _____________________
3 0 4
1. Given a matrix A = 2 1 3
4 1 0
Then, A) find det A
B) Determine the cofactor matrix of A.
C) Find the Adjoint of A
D) Find the inverse of A.
2. Given the following system of equations
x + 2y + 3z = 14
3x + y + 2z = 11
2x + 3y + z = 1
Then, A) solve using Cramer’s Rule B) solve using inverse matrix
204
27.9 REFERENCES
Contents
21.0 Aims and Objectives
21.1 Introduction
21.2 Definition of Correlation
21.3 Types of Correlation
21.4 Scatter Diagram
21.5 Degree of Correlation
21.6 Measuring Simple Linear Correlation
21.7 Measuring Rank Correlation
21.8 Summary
21.9 Answers to Check Your Progress Questions
21.10 Model Examination Questions
21.11 Glossary
21.12 References
The aim of this unit is to explain the nature, scope and methods of studying correlation.
We introduce two measures relating to correlation.
After going through this unit, the student should be able to:
Define correlation
Distinguish between simple and multiple, positive and negative, and linear and
non –linear correlation.
Recognize the degree of correlation between variables.
Explain the method of studying correlation by scatter diagram method.
Compute correlation by applying the formula of Karl Pearson.
205
Calculate coefficient of correlation by rank correlation method.
21.1 INTRODUCTION
Most of the methods we have developed so far have been for dealing with one variable
only. The scope was strictly confined to the various values of one variable. For example,
measures of central tendency, variation and skewness study with the various values of a
single variable. Those statistical measures are important for comparison and analysis but
they are not useful in looking for the quantitative relationship between the variables.
However, often several different characteristics are measured on each member of a
sample and it may be of great interest to ask whether the variables are interrelated.
A businessperson may want to know whether the volume of sales for a given month is
related to the amount of advertising the firm spends during that month. Educators, in the
other side, are interested in determining whether the number of hours a student studies is
related to the student’s score in a particular exam. Medical researchers, in their
professional field, are interested in question such as ‘ Is caffeine related to heart
damage?’ These are only a few of the many issues that can be answered by using the
technique of correlation analysis.
206
21.3 TYPES OF CORRELATION
On the basis of the nature of relation between the variables, correlation may be
categorized as follow:
A) Simple and Multiple Correlation
B) Positive and Negative Correlation
C) Linear and Non- Linear Correlation
Example: 21.3.0
A manger may wish to see whether the number of years the sales people have been
working for the company has anything to do with the amount of sales of the
representatives. The only two variables are: years of experience and amount of sales.
Example: 21.3.1
An educator may wish to investigate the relationship between the students success in the
University and factors such as the number of hours spent for studying, the students IQ,
and the background of the students. This type of study involves several variables:
success, hours spent, IQ and background.
A positive correlation exists when both variables increase or decrease at the same time
207
Example 21.3.2
A firm’s sales volume and advertisement are related; and the relationship is positive,
since the more the sales volume is generally, the more the firm advertises.
In a negative correlation, as one variable increases the other variable decreases, and vice
versa
Example: 21.3.3
If one compares the strength of people over 60 years of age, one will find that as age
increases, the strength generally decreases.
Both linear and non-linear correlation can be identified with reference to the amount of
change in the values of variables.
If the amount of change in one variable is accompanied by the same amount of change
in the other variable, it is known as linear correlation.
Example: 21.1.3.4
If the amount of change in one variable is not accompanied by the same amount of
change in the other variable, the correlation is said to be non-linear.
Example: 21.1.3.5
208
The variable Y: 40, 46, 50, 75, 90
CYP1
Explain simple linear correlation
In simple correlation, the researcher collects data on two variables to see whether a
relationship exists between variables or not.
Example 21.1.4.0
If a researcher wishes to see whether there is a relationship between the number of hours
studied by students and their test scores on an exam, he or she must select a random
sample of students, determine the hours each studied, and obtain their grades on the
exam.
Ta b l e 2 1 . 1 . 4 . 0
Student Hours Studied (X) Grade out of 100(Y)
Alemu 5 82
Challa 1 60
Daniel 4 87
Hailemichel 1 68
Fantaye 3 74
The two variables in this investigation are: Hours studied (X) and Grade
obtained out of 100(Y)
These two variables are called the independent variable and the dependent
variable.
209
Hence, in Example 21.1.4.0 the variable “the number of hours studied” is the
independent variable. It is denoted as the X – variable.
Thus, the grade the students received on the exam, in Example 21.1.4.0, is the dependent
variable, designate as the Y – variable.
Remark: The reason for distinction between the variables is that, one assumes that the
grade the student earns (Y) depends on the number of hours the student studied (X).
Since scatter diagram is an aid for understanding the correlation techniques, after the plot
is drawn, it should be analyzed to determine which type of relationship, if any, exists.
With the help of the dots plotted on the graph,
Closeness of the dots on the diagram shows high degree of correlation
If the points on the diagram rise from the lower left hand corner to the upper right
hand corner, the correlation is said to be positive.
Correlation is said to be negative if the points indicate a decreasing tendency from
the upper right hand corner to the lower left hand corner.
If all the points lie on a straight line in a positive correlation, the correlation is
said to be perfectly positive, that is, r = +1.
If the correlation is negative and all the points fall on a straight line, it is said to be
perfectly negative, that is, r = -1.
If the plotted dots lie on a haphazard manner, correlation is said to be absent, i.e.
no correlation.
The following is diagrammatical illustration: Figure 21.1.4.0
210
Y y x
y
x
x
x x x
x x x x
x x x
x x x
X X
X
Low degree of Low degree of negative No correlation (r = 0)
positive correlation (r correlation (r is close to 0
is close to 0 from the from the negative
pos i t i ve
x x
x x
x x
x x
x
211
X
Example 21H .1i.g4h.1degree negative
correlation (r is close to –1)
Construct a scatter diagram for the data obtained in Example 21.14.0
60 x
40
20
X
1 2 3 4 5 6 7 Figure 21.1.4.1
Hours Studied
CYP2
A) Construct a scatter diagram for the data obtained in a study on the number of absences
and the mark scored of seven randomly selected students from a statistics class.
212
T a b l e 2 1 .1 .4 .1
Student Number of Absences (X) Mark Scored (Y)
A 6 82
B 2 86
C 15 43
D 9 74
E 12 58
F 1 90
G 8 78
The extent of relationship between the variables is calculated with the help of a statistical
technique known as correlation coefficient. According to the formula given, correlation
coefficient always lies between –1 and +1. Here the algebraic sign (+) indicates the
positive relationship between the variables, the sign (-) denotes the negative relationship.
If no relationship exists between the variables under study, the correlation coefficient will
be zero. +1 and –1 denote the perfect positive and perfect negative correlation
respectively.
If the correlation coefficient is close to +1, we may say, there is a higher degree of
positive correlation. On the other hand, if the correlation coefficient is near to –1, we can
say, there is higher degree of negative correlation.
213
The correlation coefficient computed from the sample data measures the strength and
direction of a relationship between two variables. The symbol for the sample correlation
coefficient is r.
CYP3
We use a measure called the correlation coefficient to determine the strength of the
relationship between two variables. To do so, there are several ways to compute the
values of the correlation coefficient. One simple method is to use the formula shown
below.
N XY X Y
r
N X 2
X N Y Y
2 2 2
214
Formula 17.1.6.0 is called Karl Pearson’s coefficient of correlation.
Where x = Total of x series
y = Total of y series
x2 = sum of the square of x series
y2 = sum of the squares of y series
xy = sum of the products of x and y series
N = Number of pairs observed
Example 21.1.6.0
A manager wishes to find out whether there is a relation ship between the number of
radio advertisement aired per week and the amount of sales (in hundreds of Birr) of a
product. The data for the sample are given below.
Sales (Y) 2 4 7 6 9 10
To know the relationship, we have to compute the value of the correlation coefficient for
the sample data.
Solution:
Step 1: Make a table, as shown below
2 2
5 4
8 7
8 6
10 9
12 10
215
Step 2: Find the product of the X and Y values and place the products in the column
labeled XY
Step 3: Square the X values and place them in the column labeled X2
Step 4: Square the Y value and place them in the column labeled Y2
Step 5: Find the sum of each column. The completed table is given below.
2 2 4 4 4
5 4 20 25 16
8 7 56 64 49
8 6 48 64 36
10 9 90 100 81
X = 45 Y = 38 XY = X2 = Y2 =
338 401 286
N XY X Y
r
N X 2
X N Y Y
2 2 2
Therefore, the relationship between the numbers of radio advertisement aired per week
and the amount of sales (in thousands of Birr) is strong which is positive.
216
CYP 4
A) Calculate the correlation coefficient for the data obtained in a study on the number of
hours a person exercises each week and the amount of milk (in liter) each person
consumes per week.
A 3 48
B 0 8
C 2 32
D 5 64
E 8 10
F 5 32
G 10 56
H 2 72
I 1 48
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________
217
21.7 MEASURING RANK CORRELATION
We have so far assumed X and Y can both be measured on a continuous scale, and that
they are jointly normally distributed. However, neither of these assumptions may appear
safe to make. Thus, this method, rank correlation coefficient is used when data are not
normal or when the shape of the distribution is not known. Especially it is useful for
variables like beauty, leadership ability and so on particularly and items that cannot be
expressed in quantitative terms generally. Spearman studied these variables and
developed a method of finding out correlation between such two variables. As the result,
this method is called Spearman’s rank correlation coefficient. The formula is presented
next.
6 d 2
rs 1
n n2 1
where d difference between corresponding ranks
n number of pairs observed
Ranks are given to all the items in the distribution in the increasing or decreasing order of
their magnitude. One way of ranking is, the highest value in the distribution is assigned
first rank, and the next highest value is given the second rank and so on.
Example 21.1.7.0
A University wishes to establish whether there is a connection between academic and
sporting achievements. Calculate the Spearman’s rank correlation coefficient. Eight
pupils are selected and ranked.
218
P upi l s A B C D E F G H
Academic Rank 2 3 8 5 6 1 4 7
Sporting Rank 1 8 4 6 7 2 5 3
Solution:
Step 1: make a table as shown below and find d2
2 1 1 1
3 8 -5 25
8 4 4 16
5 6 -1 1
6 7 -1 1
1 2 -1 1
4 5 -1 1
7 3 4 16
d2 = 62
662
rs 1 0.26
8 82 1
The connection between academic and sporting achievement is positive weak correlation.
CYP5
According to rank correlation coefficient, all the calculations are based on the original
value of the observations rather than the ranks assigned to them.
219
A) (Say true or false) _____________________.
B) If your answer is false, why?
________________________________________________________________________
________________________________________________________________________
____________
CYP6
Find the Spearman’s rank correlation coefficient from the following data in respect of
marks scored by 10 students in final exam out of 100 in English and Mathematics.
Students A B C D E F G H I J
Mark in 50 60 65 30 40 35 70 75 80 45
English
Mark in Math 45 55 60 40 45 60 58 62 72 76
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________
21.8 SUMMARY
Many relationships between variables exist in our practical life in the real world.
Correlation analysis is an analytical statistical measure that helps to find out the direction
and strength (degree) of relationship between two or more variables. On the basis of the
nature of relationship, correlation analysis may be classified in to three categories,
namely
220
Correlation is measured with the help of coefficient of correlation, which always lies in
between -1 and +1. +1 and –1 indicate the perfect positive and negative correlation
respectively, where as zero (0) implies the absence of correlation between two variables.
The closer the value of the correlation coefficient is to +1 or –1, the stronger the
relationship is between the variables.
CYP1, The relationship of two variables, which is linear, is called simple linear
correlation.
CYP2 A) y
120
Mark Scored
90 x
x
x
x
x
60
x
x
30
2 4 6 8 10 12 14 16 x
Number of absences
B) The correlation (relationship) is negative which is strong
1) d
2) A) r = 0.067
B) Weak positive correlation
221
3) A) False
B) According to rank correlation coefficient, the calculations are based on ranks
rather than the original values of the observation.
4) rs = 0.515
The following table shows the connection between the number of hours devoted by five
sample students to study a course Quantitative Methods I and their marks on this course.
Sample Students S1 S2 S3 S4 S5
222
Answer the following questions (1-3) based on the given information above,
1. The two variables are
a. Quantitative Methods I and Quantitative Methods II
b. Mathematics and Statistics
c. Marks obtained in Quantitative Methods I and the hour of study
d. Not given
4. The table gave next displays the ranks of six sample students in their English and
Mathematics test out of 10.
Students A B C D E F
5. From question #4, the type of correlation between English and Mathematics is
____________________.
(NB rs is interpreted as r)
a. Perfect negative d. Weak negative
b. We cannot determine e. None
c. Weak positive
223
6. Assume d2 = 0, where d is the difference between the corresponding ranks, then
the Spearman’s correlation coefficient is ______.
a. We cannot determine because n is not given.
b. –1 c. 1 d . 0 .9 8
1. Consider the following relationship between TV advertisement of lollipop and its total
sale
Advertisement Cost (in hundred in Birr) Total Sales (in thousands in Birr)
2 10
3 15
5 12
8 17
10 18
12 20
Required:
a. Construct a scatter diagram on the usual co-ordinate plane (X-Y axis)
b. Find the Karl Pearson’s correlation coefficient, r.
c. Determine the relationship of TV advertisement of lollipop and its sale.
Required:
a. Compute the Karl Pearson’s coefficient of correlation
b. What type of correlation is it?
224
3. ABC trading Pvt. Ltd. Co. wishes to determine the relationship between sales
experience and sales volume. A random sample of 11 sales employees is selected and
their years of experience (X) and current annual sales (Y) are collected. The distribution
is given next.
X 1 4 6 7 11 10 3 3 5 9 8
Y 3 5 8 1 10 13 13 7 2 4 6
Determine:
a) The Pearson’s coefficient of correlation ( r ) between experience and sale.
b) The Spearman’s coefficient of correlation (rs) between experience and sale.
(NB r may or may not be equal to rs)
21.11 GLOSSARY
225
21.12 REFERENCES
226
UNIT 22: REGRESSION
CONTENTS
The aim of this unit is to explain the meaning, importance and computational process of
regression.
227
22.1 INTRODUCTION
The term “regression” was originally employed by Galton in 1877 for indicating certain
relationships in the theory of heredity but it has come to imply the statistical method
developed to investigate those relationships.
There are problems in business and industry where two or more variables show a mutual
relationship and are hence capable of simultaneous analysis. One of the variables is of
primary interest, which may not be measured directly. On certain other occasions the
variable of primary interest can be measured but direct measurement of this variable is
very expensive. In these problems we first try to build up a suitable functional
relationship between the primary variable and one or more auxiliary variables. On the
basis of this functional relationship we attempt to predict the value of the primary
variables for given values of the auxiliary variables. Several illustrations can be given.
B) A manufacturer of farm tools wishes to study whether the volume of sales can be
predicted from the corresponding farm income. On the basis of the data on sales
and farm income a prediction model can be obtained which may be used to
predict sales from the farm income.
After having established the fact that two variables are closely related we may be
interested in estimating (predicting) the values of one variable given the value of another.
Regression analysis reveals average relationship between two variables and this makes
possible estimation of prediction.
228
Regression analysis attempts to establish the nature of the relationship between variables
that is to study the functional relationship between the variables and thereby provide a
mechanism for prediction, or forecasting.
The shape of the scatter diagram can be taken as a broad guideline about the nature of the
functional relationship between X and Y. If the scatter diagram of the data is broadly
linear (seem to fall on a line) then we are justified in using a linear function to predict the
value of Y(say) by a single variable X.
If a linear relationship is assumed and we are dealing with the case when only one
auxiliary or independent variable is to be used to predict the primary or dependent
variable, this is what we call simple linear regression analysis. It is called simple as it
involves only one regress or (independent) variable and is said to be linear due to the
observed linear relationship.
Let n pairs of observations (xi, yi) (i = 1, 2, …n) are available. If a linear relationship is
assumed, one can use a linear regression model:
Y = a + bx ……(22.2.1)
Where the intercept a and the slope b are unknown constants
229
to assume that ‘x’ is measured with minimum error and ‘y’ the response variable depends
up on the value of ‘x’ and is therefore a random variable.
The linear regression model given in equation (22.2.1) has two parameters a and b. We
shall develop a procedure for estimating these parameters. Suppose n pairs of
observations (xi, yi) (i = 1, 2, …n) are given. The scatter diagram of these observations is
shown in Fig 22.2.1. Consider the line drawn through the points in Fig 22.2.1
y = a + bx (22.2.2)
For a given xi , the y value on the line is a + bxi. The corresponding observed y – value,
by definition, is yi. Hence the difference ai = yi – (a + bxi) = yi – a – bxi in a sense is the
measure of deviation of the observed yi value from the line.
(x2 , y2) … y = a + bx
(x1 , y1) e2 en
e1 e3 (xn, yn)
(x3, y3) …
X
Fig. 22.2.1
If we square the ei values to eliminate the effect of positive and negative deviations and
add them up then the quantity,
230
n
R ei
i 1
2
…… (22.2.3) is called the sum of the squares of errors in the prediction. i.e.
ei = y – ŷ which is the difference between the observed value of y(y) and the expected
value of y(ŷ).
The smaller the value of R the better is the representation of the observations by the line.
Using this approach, we estimate the values of the unknown constants a and b so that the
quantity R (the sum of the squares of errors) is minimized. The estimate of a and b in
subject to minimizing R ei 2 is called the Least Square Estimate. The least square
i
estimate have the property that the sum of square of the deviations of the observations
from the line is minimum. The process of estimating the parameters ‘a’ and ‘b’ is
sometimes known as fitting the linear regression to the observed data.
The line y = a + bx where ‘a’ and ‘b’ have been estimated by the least square method is
sometimes referred to as the line of best fit.
Regression lines are drawn on the assumption of ‘Least Squares’. According to this
assumption, the sum of squares of the deviations of the observed values of y from the
2
n
fitted line would be the minimum possible. i.e. y y is minimum. Further, the
i 1
sum of deviations above the line is equal to the sum of deviations below the line, i.e.
i
y y = 0.
The algebraic expressions of regression lines are called regression equations. In the case
of two variables say X and Y, there are two regression lines (equations)
i. The regression equation of Y on X and
ii. The regression equation of X on Y
231
22.5.1 Regression Equation of Y on X
Symbolically, regression equation of Y on X expressed as
Yc = a + bx (22.8.4) where Yc is the most probable value of Y (computed) and
‘a’ and ‘b’ are constants. While ‘a’ denotes the level of the fitted lines, ‘b’
denotes the slope of the line (i.e. the change in ‘Y’ variable per unit change in ‘X’
variable)
ei y a bxi
2 2
Given Y = a + bx, the sum of the squares of errors is given by
i i
ei y a bxi
2 2
* can be minimized by the principle of maxima and minima
i i
which is differentiating (*) partially with respect to ‘a’ and ‘b’ , which yields to normal
equations:
yi
i
na b xi
i
……(8.2.5)
xi yi
i
a xi b xi
i i
2
W hi l e xi , yi , xiyi
i i i
and yi
i
2
indicate the totals that are obtained from the
original values of x and y series, n denotes the number of pairs observed for the purpose
of regression analysis.
232
Solving the normal equations simultaneously for ‘a’ and ‘b’ we obtain:
b= n. xiyi - xi . yi
i i i
(22.2.6)
2
n i xi 2 - i xi
1
or b=
n
xiyi - x . y
i
(22.2.7)
1
n
xi
i
2
x2
or b= xiyi - n x . y
i
(22.2.8)
xi
2
2
nx
i
Sd y
or b= r. (22.2.9) where r is the correlation coefficient between x
Sd x
and y variable, Sdy is the standard deviation of y variable, and Sdx is the standard
deviation of x variable. and a = y b x (22.2.10)
Substituting these values in (22.2.4) , we have the desired prediction formula (equation)
of y on x as:
y = a + bx
Y = y + r.
Sd y
Sd x
. x x = y - r.
Sd y
Sd x
. x + r.
Sd y
Sd x
. x (22.2.11)
a b
233
Example 1: below is a data obtained from 5 families indicating family size and mean
monthly expenditures.
Family 1 2 3 4 5
Family Size (X) 4 3 6 5 2
Expenditure (in hundreds) (Y) 5 2 8 3 4
Let Y = a + bx
By least square estimate, the unknown constants a and b are given by:
b= xiyi - n x . y
i
xi
2
2
nx
i
and a = y bx
xi yi xi2 xiyi
4 5 16 20
3 2 9 6
6 8 36 48
5 3 25 15
2 4 4 8
x i = 20 yi = 22 xi2 = 90 xi yi = 97
x = 20/ 5 = 4 y = 2 2 / 5 = 4 .4
234
Then b = 97 – 5 4 4.4
90 – 5 (4)2
= 97 – 88
90 – 80
= 9
10
b = 0 .9
Then a = y b x
= 4 .4 – 0 .9 4
= 4 .4 – 3 .6
= 0 .8
Thus the regression equation is given by:
Y = a + bx
Y = 0 .8 + 0 .9 x the regression equation of Y on X.
In the table, write down the family sizes in the first column, the expenditures in the
second column. Third column xi2 is the square of each entries of the first column
and fourth column is the product of paired entries of the first and second column.
xi means sum the first column vertically down and x means the arithmetic mean of the
1
X variables. i.e. x = xi where n is the number of paired observations. In our case n
n
= 5 and yi means sum the second column vertically down. xi2 means sum the third
column vertically down and xi yi means sum the fourth column vertically down.
1
y= yi is the arithmetic mean of the Y variable.
n
235
Example 2:
Data below indicates the demand of a certain commodity versus price.
yi xi2 xiyi
10 40 100 400
12 48 144 576
16 52 256 832
14 46 196 644
15 50 225 750
i
x i = 67 i
yi = 236 i
xi2 = 921 i
xi yi = 3202
x = 6 7 / 5 = 1 3 .4 y = 2 3 6 / 5 = 4 7 .2
xi yi
i
n .x . y
b =
xi
2
2
nx
i
3202 3157.4
b =
921 897.8
44.6
b =
23.2
236
b = 1 .9 2
and a = y bx
a = 4 7 .2 – 1 .9 2 1 3 .4
a = 2 1 .4 7
Thus the regression equation of Y on x is given by Y = a + bx
Y = 2 1 .4 7 + 1 .9 2 x
22.5.2 Regression of X on Y
Symbolically, the regression equation of X on Y is expressed as X = a + by (22.2.13)
where x is the most probable value of X computed and ‘a’ and ‘b’ are constants, while ‘a’
denotes the level of the fitted line (i.e. the distance of the line directly above or below the
origin). ‘b’ denoted the slope of the line (i.e. the change in X variable per unit change in
Y variable). The values of ‘a’ and ‘b’ are obtained through the following two normal
equations:
xi
i
na b yi
i
xiyi
i
a yi b yi 2
i i
(22.2.13)
n xi yi xi yi
b 2
n yi yi
2
… … 2 2 .2 .1 4
i i
xi yi n x . y
i
= … … 2 2 .2 .1 5
yi 2 n . y
2
sd x
=r …… (22.2.16)
sd y
and a = x b y ……(22.2.17)
237
Example: given the data below, fit the regression line of X on Y.
X 2 3 7 6 4
Y 6 3 8 1 5
Solution:
xi yi Yi2 xiyi
2 6 36 12
3 3 9 9
7 8 64 56
6 1 1 6
4 5 25 0
x = 2 2 / 5 = 4 .4 y = 2 3 / 5 = 4 .6
In regression X on Y, using the linear model X = a + by, where a and b are estimated by
xi yi n x . y
i
b = bx y =
yi
2
2
n. y
i
103 101.2
=
135 105.8
1 .8
=
29.2
= 0 .0 6 2
and a = x b y
= 4 .4 – 0 .0 6 2 4 .6
= 4 .1 2
238
Then the regression equation of X on Y is given by
X = 4.12 + 0.062y
22.6 PREDICTION
The term prediction usually implies the estimation of a future value of a variate either by
the projection of a trend regression line or by the use of a probabilistic model. Unlike
estimation, which is mainly used to mean the determination of an approximate parameter
value from a sample, prediction is mainly used in association with linear, multivariate or
bivariate regression models. As it can be used as a means of predicting the future value
Y, the regressor X is sometimes also termed as a predictor.
From the two linear regression equations; Y on X and X on Y, given the value of the
independent variable, one can predict the value of the dependent variable.
1 65 68
2 72 74
3 60 63
4 67 62
5 75 80
239
6 52 50
7 54 53
8 69 66
9 58 61
Solution:
Let estimate shrinkage be denoted by X variable and actual shrinkage be denoted by the
Y variable.
a) To regress Y on X and X on Y, let us use the following table for the computation
of the unknown constants ‘a’ and ‘b’
X Y X2 XY Y2
572
x 63.56 x 2
= (63.56)2 = 4039.9
9
240
577
y 64.11 y 2 = (64.11)2 = 4110.1
9
x y = 6 3 .5 6 6 4 .1 1
= 4 0 7 4 .8
n = 9
Regression equation on Y on X given by Y = a + bx
Where b =
xy n . x . y
x n x
2 2
This equation will help us to predict the actual shrinkage given the appraiser’s estimate of
shrinkage
Where b=
xy n x . y
y ny2 2
and a x by
63.56 0.79 64.11 12.9
This equation will help us to predict the appraiser’s estimate of shrinkage given the actual
shrinkage.
241
b) Given x = 70, to predict y, we use the regression equation of Y on X; y = -6.11 +
1.1048x and substitute 70 in place of x, getting
i.e. if the appraiser’s estimate of shrinkage is 70, then the expected (approximated) actual
shrinkage is about 71.23
c) Given the actual shrinkage, to predict the appraiser’s shrinkage, we use the regression
equation of
X on Y formulated in ‘a’ above. i.e. x = 12.9 + 0.79y X = 12.9 + 0.79 (78) , x
= 7 4 .5 2
the two regression lines cross each other at x , y
correlation coefficient can be calculated from the regression coefficient by the
relation
r2 = byx . bxy …… (22.2.18)
CYP 1
Given bxy = 0.45 and byx = 1.44, find out coefficient of determination
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________
1. Correlation literally means the relationship between two or more variables, which
vary in sympathy so that, the movement in one tend to be accompanied by the
corresponding movement in the other(s). On the other hand, regressing means
stepping back or returning to the average value and is a mathematical measure
expressing the average relationship between the two variables.
242
2. Correlation coefficient ‘rxy’ between two variables X and Y is a measure of the
direction and degree of the linear relationship between two variables, which is
mutual. It is symmetric i.e. rxy = ryx and it is immaterial which of X or Y is
dependent variable and which is independent variable. But regression analysis
aims at establishing the functional relationship between the two variables under
study and then using this relationship to predict or estimate the value of the
dependent variable for any given value of the independent variable. It also reflects
up on the nature of the variable i.e. which is dependent and which is independent.
Regression coefficient are not symmetric in X and Y, i.e. bxy byx.
3. Correlation need not imply cause and effect relationship between the variables
under study. However, regression analysis clearly indicates the cause and effect
relationship between the variables. The variable corresponding to cause is taken as
independent variable and the variable corresponding to effect is taken as dependent
variable.
5. There may be non-sense correlation between two variables, which is due to pure
chance and has no practical relevance. E.g. the correlation between the size of shoe
and the intelligence of a group of individuals. There is no such thing like non-sense
regression.
6. Correlation analysis is confined only to the study of linear relationship between the
variables and therefore, has limited applications. Regression analysis had much
243
wider applications as it studies linear as well as non- linear relationship between the
variables.
The degree of association between these two variables is based on the ranks of
observations but not on the numerical values. The number indicating the position of a
given value in the ranking is termed as its Rank. It is given to the values of the variable
either in ascending or descending order.
The Pearson’s coefficient of correlation is designated by r and is given by r = 1 -
n
6 Di2
i 1
n3 n
Where D i
2
- the sum of square of differences of pairs of ranks.
1 - the value of r ranges between –1 and +1. If there is no relationship between the two
variables, then its value must be 0. if the relationship is perfect or if all the points on the
scatter diagram fall on the straight line, then the value of r is +1 or – 1, depending on the
direction of the line. Other values of r show an intermediate degree of relationship
between the two variables.
244
Example: -
Find the rank correlation coefficient of the data given below:
Beauty 1 .5 0 1 .7 5 1 .6 0 1 .7 0 1 .9 5 1 .9 0 1 .8 8
Behavior 1.10 1 .5 0 1 .2 0 1 .2 5 1 .6 0 1 .4 4 1 .3 0
Solution: -
X Y Rank of Rank of Difference Square of
Beauty Behavior Beauty Behavior Di = X-Y Difference
Di 2
1 .5 0 1 .1 0 7 7 0 0
1 .7 5 1 .5 0 4 2 2 4
1 .6 0 1 .2 0 6 6 0 0
1 .7 0 1 .2 5 5 5 0 0
1 .9 5 1 .6 0 1 1 0 0
1 .9 0 1 .4 4 2 3 1 1
1 .8 8 1 .3 0 3 4 1 1
6
6
6 Di
i 1 6 6 36
R=1- 1 1 1 10.017
n n
3
7 7
3
336
= 0 .8 9 2 9
The relationship between the two variables is strong.
In general, for a high degree of correlation, which leads to better estimates and
predication, the coefficient of correlating r must have a high value.
Note: if we have the same values of variables, the rank will be equal. In this case we will
jamb the rank succeeding the rank of the similar variables.
Calculate the coefficient of correlation for the data given below: (height
CYP
QQ of sons and fathers)
Father (X): 63 65 66 67 67 68
Son (Y): 66 68 65 67 69 70
245
22.9 ANSWERS TO CHECK YOUR PROGRESS QUESTIONS
1. r2 = bxy . byx
= 0 .4 5 1 .4 4
= 0 .6 4 8
22.10 SUMMARY
Regression analysis is a statistical technique with the help of which the values of an
unknown variable are estimated on the bases of the known values of another variable.
Regression analysis helps to establish the functional relationship between variables,
which is done with the help of regression lines. The algebraic expression of regression
lines is known as regression equations.
Regression coefficient indicates the degree and the direction of change in the dependent
variable in response to a unit change in the independent variable. Since there are two
regression equations for two variables there will be two regression coefficients: one for
the regression equation of x on y and the other for regression equation of y on x, the
regression coefficient of x on y indicates the degree and direction of change in ‘x’
variable in response to a unit change in ‘y’ variable and the regression coefficient of y on
x indicates the degree and direction of change in ‘y’ variable in response to a unit change
in ‘x’ variable. The square root of the product of both the regression coefficients is equal
to the coefficient of correlation between the variables.
X 30 40 75 60 50 42 70 72
Y 40 25 35 40 65 52 60 35
2) Calculate the two regression coefficients and correlation coefficient from the following
data.
X 6 .9 8 .5 5 .8 8 .6 9 .6 8 .0 9 .7
Y 2 .9 3 .8 6 .5 2 .3 5 .5 3 .5 3 .2
3) You are given that x = 190 , y = 85 , xy = 575, x2 = 15600 and y2 = 7100 for
ten paired observations. Calculate the two regression equations.
(Ans: X on Y : X = 0.613y + 17.16; Y on X : Y = 0.0867X + 83.35)
X 4 2 6 8 10 5 7
Y 7 10 8 9 5 6 4
247
22.12 GLOSSARY
3. Regression Line: it is a device used for estimating the value of one variable from
the value of the other consists of a line through the points drawn in such a
manner as to
represent the average relationship between the two variables.
22.13 REFERENCES
Gupta S.P. “ Statistical Method”, sultan chand & company, New Delhi
Gupta S.C. “Fundamental of Statistics”, Himalaya Pub. House, Bombay.
Simpson and Kafka “ Basic Statistics” oxford and I.B.H. publishing company,
Calcutta.
248