Professional Documents
Culture Documents
Statistics
Statistics
STATISTICS
CA FOUNDATION STATISTICS
Dear Student,
The coaching is very exhaustive and wholly concept based. The conceptual
explanations are entirely supported by good problems that cover the
past and the problems which peep into the future. Also, the coaching
is very systematic, well - planned and absolutely time bound. For a
change, say good - bye to mechanical learning. I am sure you will feel
that the study is a pleasurable job and not a painful exercise.
INDEX
3 MEASURES OF DISPERSION
(Average of Second Order)
115-153
7 RANDOM VARIABLE
Theory of Expectation - THEORY
374-399
Introduction:
Statistics in India
• Kautilya recorded birth and death in Arthashastra during Chandragupta Maurya’s
regime.
• Abul Fazal, during Akbar’s regime, recorded agriculture in the book Ain-i-Akbari.
“STATISTICS” DEFINED
IN SINGULAR SENSE IN PLURAL SENSE
It is defined as the scientific method By Statistics, we mean aggregate
of collecting, presenting, analyzing of facts which are known as
the data and drawing inference from “DATA” (Singular Datum).
the same.
Features of Statistics:
a) Statistics deals with masses and not individuals.
b) Statistics deals with quantitative data . Qualitative data are also to be expressed in
quantitative terms.
1
CA FOUNDATION STATISTICS
i) All Statistics are Numerical Statements but all Numerical Statements are not
statistics
APPLICATION OF STATISTICS
Statistics is used in
a) Mathematics
b) Economics
c) Accountancy
d) Auditing
f) Social Science
i) Management.
2
CA FOUNDATION STATISTICS
LIMITATIONS OF STATISTICS
i. Statistics does not study qualitative phenomenon directly.
v. Statistics results are true on the average sense only. They are not exact
ii. Population : It is used to denote the totality of the set of objects under considering.
vi. Continuous & Discrete Variable : A variable which can assume for its value any real
quantity within a specified interval is a continuous variable e.g height, weight etc
and the variables which can assume only whole numbers are discrete variables
eg :-. number of members in the family, no of accidents etc.
3
CA FOUNDATION STATISTICS
CLASSWORK SECTION
Related MCQ’s:
1. Which of the following statement is true?
a) Statistics is derived from the French word “Statistik”.
b) Statistics is derived from the Italian word “Statista”.
c) Statistics is derived from the Latin word “Statistique”.
d) None of these
2. The word statistics is used in ______ senses, namely __________ and ________.
a) two, singular, plural b) two, simple, complicated
c) two, single, combined d) none of the above
7. Statistics is applied in
a) Commerce & Industry b) Business Management
c) Economics d) All of the above
4
CA FOUNDATION STATISTICS
8. Statistics can:
a) prove anything
b) disprove anything
c) neither prove nor disprove anything, is just a tool
d) none of the above
5
CA FOUNDATION STATISTICS
6
CA FOUNDATION STATISTICS
Direct Personal Interview Indirect Interview Telephonic Interview
7
CA FOUNDATION STATISTICS
2. SCRUTINY OF DATA
It means checking the data for accuracy & consistency. Intelligence, patience &
experience is used by scrutinizing the data.
3. CLASSIFICATION OF DATA
Definitions : When the items / individuals are classified, according to some common
non-measurable characteristics processed by them, they are said to form a statistical
class, and when they are classified according to some common measureable
characteristics processed by them, they are said to form a statistical group.
Types of Classifications
Related MCQ’s:
16. A statistical survey may either be ________ purpose or ________ purpose survey.
a) general, specific
b) general, without
c) all, individual
d) none of the above
8
CA FOUNDATION STATISTICS
9
CA FOUNDATION STATISTICS
29. When data are observed __________ the type of classification is known as
chronological classification.
a) for some hours
b) over a period of time
c) seriously
d) none of the above
10
CA FOUNDATION STATISTICS
11
CA FOUNDATION STATISTICS
Presentation of Data
Presentation of Data
Textual
Textual Presentation : It is in written form. It is simple but dull, monotonous &
comparison is not possible
Tabular
Tabular Presentation : Presentation of data with the help of a statistical table having
rows & columns.
1. Title : Every Table must be given a suitable title, which usually appears at the top
of the table (below the table number or next to the table number). A title is meant
to describe in brief and concise form the contents of the table and should be self-
explanatory.
2. Stub : Stubs are the headings or designations for the horizontal rows.
12
CA FOUNDATION STATISTICS
4. Body : The arrangement of the data according to the descriptions given in the captions
(columns) and stubs(rows) forms the body of the table. It contains the numerical
information which is to be presented to the readers and forms the most important
part of the table.
Other Parts :
6. Table Number :
7. Head Note :
8. Foot Note :
9. Source Note
13
CA FOUNDATION STATISTICS
Foot Note :
Source Note :
14
CA FOUNDATION STATISTICS
Types of Tabulatio
Types of Tabulation
Simple Complex
Simple Tabulation : In this type the number or measurement of the items are placed
below the headings showing the characteristics.
Complex Tabulation : In this type each numerical figure in the table is the value of
the measurement having the characteristics shown both by the column and the row
headings.
Related MCQ’s
35. The most accurate mode of data presentation is :
a) Diagrammatic method b) Tabular
c) Textual presentation d) None of the above.
36. When the accuracy in presentation is more important than the method of presentation
it is done through:
a) Textual b) Diagrammatic
c) Tabular d) Either b) or c)
15
CA FOUNDATION STATISTICS
16
CA FOUNDATION STATISTICS
Advantages :
1. It is more attractive and informative to an ordinary person.
3. It reveals the hidden facts which are not apparent from the tabular presentation.
4. Two or more sets of values can be compared very easily from a diagram.
Line Diagram :
It is used for time related data (Time series).
When there is wide range of fluctuations, logarithmic or ratio charts are used.
17
CA FOUNDATION STATISTICS
Bar Diagram
1. Vertical Bar Chart ( or Colum Chart) :
This is generally used to represent a time series data or a data which is classified by
the values of the variable. (Measurable characteristics).
5. Percentage Bars :
Percentag e Bars are particularly useful in statistical work which requires the
portrayal of relative changes.
6. Deviation Bars
Deviation Bars are popularly used for representing net quantities – excess or deficit
i.e. net profit, net loss, net exports or imports, etc. Such bars can have both positive
and negative values. Positive values are shown above the base line and negative
values below it.
18
CA FOUNDATION STATISTICS
7. Broken Bars
In certain series there may be wide variations in values – some value may be very
small and others very large. In order to gain space for the smaller bars of the series,
larger bars may be broken.
Example :
Draw a pie chart to represent the following data on the proposed outlay during a
Five-year Plan of a Government :
19
CA FOUNDATION STATISTICS
ii) The area of each rectangle is proportional to the frequency of the corresponding
class.
20
CA FOUNDATION STATISTICS
iv) It helps us to get an idea about the frequency curve and frequency polygon.
v) Comparison among the frequencies can be made for different class intervals.
Example
The monthly profits in rupees of 100 shops are distributed as follows:
Profits per Shop 0-100 100-200 200-300 300-400 400-500 500-600
No. of Shops 12 18 27 20 17 6
Draw the histogram to the data and hence find the modal value.
In the histogram, the top right corner of the highest rectangle is joined by a straight
line to the top right corner of the preceding rectangle. Similarly, top left corner
of the highest rectangle is joined to the top left corner of the following rectangle.
From the point of intersection of these two lines a perpendicular is drawn on the
horizontal axis. The foot of the perpendicular indicates the Mode. This is read from
the horizontal scale and the modal value is found to be 256 (in `) approximately.
Profits (`)
2. Frequency Polygon and Frequency Curve
i) In this method, the frequency of each class is plotted against the mid-value
of the corresponding class. The points thus obtained are joined successively
by straight lines. The polygon is then completed by joining two end-points to
the mid-values of two empty classes assumed in either side of the frequency
distribution.
ii) Frequency polygon can be obtained from the histogram by joining the successive
21
CA FOUNDATION STATISTICS
mid-points of the top of the rectangles which constitute the histogram and the
polygon is completed in the same manner as before.
iii) If in a frequency distribution the widths of the classes are reduced, then the
number of classes will increase. As a result the vertices of a frequency polygon
will come very close to each other. In that case, if we join the points by smooth
free hand line instead of straight lines, a smooth curve is obtained which is
known as a Frequency Curve.
Example
Draw the cumulative frequency diagram (both more-than and less-than ogive) of
the following frequency distribution and locate graphically the Median:
Marks-Group 0-10 10-20 20-30 30-40 40-50 50-60 60-70 Total
No. of Students 4 8 11 15 12 6 3 59
22
CA FOUNDATION STATISTICS
60 56 3
70 59 0
Class Boundary
23
CA FOUNDATION STATISTICS
2. U Shaped Curve
In this curve, the frequency is minimum at the central part, and slowly but steadily
it reaches to two extremities. The distribution of people travelling on streets will be
exhibited through this kind of curves.
3. J Shaped Curve:
The J Shaped Curve starts with the minimum frequency and then gradually reaches
its maximum frequency at the other extremity. The distribution of commuters in a
particular time interval will be exhibited through this kind of curves.
Diagram
4. Asymmetrical Curves
(A) In case of symmetrical curves or bell shaped curves the
(i) Mean (M) = Median (Me) = Mode (Mo)
ii) Skewness = 0
24
CA FOUNDATION STATISTICS
(B) In case of Asymmetrical curves Mean, Median & Mode are unequal and accordingly
skewness 0
Asymmetrical Curves
Related MCQ’s :
48. In a two-dimensional diagram:
a) only height is considered
b) only width is considered
c) both height and width is considered
d) height, width and thickness are considered
49. Graph is a :
a) Line diagram b) Bar diagram
c) Pie diagram d) Pictogram.
25
CA FOUNDATION STATISTICS
53. Ogive is a
a) line diagram b) bar diagram
c) both a) and b) above d) none of these
56. When the width of all classes is same, frequency polygon has not the same area as
the Histogram :
a) true b) false
c) both a) and b) above d) none of the above
26
CA FOUNDATION STATISTICS
61. The breadth of the rectangle is equal to the length of the class-interval in
a) ogive b) histogram
c) both a) and b) above d) none of these.
64. The curve obtained by joining the points, whose x-coordinates are the upper limits
of the class-intervals and y-coordinates are corresponding cumulative frequencies
is called
a) ogive b) frequency polygon
c) histogram d) frequency curve
66. Less than type and more than type ogives meet at a point known as:
a) Mean b) Median c) Mode d) Quartile
27
CA FOUNDATION STATISTICS
67. If we plot less than and more than type frequency distribution, then the graph
plotted is:
a) Frequency Curve b) Histogram
c) Ogive d) None of these
69. When the two curves of ogive intersect, the point of intersection provides:
a) First Quartile b) Third Quartile
c) Second Quartile d) None of these
74. Arrange the dimensions of Bar Diagram, Cube Diagram, Pie Diagram in sequence.
a) 1, 3, 2 b) 2, 1, 3 c) 2, 3, 1 d) 3, 2, 1
75. The most appropriate diagram to represent the data relating to the monthly
expenditure on different items by a family is:
a) Pie Diagram b) Line Diagram c) Histogram d) Frequency Polygon
28
CA FOUNDATION STATISTICS
FREQUENCY DISTRIBUTION
1. Tabular representation of statistical data is usually made in ascending order of
magnitude relating to measurable characteristics according to individual value or
group of values.
iii) Class Interval (or class) : A large number of observations having wide range, is
usually classified into number of groups. Each of these groups is known as a class.
iv) Class frequency, Total Frequency : The number of observations which is class
contains, is known as its class frequency. The total number of observations in
the frequency distribution is known as ‘Total Frequency’.
v) Class Limit : The two ends of a class interval are known as class limits of that
class. The smaller of the two ends is called LOWER Class Limits and the greater
is called Upper Class Limit. These classification are called non-overlapping or
mutually inclusive classification.
vi) Class Boundaries : When we consider a continuous variable, the observation are
recorded nearest to a certain unit. For example, let us consider the distribution
of weight of a group of persons. If we measure the weight nearest to the pound,
then a class interval like (100-109) will include all the observations between
29
CA FOUNDATION STATISTICS
99.5 lb to 109.5 lb. Similarly, all the observations between 109.5 lb to 119.5
lb will be included in the class interval (110- 119). For the class interval (100-
109), 99.5 is the lower class-boundary and 109.5 is the upper class boundary.
For the class (110-119), the lower and upper class boundary respectively 109.5
and 119.5. These classifications are called overlapping or mutually exclusive
classification.
Class boundaries can be calculated from the class limits by the following rule:
where, d is the common difference between the upper limit of a class and the
lower limit of the next class. d/2 is called the Correction Factor
viii) Width or Size : This is the length of a class and is obtained by the difference
between the upper and lower class boundaries of that class.
ix) Frequency Density : This is defined as the frequency per unit width of the class.
Frequency Density =
30
CA FOUNDATION STATISTICS
x) Relative Frequency : This is the ratio of the class frequency to the total frequency,
i.e. Relative frequency =
2. This distribution is prepared from the grouped frequency distribution by taking the
end values (ie. class boundaries and not class limits)
3. Number of observation less than or equal to the class boundaries are called “Less-
Than” Type Cumulative Frequency Distribution.
4. Number of observation greater than or equal to class boundaries are called “ More-
Than” Type Cumulative Frequency Distribution.
5. It can be made both for discrete series i.e. ungrouped data as well as for grouped
data.
Example 2 :
From the following frequency distribution construct the cumulative frequency distribution:
Weights of 60 students in a class
Weights of 60 students in a class
31
CA FOUNDATION STATISTICS
50-54 14
55-59 6
60-64 2
Total 60
Otherwise
Cumulative Frequency Distribution of weights of 60 students
Here the less than cumulative frequency of the second class is 8. This implies that
there are 8 students whose weights are less than 39.5 kg (the upper boundary of
that class). The more than cumulative frequency of the second class is 57, i.e. there
are 57 students whose weights are more than 34.5 kg(the lower boundary of that
class).
32
CA FOUNDATION STATISTICS
Example 3 :
(a) Marks CF (Less than) C.I Frequency
Less than 20 5 10-20 5
Less than 30 18 20-30 13
Less than 40 30 30-40 12
Less than 50 35 40-50 5
---- -----
N= 35 = f
Related MCQ’s
76. The number of observations corresponding to a particular class is known the _____
of that class.
a) frequency b) weight c) power d) both c) and a) above
33
CA FOUNDATION STATISTICS
82. For determining the class frequency it is necessary that these classes are:
a) Mutually exclusive b) Not mutually exclusive
c) Independent d) None of these
34
CA FOUNDATION STATISTICS
88. When one end of a class is not specified, the class is called.
a) closed-end class b) open-end class
c) both a) and b) above d) neither a) nor b) above
89. When all classes have equal width, the heights of the rectangles in Histogram will
be numerically equal to the
a) class frequencies b) class boundaries
c) both a) and b0 above d) none of the above
91. Most extreme values which would ever be included in a class interval are called:
a) Class Interval b) Class Limits
c) Class Boundaries d) None of the above
35
CA FOUNDATION STATISTICS
96. For a particular class boundary, the less than cumulative frequency and more than
cumulative frequency add up to
a) Total Frequency b) 50% of the total Frequency
c) Either a) or b) d) None
Theory Answers
1 b 21 d 41 c 61 b 81 a
2 a 22 b 42 d 62 a 82 a
3 b 23 a 43 a 63 c 83 b
4 a 24 b 44 b 64 a 84 b
5 c 25 d 45 d 65 d 85 b
6 b 26 c 46 d 66 b 86 c
7 d 27 a 47 d 67 c 87 a
8 c 28 a 48 c 68 b 88 b
9 c 29 b 49 a 69 c 89 a
10 c 30 c 50 b 70 c 90 b
11 c 31 c 51 a 71 c 91 c
12 c 32 d 52 b 72 b 92 b
13 b 33 b 53 d 73 d 93 a
14 b 34 d 54 b 74 a 94 c
15 b 35 b 55 a 75 a 95 a
16 a 36 c 56 b 76 a 96 a
17 c 37 a 57 b 77 c
18 a 38 c 58 d 78 a
19 b 39 a 59 c 79 a
20 a 40 d 60 a 80 d
36
CA FOUNDATION STATISTICS
Numerical Problems
In 1995, out of the 2,000 students in a college; 1,400 were for graduation and the
rest of Post-Graduation (PG). Out of 1,400 Graduate students 100 were girls, in all
there were 600 girls in the college. In 2000, number of graduate students increased
to 1,700 out of which 250 were girls, but the number of PG students fall to 500 of
which only 50 were boys. In 2005, out of 800 girls 650 were for graduation, whereas
the total number of graduates was 2,200. The number of boys and girls in PG
classes were equal.
98. Calculate the percentage increase in the number of graduate students in 2005 as
compared to 1995.
99. Out of 1000 persons, 25% were industrial workers and the rest were agricultural
workers. 300 persons enjoyed world cup matches on TV. 30% of the people who
had not watched world cup matches were industrial workers. What is the number
of agricultural workers who had enjoyed world cup matches on TV?
a) 230 b) 240 c) 250 d) 260
100. The class marks of a distribution are: 26, 31, 36, 41, 46, 51, 56, 61, 66, 71. Find the
true class limits.
101. When the class intervals are 10 – 19, 20 – 29, 30 – 39, ... .... ... Upper class boundaries
(UCB) and the Upper class limits (UCL) of the 2nd class interval are:
a) 29, 29 b) 20, 29 c) 29.5, 29.5 d) 29.5, 29
102. The class mark of the Class Intervals: 10 – 20, 20 – 30, 30 – 40, are:
a) 15, 25, 35 b) 14.5, 24.5, 34.5
c) 30, 50, 70 d) None of the above
103. From the following data find the number of class intervals, if class length is given
as 5:
73, 72, 65, 41, 54, 80, 50, 46, 49, 53
a) 5 b) 6 c) 7 d) 8
37
CA FOUNDATION STATISTICS
104. The marks obtained by 30 students in a class test, out of 50 marks, according to
their roll numbers are: 41, 25, 33, 12, 21, 19, 39, 19, 21, 12, 1, 19, 17, 12, 19, 17,
12, 17, 17, 41, 41, 19, 41, 33, 12, 21, 33, 5, 1, 21. If the data are arranged in the
form of a frequency distribution with classes 1-10, 11-20, 21-30, 31-40, 41-50
then the frequencies of these 5 class intervals are:
a. 4, 13, 5, 4, 4 b. 3, 14, 5, 4, 4
c. 4, 13, 4, 5, 4 d. None of the above
105. The number of accidents for several days in a locality is given below:
No. of accidents : 0 1 2 3 4 5 6
Frequency : 15 19 22 31 9 3 2
What is the number of cases when 3 or less accident occurred?
a) 56 b) 6 c) 68 d) 87
107. Find the number of observations between 250 and 300 from the following data :
Value More than: More than More than More than More than
200 250 300 350
No. of observation: 56 38 15 0
a) 56 b) 23 c) 15 d) 8
108. Cost of sugar in a month under the heads raw materials, labour, direct production
and others were 12, 20, 35 and 23 units respectively. The difference between their
central angles for the largest and smallest components of the cost of sugar is (in
degrees):
a) 48 b) 56 c) 72 d) 92
38
CA FOUNDATION STATISTICS
6 – 10 8
11 – 15 9
16 – 20 12
21 – 25 31
26 – 30 20
31 – 35 11
Total 100
39
CA FOUNDATION STATISTICS
HOMEWORK SECTION
40
CA FOUNDATION STATISTICS
7. Find the number of observations between 350 and 400 from the following data:
Value: More than 200 More than 350 More than 400 More than 450
No. of 48 25 12 0
observations:
(a) 13 (b) 15 (c) 17 (d) 19
8. When the width of all classes is same, frequency polygon has not the same area as
the Histogram:
(a) False (b) True (c) Both (d) None
11. Cost of sugar in a month under the heads raw materials, labour, direct production
and others were 12, 20, 35 and 23 units respectively. What is the difference between
the central angles for the largest and smallest components of the cost of sugar?
(a) 72° (b) 48° (c) 56° (d) 92°
41
CA FOUNDATION STATISTICS
15. Most extreme values which would ever be included in a class interval are called:
(a) Class Interval (b) Class Limits
(c) Class Boundaries (d) None of these
16. In 2000, out total of 1,750 workers of factory, 1,200 were members of a trade union.
The number of women employed was 200 of which 175 did not belong to a trade
union. In 2004, there were 1,800 employees who belong to a trade union and 50
who did not belong to trade union. Of all the employees in 2004, 300 were women
of whom only 8 did not belong to the trade union. On the basis of this information,
the ratio of female members of the trade union in 2000 and 2004 is:
(a) 292 : 25 (b) 8 : 175
(c) 175 : 8 (d) 25 : 292
19. Out of 1000 persons, 25 per cent were industrial workers and the rest were
agricultural workers. 300 persons enjoyed world cup matches on T.V. 30 per cent of
the people who had not watched world cup matches were industrial workers. What
is the number of agricultural workers who had enjoyed world cup matches on TV?
(a) 230 (b) 250 (c) 240 (d) 260
42
CA FOUNDATION STATISTICS
25. From the following data find the number class intervals if class length is given as 5.
73, 72, 65, 41, 54, 80, 50, 46, 49, 53.
(a) 6 (b) 5 (c) 7 (d) 8
26. The most appropriate diagram to represent the data relating to the monthly
expenditure on different items by a family is
(a) Histogram (b) Pie-diagram
(c) Frequency polygon (d) Line graph
43
CA FOUNDATION STATISTICS
31. Less than type and more than type gives meet at a point known as:
(a) Mean (b) Median (c) Mode (d) None
32. Arrange the dimensions of Bar diagram, Cube diagram, Pie diagram in sequence.
(a) 1, 3, 2 (b) 2, 1, 3 (c) 2, 3, 1 (d) 3, 2, 1
35. If we plot less than and more than type frequency distribution, then the graph
plotted is _____.
(a) Histogram (b) Frequency Curve
(c) Ogive (d) None of these
44
CA FOUNDATION STATISTICS
40. The statistical measure computed from the sample observations alone have been
termed as
(a) estimate (b) parameter (c) statistic (d) attribute
41. When the two curves of ogive intersect, the point of intersection provides:
(a) First Quartile (b) Second Quartile
(c) Third Quartile (d) Mode
42. The Choronological classification of data are classified on the basis of:
(a) Attributes (b) Area (c) Time (d) Class Interval
43. Arrange the following dimension wise: pie-diagram, bar-diagram and cubic
diagram.
(a) 1, 2, 3 (b) 3, 1, 2 (c) 3, 2, 1 (d) 2, 1, 3
45
CA FOUNDATION STATISTICS
47. The data given below refers to the marks gained by a group of students:
Marks Below 10 Below 20 Below 30 Below 40 Below 50
No. of Students 15 38 65 84 100
Then the no. of students getting marks more than 30 would be _____.
(a) 50 (b) 53 (c) 35 (d) 62
50. Difference between the maximum and minimum value of a given data is called
(a) Width (b) Size (c) Range (d) Class
51. If class interval is 10 – 14, 15 – 19, 20 – 24, then the first class is
(a) 10 – 15 (b) 9.5 – 14.5 (c) 10.5 – 15.5 (d) 9 – 15
52. Difference between the upper and lower boundary of a class is called _____.
(a) Class interval (b) Mid value
(c) Class boundary (d) Frequency
53. There were 200 employees in an office in which 150 were married. Total male
employees were 160 out of which 120 were married. What was the number of
female unmarried employees?
(a) 30 (b) 10 (c) 40 (d) 50
46
CA FOUNDATION STATISTICS
56. The most appropriate diagram to represent the five – year plan outlay of India in
different economic sectors is:
(a) Pie diagram (b) Histogram
(c) Line-Graph (d) Frequency Polygon
57. If the fluctuations in the observed value are very small as compared to the size of
the item, it is presented by:
(a) Z chart (b) Ogive curve
(c) False base line (d) Control chart
59. 100 persons are classified into male / female and graduate / non-graduate classes.
This data classification is:
(a) Cardinal data (b) Ordinal data
(c) Spatial Series data (d) Temporal data
60. If we draw a perpendicular on x-axis from the point of inter-section of both ‘less
than’ and ‘more than’ frequency curves we will get the value of _____
(a) mode (b) median
(c) arithmetic mean (d) third quartile
61. Histogram is used for the presentation of the following type of series
(a) Time series (b) Continuous frequency distribution
(c) Discrete frequency distribution (d) Individual observation
47
CA FOUNDATION STATISTICS
62. Curves obtained by joining the points whose x coordinates are the upper limits of
the class intervals and y coordinates are the corresponding cumulative frequencies
is called
(a) Frequency Polygon (b) Frequency curve
(c) Histogram (d) Ogive
63. The number of observations between 150 and 200 based on the following data is
Value More than 100 More than 150 More than More than
200 250
No. of observations: 76 63 28 05
(a) 46 (b) 35 (c) 28 (d) 23
64. The number of car accidents in several days in a locality are given below:
No. of accidents: 0 1 2 3 4 5 6 7
Frequency: 12 9 11 13 8 9 6 3
What will be the number of cases when 4 or more accidents occurred?
(a) 32 (b) 41 (c) 26 (d) 18
68. Find the number of observation between 250 and 300 from the following data:
Value more than 200 250 300 500
No. of observation 56 38 15 0
(a) 38 (b) 23 (c) 15 (d) None of the above
48
CA FOUNDATION STATISTICS
73. If the data represent costs spent on conducting an examination under various heads,
then the most suitable diagram will be:
(a) Pie diagram (b) Frequency diagram
(c) Bar diagram (d) Multiple bar diagram
74. The point of intersection of less than ogive and greater than ogive curve gives us:
(a) Mean (b) Mode
(c) Median (d) None of the above
49
CA FOUNDATION STATISTICS
80. Data are said to be ______ if the investigator himself is responsible for the collection
of the data.
(a) Primary data
(b) Secondary data
(c) Mixed of primary and secondary data
(d) None of the above
81. A suitable graph for representing the portioning of total into sub parts in statistics
is:
(a) A Pie chart (b) A pictograph (c) An ogive (d) Histogram
82. The number of times a particular items occurs in a class interval is called its:
(a) Mean (b) Frequency
(c) Cumulative frequency (d) None of the above
50
CA FOUNDATION STATISTICS
84.
Class 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50
Frequency 4 6 20 8 3
For the class 20 – 30. Cumulative frequency is:
(a) 10 (b) 26 (c) 30 (d) 41
85. Which of the following graph is suitable for cumulative frequency distribution?
(a) Ogive (b) Histogram (c) G.M. (d) A.M.
51
CA FOUNDATION STATISTICS
HOMEWORK SOLUTION
2. (b) According to the History of Statistics we can see that one school of thought is of
the view that statistics is derived from the Italian word ‘Statist’.
45
` 2,500 = 90 × 100 = 50%
4. (d) The source of data, if any, in any kind to tabulation is shown in the footnote.
5. (c) Divided Bar Chart, also known as percentage Bar Diagrams, is good for both the
things i.e. for comparing different components of a variable as well as the relating
of the different components to the whole.
6. (b) Relative frequency of a class interval is defined as the ratio of the class frequency
to the total frequency. Therefore, Relative frequency for a particular class lies
between 0 and 1 both inclusive.
7. (a) The number of observation which are more than 350 in inclusive of those
observation which are more than 400 and 450.
\ Deducting those number of observations which are more than 400 and 450 from
the number of observations which are 350, we will get the number of observations
lying between 350 and 400.
8. (a) When the width of all classes is same frequency, polygon has the same area as
the histogram.
52
CA FOUNDATION STATISTICS
12
i.e. 90 × 360° = 140
Smallest component of cost of sugar
= 12 units
12
i.e. 90 × 360° = 48°
\ Difference between the central angles for the largest and smallest components
of the cost of sugar
= 140° – 48° = 92°
12. (b) Frequency density of a class interval is defined as the ratio of the frequency of
that class interval to the corresponding class length.
13. (c) Multiple Bar Chart also known as Grouped Bar Chart is one dimensional diagram
in which two or more bars adjoining each other are constructed to represent the
values of different variables or the values of various components of the same
variable.
Multiple Bar Chart or Grouped Bar Chart is considered to compare two or more
related series.
53
CA FOUNDATION STATISTICS
14. (a) Histogram is a graph that represents the class frequencies in a frequency
distribution by vertical adjacent rectangles. A Histogram is two-dimensional, i.e. a
histogram comprises of both length as well as the width. As the Product of length
and width indicates the area. Therefore Histogram is referred to as an Area Diagram.
Its area represents the total frequency as distributed through the classes.
15. (c) Most extreme values which would be ever included in a class-interval are called
as class boundaries, also referred to as actual class limit, are defined as the limits
up to which the two limits, (actual) of each class may be extended to fill up the gap
that exist between the classes.
16. (d) Title: Sex distribution of Trade Union and Non-union members.
Required ratio of female members of the trade union is 2000 : 2004 = 25 : 292.
54
CA FOUNDATION STATISTICS
18. (c) The bell-shaped curve looks like a bell. On a bell-shape curve, the frequency,
starting from a rather low value, gradually reaches the maximum value, somewhere
near the central part and then gradually decreases to reach is lowest value at the
other extremity. Similar is the case of profits of a company. It rises till the resources
are fully utilized and if the resources are still utilized then due to over-utilization of
resources, the profits start declining. This can be clearly depicted through the data
given below:
19. (d)
Category T.V. NTV Total
Agricultural workers 260 490 750
Industrial workers 40 210 250
Total 300 700 1000
Therefore, number of agricultural workers who had enjoyed world cup matches on
T.V. = 260.
20. (c) Ogives are considered for obtaining quartiles graphically. If a perpendicular is
drawn from the point of intersection of the two o-gives, i.e. less than type ogive and
more than type give, on the horizontal axis, then x-value of this point gives us the
value of median, the second or middle quartile.
55
CA FOUNDATION STATISTICS
21. (b) Indirect oral investigation is a method in which a third person is contacted who
is expected to know the necessary details about the persons for whom the enquiry
is meant. This method is suitable when it is not possible or deliverable to approach
informant directly.
23. (d) Each column is given a heading to explain what the figures in the columns
represent. These column headings of a table are known as caption.
24. (a) The Government source like Indian trade journal – weekly, reserve Bank of
Indian Bulletin – monthly, etc and International sources like WHO, World Bank, IMF,
etc are some of the important sources of secondary data.
27. (b) Option (b) represents statistical data which can be understood by referring the
definition of statistics keeping note of the following points.
1. Statistics are aggregate of facts. A single figure cannot be called as statistics
because it cannot be compared to draw any conclusion out of it.
2. All statistical facts are expressed in numbers. Qualitative expressions like
young, old, etc do not constitute statistics.
3. Statistics should be placed in relation to each other so as to facilitate comparison.
For this purpose, the data must be homogenous and not heterogenous. e.g.
height and weight are heterogenous in character.
56
CA FOUNDATION STATISTICS
28. (c) Given data represents unclassified and ungrouped data. Therefore, the given
series is an individual series.
31. (b) By plotting cumulative frequency against the respective class boundary, we
get Ogives. There are two type of ogives:
(i) Less than type ogive.
(ii) More than type ogive.
Olives may be considered for obtaining quartiles graphically. If a perpendicular is
drawn from the point of intersection of two ogives on the horizontal axis, then
then the x-value of this point gives us the value of median, the second or middle
quartile.
Therefore, the meeting point of less than type ogive and more than type ogive is
known as ‘Median’.
35. (c) If we plot less than and more than type frequency distribution, then the graph
plotted is Ogive.
Ogive are of two types – Less than type ogive and more than type ogive. [self-
explanatory]
57
CA FOUNDATION STATISTICS
37. (c) Olives are considered for obtaining quartiles graphically. If a perpendicular is
drawn from point of intersection of two Olives on horizontal axis, then x-value of
this point gives us the value of median (2nd or middle quartile).
41. (b) We know, that the two curves viz. Less than Ogive & More than Ogive intersect
at a point called Median or we can say Second Quartile.
42. (c) Chronological Classification data are classified on the basis of ‘Time’.
Cubic-Diagram: Three Dimensional Diagram (3) are those in which three dimensions
viz length, breadth & height are taken into account used when these is wide range
of data and three different but inter-related features of data are to be represented
simultaneously.
58
CA FOUNDATION STATISTICS
44. (c)
Class Cumulative Freq. Frequency
0–10 5 5
10–20 13 13 – 5 = 8
20–30 28 28 –13 = 15
30–40 34 34 – 28 = 6
40–50 38 38 – 34 = 4
46. (b) We can calculate partition values with the help of O’Give Curve for graphical
representation.
47. (c) Converting the given “Less than” type frequency distribution to Normal
frequency distribution:
Class Interval (f) frequency
0 – 10 15
10 – 20 23
20 – 30 27
30 – 40 19
40 – 50 16
Hence,
The no. of students getting marks more than 30 is 19 + 16 = 35.
48. (c) In exclusive series, upper limit is not included in class frequency.
49.
240
= 720
X 360 = 120
50. (c) Difference between the maximum and minimum value of given data is called
Range.
59
CA FOUNDATION STATISTICS
First class is (10 – 0.5) – (14 + 0.5)
= 9.5 – 14.5
52. (a) The difference between the upper and lower boundary of class is called class
interval (class width).
55. (b) To Draw Histogram, the frequency distribution should be exclusive type.
57. (c) If the fluctuations in the observed value are very small as compared to the size
of the item, it is present by false base line.
60. (b) If we draw a perpendicular on x-axis from the point of intersection of both
‘less than’ and ‘more than’ frequency curve. We will get the value of ‘Median’.
60
CA FOUNDATION STATISTICS
61. (b) Histogram is used for the presentation to the continuous frequency distribution
of the series.
62. (d) Curve obtained by joining the points whose x co-ordinate are the upper limits of
the class intervals and y co-ordinates are the corresponding cumulative frequencies
is called ‘o’ give.
63. (b)
C.I. Frequency
100 – 150 76 – 63 = 13
150 – 200 63 – 28 = 35
200 – 250 28 – 05 = 23
250 – 300 05
The no. of observation b/w 150 and 200 is 35.
64. (c)
No. of Accident 0 1 2 3 4 5 6 7
Frequency 12 9 11 13 8 9 6 3
No. of Cases when 4 or more Accidents occurred
= 8 + 9 + 6 + 3 = 26
65. (a) The most common form of diagrammatic representation of a group frequency
distribution is Histogram.
67. (a) The chart that uses logarithm of variable is known as Ratio Chart.
68. (b)
C.I. Frequency
200 – 250 56 – 38 = 18
250 – 300 38 – 15 = 23
300 – 350 15 – 0 = 15
350 – 400 0–0=0
No. of observation b/w 250 and 350 = 23.
61
CA FOUNDATION STATISTICS
69. (b) Data collected on religion from the census reports are secondary data.
70. (d) Personal interview method and telephone interview method are the interview
method.
71. (c) Profit made by XYZ Bank is different years refer to a continuous variable.
72. (d) Mode of presentation data are textual presentation and tabulation.
73. (a) If the data represent cost spent on conducting an examination under various
heads then the most suitable diagram will be Pie diagram.
74. (c) The point of intersection of less than 0give and greater than 0give curve gives
us Median.
75. (d) ‘Stub’ of a table is the left part of the table describing the rows.
77. (d) Divided Bar Chart is considered for comparing different components of a
variable and the relation of different components to the table.
80. (a) Data are said to be Primary data if the Investigator himself is responsible for
the collection of the data.
81. (a) A suitable graph for representating the portioning of total into sub parts in
statistics is a Pie chart.
82. (b) The number of times a particular items occurs in a Class Interval is called its
Frequency.
62
CA FOUNDATION STATISTICS
84. (c)
C.I. F C.F.
0 – 10 4 4
10 – 20 6 10
20 – 30 20 30
30 – 40 8 38
40 – 50 3
Cumulative frequency of Class Interval ’20 – 30’ is 30.
63
CA FOUNDATION STATISTICS
64
CA FOUNDATION STATISTICS
10. Charts and graphs are the presentation of numerical facts by means of:
a) Symbols b) Points and Lines
c) Area and other Geometrical forms d) All of the above
11. The diagrams for the preparation of which both length and width are considered,
are called:
a) Sub-divided bar diagram b) Multi bar diagram
c) Percentage sub-divided diagram d) Two-dimensional diagram
65
CA FOUNDATION STATISTICS
15. When there is an increase in a series at constant rate, the graph will be:
a) Convex curve b) Concave curve
c) A straight line from left bottom to right top
d) A straight line from left top to right bottom
18. The headings of the rows given in the first column of a table are called:
a) Captions b) Sub-titles c) Stubs d) Prefatory notes
19. Ogives for more than type and less than type distribution intersect at:
a) Mode b) Quartile c) Mean d) None of the above
66
CA FOUNDATION STATISTICS
a) 12 b) 18 c) 21 d) 24
24. The given histogram shows a frequency distribution of marks obtained by 56 students
in statistics. Find the number of students securing marks between 70 and 100.
a) 2 b) 4 c) 6 d) 8
25. Study the following data. If data is presented on a pie-chart. Find the difference
between central angle of C and E.
City A B C D E
Population (in Cr) 10 6 1 0.5 0.5
67
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
1. In plural sense, statistics means a set of numerical figures, related to any sphere of
enquiry. Option C
3. Mistakes in collection of primary data can be due to error by the investigators and
informants both. Option D
5. Questionnaire is the best method to collected data from large educated population
with least cost and in minimum time. Option C
6. As the number of observations and class increase, the frequency polygon tends to
becomes flat or starts smoothening. Option A
7. All the three statements are correct. None of them is NOT CORRECT, Option D
10. Both charts and graphs use symbols, points, lines, area, and geometric forms.
Option D
11. Two-dimensional diagrams use both length and width of the data. Option D
12. In continuous data classes may be represented by fractions but not in discrete data
classes. Option C
68
CA FOUNDATION STATISTICS
13. Class width or length of a class interval is the difference between upper and lower
boundaries of the class. Option C
14. Class-mark or mid-point of a class is obtained by dividing the total of upper and
lower limit by 2. Option C
15. An increasing graph at a constant rate is shown by a upward sloping straight line
from left to right on a graph paper. Option C
16. With the help of histograms, one can draw a frequency curve, frequency polygon
and even a frequency distribution. Option D
18. The heading of the rows given in the first column are mentioned in stub of a table.
Option C
19. The intersecting point of the more than type and less than type Ogives gives Median
of the distribution. Option D
21. The given series is time based. It is a time series data. Option D
22. The given distribution is a cumulative frequency distribution for exclusive type class.
Option C
24. Total frequency = 56. Total frequency on graph = 0.5 + 1 + 1.5 + 1 + 0.5 + 0.5 + 0.2
+ 0.2 + 0.2 = 5.6. Thus 0.1 points on graph represents 1 unit of frequency. Students
securing marks between 70 and 100 = 0.2 + 0.2 + 0.2 = 0.6 points of graph = 6
students. Option C
25. Total population = 10 + 6 + 1 + 0.5 + 0.5 = 18 representing 360o on the pie chart.
Difference be-tween C and E = 1 – 0.5 = 0.5 Cr = 0.5 * 360o/18 = 10o Option B
69
CA FOUNDATION STATISTICS
Introduction:
• Central tendency is defined as the tendency of the data to concentrate towards the central
or middle most region of the distribution.
• The tendency of the variables to accumulate at the center of the distribution (data) is
known as measures of central tendency.
Average
Mathematical Avg. Positional Avg.
A. M G. M H. M Median Mode
1. It should be simple to understand. (Mean, Median & Mode are easy to compute)
2. It should be based on all the observations. (AM,GM,HM are based on all the observations)
70
CA FOUNDATION STATISTICS
4. It should not be affected by extreme values ( Median & Mode are not affected by
extreme values.
ARITHMETIC MEAN
• It is the best measure of central tendency and most commonly used measure
• The only drawback of this measure is that it gets highly affected by presence of extreme
values in the distribution.
• Calculation of AM
1. For Simple series: A.M. =
2.
71
CA FOUNDATION STATISTICS
Where,
X = mid-values or original values if it is a discreet series
a = Assumed Mean i.e., a value arbitrarily chosen from mid-values or any other
values
I = class width or any arbitrary value
PROPERTIES
1. If all values of the variable are constant, then AM is constant.
5. If the frequencies of variable increases or decreases by the same proportion, the value of
AM will remain unaltered.
6. Weighted AM of first “n” natural numbers, when the values are equal to their
corresponding weights, will be given by
7. Sum of squares of deviation is minimum when the deviation is taken from AM.
Where,
= mean of the first group
= mean of the second group
= number of samples in the first group
= number of samples in the second group
72
CA FOUNDATION STATISTICS
1.
2.
8. GM is particularly useful in cases where we have to find out average rates or ratios of
quantities which are changing at a cumulative rate, i.e., the change is related to the
immediate preceding data. For example, average rate of depreciation by WDV method or
average rate of growth of population.
10. GM is the most difficult average to calculate and understand because it involves the
knowledge of logarithms.
11. Logarithm of GM of “n” observations is equal to the AM of the logarithm of these “n”
observations.
13. If all the observations assumed by a variable constant, say K, then the GM of the
observations is also K
73
CA FOUNDATION STATISTICS
14. GM of the product of two variables is the product of their GM’s i.e.,
if z = xy,
then GM of z = (GM of x) . (GM of y)
15. GM of the ratio of two variables is the ratio of GM’s of two variables i.e.,
if z = x/y
then GM of
n1 log G1 + n2 log G2
16. Combined GM: log G12
n1 + n2
2.
4. HM has a very restricted use, and they are usually used for calculating average speed,
average rates of quantities, etc.
7. It is less affected by extreme values and sampling fluctuations compare to AM and GM.
8. If y = ax then
HM(y) = a HM (x) | GM(y) = a GM (x)
9. If all the observations are constant, HM is constant
74
CA FOUNDATION STATISTICS
When the average to be calculated is of the form a/b, where a and b are different quantities
then
i. Use HM when ‘a’ is constant
ii. Use AM when ‘b’ is constant
For eg,
Avg. speed = ? Distance = same (given)
Distance
Use H. M we know that Speed
Time
Avg. speed = ? Time = same (given)
Use A. M
3.
MEDIAN:
1. Median is defined as the positional average and is regarded as the second best average
after arithmetic mean.
2. Median is suitable when there is a wide range of variation in data or distribution pattern
is to be studied at a varying level.
75
CA FOUNDATION STATISTICS
6. The absolute sum of deviations is minimum when the deviations are taken from Median,
and this property of Median is known as “Minimal Property”.
Calculation
For Simple Series
Median = value corresponding to (n + 1)/2th term in the distribution
Note 2: If the value of (n+1)/2th term is a fraction then the average of the values within which
it is lying is the median.
Note 3: If n is odd median = simply the middle most value and if n is even median = average
of 2 mid values
l1 = Lower boundary of the median class i.e., the class where Cumulative Frequency N/2
falls
N = Total frequency
F = Cumulative frequency of the pre-median class.
fm = Frequency of the median class
i = Width of the median class
76
CA FOUNDATION STATISTICS
MODE
1. Mode is that value of the distribution which occurs with highest frequency.
2. Mode is a crude method of finding out average and it provides only a Bird’s Eye view of
the distribution.
3. It is the most unstable average and the quickest method of finding out the average where
we need to find out the most common value of the distribution
10. When the classes are of unequal width, we consider frequency densities instead of class
frequency to locate mode,
where frequency density = Class Frequency
Width of the Class
2. For simple frequency distribution Mode can be calculated by mere inspection. The variable
occurring with the highest frequency is the mode of the distribution. A distribution can be
uni-modal or bi-modal, but not multi-modal.
77
CA FOUNDATION STATISTICS
o If only one value of variable occurs with the highest frequency, then there is only one mode.
o If two values of variable occurs with the same highest frequency, then there are two modes.
o If all values of variable occurs with same frequency, then there is no mode.
o If more than two values of variable occurs with same highest frequency, then also
there is no mode.
L1 = Lower boundary of the modal class i.e., the class with highest frequency.
fm = Frequency of the modal class
f1 = Frequency of the pre-modal class
f2 = Frequency of the post-modal class
i = Class width
1. When in a distribution all the measures of central tendencies are equal, the distribution is
said to be symmetrical.
3. Any deviation from this symmetry makes the distribution asymmetrical or skewed.
• Quartiles
o There are 3 quartiles (Q1, Q2, Q3), which divides the distribution in 4 equal parts
representing 25%, 50% and 75% of the data respectively.
78
CA FOUNDATION STATISTICS
• Deciles
o There are 9 deciles (D1, D2, ......, D9), which divides the distribution in 10 equal parts
representing 10%, 20% ....... 90% of the data respectively.
• Percentiles
o There are 99 percentiles (P1, P2, ......, P99), which divides the distribution in 100 equal
parts representing 1%, 2% ..... 99% of the data respectively.
• Note
o All partition values are dependent on the change of Origin and Scale.
79
CA FOUNDATION STATISTICS
Simple
Frequency Dist
Group
Frequency Dist
80
CA FOUNDATION STATISTICS
CLASSWORK SECTION
AIRHTEMATIC MEAN
7. If there are two groups containing 30 and 20 observation and having 50 and 60 as
arithmetic means, then the combined arithmetic mean is:
a) 52 b) 54 c) 55 d) 56
9. The average salary of a group of unskilled workers is ` 10,000 and that of a group
of skilled workers is ` 15,000. If the combined salary is ` 12,000, then what is the
percentage of skilled workers?
(a) 40% (b) 50% (c) 60% (d) none of these
81
CA FOUNDATION STATISTICS
10. If the relationship between two variables u and v are given by 2u + v + 7 = 0, and if
AM of u is 10, then the AM of v is:
a) 17 b) –17 c) 27 d) – 27
11. The arithmetic mean of a variable y is 10. Find the mean of the variable 5y – 10.
a) 50 b) 40 c) 25 d) 30
GEOMETRIC MEAN
15. The interest paid on the same sum yielding 3%, 4%, and 5% compound interest for
3 consecutive year respectively. What is the average yield percent on the total sum
invested.
(a) 3.83% b) 4.83% c) 2.83% d) 3.99%
16. What is the GM for the numbers 8, 24 and 40?
3
(a) 24 (b) 12 (c) 8 x (d) 10
HARMONIC MEAN
82
CA FOUNDATION STATISTICS
19. An aeroplane flies from A to B at the rate of 500 km/hour and comes back from B
to A at the rate of 700 km/hour. The average speed of the aeroplane is
(a) 600 km. per hour (b) 583.33 km. per hour
(c) 100 km. per hour (d) 620 km. per hour.
20. The interest paid on each of three different sums of money yielding 3%, 4%, and 5%
simple interest p.a. respectively is the same. What is the average yield percent on
the total sum invested.
a) 3.83% b) 4.83% c) 2.83% d) None of these
21. If there are two groups with 75 and 65 as harmonic means and containing 15 and
13 observation then the combined HM is given by
(a) 65 (b) 70.36 (c) 70 (d) 71.
22. If the AM and HM for two numbers are 5 and 3.2 respectively then the GM will be
(a) 16.00 (b) 4.10 (c) 4.05 (d) 4.00.
23. If the AM and GM for 10 observations are both 15, then the value of HM is
(a) Less than 15 (b) More than 15
(c) 15 (d) Can not be determined
24. If the AM and GM for two numbers are 6.50 and 6 respectively then the two numbers
(a) 6 and 7 (b) 9 and 4 (c) 10 and 3 (d) 8 and 5.
MEDIAN
83
CA FOUNDATION STATISTICS
28. Two variables x and y are given by y = 2x – 3. If the median of x is 20, what is the
median of y?
a) 20 b) 37 c) 40 d) 35
PARTITION VALUE
29. What is the value of the first quartile for observations 15, 18, 10, 20, 23, 28, 12, 16?
(a) 17 (b) 16 (c) 12.75 (d) 12
30. The third decile for the numbers 15, 10, 20, 25, 18, 11, 9, 12 is
(a) 13 (b) 10.70 (c) 11 (d) 11.50
MODE
32. If x and y are related by x–y–10 = 0 and mode of x is known to be 23, then the mode
of y is
(a) 20 (b) 13 (c) 3 (d) 23.
33. If arithmetic mean is 26.8, median is 27.9, then what is the value of mode?
a) 29 b) 30.1 c) 31.1 d) 29.9
34. If the Mean and Mode of a certain set of numbers be 60.4 and 50.2 respectively, find
approximately the value of the Median.
a) 55 b) 56 c) 57 d) 58
84
CA FOUNDATION STATISTICS
MISCELLANEOUS SUM
35. What is the value of mean and median for the following data:
Marks: 5-14 15-24 25-34 35-44 45-54 55-64
No. of 10 18 32 26 14 10
Students:
add 5+14=19, then 19/2=9.5 So, 9.5 is the mid value for mean
(a) 30 and 28 (b) 29 and 30
(c) 33.68 and 32.94 (d) 34.21 and 33.18
36. The mean and mode for the following frequency distribution
Class 350-369 370-389 390-409 410-429 430-449 450-469
interval :
Frequency: 15 27 31 19 13 6
are
(a) 400 and 390 (b) 400.58 and 390
(c) 400.58 and 394.50 (d) 400 and 394.
37. The median and modal profits for the following data
Profit in ‘000 `: below 5 below 10 below below 20 below 25 below 30
15
No. of firms: 10 25 45 55 62 65
are
(a) 11.60 and 11.50 (b) ` 11556 and ` 11267
(c) ` 11875 and ` 11667 (d) 11.50 and 11.67.
85
CA FOUNDATION STATISTICS
39. The third quartile and 65th percentile for the following data are
Profits in ‘000 : less than 10 10–19 20–29 30–39 40–49 50–59
No. of firms: 5 18 38 20 9 2
40. For the following incomplete distribution of marks of 100 pupils, median mark is
known to be 32.
Marks: 0–10 10–20 20–30 30–40 40–50 50–60
No. of Students: 10 - 25 30 - 10
What is the mean mark?
(a) 32 (b) 31 (c) 31.30 (d) 31.50
THEORETICAL ASPECTS
42. While computing the AM from a grouped frequency distribution, we assume that
(a) The classes are of equal length
(b) The classes have equal frequency
(c) All the values of a class are equal to the mid-value of that class
(d) None of these.
86
CA FOUNDATION STATISTICS
45. For open-end classification, which of the following is the best measure of central
tendency?
(a) AM (b) GM (c) Median (d) Mode
50. Which of the following measure of the central tendency is difficult to compute?
(a) Mean (b) Median (c) Mode (d) GM
51. Which measure(s) of central tendency is(are) considered for finding the average rates?
(a) AM (b) GM (c) HM (d) Both (b) and (c)
87
CA FOUNDATION STATISTICS
54. Which of the following results hold for a set of distinct positive observations?
(a) AM GM HM (b) HM GM AM
(c) AM > GM > HM (d) GM > AM > HM
55. When a firm registers both profits and losses, which of the following measure of
central tendency cannot be considered?
(a) AM (b) GM (c) Median (d) Mode
56. Quartiles are the values dividing a given set of observations into
(a) Two equal parts (b) Four equal parts
(c) Five equal parts (d) None of these
59. Which of the following measure(s) satisfies (satisfy) a linear relationship between
two variables?
(a) Mean (b) Median (c) Mode (d) All of these
60. Which of the following measures of central tendency is based on only fifty percent
of the central values?
(a) Mean (b) Median (c) Mode (d) Both (a) and (b)
61. In the formula, Mode = L1 + (d1 X c)/ (d1 + d2 ), d1 is the difference of frequencies in
the modal class & the __________ class.
(a) preceding (b) following (c) both (d) none
88
CA FOUNDATION STATISTICS
62. In the formula, Mode = L1 + (d1 X c)/ (d1 + d2), d2 is the difference of frequencies in the
modal class & the ———————— class.
(a) preceding (b) succeeding (c) both (d) none
63. —————— of a set of observations is defined to be their sum, divided by the no. of
observations.
(a) H.M (b) G.M (c) A.M (d) none
64. The sum of the squares of deviations of a set of observations has the smallest
value, when the deviations are taken from their
(a) A.M (b) H.M (c) G.M (d) none
65. ———— is equal to the value corresponding to cumulative frequency 3 (N + 1)/4 from
simple frequency distribution
(a) Median (b) 1st quartile
(c) 3rd quartile (d) 1st decile
67. For 899, 999, 391, 384, 590, 480, 485, 760, 111, 240
Rank of median is
(a) 2.75 (b) 5.5 (c) 8.25 (d) none
68. The deviations from median are ——————— if negative signs are ignored as compared
to other measures of central tendency.
(a) minimum (b) maximum (c) same (d) none
89
CA FOUNDATION STATISTICS
90
CA FOUNDATION STATISTICS
HOMEWORK SECTION
2. A man travels at a speed of 20 km/hr and then returns at a speed of 30 km/hr. His
average speed of the whole journey is:
(a) 25 km/hr (b) 24.5 km/hr
(c) 24 km/hr (d) none
4. The sum of the squares of deviations of a set of observations has the smallest
value, when the deviations are taken from their
(a) A.M. (b) H.M. (c) G.M. (d) None
5. Which of the following result hold for a set of distinct positive observations?
(a) A.M. G.M. H.M.
(b) G.M. > A.M. > H.M.
(c) G.M. A.M. H.M.
(d) A.M. > G.M. > H.M.
6. If the A.M. and H.M. for two numbers are 5 and 3.2 respectively then the G.M. will
be
(a) 4.05 (b) 16 (c) 4 (d) 4.10
7. An aeroplane flies from A to B at the rate of 500 km/hr and comes back from B to
A at the rate of 700 km/hr. The average speed of the aeroplane is:
(a) 600 km/hr (b) 583.33 km/hr
(c) km/hr (d) 620 km/hr
91
CA FOUNDATION STATISTICS
11. The mean salary for a group of 40 female workers is ` 5200 per month and that for
a group of 60 male workers is ` 6800 per month. What is the combined salary?
(a)
` 6160 (b) ` 6280
(c) ` 6890 (d) ` 6920
12. If there are two groups with 75 and 65 as harmonic means and containing 15 and
13 observations, then the combined H.M. is given by:
(a) 70 (b) 80 (c) 70.35 (d) 69.48
92
CA FOUNDATION STATISTICS
16. The average salary of 50 men was ` 80 but it was found that salary of 2 of them
were ` 46 and ` 28 which was wrongly taken as ` 64 and ` 82. The revised average
salary is:
(a)
` 80 (b) ` 78.56 (c) ` 85.26 (d) ` 82.92
17. If A be the A.M. of two positive unequal quantities X and Y and G be their G.M. then:
(a) A < G (b) A>G
(c) A G (d) A G
18. When mean is 3.57 and mode is 2.13 then the value of median is _____.
(a)
3.09 (b) 5.01
(c)
4.01 (d) None of these
20. The mean weight of 15 students is 110 kg. The mean weight of 5 of them is 100
kg and of another five students is 125 kg. then the mean weight of the remaining
students is:
(a)
120 (b) 105
(c)
115 (d) none of these
21. In a class of 11 students, 3 students were failed in a test. 8 students who passed
secured 10, 11, 20, 15, 12, 14, 26 and 24 marks respectively. What will be the
median marks of the students:
(a) 12 (b) 15 (c) 13 (d) 13.5
22. A lady travel at a speed of 20 km/h and returned at quicker speed. If her average
speed of the whole journey is 24 km/hr, find the speed of return journey (in km/h)
(a) 25 (b) 30 (c) 35 (d) 38
23. Let the mean of the variable ‘x’ be 50, then the mean of u = 10 + 5x will be :
(a) 250 (b) 260 (c) 265 (d) 273
93
CA FOUNDATION STATISTICS
24. If the difference between mean and Mode is 63, then the difference between Mean
and Medium will be ____
(a) 63 (b) 31.5 (c) 21 (d) None of the above
25. If the Arithmetic mean between two numbers is 64 and the Geometric mean between
them is 16. The Harmonic Mean between them is ____
(a) 64 (b) 4 (c) 16 (d) 40
26. The average of 5 quantities is 6 and the average of 3 is 8 what is the average of the
remaining two.
(a) 4 (b) 5 (c) 3 (d) 3.5
27. The median of following numbers, which are given is ascending order is 25. Find the
value of X.
11, 13, 15, 19, (x + 2), (x + 4), 30, 35, 39, 46
(a) 21 (b) 20 (c) 15 (d) 30
28. The average age of a group of 10 students was 20 years. The average age increased
two years when two new students joined the group. What is the average age of two
new students who joined the group?
(a)
22 years (b) 30 years
(c)
20 years (d) 32 years
29. Geometric Mean of three observations 40, 50 and X is 10. The value of X is
(a)
2 (b) 4
(c)
1/2 (d) None of the above
30. The mean of first three term is 14 and mean of next two terms is 18. Then mean of
all five term is:
(a) 14.5 (b) 15 (c) 14 (d) 15.6
31. The mean salary of a group of 50 persons is ` 5850. Later on it is discovered that
the salary of one employee has been wrongly taken as ` 8000 instead of ` 7800.
The corrected mean salary is
(a)
` 5854 (b) ` 5846
(c)
` 5650 (d) None of the above
94
CA FOUNDATION STATISTICS
33. The point of intersection of the “less than” and “more than” ogives correspond to
(a)
Mean (b) Mode
(c)
Median (d) 10th Percentile
34. A man travels from Agra to Gwalior at an average speed of 30 km per hour and
back at an average speed of 60 km per hour. What is his average speed?
(a) 38 km per hour (b) 40 km per hour
(c) 45 km per hour (d) 35 km per hour
37. The average age of 15 students of a class is 15 years. Out of them, the average age
of 5 students is 14 years and that of the other 9 students is 16 years. The age of the
15th student is:
(a)
11 years (b) 14 years
(c)
15 years (d) None of these
95
CA FOUNDATION STATISTICS
40. The mean of the following data is 6. Find the value of ‘P’
x 2 4 6 10 P+5
f 3 2 3 1 2
(a) 4 (b) 6 (c) 8 (d) 7
41. The third decile for the numbers 15, 10, 20, 25, 18, 11, 9, 12, is :
(a)
13 (b)
10.70
(c)11 (d)
11.50
42. A random variable X has uniform distribution on the interval (–3, 7). The mean of
the distribution is:
(a) 2 (b) 4 (c) 5 (d) 6
43. If the arithmetic mean of two numbers is 10 and the geometric mean of these
numbers is 8, then the harmonic mean is:
(a) 9 (b) 8.9 (c) 6.4 (d) None of these
44. The harmonic mean H of two numbers is 4 and their arithmetic mean A and the
geometric mean G satisfy the equation 2A + G2 = 27, then the numbers are
(a) (1, 3) (b) (9, 5) (c) (6, 3) (d) (12, 7)
46. In a class of 50 students, 10 have failed and their average marks in 2.5. The total
marks secured by the entire class were 281. The average marks who have passed is:
(a)
5.32 (b) 7.25
(c)
6.40 (d) None of the above
47. If the mean of two numbers is 30 and geometric mean is 24 then what will be these
two numbers?
(a) 36 and 24 (b) 30 and 30
(c) 48 and 12 (d) None of these
96
CA FOUNDATION STATISTICS
48. For moderately skewed distribution of marks in commerce for a group of 200
students the mean marks and mode marks were found to be 55.60 and 46. What is
the median marks?
(a)
55.5 (b) 60.5
(c)
52.4 (d) None of these
49. Mean for the data 6, 4, 1, 6, 5, 10, 3 is 5 when each observation added by 2, what
is mean of the data
(a) 5 (b) 6 (c) 7 (d) 10
50. The average of 10 observations is 14.4. If the average of first 4 observations is 16.5.
The average of remaining 6 observations is:
(a) 13.6 (b) 13.0 (c) 13.2 (d) 12.5
51. The ordering of a particular design of a cloth show room, a ____ size be more
appropriate
(a) median (b) mean (c) mode (d) all of these
52. The rates of returns from three different shares are 100%, 200% and 400%
respectively. The average rate of return will be :
(a) 350% (b) 233.33% (c) 200% (d) 300%
53. If geometric mean is 6 and arithmetic mean is 6.5, then harmonic mean will be:
(a)
(b)
54. A company’s past 10 years average earning is ` 40 crores. To have the same average
earning for 11 years including these 10 years, how much earning must be made by
the company in the eleventh year?
(a)
` 40 crores (b) ` crores
(c) More than ` 40 crores (d) None of these
55. A person purchases 5 rupees worth of eggs from 100 different markets. You are to
find the average number of eggs per rupee purchased from all the markets taken
97
CA FOUNDATION STATISTICS
57. If the mean value of seven numbers 7, 9, 12, X, 4, 11 and 5 is 9, then the missing
number X will be:
(a) 13 (b) 14 (c) 15 (d) 8
58. When all observations occur with equal frequency ______ does not exist.
(a)
median (b) mode
(c)
mean (d) None of the above
59. If the variables x and z are so related that z = ax + b for each x = x1 where a and b
are constant, then
(a)
True (b) false
(c)
both (d) none
61. For 899, 999, 391, 384, 590, 480, 485, 760, 111, 240 Rank of median is
(a) 2.75 (b) 5.5 (c) 8.25 (d) none
62. The average of a series of overlapping averages, each of which is based on a certain
number of item within a series is know as
(a)
Moving average (b) Weighted average
(c)
Simple average (d) None
63. The median of the data 5, 6, 7, 7, 8, 9, 10, 11, 11, 12, 15, 18 and 19 is
(a) 10.5 (b) 10 (c) 11 (d) 11.5
98
CA FOUNDATION STATISTICS
64. The mean of 20 items of a data is 5 and if each item is multiplied by 3, then the new
mean will be
(a) 5 (b) 10 (c) 15 (d) 20
66. The Algebraic sum of the deviation of a set of values from their arithmetic mean is
(a) > 0 (b) = 0 (c) < 0 (d) None of the above
68. If total frequencies of three series are 50, 60 and 90 and their means are 12, 15 and
20 respectively, then the mean of their composite series is
(a) 16 (b) 15.5 (c) 16.5 (d) 14.5
69. If in a moderately skewed distribution the values of mode and mean are 32.1 and
35.4 respectively, then the value of the median is
(a) 34.3 (b) 33.3 (c) 34 (d) 33
99
CA FOUNDATION STATISTICS
ANSWER
1 b 21 a 41 b 61 b
2 c 22 b 42 a 62 a
3 d 23 b 43 c 63 b
4 a 24 c 44 c 64 c
5 d 25 b 45 c 65 b
6 c 26 c 46 c 66 b
7 b 27 a 47 c 67 a
8 c 28 d 48 c 68 c
9 a 29 c 49 c 69 a
10 c 30 d 50 b
11 a 31 b 51 c
12 a 32 c 52 c
13 d 33 c 53 a
14 a 34 b 54 a
15 a 35 a 55 c
16 b 36 a 56 a
17 b 37 a 57 c
18 a 38 a 58 b
19 b 39 c 59 a
20 b 40 d 60 a
100
CA FOUNDATION STATISTICS
HOMEWORK SOLUTION
1. (b) X – Y – 10 = 0
Mode(X) – Mode(Y) – 10 = 0
Mode(y) = Mode(X) – 10 = 23 – 10 = 13
3. (d) Arranging the given data in ascending order: 2, 4, 6, 8, 11, 13, 15, 18
Median is the value of (8 + 1)/2 = 4.5th item = ½(4th item + 5th item) = ½(8+11)
= 9.5
4. (a) The sum of Squares of deviations is least, when it is taken from its AM.
101
CA FOUNDATION STATISTICS
13. (d) (Find the cube of the options, the option which gives 192 is the answer)
GM3 = 4*6*8 = 192
GM = 5.77
14. (a) GM is the better measure than others when ratios and percentages are given.
19. (b) HM = n / (1/1 + 1/1/2 + 1/1/3 + .... + 1/1/n) = n/(1 + 2 + 3 + ... + n) = 2/(n+1)
20. (b) Mean of remaining 5 = 1/5 [15*110 – 5*100 – 5*125] = 1/5*525 = 105.
102
CA FOUNDATION STATISTICS
27. (a) 11, 13, 15, 19, (x + 4), 30, 35, 39, 46
Median is the 5th value
Rank = N+1 10 5th
2 5
x+ 4= 25
x = 25 – 4
= 21
28. (d) Average age of two new person = ½ [12*22 – 10*20] = ½ (264 – 200) = 32
30. (d) Mean = 1/5 [3*14 + 2*18] = 1/5(42 + 36) = 78/5 = 15.6
31. (b) Correct Mean = 1/50 [50*5850 – 8000 + 7800] = 1/50 [292300] = 5846
33. (c) The point of intersection of the “less than“ and “more than“ Ogives correspond
to the Median of the distribution.
103
CA FOUNDATION STATISTICS
37. (a) The age of 15th student = 15*15 – 5*14 – 9*16 = 225 – 70 – 144 = 11
39. (c) Median is not based on all observations. Mode is the most popular value and
not the mid-value. Median is the second quartile.
41. (b) Ascending order: 9, 10, 11, 12, 15, 18, 20, 25
3rd Decile = 3(8 + 1)/10 = 2.7th element = 2nd element + 0.7 (3rd – 2nd) = 10 +
0.7(11 – 10) = 10.7
43. (c) 82 = 10 * HM
HM = 64/10 = 6.4.
46. (c) Average marks of students who have passed = 1/40[281 – 10*2.5] = 1/40(256)
= 6.4.
104
CA FOUNDATION STATISTICS
50. (b) Average of remaining 6 = 1/6 [10*14.4 – 4*16.5] = 1/6[144 – 66] = 78/6 = 13.
51. (c) Mode is an appropriate average for a cloth show room, which shall want to
order that particular design, which sells more.
54. (a) Earning in the 11th year = 40 crores. (to maintain the average of last 10 years)
55. (a) AM is to be used to find average rate at which eggs are to be purchased.
58. (b) When all observations occur with equal frequency, we can’t calculate its Mode.
105
CA FOUNDATION STATISTICS
63. (b) 5 6 7 7 8 9 10 11 11 12 15 18 19
value
Median = 10
66. (b) The algebraic sum of the deviation of a set of values from their AM is always
zero.
69. (a) 3Median = 2Mean + Mode = 2*35.4 + 32.1 = 70.8 + 32.1 = 102.9
Median = 102.9/3 = 34.3
106
CA FOUNDATION STATISTICS
1. What is the major assumption we make when computing a mean from grouped
data?
a) No value occurs more than once
b) Each class contains exactly the same number of values
c) All values are discrete
d) Every value in a class is equal to the mid-point
2. Median is:
a) 50th percentile b) 2nd quartile c) 5th decile d) All of the above
3. Suitable average for finding out the mean size of sale of shoes shall be:
a) Arithmetic mean b) Median c) Mode d) None of the above
4. A cyclist goes from a place to another and returns by the same route. He pedals
his way uni-formly with speed U while going and with speed V while returning. The
average speed of his journey is:
a) (U + V)/2 b) 2/(U + V) c) 2UV/(U + V) d) None of the above
5. The mean of the value 1, 2, 3, ...., 120 with respective frequencies F, 2F, 3F,...., 120F
is equal to:
a) (60 + 1)2 b) (240 + 1)/6 c) 60 d) 241/3
107
CA FOUNDATION STATISTICS
10. Of the various measure of central tendency, which of the following can be used
when measurement are on an ordinal scale?
a) Arithmetic Mean and Median
b) Mode and Arithmetic Mean
c) Median and Mode
d) All of the above
11. The mean of 15 observations is 15. If the two numbers 18 and 22 are excluded, then
the mean of the remaining numbers is:
a) 10.53 b) 12.49 c) 14.23 d) 15.49
12. While dividing each entry in a data by a non-zero number A, the arithmetic mean
of the new data:
a) Does not change b) Increased
c) Is divided by A d) Is diminished by A
13. If arithmetic mean of two items is 16 and their harmonic mean is 9. Find their
geometric mean.
a) 10 b) 12 c) 14 d) 16
108
CA FOUNDATION STATISTICS
14. Find the median of 33, 86, 68, 80, 48, 70, 64.
a) 80 b) 68 c) 70 d) 48
15. The mean monthly salary of all employees in a company is Rs. 25,000. The mean
salaries of male and female employees are Rs. 27,000 and Rs. 17,000 respectively,
the percentage of males employed by the company is:
a) 80% b) 20% c) 30% d) 70%
20. The means of two samples of sizes 50 and 100 respectively are 54.1 and 50.3.
Obtain the mean of the sample of size 150 obtained by combining the two samples.
a) 50.47 b) 51.57 c) 52.37 d) 54.48
23. Find the median of 88, 72, 33, 29, 70, 86, 54, 91, 61, 57.
a) 65 b) 66 c) 65.5 d) None of the above
109
CA FOUNDATION STATISTICS
24. In a moderately symmetric distribution the mode and mean are 12.30 and 18.48
respectively. Find the median.
a) 16 b) 16.42 c) 16.24 d) None of the above
25. The average weight of A, B, C is 45 Kg. If the average weight of A and B be 40 Kg and
that of B and C be 43 Kg, then the weight of B is:
a) 15 Kg b) 20 Kg c) 25 Kg d) 31 Kg
26. The average attendance of a school for the first 3 days of the week is 325 and for
the first 4 days of the week it is 320. The students present on the fourth day were:
a) 305 b) 310 c) 315 d) 325
27. The AM calculated from the following frequency distribution is known to be 67.45.
Find the value of F.
Class Interval 60-62 63-65 66-68 69-71 72-74
Frequency 15 54 F 81 24
a) 120 b) 126 c) 134 d) None of the above
28. The median of the following incomplete frequency distribution is 4. Find the frequency
when X = 8.
X: 1 2 3 4 5 6 7 8
CF: 2 3 4 5 6 7 8 ?
a) 1 b) 2 c) 3 d) 4
110
CA FOUNDATION STATISTICS
30. The algebraic sum of the deviations of 50 observations measured from 90 is – 110.
Find the AM of the observations.
a) 90 b) 110 c) 88.7 d) 87.8
111
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
1. While calculating mean from the group distribution, it is assumed that every value
in a class is equal to the mid-point of that class. Option D
3. Mode is the suitable average to find the mean sale of shoes. Option C
4. Average speed is the HM between U and V = 2/(1/U + 1/V) = 2UV/(U + V). Option C
8. Geometric mean is the nth root of the product of n observations given in the data
set. Option C
9. Both Median and Mode are not affected by the presence of the extreme values.
Option B
10. Arithmetic mean is only possible for cardinal scale. For ordinal scale all partition
and positional values are possible. Option C
112
CA FOUNDATION STATISTICS
14. After arranging: 33, 48, 64, 68, 70, 80, 86. Median is (7 + 1)/2 = 4th item from any
end = 68. Option B
4 4
18. GM = 41.62.91 = 1296 = 6. Option A
23. After arranging: 29, 33, 54, 57, 61, 70, 72, 86, 88, 91. Median is ½ (5th + 6th value)
= ½ (61 + 70) = ½(131) = 65.5. Option C
113
CA FOUNDATION STATISTICS
26. Attendance of 4th day = (320 * 4) – (325 * 3) = 1280 – 975 = 305. Option A
114
CA FOUNDATION STATISTICS
MEASURES OF DISPERSION
(Average of Second Order)
THEORY
Introduction:
• Dispersion is defined as deviation or scattering of values from their central values i.e,
average (Mean, Median or Mode but preferably Mean or Median)
• In other words, dispersion measures the degree or extent to which the values of a
variable deviate from its average
• If all values are equal then any measure of dispersion is always zero
• All measures of dispersions are independent of the change of origin but dependent on the
change of scale
• All pre requisites of a good measure of central tendency are equally applicable for good
measure of dispersion
115
CA FOUNDATION STATISTICS
116
CA FOUNDATION STATISTICS
Measure of Dispersion
Absolute Relative
Range Quartile Mean Standard Coefficient Coefficient Coefficient Coefficient
Quartile Absolute
Range Deviation
RANGE
• It is the quickest measure, of finding out Dispersion
• If y=a±bx
R(y)=|b| ×R(x)
Calculation Of Range:
• For simple series and simple Frequency Distribution :
Range = Highest Value – Lowest Value (H – L).
117
CA FOUNDATION STATISTICS
• It is based on the upper and the lower Quartile and covers 50% of the observations.
• For distributions with the Open Ends Q.D is the best and only measure of dispersion.
• If y=a±bx
QD( y)=|b| ×QD(x)
• Quartile Deviation (QD) = , Where Q3 is the upper quartile and Q1 is the lower
quartile.
118
CA FOUNDATION STATISTICS
• Since sum of Deviations is least when Deviations are taken about Median hence MD
about Median will have the least value.
• MD is the independent of the change of origin but dependent on the change of scale
• If y=a±bx
MD( y)=|b| ×MD(x)
Where n = number of observation
∑f=N = Total frequency
=A.M
M = Median
X=Either actual values of the variables or mid values if it a group frequency distributions
Standard Deviation
• It is the best measure and the most commonly used Measure of Dispersion.
• It takes into consideration the magnitude of all the observations and gives the
minimum value of dispersion possible.
• SD has all the pre-requisites of a good measure of dispersion, except the fact
that it gets unduly affected by the presence of extreme values,
119
CA FOUNDATION STATISTICS
• It is denoted by
• SD2 = Variance=
• If all observations are equal variance =SD=0
• SD is the independent of the change of origin but dependent on the change of scale
• If y=a±bx
SD( y)=|b| ×SD(x)
V(y)=b2×v(x)
Definition of SD:
• SD of a set of values of a variable is defined as the positive Square Root of the AM of
the Square of Deviations of the values from their AM
Calculation of SD
• Where, ,
x= mid-values if it is a grouped frequency distribution or original values if it is a discrete
series
A = Assumed Mean i.e., a value arbitrarily chosen from mid-values or any other
value.
i = class width or any arbitrary value
120
CA FOUNDATION STATISTICS
Useful Results:
• SD of two numbers is the half of their absolute difference(Range), i.e., if numbers are a and
b, then SD =
Group I Group II
Numbers
Mean
Standard Deviation
• More C.V implies more variability indicating thereby less stability or consistency and vice
versa.
• Regarding choice of an item always choose that item which has less C.V, because the item
with lower C.V is more stable.
121
CA FOUNDATION STATISTICS
CLASSWORK SECTION
RANGE
1. What is the coefficient of range for the following wages of 8 workers?
` 80, ` 65, ` 90, ` 60, ` 75, ` 70, ` 72, ` 85.
(a)
` 30 (b) ` 20 (c) 30 (d) 20
QUARTILE DEVIATION
5. The quartiles of a variable are 45, 52 and 65 respectively. Its quartile deviation is
(a) 10 (b) 20 (c) 25 (d) 8.30.’
6. If x and y are related as 3x+4y = 20 and the quartile deviation of x is 12, then the
quartile deviation of y is
(a) 16 (b) 14 (c) 10 (d) 9.
122
CA FOUNDATION STATISTICS
MEAN DEVIATION
9. What is the value of mean deviation about mean for the following numbers?
5, 8, 6, 3, 4.
(a) 5.20 (b) 7.20 (c) 1.44 (d) 2.23
10. The coefficient of mean deviation about mean for the first 9 natural numbers is
(a) 200/9 (b) 80 (c) 400/9 (d) 50.
11. If the relation between x and y is 5y–3x = 10 and the mean deviation about mean
for x is 12, then the mean deviation of y about mean is
(a) 7.20 (b) 6.80 (c) 20 (d) 18.80.
12. If two variables x and y are related by 2x + 3y –7 =0 and the mean and mean
deviation about mean of x are 1 and 0.3 respectively, then the coefficient of mean
deviation of y about its mean is
(a) –5 (b) 12 (c) 50 (d) 4.
13. The mean deviation about mode for the numbers 4/11, 6/11, 8/11, 9/11, 12/11,
8/11 is
(a) 1/6 (b) 1/11 (c) 6/11 (d) 5/11.
14. What is the mean deviation about mean for the following distribution?
Variable: 5 10 15 20 25 30
Frequency: 3 4 6 5 3 2
(a) 6.00 (b) 5.93 (c) 6.07 (d) 7.20
123
CA FOUNDATION STATISTICS
15. What is the mean deviation about median for the following data?
X 3 5 7 9 11 13 15
F 2 8 9 16 14 7 4
(a) 2.50 (b) 2.46 (c) 2.43 (d) 2.37
16. What is the coefficient of mean deviation for the following distribution of heights?
Take deviation from AM.
Height in inches: 60-62 63-65 66-68 69-71 72-74
No. of students: 5 22 28 17 3
(a) 2.30 (b) 3.45 (c) 3.82 (d) 2.48
17. The mean deviation of weights about median for the following data:
Weight (lb) : 131-140 141-150 151-160 161-170 171-180 181-190
No. of persons : 3 8 13 15 6 5
Is given by
(a) 10.97 (b) 8.23 (c) 9.63 (d) 11.45.
STANDARD DEVIATION
18. What is the standard deviation of 5, 5, 9, 9, 9, 10, 5, 10, 10?
(a) (b) (c) 4.50 (d) 8
124
CA FOUNDATION STATISTICS
23. If the SD of the 1st n natural numbers is 2, then the value of n must be
(a) 2 (b) 7 (c) 6 (d) 5
24. If x and y are related by y = 2x+ 5 and the SD and AM of x are known to be 5 and 10
respectively, then the coefficient of variation of y is
(a) 25 (b) 30 (c) 40 (d) 20
25. The mean and SD for a, b and 2 are 3 and respectively, The value of ab would be
(a) 5 (b) 6 (c) 11 (d) 3
26. What is the standard deviation from the following data relating to the age distribution
of200 persons?
Age (year) : 20 30 40 50 60 70 80
No. of people: 13 28 31 46 39 23 20
(a) 15.29 (b) 16.87 (c) 18.00 (d) 17.52
27. What is the coefficient of variation for the following distribution of wages?
28. Which of the following companies A and B is more consistent so far as the payment
of dividend is concerned ?
Dividend paid by A : 5 9 6 12 15 10 8 10
Dividend paid by B : 4 8 7 15 18 9 6 6
(a) A (b) B
(c) Both (a) and (b) (d) Neither (a) nor (b)
29. The mean and SD for a group of 100 observations are 65 and 7.03 respectively. If
60 of these observations have mean and SD as 70 and 3 respectively, what is the SD
for the group comprising 40 observations?
(a) 16 (b) 25 (c) 4 (d) 2
125
CA FOUNDATION STATISTICS
30. If two samples of sizes 30 and 20 have means as 55 and 60 and variances as 16 and
25 respectively, then what would be the SD of the combined sample of size 50?
(a) 5.00 (b) 5.06 (c) 5.23 (d) 5.35
THEORETICAL ASPECTS
126
CA FOUNDATION STATISTICS
38. Which measures of dispersions is not affected by the presence of extreme observations?
(a) Range (b) Mean deviation
(c) Standard deviation (d) Quartile deviation
40. Which measure is based on only the central fifty percent of the observations?
(a) Standard deviation (b) Quartile deviation
(c) Mean deviation (d) All these measures
127
CA FOUNDATION STATISTICS
45. If the profits of a company remains the same for the last ten months, then the
standard deviation of profits for these ten months would be ?
(a) Positive (b) Negative (c) Zero (d) (a) or (c)
46. Which measure of dispersion is considered for finding a pooled measure of dispersion
after combining several groups?
(a) Mean deviation (b) Standard deviation
(c) Quartile deviation (d) Any of these
49. The standard deviation of 10, 16, 10, 16, 10, 10, 16, 16 is
(a) 4 (b) 6 (c) 3 (d) 0.
128
CA FOUNDATION STATISTICS
129
CA FOUNDATION STATISTICS
HOMEWORK SECTION
2. For a Symmetrical distribution, quartile deviation and the standard deviation are
related by
(a) 5 Q.D. = 4 S.D. (b) 4 Q.D. = 5 S.D.
(c) 2 Q.D. = 3 S.D. (d) 3 Q.D. = 2 S.D.
3. If two samples of sizes 30 and 20 have means as 55 and 60 and variances as 16 and
25 respectively, then what would be the S.D. of the combined sample size 50?
(a) 5.33 (b) 5.17 (c) 5.06 (d) 5
4. If two variables x and y are related by 2x + 3y – 7 = 0 and the mean and mean
deviation about mean of x are 1 and 0.3 respectively, then the co-efficient of mean
deviation of y about mean is
(a) –5 (b) 4 (c) 12 (d) 50
6. In a set of 100 observations, taking assumed mean as 4, the sum of the deviations
is –11 cm, and the sum of the squares of these deviations is 257 cm2. The coefficient
of variation is
(a) 41.13% (b) 42.13% (c) 40.13% (d) none
7. _____ are used for measuring central tendency, dispersion and skewness:
(a)
Median (b) Deciles
(c) Percentiles (d) Quartiles
130
CA FOUNDATION STATISTICS
10. A sample of 35 observations has the mean 80 and S.D. as 4. A second sample of
65 observations from the same population has mean 70 and S.D. 3. The S.D. of the
combined sample is:
(a) 5.85 (b) 5.58 (c) 10.23 (d) None of these
11. If x and y are related as 3x – 4y = 20 and the quartile deviation of x is 12, then the
quartile deviation of y is :
(a) 14 (b) 15 (c) 16 (d) 9
13. If the mean and S.D. of x and a and b respectively, then the S.D. of is :
(a) a/b (b) –1 (c) 1 (d) ab
14. Suppose a population A has 100 observations 101, 102, 103, . . . 200 and another
population B has 100 observations 151, 152, 153, . . . 250. If VA and VB represents
the variance of the two populations respectively, then VA / VB =
(a) 9/4 (b) 1 (c) 4/9 (d) 2/3
15. The mean and S.D. for group of 100 observations are 65 and 7.03 respectively. If 60
of these observations have mean and S.D. as 70 and 3 respectively, what is the S.D.
for the group comprising 40 observations?
(a) 2.03 (b) 4.03 (c) 8.03 (d) 9.33
131
CA FOUNDATION STATISTICS
17. If X and Y are two independent random variables then v(x + y) is:
(a) v(x) + v(y) (b) v(x) + v(y) – 2v(x, y)
(c) v(x) + v(y) + 2v(x, y) (d) v(x) – v(y)
18. Mean and S.D. of x is 50 and 5 respectively. Find mean and S.D. of .
(a) (1, 0) (b) (0, 1)
(c) (1, 1) (d) (0, –1)
19. Mean and S.D. of a given set of observations is 1,500 and 400 respectively. If there
is an increment of 100 in the first year and each observation is hiked by 20% in 2nd
years, then find new mean and S.D.
(a) 1920, 480 (b) 1920, 580
(c) 1600, 480 (d) 1600, 400
20. If 5 is subtracted from each observation of some certain item then its co-efficient of
variation is 10% and if 5 is added to each item then its coefficient of variation is 6%.
Find original coefficient of variation.
(a)
8% (b) 7.5%
(c) 4% (d) none of these
22. The sum of squares of deviation from mean of 10 observations is 250. Mean of the
data is 10. Find the co-efficient of variation.
(a)
10% (b) 25%
(c)
50% (d) 0%
132
CA FOUNDATION STATISTICS
(a) (b)
(c) (d)
24. The equation of a line is 5x + 2y = 17. Mean deviation of y about mean is 5. Calculate
mean deviation of x about mean
(a) –2 (b) 2 (c) –4 (d) none
27. Given the observations: 4, 9, 11, 14, 37. The Mean deviation about the median is
(a) 11 (b) 8.5 (c) 7.6 (d) 7.45
28. If all observations in a distribution are increased by 6, then the variance of the
series will be ____.
(a) Increased (b) Decreased
(c) Unchanged (d) None of these
29. The standard deviation of the weights (in kg) of the students of a class of 50 students
was calculated to be 4.5 kg. Later on it was found that due to some fault in weighing
machine, the weight of each student was under measured by 0.5 kg. The correct
standard deviation of the weight will be:
(a) Less than 4.5 (b) Greater than 4.5
(c) Equal to 4.5 (d) Can not be determined
30. For Normal distribution he relation between quartile deviation (Q.D.) and standard
deviation (S.D.) is
(a) Q.D. > S.D. (b) Q.D. < S.D.
(c) Q.D. = S.D. (d) None of the above
133
CA FOUNDATION STATISTICS
31. If standard deviation of first ‘n’ natural numbers is 2 then value of ‘n’ is
(a) 10 (b) 7 (c) 6 (d) 5
33 In a normal distribution, the relationship between the three most commonly used
measures of dispersion are:
(a) Standard Deviation > Mean Deviation > Quartile Deviation
(b) Mean Deviation > Standard Deviation > Quartile Deviation
(c) Standard Deviation > Quartile Deviation > Mean Deviation
(d) Quartile Deviation > Mean Deviation > Standard Deviation
35. Which of the following measures of dispersion is used for calculating the consistency
between two series?
(a) Quartile deviation (b) Standard deviation
(c) Coefficient of variation (d) None of the above
36. If sum of squares of the values = 3390, n = 30 and standard deviation = 7, find out
the mean.
(a) 113 (b) 210 (c) 8 (d) None of these
37. If the mean of a frequency distribution is 100 and coefficient of variation is 45%
then standard deviation is:
(a) 45 (b) 0.45 (c) 4.5 (d) 450
38. Find at the variance given that the Arithmetic Mean = (8 + 4)/2
(a) 2 (b) 6 (c) 1 (d) 4
39. Coefficient of mean deviation about mean for the first 9 natural numbers is
(a) 200/9 (b) 80 (c) 400/9 (d) 50
134
CA FOUNDATION STATISTICS
40. If mean = 5, Standard deviation = 2.6, median = 5 and quartile deviation = 1.5, then
the coefficient of quartile deviation equals
(a) 35 (b) 39 (c) 30 (d) 32
41. What will be the probable value of mean deviation? When Q3 = 40 and Q1 = 15.
(a) 17.50 (b) 18.75 (c) 15.00 (d) None of the above
42. The formula for range of middle 50% items of a series is:
(a) Q3 – Q1 (b) Q3 – Q2 (c) Q2 – Q1 (d)
43. If the first quartile is 142 and semi-inter quartile range is 18, then the value of
median is:
(a) 151 (b) 160 (c) 178 (d) None of these
45. The standard deviation of a variable x is known to be 10. The standard deviation of
50 + 5x is
(a) 50 (b) 100 (c) 10 (d) 500
135
CA FOUNDATION STATISTICS
48. What is value of mean deviation about mean from the number 5, 8, 6, 3 and 4?
(a) 5.20 (b) 7.20 (c) 1.44 (d) 2.23
50. If a variance of a random variable ‘x’ is 23, then what is variance of y= 2x + 10?
(a) 56 (b) 33 (c) 46 (d) 92
51. If variance = 148.6 and , =40 then the coefficient of variation is:
(a) 37.15
(b) 30.48
(c) 33.75
(d) None of the above
(a)
(b)
(c)
(d) None of these
54. If same amount is added to or subtracted from all the values of an individual series
then the standard deviation and variance both shall be ____
(a) changed
(b) unchanged
(c) same
(d) none of these
136
CA FOUNDATION STATISTICS
55. If arithmetic mean and coefficient of variation of x are 10 and 40, respectively then
the variance of y = –15 + will be:
(a) 64 (b) 81 (c) 49 (d) 36
56. Mean deviation is the least when deviations are taken from
(a) Mean (b) Median (c) Mode (d) Harmonic mean
57. is known as
58. If the S.D. of the 1st n natural nos. is . Then the value of n is
(a) 19 (b) 20 (c) 21 (d) None
59. If the range of a set of values 65 and maximum value in the set is 83, then the
minimum value in the set is
(a) 74 (b) 9 (c) 18 (d) None of the above
61. Standard Deviation for the marks obtained by a student in monthly test in mathematic
(out of 50) as 30, 35, 25, 20, 15 is
(a) 25 (b) (c) (d) 50
62. If the standard deviation for the marks obtained by a student in monthly test is 36,
then the variance is
(a) 6 (b) 36 (c) 1296 (d) None of the above
137
CA FOUNDATION STATISTICS
ANSWER
1 a 21 b 41 c 61 b
2 d 22 c 42 a 62 c
3 c 23 b 43 b
4 c 24 b 44 a
5 b 25 b 45 a
6 a 26 b 46 a
7 d 27 c 47 d
8 a 28 c 48 c
9 d 29 c 49 b
10 a 30 b 50 d
11 d 31 b 51 b
12 d 32 b 52 a
13 c 33 a 53 a
14 b 34 d 54 b
15 b 35 c 55 d
16 d 36 c 56 b
17 a 37 a 57 b
18 b 38 d 58 a
19 a 39 c 59 c
20 b 40 c 60 b
138
CA FOUNDATION STATISTICS
HOMEWORK SOLUTION
7. (d) Quartiles are used for measuring central tendency, dispersion and skewness.
139
CA FOUNDATION STATISTICS
8. (a) Company A:
Mean = (5+9+6+12+15+10+8+10)/8 = 9.375
∑x2 = 25+81+36+144+225+100+64+100 = 775
SD2 = 775/8 – (9.375)2 = 8.984375
SD = 3
COV = 3/9.375 * 100 = 32%.
Company B:
Mean = (4+8+7+15+18+9+6+6)/8 = 9.125
∑x2 = 16+64+49+225+324+81+36+36 = 831
SD2 = 831/8 – (9.125)2 = 20.609375
SD = 4.54
COV = 4.54/9.125 * 100 = 49.75%
Company A is more consistent. [As COV(A) < COV(B)]
140
CA FOUNDATION STATISTICS
14. (b) A = B – 50
V(A) = V(B)
V(A) / V(B) = 1
16. (d) QD =
On solving we get , Q1 = 3 and Q3 = 5
QD =
=
= 1
141
CA FOUNDATION STATISTICS
20. (b) Y = X – 5
10 = S /(X’ – 5) * 100
X’ – 5 = 10S
Z=X+5
6 = S/(X’ + 5) * 100
6X’ + 30 = 100S
10X’ – 50 = 100S
4X’ = 80
Mean(X) = 20
SD(X) = 1.5
Original COV = 1.5/20 * 100 = 7.5%
25. (b) Vx = 5
Vy= ?
Y = 2 -3x
Vy = b2 x Vx
= (-3)2 x 5
=9x5
= 45
142
CA FOUNDATION STATISTICS
= 7.6
29. (c) SD is independent of the change of origin. No change in the SD. SD = 4.5.
30. (b) SD > QD. Or, QD < SD.
31. (b) S2 = (n2 – 1)/12
4*12 + 1 = n2 = 49
n=7
32. (b) SD is independent of the change of Origin.
σy= σ
35. (c) Standard deviation is used to measure the consistency between two series
using Coefficient of variation.
143
CA FOUNDATION STATISTICS
Variance = σ2 = 4
Coefficient of MD =
42. (a) Range of middle 50% of items = Q3 –Q1 and it is called inter quartile range.
Whereas is called semi inter quartile range which is quartile deviation
144
CA FOUNDATION STATISTICS
47. (d) If all the observations are increased by 5, there is no change in any measure of
dispersion, as all measures of dispersion are independent of change of origin.
48. (c) Mean = (5+8+6+3+4)/5 = 26/5 = 5.2
|x – Mean| = 0.2 + 2.8 + 0.8 + 2.2 + 1.2 = 7.2
MD = 7.2/5 = 1.44.
56. (b) Mead deviation is least when deviations are taken from Median.
145
CA FOUNDATION STATISTICS
146
CA FOUNDATION STATISTICS
6. We shall compute the following to study the deviations of middle 40% portion of a
series:
a) Q2 – Q1 b) Q3 – Q1
c) P70 – P30 d) D6 – D2
147
CA FOUNDATION STATISTICS
9. What empirical relation would you expect to exist between the semi-inter quartile
range and the mean deviation for bell shaped distribution which are moderately
skewed?
a) Semi IQR = 5/6 * MD b) Semi IQR = 6/5 * MD
c) 5/6 * Semi IQR = MD d) None of the above
10. If the profits of a firm remain the same for the last 10 months, then the standard
deviation of profits for these 10 months would be?
a) Positive b) Zero
c) Negative d) Can’t be determined
14. To compare the variability between two series which also differ on their unit of
measurements, the measure usually used is:
a) Standard Deviation b) Mean Absolute Deviation
c) Coefficient of Variation d) Inter-Quartile Range
148
CA FOUNDATION STATISTICS
17. Find the Mean-Deviation about the Arithmetic Mean of the numbers 31, 35, 29, 63,
55, 72, 37.
a) 12.86 b) 14.86 c) 13.78 d) 13.86
18. Find the coefficient of mean deviation of the following numbers: 46, 79, 26, 85, 39,
65, 99, 29, 56, 72.
a) 33.27% b) 34.87% c) 34.23% d) 32.43%
19. The arithmetic mean is 25 and standard deviation is 6.25. Find the coefficient of
variation.
a) 20 b) 25 c) 30 d) 50
20. The mean and SD of 20 items were found to be 12 and 6 respectively. On checking
it was dis-covered that items which should correctly read as 11 and 21 had been
wrongly taken as 15 and 27 respectively. Find the correct SD.
a) 3.5 b) 4.3 c) 5.3 d) 4.5
22. The mean of 5 observations is 4.4 and the variance is 8.24. If three of the five
observations are 1, 2 and 6, find the other two.
a) 4, 8 b) 9, 4 c) 8, 6 d) 9, 5
23. If in the distribution, n = 10, ∑x = 20, ∑x2 = 200, then find the value of SD.
a) 2 b) 4 c) 3.5 d) 4.5
24. The two variables X and Y are related by Y = 10 – 3X. If the SD of X is 4, what is the
SD of Y.
a) 2 b) 12 c) – 2 d) None of the above
149
CA FOUNDATION STATISTICS
25. Find the Quartile Deviation of the following data: 12, 10, 17, 14, 19, 21, 27, 30, 32,
28, 34.
a) 8 b) 3.8 c) 8.3 d) 9
26. The standard deviation of 10, 16, 10, 16, 10, 10, 16, 16 is:
a) 6 b) 4 c) 3 d) 9
27. The maximum and minimum values of a series are 60 and 0 respectively. What is
the coeffi-cient of range?
a) 1 b) – 1 c) 0 d) None of the above
28. Find the first quartile, if coefficient of quartile deviation = 0.37 and the third quartile
is 46.25.
a) 46.62 b) 21.26 c) 21.07 d) 27.08
29. The mean of 200 items is 48 and standard deviation is 3. What is the sum of the
squares of these items?
a) 462600 b) 400000 c) 460000 d) None of the above
30. The means of two samples of sizes 50 and 100 respectively are 54.4 and 50.3 and
the standard deviation is 8 and 7. Obtain the standard deviation of the sample size
150 obtained by combining the two samples.
a) 5.79 b) 6.79 c) 7.59 d) 7.37
150
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
6. Middle 40% means to the left and right, we need to leave 60% data, 30% on both
sides. Thus middle 40% is correctly represented by P70 – P30. Option C
14. With different units of measurements, co-efficient of variation is the best relative
measure of dispersion. Option C
151
CA FOUNDATION STATISTICS
17. AM = (31+35+29+63+55+72+37)/7 = 46
MD = (15 + 11 + 17 + 17 + 9 + 26 + 9)/7 = 104/7 = 14.86. Option B
25. After arrangement: 10, 12, 14, 17, 19, 21, 27, 28, 30, 32, 34
Q3 = ¾(11+1) = 9th item = 30
Q1 = ¼(11+1) = 3rd item = 14
QD = (Q3 – Q1)/2 = (30 – 14)/2 = 16/2 = 8. Option A
152
CA FOUNDATION STATISTICS
153
CA FOUNDATION STATISTICS
• In other words, correlation measures the degree or extent to which two variables
move in sympathy.
• It is a pure number without any unit and the value of which lies between -1 and +1
a. When correlation coefficient is +1, perfect positive Correlation
b. When correlation coefficient is -1, perfect negative Correlation
c. When correlation coefficient is 0, no correlation
Note:
Concept of Spurious or Nonsense correlation:
Sometimes it is found that there is no casual relation between two variables but due to
presence of a third variable a correlation can be observed between the two. This variable
which is responsible for the correlation other two variable is called “Lurking variable”.
154
CA FOUNDATION STATISTICS
i.
Where,
Cov(x, y) = Covariance between x and y
ii. Thus,
iii. When deviations are taken from actual means say and such that u=x- and
v=y- in such a case r will be given by,
iv. When deviations are taken from assumed means say ‘a’ from X and ‘b’ from Y such
that u=X-a and v=Y-b in such a case ‘r’ is given by,
Note 1: Use (i) when you find that cov (x, y), and are provided
Note 2: Use (ii) when you find that the values of x and y are small
Note 3: Use (iii) when you find that and are whole numbers
Note 4: Use (iv) when you find that and are not whole numbers or the values of x and y are
large or the problems specifically directs that the deviations are to be taken from assumed
mean only.
155
CA FOUNDATION STATISTICS
Where,
n = Total number of individuals
D = Rank difference= Rx - Ry
Where,
t = number of variables are involved in tie.
n = total number of variables
D = Rx – Ry=Rank difference
156
CA FOUNDATION STATISTICS
• Let (x1,y1), (x2,y2), .... ...., (xn+1,yn+1) be a set of (n+1) pairs of values of x and y. Let
Cx and Cy denote the direction changes in the values of x and y i.e., Cx and Cy will
have positive signs if there is an increase in the values of x and y w.r.t its immediate
preceding value and will have negative signs in case of decrease.
If C denotes the number of concurrent deviations i.e., total number of positive signs in the Cx .Cy
column then the coefficient of concurrent deviation is given by,
Where,
n = pairs of deviations compared
c = number of concurrent deviations
i. If is positive, positive sign is to be assigned both inside and outside the square root.
ii. If is negative, negative sign is to be assigned both inside and outside the square
root.
iii. When C = 0, r = -1
iv. When C = n, r = 1
v. When C = ,r=0
• It gives a vague idea about the nature of correlation between two variables
• It helps us to distinguish between different types of correlation but fails to measure the
extent of relationship between the variables
• Through scatter diagram we can get an idea about the nature of correlation; positive,
negative, zero or curvilinear
157
CA FOUNDATION STATISTICS
158
CA FOUNDATION STATISTICS
Miscellaneous Properties:
• Coefficient of determination = r2
• Probable error and standard error both are used for determining the reliability of
correlation coefficient. For this purpose the following rule is followed,
1) If r < P.E. there is no significant correlation in population.
2) If r > 6 P.E. there is significant correlation in population and we can rely on the
value of r
3) Otherwise, in the intermediate interval there is no clear idea about the correlation in
the population and hence no inference can be drawn about the population correlation
coefficient ( ).
Using probable Error (P.E.), we can find the probable limits for population correlation
coefficient ( ) as follows
Probable limits = r P.E
= (r-P.E) to (r+ P.E)
159
CA FOUNDATION STATISTICS
• Let x and y are two uncorrelated variables, then Cov(x,y) = 0 and hence,
Bivariate Data
• When a set of data is collected for two variables simultaneously it is called a
Bivariate Data
• Following is a bivariate table for the data relating to marks in maths and statistics
Marks in Mathematics
Marks in 0-4 4-8 8-12 12-16 16-20 Total
Stats 0-4 1 1 2 0 0 4
4-8 1 4 5 1 1 12
8-12 1 2 4 6 1 14
12-16 0 1 3 2 5 11
16-20 0 0 1 5 3 9
Total 3 8 15 14 10 50
Observations:
• A bivariate frequency distribution having m rows and n columns has m x n cells
• Some of the cell frequencies may be zero
From a bivariate distribution we can have the following two types of Uni-variate distributions
i. Two Marginal Distributions
ii. m+n Conditional Distributions
160
CA FOUNDATION STATISTICS
From the above table the two marginal distributions are as follows,
Marks No of students
0-4 3
4-8 8
8-12 15
12-16 14
16-20 10
Total 50
Marks No of students
0-4 2
4-8 5
8-12 4
12-16 3
16-20 1
Total 15
Bivariate Relationship
Between two variables x and y there can exist any of the following three relationship
a. Direct or Positive – with change in one variable x, the other variable y will also
change in the same direction. Eg: Price and quantity supplied: amount of rainfall
and crop yield
b. Indirect or Inverse or Negative – With change in one variable, the other variable will
change in the opposite direction. Eg: Price and quantity demanded.
c. No relation – With change in one variable x, if another variable y doesn’t show any
specific trend (increasing or decreasing), then we say there exist no relation between
x and y.
161
CA FOUNDATION STATISTICS
CLASSWORK SECTION
a) 30 b) 40 c) 20 d) 10
2. The Cov (x, y) =15, what restrictions should be put for the standard deviations of x and y?
a) No restriction
b) The product of the standard deviations should be more than 15
c) The product of the standard deviations should be less than 15
d) The sum of the standard deviations should be less than 15
5. Calculate r, if Cov (x, y) = 10, Var (x) = 6.25 and Var (y) = 31.36.
a) 0.71 b) – 0.71 c) 0.61 d) – 0.61
7. Given that
X -3 0 3
Y 9 0 9
162
CA FOUNDATION STATISTICS
8. Which of the following is the value of correlation coefficient for the following data:
a) 0.987 b) – 0.75 c) 0.75 d) 0.85
9. The co-efficient of correlation between X and Y is 0.6, U and V are two variables
defined as , then the co-efficient of correlation between U and V is
a) 0.6 b) 0.4 c) 0.8 d) 1
12. The Cov (x, y) = 25; v(x) = 36, v(y) = 25 , then the coefficient of correlation is
a) 0.409 b) 0.416 c) 0.833 d) 0.277
13. If n = 10 and d2 = 280, then which of the following represents the value of rank
correlation coefficient?
a) 0.70 b) – 0.7 c) 0.645 d) None of the above
14. For two series we have, d2= 30 and N = 10, find the value of R. (symbols having
usual meanings).
a) 0.28 b) – 0. 82 c) 0. 82 d) – 0.28
163
CA FOUNDATION STATISTICS
15. While computing rank correlation coefficient between profit and investment for the
last 6 years of a company the difference in rank for a year was taken 3 instead of 4.
What is the rectified rank correlation coefficient of it is know that the original value
of rank correlation was 0.4?
a) 0.3 b) 0.2 c) 0.25 d) 0.28
16. The ranks according to two attributes in a sample are given below. The rank
correlation between them would be:
R1: 1 2 3 4 5
R2: 1 2 3 4 5
a) 1 b) 0 c) –1 d) None of the above
17. The coefficient of rank correlation between the marks in Statistics and Mathematics
obtained by a certain group of students is 2/3 and the sum of the squares of the
differences in ranks is 55. How many students are there in the group?
a) 10 b) 9 c) 12 d) more than 15
18. If the sum of squares of differences of rank is 50 and number of items is 8 then what
is the value of rank correlation co-efficient
a) 0.59 b) 0.40 c) 0.36 d) 0.63
19. From the following data calculate the value of coefficient of Rank correlation:
X: 75 88 95 70 60 80 81 50
Y: 120 134 150 115 110 140 142 100
a) 0.93 b) – 0.85 c) 0.85 d) 0.63
20. In a beauty contest there were 10 competitors, rank of these candidates are assigned
by two judges A and B. The sum of squares of difference of rank is 44. The value of
rank correlation is
a) 0.70 b) 0.73 d) 0.80 d) 0.60
164
CA FOUNDATION STATISTICS
21. Find the coefficient of rank correlation between marks obtained by students in
Auditing and Statistics:
22. What is the coefficient of concurrent deviations for the following data:
Supply: 68 43 38 78 66 83 38 23 83 53 48
Demand: 65 60 55 61 35 75 45 40 85 80 85
a) 0.82 b) 0.85 c) 0.89 d) – 0.81
25. What is the coefficient of concurrent deviations for the following data:
Year : 1996 1997 1998 1999 2000 2001 2002 2003
Price : 35 38 40 33 45 48 49 52
Demand : 36 35 31 36 30 29 27 24
a) -1 b) 0.43 c) 0.5 d)
165
CA FOUNDATION STATISTICS
26. The coefficient of concurrent deviation for p pairs of observation was found to be
1/ . If the number of concurrent deviations was found to be 6, then the value of p
a) 10 b) 9 c) 8 d) None of these
Change of Origin and Change of Scale
29. If rxy = 0.6, what will be the value of ruv, where u = 3x + 5 and v = - 4y + 3?
a) – 0.6 b) 0.6 c) 0.36 d) 0.66
30. If the correlation coefficient between the variables x and y is 0.5, then the correlation
coefficient between the variables u = 2x-4 and v = 3-2y is :
a) 1 b) 0.5 c) -0.5 d) 0
31. The coefficient correlation between x and y series is – 0.38. The linear relation
between x and u ; y and v are 3x+5u=3 and -8v-7y=44, what is the coefficient
between u and v?
a) 0.38 b) -0.38 c) 0.40 d) None of these
Miscellaneous Concepts
166
CA FOUNDATION STATISTICS
35. If coefficient of correlation x and Y variables is 0.90 then what will be co-efficient
of determination?
a) 0.30 b) 0.81 c) 0.94 d) None of these
37. If r = + 0.526 and no. of observation are 50, what is the probable error of “r”?
a) 0.069 b) 0.6623 c) 1.6623 d) –0.6629
Theoretical Aspects
38. If r = 0, then there is _______ correlation between the two variables.
a) Negative b) Positive
c) No d) Both positive and negative
41. The co-efficient of correlation between the temperature of environment and power
consumption in India is always:
a) Positive b) Negative c) Zero d) Equal to 1
42. If the variables are inversely proportional to each other, then statistically they are
_________ correlated.
a) Positively b) Not
c) Negatively d) None of the above
167
CA FOUNDATION STATISTICS
44. For which of the following statements the correlation will be positive?
a) Age and Income
b) Speed of an automobile and the distance travelled by it after applying brakes
c) Sale of cold-drinks and day temperature
d) All of the above are positively correlated
45. For which of the following statements the correlation will be negative?
a) Production and price per unit
b) Sale of woolen garments and day temperature
c) Neither (a) nor (b)
d) Both a) and b) above
46. Which of the following is the example of variables which are uncorrelated?
a) Profit & Investment
b) Price & Demand of an item
c) Shoe-size & Intelligence
d) Yield & Rainfall
168
CA FOUNDATION STATISTICS
Properties:
49. In case the correlation coefficient between two variables is 1, which of the following
would be the relationship between the two variables?
a) y = p + qx, q > 0
b) y = p + qx, q < 0
c) y = p + qx, p > 0, q < 0
d) Both a) and b) above
50. If the relationship between two variables x and y is given by 22x + 33y + 84 = 0, then
the value of correlation coefficient between x and y will be:
a) 1.00 b) 0 c) – 1.00 d) Between 0 and 1.00
52. Rank correlation is useful when we study the relationship between __________
characteristics.
a) Quantitative b) Qualitative
c) Both a) and b) above d) Either a) or b) above
53. Rank of beauty contest by two judges are in reverse orders the find the value of
spearmen’s rank correlation co-efficient
a) -1 b) 0 c) 1 d) 0.75
169
CA FOUNDATION STATISTICS
55. If x and y are two correlated variable, then which of the following is true?
a) Var(x+y)=Var(x)+Var(y)+ 2Cov(x,y)
b) Var(x+y) = Var(x) + Var(y)
c) Var(x – y) = Var(x) – Var(y)
d) None of the above is true
56. If x and y are two uncorrelated variables, then which of the following relation is
TRUE?
a) Var (x + y) = Var (x) + Var (y)
b) Var (x - y) = Var (x) + Var (y)
c) Both a) and b) above are true
d) Only a) above is true
57. In the method of concurrent deviations, only the direction of change (Positive/
Negative direction) in the variable are taken into account for calculation of
a) Co efficient of SD
b) Co efficient of regression
c) Co efficient of correlation
d) None
58. Relation between ranks _____
a) Linear b) May (or) may not be linear
c) Both (a) and (b) d) None
170
CA FOUNDATION STATISTICS
Application of r:
60. ________ of correlation co-efficient is a measure of reliability.
a) Standard Error b) Statistical Error
c) Probable Error d) Both a) and c) above
63. The ratio of coefficient of non-determination and the square root of the number of
observation is known as:
a) Probable Error b) Error
c) Standard Error d) None of the above
Scatter Diagram:
64. When the correlation coefficient r=+1, all the points in a scatter diagram would be
a) On a straight line directed from upper left to lower right
b) On a straight line directed from lower left to upper right
c) On a straight line
d) Both (a) and (b)
65. The co-efficient of correlation between x and y is 0.6. if x and y both are multiplied
by -1, then the co-efficient of correlation will be:
a) 0.6 b) - 0.6 c) d)
171
CA FOUNDATION STATISTICS
67. The scatter diagram gives us the ______ ideas about the correlation between the
two variables.
a) Exact b) Vague c) No d) None of the above
69. On a scatter diagram if one is not able to recognize a specific pattern, then the two
variables depicts _______ correlation.
a) No
b) Positive
c) Both negative and positive
d) Perfectly Negative
Bivariate Data:
70. For bivariate data, there exist how many types of relationship between the variables?
a) One b) Two c) Three d) More than three
71. If the data are collected for two variables simultaneously, it is known as:
a) Univariate Data
b) Bivariate Data
c) Conditional Data
d) Marginal Data
72. The frequency distribution related to two variables collected at the same point of
time is known as:
a) Bivariate Frequency Distribution
b) Joint Frequency Distribution
c) Two-way Frequency Distribution
d) All of the above
172
CA FOUNDATION STATISTICS
73. From the Bivariate Frequency Distribution, we can obtain which of the following
Univariate distribution?
a) Marginal distribution
b) Conditional distribution
c) Both a) and b) above
d) Neither a) nor b) above
77. In a (m x n) bivariate distribution table, some of the cell frequencies may be:
a) Zero b) Negative
c) Both a) and b) above d) None of the above
173
CA FOUNDATION STATISTICS
REGRESSION ANALYSIS
Introduction
• Regression is the average linear relationship between two or more variables.
• The word regression implies “estimation or prediction”. In other words through regression
equations we can quantify the relationship between two variables and we can predict the
average value of one variable corresponding to a specific value of the other.
• Regression equation enables us to find the nature and the extent of relationship between
two variables. Correlation can measure only the degree of association between the two
variables whereas regression quantifies such relationship.
• The two variables are dependent and independent variable. Thus, we try to estimate the
average value of dependent variable, for a specified value of independent variable using
regression analysis.
• If there are two variables, then the independent variable is called the “Regressor” or
“Explaining Variable” and the dependent variable is called the “Regressed” or “Explained
Variable”.
• This average linear relationship between two variables is expressed by means of two
straight line equation known as regression lines or regression equations.
• If there are two variables x and y we can have the following two types of regression lines,
i. Regression equation of y on x (y dependent, x independent)
ii. Regression equation of x on y (x dependent, y independent)
174
CA FOUNDATION STATISTICS
REGRESSION LINES
• byx stands for regression coefficient of • bxy stands for regression coefficient of
y on x x on y
• This equation will be of the form • This equation will be of the form
y = a + bx x = a + by
• This equation is used to estimate the • This equation is used to estimate the
value of y given the value of x value of x given the value of y
• The slope of this equation is byx • The slope of this equation is bxy
175
CA FOUNDATION STATISTICS
3. When deviations are taken from 3. When deviations are taken from
actual mean i.e., and such that actual mean i.e., and such that
4. When deviations are taken from 4. When deviations are taken from
assumed mean say A & B for x and y, assumed mean say A & B for x and y,
u=x-A, v = y-B u=x-A, v = y-B
5. Using ‘r’ 5. Using ‘r’
and r = Correlation co-efficient between and r = Correlation co-efficient between
x and y x and y
1. byx = slope of the regression line of y on x which measures the change in variable y for a unit
change in variable x.
176
CA FOUNDATION STATISTICS
2. bxy= slope of the regression line of x on y which measures the change in variable x for a unit
change in variable y.
3. Correlation coefficient is symmetric i.e., ryx= rxy but regression coefficients are not
symmetric byx bxy .
byx bxy r
+ + +
- - -
Note:
When byx and bxy are of opposite signs, data are inconsistent, r is imaginary.
8. Regression coefficients are independent of the Change of Origin but they are dependent on
Change of Scale. If and then
i. byx = bvu
9. There is no specific range within which two regression coefficients will lie but their values
should be such that the square root of the product of two regression coefficients must lie
between -1 and +1 (both inclusive). Thus, if one of the regression coefficient, is greater
than unity then the other must be less than unity.
177
CA FOUNDATION STATISTICS
• Two regression lines always intersect at their mean or average values ( . In other
words if we solve two regression equations we get the average values of x and y.
• When r = 0, then
i. byx = bxy=0
iii. Nothing can be predicted from the two regression lines since, the variables become
independent.
iv. The angle between the two regression lines becomes 90o i.e., they are perpendicular to
each other.
• When r = , then
i. The two regression lines become identical i.e., they coincide.
ii. byx=
iii. Perfect linear co-relationship is observed and the angle between the two regression
lines becomes 0o.
• As the angle between two regression lines numerically decreases from 90o to 0o, the
correlation increases from 0 to 1 and the two regression lines comes closer to each other.
• Angle between two regression lines; if A is the angle between two regression lines then
tan A =
178
CA FOUNDATION STATISTICS
Miscellaneous Properties:
• In regression analysis, the difference between the Observed value and the Estimated value
is known as Residue or Error.
v. All the points on the scatter diagram will lie on the regression line
viii. For a given value of one variable, we have a fixed value of the other variable
179
CA FOUNDATION STATISTICS
CLASSWORK SECTION
2. Taking data from the previous question, find the regression equation of Y on X.
a) Y = 0.9X – 1.3 b) Y = 0.9X – 0.6
c) Y = 0.9X + 1.3 d) Y = 0.9X + 0.6
3. The information given below relates to the advertisement expenses and sales
revenue of a company.
Adv Exp Rs. Lakh Sales Rs Lakh
Mean 20 100
S.D. 3 12
r = +0.8
180
CA FOUNDATION STATISTICS
5. Given below the information about the capital employed and profit earned by a
company over the last twenty five years:
Mean SD
Capital employed (0000 `) 62 5
Profit earned (000 `) 25 6
Correlation coefficient between capital employed and profit = 0.92. The sum of the
Regression coefficients for the above data would be:
(a) 1.871 (b) 2.358 (c) 1.968 (d) 2.346
6. With bxy = 0.5,. r = 0.8 and variance of y = 16, standard deviation of x equals to:
a) 6.4 b) 2.5 c) 10.0 d) 26.5
9. If Mean of x = 10, Mean of y = 50, SD(x) = 3, SD(y) = 15, r = 0.9, then find the
estimated value of x for corresponding y = 100.
a) 18 b) 19 c) 20 d) 21
10. While calculating the coefficient of correlation between two variables X and Y the
following results were obtained:
The number of observation N=25
It was however, later discovered at the time of checking that two pairs of observations
(X,Y) were copied (6,14) and (8,6) while the correct values (8,12) and (6,8) respectively.
The correct value of byx is:
a) 0.6 b) 0.7 c) 0.8 d) 0.65
181
CA FOUNDATION STATISTICS
17. If bxy = - 1.2 and byx = - 0.3, then the coefficient of correlation between x and y is:
a) – 0.698 b) – 0.36 c) – 0.51 d)– 0.6
19. Given bxy = 1.36, byx = 0.613, then the value of coefficient of determination is:
a) 0.734 b) 0.634 c) 0.534 d) 0.834
182
CA FOUNDATION STATISTICS
20. Given bxy = 0.756, byx = 0.659, then the value of coefficient of non-determination is
given by:
a) 0.402 b) 0.502 c) 0.602 d) 0.702
21. Find the mean of x and y, if the two regression lines are
3x – y – 5 = 0 and 2x – y – 4 = 0.
a) 1, - 2 b) – 1, 2 c) 2, - 1 d) – 2, - 1
24. If the Coefficient of non-determination is 0.502, then the Coefficient of alienation is:
a) 0.71 b) 0.61 c) 0.51 d) 0.81
26. Given =3, =12 and r=1 then the standard error of estimate of X on Y is:
a) 4 b) 3 c) 2 d) 0
183
CA FOUNDATION STATISTICS
Identification Problems
29. Which of the following regression equations represents regression lines of Y on X :
7x + 2y + 15 =0, 2x + 5y + 10 =0
a) 7x + 2y+15=0 b) 2x+5y+10=0
c) Both a) and b) above d) None of these
30. In a partially destroyed record, the following data are available: variance of
x = 9 and the regression lines are 8x – 10y + 66 = 0 and 40x – 18y = 214, find the
coefficient of correlation between x and y.
a) – 0.6 b) 0.36 c) – 0.36 d) + 0.6
31. Taking data from the previous question, what would be the standard deviation of y?
a) 9 b) 4 c) 6 d) 5
32. Two random variables have the regression lines 3x+2y=26 and 6x+y=31. The
coefficient of correlation between x and y is :
a) -0.25 b) 0.5 c) -0.5 d) 0.25
33. For a bivariate data, two types of regression are 40x – 18y =214 and 8x-10y+66=0,
then find the values of and
a) 17 and 13 b) 13 and 17 c) 13 and -17 d) -13 and 17
34. The equations of two regression lines are x + y = 6 and x + 2y = 10, then the value
of correlation between x and y is
a) b) c) d)
35. The two regression lines are 7x-3y-18=0 and 4x-y-11=0. Find the value of byx and
bxy
a) b) c) d) none of these
184
CA FOUNDATION STATISTICS
Theoretical Aspects
37. The word regression is used to denote ________ of the average value of one
variable for a specified value of the other variable.
a) Estimation b) Prediction
c) Either a) or b) above d) None of the above
42. If the curve plotted on a Scatter Diagram is a straight line, it is called the:
a) Line of correlation b) Line of scatter diagram
c) Line of regression d) Both a) and c) above
185
CA FOUNDATION STATISTICS
46. For two variables, the number of regression lines would be:
a) 1 b) 3 c) 2 d) Greater than 3
47. For the relationship between two variables X and Y, we have which of the
following’s lines of regression?
a) Y on X b) X on Y
c) Both a) and b) above d) Only b) above
48. Which of the following regression equations is used to estimate Y, when the value
of X is known?
a) X on Y b) Y on X
c) Either of the two can be used d) Neither of the two can be used
49. Since Yield of a crop depends upon amount of rainfall, we need to consider:
a) The regression equation of yield on rainfall
b) The regression equation of rainfall on yield
c) Any one of a) or b) above can be considered
d) Neither of a) or b) can be considered
186
CA FOUNDATION STATISTICS
51. The method applied for deriving the regression equations is:
a) Fitting of Normal Equations b) Rank Correlation Method
c) Least Square Method d) Product Moment Method
187
CA FOUNDATION STATISTICS
Properties:
57. When the slope of two regression lines are equal:
a) The lines are perpendicular to each other.
b) The lines will coincide.
c) The lines will be parallel to each other.
d) None of the above.
61. The sign analogy of correlation coefficient and two regression coefficients is:
a) -, +, + b) -, -, - c) +, +, + d) Both b) and c) above
63. As “r” increases numerically from 0 to 1, the angle between the regression lines:
a) Increases from 0o to 90o
b) Diminishes from 90o to 0o
c) Increases from 0o to 180o
d) Both a) and c) above
188
CA FOUNDATION STATISTICS
65. If correlation coefficient between two variables is zero, which of the following is
true?
a) Both regression coefficients are greater than one.
b) Both regression coefficients are negative.
c) One of the regression coefficient is zero.
d) Both regression coefficients are zero.
66. In regression analysis the difference between the observed value and the
estimated value is known as:
a) Residue b) Deviation c) Error d) Either a) or c) above
68. Which of the following statement/s is/are FALSE regarding the regression
coefficient?
a) If one of the regression coefficient is greater than unity the other one is less
than unity.
b) The product of two regression coefficient is equal to the square of the correlation
coefficient between the two variables.
c) The regression coefficient lies between – infinity to + infinity.
d) None of the above is FALSE.
189
CA FOUNDATION STATISTICS
a) b) c) d)
190
CA FOUNDATION STATISTICS
HOMEWORK SECTION
2. Take 200 and 150 respectively as the assumed mean for X and Y series of 11 values,
then dx = X – 200, dy = Y – 150, dx = 13 dx2 = 2667, dy = 42, dy2 = 6964, dx
dy = 3943. The value of r is:
(a) 0.77 (b) 0.98 (c) 0.92 (d) 0.82
3. For some bivariate data, the following results were obtained for the two variable x
and y:
= 53.2, = 27.9, byx = -1.5, bxy = -0.2
The most probable value of y when x = 60 is:
(a) 15.6 (b) 13.4 (c) 19.7 (d) 17.7
4. If the sum of squares of the rank difference in mathematics and physics marks of 10
students is 22, then the coefficient of rank correlation is:
(a) 0.267 (b) 0.867 (c) 0.92 (d) None
5. Two random variables have the regression lines 3x + 2y = 26 and 6x + y = 31. The
coefficient of correlation between x and y is:
(a) – 0.25 (b) 0.5 (c) – 0.5 (d) 0.25
6. The coefficient of correlation between X and Y is 0.6. U and V are two variables
defined as U = , V = , then the coefficient of correlation between U and
V is:
(a) 0.6 (b) 0.4 (c) 0.8 (d) 1
191
CA FOUNDATION STATISTICS
8. The following data is given, based on 450 students for marks in Statistics and
Economics at a certain examination:
Mean marks in statistics = 40
Mean marks in Economics = 48
S.D. of marks (Statistics) = 12
Variance of marks (Economics) = 256
Sum of the products of deviations of marks from their respective mean = 42075
The average marks in Economics of candidates who obtained 50 marks in Statistics
is:
(a) 45 (b) 54.5 (c) 54 (d) 47.5
10. If the covariance between two variables is 20 and the variance of one of the variables
is 16, what would be the variance of the other variable?
(a) More than 10 (b) More than 100
(c) More than 1.25 (d) Less than 10
11. Assume 69 and 112 as the mean values for X and Y respectively. dx = 47, dx2 =
1475, dy = 108, dy2 = 3468, dx dy = 2116 and N = 8. When dx = X – 69,
dy = Y – 112. Then the value of r is:
(a) 0.95 (b) 0.65 (c) 0.75 (d) 0.85
13. The lines of regression are as follows: 5x – 145 = –10y ; 14y – 208 = –8x. The mean
value is:
(a) (12, 5) (b) (5, 7) (c) (7, 12) (d) (5, 12)
192
CA FOUNDATION STATISTICS
14. The coefficient of rank correlation of marks obtained by 10 students, in English and
Economics was found to be 0.5. It was later discovered that the difference in ranks
in the two subjects obtained by one student was wrongly taken as 3 instead of 7.
The correct coefficient of rank correlation is:
(a) 0.32 (b) 0.26 (c) 0.49 (d) 0.93
16. The method applied for deriving regression equations is known as:
(a) Concurrent deviation (b) Product moment
(c) Least squares (d) Normal equation
17. The coefficient of correlation between x and y series from the following data:
X series Y series
Number of pairs of observations 15 15
Arithmetic Mean 25 18
Standard Deviation 3.01 3.03
Sum of the squares of deviation from mean 136 138
Sum of the product of the deviations of x and y series from their respective means
= 122, is:
(a) 0.89 (b) 0.99 (c) 0.69 (d) 0.91
19. If the correlation coefficient between two variables is 1, then the two lines of
regressions are:
(a) Parallel (b) At right angles
(c) Coincident (d) None of these
193
CA FOUNDATION STATISTICS
20. If the sum of square of differences of rank is 50 and number of items is 8 then what
is the value of rank of rank correlation coefficient
(a) 0.59 (b) 0.40 (c) 0.36 (d) 0.63
25. The correlation coefficient between x and y is – ½. The value of bxy = – 1/8.
find byx.
(a) – 2 (b) – 4 (c) 0 (d) 2
26. Ranks of two characteristics by two judges are in reverse order then find the value
of spearman rank correlation co-efficient.
(a) – 1 (b) 0 (c) 1 (d) 0.75
194
CA FOUNDATION STATISTICS
28. If the rank correlation co-efficient between marks in Management and Mathematics
for a group of students is 0.6 and the sum of the squares of the difference in ranks
is 66. Then what is the number of students in the group?
(a) 9 (b) 10 (c) 11 (d) 12
30. The two regression lines are 7x – 3y – 18 = 0 and 4x – y – 11 = 0. Find the values of
byx and bxy.
(a) 7/3, ¼ (b) –7/3, –1/4
(c) –3/7, –1/4 (d) None of these
31. If ‘P’ is the simple correlation coefficient, the quantity P2 is known as:
(a) Coefficient of determination
(b) Coefficient of Non-determination
(c) Coefficient of alienation
(d) None of the above
32. ______ of the regression Coefficients is greater than the correlation coefficient
(a) Combined mean (b) Harmonic mean
(c) Geometric mean (d) Arithmetic mean
195
CA FOUNDATION STATISTICS
35. If the sum of the product of deviations of x and y series from their means is zero,
then the coefficient of correlation will be
(a) 1 (b) –1 (c) 0 (d) None of these
40. The covariance between two variables X and Y is 8.4 and their variances are 25 and
36 respectively. Calculate Kari Pearson’s coefficient of correlation between them
(a) 0.82 (b) 0.28 (c) 0.01 (d) 0.09
196
CA FOUNDATION STATISTICS
41. For a bivariate data, two lines of regression are 40x–18y = 214 and 8x–10y+66 = 0,
then find the values of mean of x and y?
(a) 17 and 13 (b) 13 and 17
(c) 13 and – 17 (d) – 13 and 17
42. Three competitors in a contest are ranked by two judges in the order 1, 2, 3 and 2,
3, 1 respectively. Calculate the Spearman’s rank correlation coefficient.
(a) – 0.5 (b) – 0.8 (c) 0.5 (d) 0.8
43. Out of the following which one affects the regression co-efficient.
(a) Change of Origin only
(b) Change of scale only
(c) Change of scale & origin both
(d) Neither change of origin nor change of scale
44. For a bivariate data, the lines of regression of Y on X, and of X on Y are respectively
2.5Y – X = 35 and 10X – Y = 70, then the Correlation coefficient r is equal to:
(a) 0.2 (b) – 0.2 (c) 0.5 (d) – 0.5
45. If one of regression coefficient is ______ unity, the other must be _____ unity.
(a) more than, more then (b) Less than, Less then
(c) more than, less than (d) Positive, Negative
46. If Y is dependent variable and X is independent variable and the S.D. of X and Y are
5 and 8 respectively and Co-efficient of co-relation between X and Y is 0.8. Find the
Regression coefficient of Y on X.
(a) 0.78 (b) 1.28 (c) 6.8 (d) 0.32
47. If the regression lines are 8x – 10y + 66 = 0 and 40x – 18y = 214, the correlation
coefficient between ‘x’ and ‘y’ is:
(a) 1 (b) 0.6 (c) – 0.6 (d) –1
48. The coefficient of correlation between two variables x and y is the simple _____ of
the two regression coefficients.
(a) Arithmetic Mean (b) Geometric Mean
(c) Harmonic Mean (d) None of the above
197
CA FOUNDATION STATISTICS
50. If the covariance between variables X and Y is 25 and variance of X and Y are
respectively 36 and 25, then the coefficient of correlation is
(a) 0.409 (b) 0.416 (c) 0.833 (d) 0.0277
52. In Spearman’s Correlation Coefficient, the sum of the differences of ranks between
two variables shall be ____
(a) 0 (b) 1 (c) – 1 (d) None of the above
53. For certain x and y series which are correlated, the two lines of regression are
5x – 6y + 9 = 0
15x – 8y – 130 = 0
The correlation coefficient is
(a) 4/5 (b) 3/4 (c) 2/3 (d) 1/2
54. The Coefficient of correlation between x and y series is – 0.38. The linear relation
between x & u and y & v are 3x + 5 u = 3 and –8y–7v=44, what is the coefficient of
correlation between u & v?
(a) 0.38 (b) – 0.38 (c) 0.40 (d) None of the above
56. If ‘r’ be the Karis Pearson’s coefficient of correlation in a bivariate distribution then
the two regression lines are at right angle if:
(a) r = ± 1 (b) r=0
(c) r = ± any finite value whose numerical value is less than 1
(d) None of these
198
CA FOUNDATION STATISTICS
58. The coefficient of correlation between two variables x and y is 0.28. Their covariance
is 7.6. If the variance of x is 9, then the standard deviation of y is:
(a) 8.048 (b) 9.048
(c) 10.048 (d) 11.048
59. Two variables x and y are related according to 4x + 3y = 7. Then x and y are:
(a) Positively correlated (b) Negatively correlated
(c) Correlation is zero (d) None of these
62. If mean of x and y variables is 20 and 40 respectively and the regression coefficient
of y on x is 1.608, then the regression line of y on x is
(a) y = 1.608x + 7.84 (b) y = 1.5x + 4.84
(c) y = 1.608x + 4.84 (d) y = 1.56x + 7.84
63. When the value of correlation coefficient is + 1 or –1, then the two regression lines
will _____
(a) have 30° angle between them (b) have 45° angle between them
(c) coincide (d) be perpendicular to each other
199
CA FOUNDATION STATISTICS
65. When each individual gets the exactly opposite rank by the two Judges, then the
rank correlation will be ____
(a) 0 (b) –1 (c) + 1 (d)
66. If the mean of the two variables ‘x’ and ‘y’ are 3 and 1 respectively. Then the
equation of two regression lines are ______
(a) 5x + 7y – 22 = 0, 6x + 2y – 20 = 0
(b) 5x + 7y – 22 = 0, 6x + 2y + 20 = 0
(c) 5x + 7y + 22 = 0, 6x + 2y – 20 = 0
(d) 5x + 7y + 22 = 0, 6x + 2y + 20 = 0
67. The equation of two lines of regression for ‘x’ and ‘y’ are 5x = 22 + y and
64x = 24 + 45y then the value of regression coefficient of ‘y’ on ‘x’ will be _____
(a) 5 (b)
(c) (d)
68. If the correlation coefficient between two variables is zero, then the lines of regression
are:
(a) Parallel (b) Perpendicular (c) Coincide (d) None of these
69. If the value of correlation coefficient between x & y is 1, then the value of correlation
coefficient between u = x – 2 and v = is:
(a) 1 (b) –1 (c) –1/2 (d) 1/2
70. The equations of two regression lines are x + y = 6 and x + 2y = 10, then the value
of correlation coefficient between x and y is:
(a) –1/2 (b) +1/2 (c) (d)
200
CA FOUNDATION STATISTICS
72. When the correlation coefficient r is equal to + 1, all the points in a scatter diagram
would be
(a) On a straight line directed from upper left to lower right
(b) On a straight line directed from lower left to upper right
(c) On a straight line
(d) Both (a) and (b)
(a) (b)
(c) (d)
74. In case of “Insurance Companies” profits and the number of claims they have to pay
there is ____ correlation.
(a) Positive (b) Negative
(c) No correlation (d) None of the above
76. If the coefficient of correlation between X and Y variables is + 0.90 then what will
be the coefficient of determination?
(a) 0.30 (b) 0.81 (c) 0.94 (d) None of these
201
CA FOUNDATION STATISTICS
80. Out of the following the one which effects the regression coefficient is
(a) Change of origin only
(b) Change of scale only
(c) Change of scale and origin both
(d) Neither change in origin nor change of scale
82. in a beauty contest there were 10 competitors. Rank of these candidates are assigned
by two judges A and B. The sum of squares of differences of ranks is 44. The value
of rank correlation is:
(a) 0.70 (b) 0.73 (c) 0.80 (d) 0.60
83. The coefficient of correlation between the temperature of environment and power
consumption is always:
(a) Positive (b) Negative (c) Zero (d) Equal to 1
84. If two regression lies are x + y = 1 and x – y = 1 then mean values of x and y will be:
(a) 0 and 1 (b) 1 and 1 (c) 1 and 0 (d) – 1 and – 1
85. The coefficient of correlation between x and y is 0.6. If x and y values are multiplied
by –1, then the coefficient of correlation will be:
(a) 0.6 (b) – 0.6 (c) (d) 1 – 0.6
202
CA FOUNDATION STATISTICS
86. If two regression lines are 5y = 9x – 22 and 20x = 9y + 350, then the value of
correlation coefficient (r) will be:
(a) 0.10 (b) –0.10 (c) – 0.90 (d) 0.90
89. The correlation coefficient (r) is the _____ of the two regression coefficients (byx and
bxy)
(a) AM (b) GM (c) HM (d) Median
90. If there is a constant increase in a series, then the corresponding graph will be
(a) Convex curve
(b) Concave curve
(c) Parabola
(d) Straight line from the left to the right
91. If the plotted points is a scatter diagram are evenly distributed, then the correlation
is
(a) Zero (b) Negative (c) Positive (d) (a) or (b)
(a) (b)
203
CA FOUNDATION STATISTICS
94. In the method of Concurrent Deviations, only the directions of change (Positive
direction/Negative direction) in the variables are taken into account for calculation
of
(a) Coefficient of SD (b) Coefficient of regression
(c) Coefficient of correlation (d) none
96. In case speed of an automobile and the distance traveled by the car after applying
brakes correlation is
(a) Positive (b) Negative (c) Zero (d) None
380
97. A relationship r2 = 1 – 200 is not possible
(a) True (b) False (c) Both (d) None
101. If the two regression lines are 3X = Y and 8Y = 6X, then the value of correlation
coefficient is
(a) 0.5 (b) – 0.5 (c) 0.75 (d) – 0.80
204
CA FOUNDATION STATISTICS
103. If the correlation coefficient between the variables X and Y is 0.5, then the correlation
coefficient between the variables 2x – 4 and 3 – 2y is
(a) 1 (b) 0.5 (c) – 0.5 (d) 0
(a) (b)
(c) (d)
108. Determine Spearman’s rank correlation coefficient from the given data ,
n = 10:
(a) r = 0.82 (b) r = 0.32 (c) r = 0.40 (d) None of the above
205
CA FOUNDATION STATISTICS
HOMEWORK SOLUTION
2. (c) As we know
or y = 107.7 – 1.5x
when x = 60 then
y = 107.7 – 1.5 × 60 = 17.7
206
CA FOUNDATION STATISTICS
\ byx = –3/2
and 6x + y = 31
\ by = –1/6
Now r2 = byx . bxy
= – 0.25
r = – 0.5
Hence, our assumption hold and r = – 0.5 ( – 1 r 1)
Note: r is negative because byx and bxy < 0
\x<0
6. (a) Since correlation coefficient (Karl Pearson’s) is independent of both scale and
origin, therefore,
p(u, v) = p(x, y) = 0.6
It may be noted that if
µ1 = ax1 + b and V1 = cy1 + d then
r(u, v) = P(x, y) if a and c are of same signs
– p(x, y) if a and c are of opposite signs
207
CA FOUNDATION STATISTICS
7. (c)
S.No. Rank in Botany (x1) Rank in Chem (y1) d = x1y1 d2
1 1 2 –1 1
2 2 3 –1 1
3 3 1 2 4
4 4 5 –1 1
5 5 4 1 1
Total – – 0 8
Hence, coefficient of rank correlation
(x-x) (y-y)
bxy = 2
n5x
42075
bxy = = 0.65
450x144
a = y - bx
=48-0.65 (40) = 22
Now, y= a+bx
y = 22 + 0.65 x
When x = 50 then
y = 22 + 0.65 (50)
=54.5
208
CA FOUNDATION STATISTICS
Here (2c – n) is negative, so negative sign is taken at both the places so,
Now, by definition
11. (a)
12. (a) In spearman’s rank correlation, the association need not be linear since it is
based on ranks rather than on actual observations.
209
CA FOUNDATION STATISTICS
13. (d) As we know, that the mean values of 2 regression equations are their points of
intersection, therefore solving the equations simultaneously.
Given:
Subtracting (4) from (3), we get:
14. (b) Coefficient of rank correlation:
16. (c) When the relationship between two variables is linear, the method of least
squares is used for obtaining the best values of the constants in an appropriate
equation. Therefore, the above method is applied for deriving regression equation.
210
CA FOUNDATION STATISTICS
19. (c) As the value of r increases numerically from O to 1, the angle between regression
equations decreases from 90° to 0°. In other words, the farther the two regression
lines are from each other, the lesser is the degree of correlation (i.e. approaching
1) and nearer the two regression lines are to each other, the higher is the degree of
correlation (i.e. approaching + 1).
The above explanation clarifies if r = 1. The two lines of regressions are coincident.
211
CA FOUNDATION STATISTICS
ruv = 0.46
Note: The two correlation coefficients remain equal when b and d, the two scales
have similar signs.
Again, 3x + y – 13 = 0
Thus, r2 = byx × bxy
Since |r| 1 r2 1, our assumptions are correct. Thus, 2x + 5y – 20 = 0 truly
represents the regression line of y on x.
23. (c) The probable error of the coefficient of correlation helps in interpreting its
value. Since the coefficients of correlation are generally, computed from samples,
they, like other statistical quantities, are subjected to errors of sampling. So from
interpretation point of view probable error of the coefficient of correlation is
very useful. When r 6 × P.E., then coefficient of correlation is significant and the
correlation exists.
212
CA FOUNDATION STATISTICS
24. (b) To find the value of y when x’s value is given, regression equation of Y on X
should be known.
Let us assume that 2x + 3y + 18 = 0 represents the regression line of Y on X”
and x + 2y – 25 = 0 represents the regression line of X on Y.
Now, 2x + 3y + 18 = 0
Again x + 2y – 25 = 0
x = 25 – 2y
\ bxy = –2
Thus, r2 = byx × bxy
Since |r| 1 = r2 1, our assumption is wrong. Thus, 2x + 3y + 18 = 0 truly
represents the regression line of X on Y and x + 2y – 25 = 0 truly represents the
regression line of Y on X.
\ x + 2y – 25 = 0 . . . (1)
Substituting x = 9 in (1)
9 + 2y – 25 = 0
2y = 25 – 9
y=
y=8
\ When x = 9 then y = 8
byx = –2
213
CA FOUNDATION STATISTICS
26. (a) Lets solve this question by taking an example. Suppose the hypothetical value
of n be 5.
Then,
Rank by I Judge Ran by II Judge d = Rx Ry d2
(Rx) (Ry)
1 5 –4 16
2 4 –2 4
3 3 0 0
4 2 +2 4
5 1 +4 16
40
Spearman’s Rank Correlation Coefficient
Note:
Students may, however, solve this question by taking any value of n. But, they
should remember that the ranks given by two judges are in reverse order.
214
CA FOUNDATION STATISTICS
2x + 5y + 10 = 0
r2 = bxy × byx
r = –0.33
Since – 1 r 1 \ our assumption is correct So, 2x + 5y + 10 = 0 is the regression
line of Y on X.
n(n2 – 1) = 990
n = 10
Therefore, the number of students = 10
215
CA FOUNDATION STATISTICS
29. (b) When X and Y move in opposite direction, then correlation coefficient will be
negative.
4x – y – 11 = 0
4x = y + 11
r2 = bxy × byx
Since value of r is lying between – 1 and 1 therefore our assumption was correct.
So, byx = and bxy =
31. (a) Better measure for measuring correlation is provided by the square of correlation
coefficient, known as ‘coefficient of determination’ which is expressed as
32. (d) Correlation Coefficient (r) is the Geometric Mean (G.M.) between two co
regression coefficients.
Since, AM > GM > HM
Therefore, AM of regression coefficients is greater than correlation coefficient.
216
CA FOUNDATION STATISTICS
Here r2= bxy × byx = 0.33<1
Hence, x+2y-5 = 0 is regression line of y on x.
217
CA FOUNDATION STATISTICS
36. (c)
Judge 1 (r1) Judge 2 (r2) d d2
A 1 5 –4 16
B 2 4 –2 4
C 3 3 0 0
D 4 2 2 4
E 5 1 2 16
40
= 1 – 2 = –1
37. (d) Regression coefficient are independent of change of origin but not of scale (As
per Fundamental Principle)
Here r2= bxy × byx = 0.33<1
Hence, x+2y-5 = 0 is regression line of y on x.
218
CA FOUNDATION STATISTICS
= 0.28
42. (a)
1 2 –1 1
2 3 –1 1
1 +2
43. (b) The regression coefficients remain unchange due to a shift of origin but change
due to a shift of scale.
219
CA FOUNDATION STATISTICS
On comparing
y = a + bx
we get b =
Comparing from x = a + by
we get b = bxy
coefficient of correlation (r) =
= 0.2
45. (c) If one of regression co-efficient is more than unity, the other must be less than
unity.
220
CA FOUNDATION STATISTICS
y = 6.6 + 0.8x
on comparing y = a + bx
we get b = byx = 0.8
and IInd regression line
40x – 18y = 214
40x = 214 + 18y
x = 5.35 + 0.45y
on comparing x = a + by
we get b = bxy = 0.45
coefficient of correlation between x & y
= ± 0.6
\ r = 0.6 (\ bxy and byx are positive)
48. (b) The coefficient of correlation between two variables x and y is the simple
geometric mean of the two regression coefficient.
49. (b) If two variables are uncorrelated (i.e. r = 0) then regression lines are
perpendicular.
221
CA FOUNDATION STATISTICS
= 0.833
51. (a) Since the two lines of regression pass through the point ( ), the mean values
( ) can be obtained as the point of intersection of the two regression lines.
52. (a) In spearman’s correlation coefficient, the sum of the differences of ranks
between two variable shall be any number.
Coefficient of correlation
= ± 2/3
222
CA FOUNDATION STATISTICS
y
u = a + bx v = c + dy
We get We get
y
ruv = – 0.38
x = a + by
We get b = bxy = 1/18
223
CA FOUNDATION STATISTICS
x = a + 3y y = a + bx
We get We get
= + 0.87
sy = 9.048
224
CA FOUNDATION STATISTICS
y = a + bx x = a + by
We get We get
b = –4/3 = byx b = –3/4 = bxy
r = –1 (Negative correlated)
61. (c) Price and Demand is the example for negative correlation.
62. (a)
The Regression equation of line y on x
y – 40 = 1.608 (x – 20)
y – 40 = 1.608x – 32.16
y = 1.608x – 32.16 + 40
y = 1.608x + 7.84
63. (c) When the value of correlation coefficient is +1 or –1 then the two regression
line will coincide.
225
CA FOUNDATION STATISTICS
65. (b) When each individual gets the exactly opposite rank by the two Judges, then
the rank correlation will be – 1.
68. (b) If the correlation coefficient b/w two variables is zero, then the lines of
regression are perpendicular.
226
CA FOUNDATION STATISTICS
227
CA FOUNDATION STATISTICS
= + 0.6
72. (b) When the correlation coefficient r is equal to ‘+1’, all the points in a scatter
diagram on a straight line directed from lower left to upper Right.
228
CA FOUNDATION STATISTICS
79. (a) The two Regression lines passing through or (Interest) at their means.
229
CA FOUNDATION STATISTICS
= 1 – 0.267
= 0.733
= 0.73
83. (a) The coefficient of correlation between the temperature of environment and
power consumption is always positive.
x = 1 in equation (1) we get
1+y=1
y=0
Mean of x = x = 1
Mean of y = y = 0
230
CA FOUNDATION STATISTICS
85. (a) The coefficient of correlation between X and Y is 0.6. If X and Y values are
multiplied by – 1 then coefficient to correlation remains unchanged. Then are
coefficient of correlation will be 0.6.
90. (d) If there is a constant increase in a series, then the corresponding graph will be
straight line from the left to the right.
231
CA FOUNDATION STATISTICS
91. (a) If the plotted points in a scatter diagram are evenly distributed, then the
correlation is Zero.
92. (d) The Co-variance between two variables is either positive or negative or zero.
94. (c) The Method of concurrent Deviation, only the directions of change (Positive
direction/ Negative direction) in the variables are taken into account for
calculation of Coefficient of correlation.
96. (a) In case ‘speed of on automobile and the distance required to stop the car often
applying brakes’ correlation is positive.
is not possible
So, it is true.
98. (b) Rank correlation coefficient lies between – 1 to + 1 inclusive of both value.
99. (a) The two line of regression intersect at the point of Mean.
Here, byx × bxy 1 which is satisfied.
So 1st equation x + 2y – 5 = 0 is the Regression Equation y on x.
232
CA FOUNDATION STATISTICS
bxy =
Coeff. of correlation is given by
= 0.5
Here, b and d both have different sign so ruv = –rxy
= –0.5
233
CA FOUNDATION STATISTICS
= 0.067
i i
i=1
234
CA FOUNDATION STATISTICS
= 0.82
109. (c) The regression line of y on x is given by y = x + 2
x–y+2=0
by x = 1
coeff. of correlation (r) = 0.5
then Regression coefficient y or x
1
=2
0.5
235
CA FOUNDATION STATISTICS
3. When high values of one variable are associated with high values of the other & low
values of one variable are associated with low value of another, then they are said
to be
a) Positively correlated b) Directly correlated
c) Both d) None
4. A small value of r indicates only a ____ linear type of relationship between the
variables
a) Good b) Poor c) Maximum d) Highest
6. For finding degree of agreement about beauty between 2 judges in a Beauty Consent,
we use
a) Scatter diagram b) Coefficient of rank correlation
c) Coefficient of correlation d) Coefficient of concurrent deviation
7. If the plotted points in a scatter diagram are evenly distributed, then the correlation is
a) zero b) Negative c) Positive d) (a) or (b)
236
CA FOUNDATION STATISTICS
8. The correlation coefficient being +1 if the slope of the straight line a scatter diagram
is
a) Positive b) Negative c) Zero d) None
9. If the covariance between two variables is 20 and the variance of one of the variables
is 16, what would be the variance of the other variable?
a) More than 100 b) More than 10
c) Less than 10 d) More than 1.25
10. If the relationship between two variables x and y is given by 2x+3y+4=0, then the
value of the correlation coefficient x and y is
a) 0 b) 1 c) -1 d) Negative
12. Coefficient of correlation between x and y for 20 items is 0.4. The AM’s and the SD’s
of x and y are known to be 12,15, 3 and 4 respectively. Later on, it was found that
the pair (20, 15) was wrongly taken as (15, 20). Find the correct value of correlation
coefficient.
a) 0.28 b) 0.31 c) 0.53 d) 0.47
13. What is the value of correlation coefficient Karl Pearson on the basis of the following
data:
X -4 -3 -2 -1 0 1 2 3 4
Y 18 11 6 3 2 3 6 11 18
a) 1 b) -1 c) 0 d) –0.5
14. If u + 5x = 6 and 3y – 7v =20 and the correlation coefficient between x and y is 0.58
then what would be the correlation between u and v?
a) 0.58 b) –0.58 c) -0.84 d) 0.84
15. Great advantage of ______ is that it can be used to rank attributes which cannot be
expressed by way of numerical value
a) Concurrent correlation b) Regression
c) Rank Correlation d) None
237
CA FOUNDATION STATISTICS
16. For a group of 8 students, the sum of squares of difference in ranks for Maths and
Stats mark was found to be 50. What is the value of rank correlation coefficient?
a) 0.23 b) 0.40 c) 0.78 d) 0.92
17. If the rank correlation between marks in management and mathematics for a group
of students is 0.6 and the sum of square of the differences in ranks in 66, what is the
number of students in the group?
a) 10 b) 9 c) 8 d) 11
18. In method of concurrent deviations, only the directions of change (positive direction/
negative direction) in the variables are taken into account for calculation of
a) Coefficient of S.D b) Coefficient of regression
c) Coefficient of correlation d) None
19. For 10 pairs of observations, no of concurrent deviations was found to be 4. What is
the value of the coefficient of concurrent deviation?
a) b) c) d)
20. The Coefficient of concurrent deviation for p pairs of observation was found to be
. If the number of concurrent deviations was found to be 6, then the value of
p is
a) 10 b) 9 c) 8 d) None of these
Answer Key
1.
d 2.
b
4.
3.
a b
6.
5.
a b
8.
7.
a a
10.
9.
a c
12.
11.
b b
14.
13.
c b
16.
15.
c b
18.
17.
a c
20.
19.
d a
238
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
1. For a bi-variate frequency distribution, there exists two marginal distribution, one
for variable x and another for variable y. Option D
3. Both variables are moving in the same direction and association is high to high and
low to low. The two variables are positively correlated. Option A
4. A small value of r indicates a poor or low relationship between the two variables.
Option B
7. If the points are evenly distributed in a scatter diagram, the correlation is zero.
Option A
8. The correlation coefficient is +1, if the slope of the straight line is positive. Option A
239
CA FOUNDATION STATISTICS
Correct data:
∑x = 240 + 20 – 15 = 245
∑y = 300 + 15 – 20 = 295
∑x2 = 3060 + 202 – 152 = 3235
∑y2 = 4820 + 152 – 202 = 4645
∑xy = 3696 + (20*15) – (15*20) = 3696
X Y X2 Y2 XY
-4 18 16 324 -72
-3 11 9 121 -33
-2 6 4 36 -12
-1 3 1 9 -3
0 2 0 4 0
1 3 1 9 3
2 6 4 36 12
3 11 9 121 33
4 18 16 324 72
0 2 60 984 0
240
CA FOUNDATION STATISTICS
Since the values of x and y are equal in magnitude but opposite in sign.
Hence ΣXY = 0 and ΣX= 0.
∴ cov(x,y) =
=0–0
=0
∴R=0
14. The change of scale in x and y are in opposite direction. Thus correlation coefficient
between u and v = - 0.58. Option B
241
CA FOUNDATION STATISTICS
3. The difference between the observed value and the estimated value in regression
analysis is known as
a) Error b) Residue
c) Deviation d) (a) or (b)
4. If bxy and byx are the two regression coefficients, then – (Here ρ = r)
a) byx + bxy ≥ 2 r b)
c) d) None of these
242
CA FOUNDATION STATISTICS
a) b)
c) d)
10. The regression equation are 8x – 10y + = 60 and 40x – 18y = 214. Find the coefficient
of correlation
a) b)
c) d) -1
11. If the mean of two variables x & y are 3 and 1 respectively. Then the equation of two
regression lines are
a) 5x + 7y - 22 =0 & 6x + 2y - 20 =0
b) 5x + 7y - 22 =0 & 6x + 2y + 20 =0
c) 5x + 7y + 22 =0 & 6x + 2y - 20 =0
d) 5x + 7y + 22 =0 & 6x + 2y + 20 =0
12. if two regression lines are: Y = 4 + kx and x = 5 + 4y, then the range of k is –
a) k ≤ 0 b) k ≥ 0
c) 0 ≤ k ≤ 1 d) 0 ≤ 4k ≤ 1
13. For the variables x and y, the regression equations are given as 7x – 3y – 18 = 0 and
4x – y – 11=0.
If the variance of x is 9, find the SD of y.
a) 8.5642 b) 6.2453
c) 9.1647 d) 7.4789
243
CA FOUNDATION STATISTICS
14. If the regression line of y on x and that of x on y are given by y = -2x + 3 and
8x = -y + 3 respectively, what is the coefficient of correlation between x and y?
a) 0.5 b)
c) –0.5 d) None of these
a) b) c) d) 4
16. Find the two regression equation from the following data and estimate y when is
13, x when y is 15.
x 2 4 5 5 8 10
y 6 7 9 10 12 12
a) b)
c) Both d) None
18. The coefficient of correlation between two variables is 0.5, then the coefficient of
determination is
a) 0.5 b) 0.25
c) –0.5 d)
19. If the coefficient of correlation between two variables is 0.7 then the percentages of
variation unaccounted for is
a) 70% b) 30%
c) 51% d) 49%
244
CA FOUNDATION STATISTICS
20. Find the coefficient of correlation when its probable error is 0.2 and the number of
pairs of item is 9.
a) 0.505 b) 0.332
c) 0.414 d) 0.316
Answer Key
1.
a 2.
a
4.
3.
d b
6.
5.
b a
8.
7.
b a
10.
9.
a c
12.
11.
a d
14.
13.
c c
16.
15.
b b
18.
17.
c b
20.
19.
c d
245
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
3. The difference between the observed value and estimated value in regression
analysis is known as Error or Residue. Option D
5. The correlation coefficient (r), and the two regression coefficients must have the
same sign. Option B
7. The regression equations are based on the method of least squares. Option B
246
CA FOUNDATION STATISTICS
11. Mean of x & y is 3 and 1 respectively. For x = 3 and y = 1, the values fits in
5x + 7y – 22 = 0 and 6x + 2y – 20 = 0. Option A
12. y = 4 + kx
byx = k
x = 5 + 4y
bxy = 4
r2 = 4k
0 ≤ r2 ≤ 1
0 ≤ 4k ≤ 1
Option D
13. 7x – 3y = 18
byx = 7/3
4x – y = 11
bxy = ¼
r2 = 7/3 * ¼ = 7/12
byx2 = r2 * (Vy/Vx)
49/9 = 7/12 (Vy/9)
Vy = 84
SDy = √84 = 9.1647
Option C
247
CA FOUNDATION STATISTICS
X Y X2 Y2 XY
2 6 4 36 12
4 7 16 49 28
5 9 25 81 45
5 10 25 100 50
8 12 64 144 96
10 12 100 144 120
34 56 234 554 351
X = 34/6 = 5.67
Y = 56/6 = 9.33
Cov(x,y) = 351/6 – (34/6)(56/6) = 5.611
V(x) = 234/6 – (34/6)2 = 6.88
V(y) = 554/6 – (56/6)2 = 5.22
bxy = 5.611/5.22 = 1.07
byx = 5.611/6.88 = 0.81
Y, when X = 13
(Y – 9.33) = 0.81(13 – 5.67)
Y = 15.2673
X, when Y = 15
(X – 5.67) = 1.07(15 – 9.33)
X = 11.73
248
CA FOUNDATION STATISTICS
0.2 = 0.6745
0.2 = 0.6745
1 – r2 = 0.8895
r2 = 1 – 0.8895
r2 = 0.1104
r = 0.332
Option B
249
CA FOUNDATION STATISTICS
INDEX NUMBERS
Basic Concepts
• Index Numbers are special kind of averages, expressed in ratio, calculated as
percentage and used as numbers.
• Index number is a number which is used as a tool for comparing prices and quantities
of a particular commodity or a group of commodities in a particular time period
with respect to other time period or periods.
• The year in which the comparison is made is called the “Current Year” and the year
with respect to which the comparison is made is the “Base Year”.
250
CA FOUNDATION STATISTICS
Method of Aggregates
Case: 1 Case: 2
Simple Aggregate of prices Weighted Aggregate of prices
251
CA FOUNDATION STATISTICS
If w = Q0 + Q1
Marshall-Edgeworth Index
Relative Method
First calculate Price Relative (PR) of each commodity. Price Relative (PR) is defined as
the ratio of the current year’s price to the base year’s price, expressed as percentage
and is given by
Case: 1 Case: 2
Simple AM of Price Relative Weighted AM of Price Relative
Note :
• GM is the best average in the construction of index numbers but practically we use
AM, because G.M is difficult to compute.
• When a series of Index Numbers for different years are expressed in a tabular form to
compare the changes in different years, then this tabular representation of numbers
is known as “Index Time Series”.
it will be
252
CA FOUNDATION STATISTICS
• CLI is defined as the weighted AM of index numbers of few groups of basic necessities.
Generally for calculating CLI; food, clothing, house rent, fuel & lightning and miscellaneous
groups are taken into consideration.
o Increase in CLI implies increase in price index causing thereby an inflation i.e.
reduction in the purchasing power.
o Purchasing Power of 1 =
o Real Income =
Let the money income in Y2 is X2. If X2 is less than or equal to X1, then no allowances
are required to be given. But if X2 is greater than X1, then amount of Dearness
Allowances = (X2 – X1)
253
CA FOUNDATION STATISTICS
• Index Number for any year (with base year shifted) is given by:
• Time Reversal Test (TRT) – According to this test I01 x I10 = 1 (ignore 100). This test is
satisfied by:
o Simple Aggregate of Prices
o Weighted GM of Price Relative
o Marshall Edgeworth Index
o Fisher’s Ideal Index
• Factor Reversal Test (FRT) – According to this test Price Index x Quantity Index = Value
Index. Only Fisher’s Ideal Index satisfies this test.
• Circular Test – Circular Test is an extension of Time Reversal Test. According to this
test I01 x I12 x I23 x .... x I(n-1), n x In,0 = 1. This test is satisfied by:
o Simple Aggregate of Prices (ie. Weighted Aggregate of Prices with Fixed Weights)
o Simple GM of Price Relatives
• Under Chain Base Method (CBM), all the index numbers are calculated with respect
to the price of immediate preceding period.
• Under CBM, the index number for the first year will always be 100.
• For the first year, Chain Base Index = Fixed Base Index.
254
CA FOUNDATION STATISTICS
o Link Relative =
Note: Always start with one year preceding to the given years from which you are to
calculate the chain index numbers. In that year (i.e. the preceding year) take both the link
relative and the chain index to be 100.
• The factor which is multiplied for such conversion is called “Conversion Multiplier”.
• Let there are two series Y1 and Y2. When the series Y1 is merged into the series Y2,
it is known as “Forward Splicing” and when series Y2 is merged into series Y1, it is
known as “Backward Splicing”.
255
CA FOUNDATION STATISTICS
CLASSWORK SECTION
(i) Find out price relatives : [100, 116.67, 116.67, 133.33, 166.67, 233.33, 200,
216.67] Taking 1998 as base
(ii) Also calculate the link relatives. [100, 116.67, 100, 114.29, 125, 140, 85.71,
108.33]
3. From the following table by the method of relatives using Arithmetic mean the price
Index number is
256
CA FOUNDATION STATISTICS
4. Find the Index number by the method of Relatives (using Arithmetic Mean) from the
following data:
5. If the ratio between Laspeyre’s index number and Paasche’s Index number is 28 : 27.
Then the missing figure in the following table P is :
The general Index number is
(a) 270 (b) 269.2 (c) 268.5 (d) 272.5
257
CA FOUNDATION STATISTICS
7. Given below are the data on prices of some consumer goods and the weights
attached to the various items Compute price index number for the year 1985 (Base
1984 = 100)
258
CA FOUNDATION STATISTICS
13. Bowley’s index number is 150. Fisher’s index number is 149.95. Paasche’s index number is
(a) 158 (b) 154 (c) 148 (d) 156
14. In calculating a certain cost of living index number the following weights were used.
Food 15, Clothing 3, Rent 4, Fuel & Light 2, Miscellaneous 1. Calculate the index for
the data when the average percentages rise in prices of items in the various groups
over the base period were 32, 54, 47, 78 & 58 respectively.
a) 139.76 b) 141.99 c) 141.76 d) 139.87
15. Calculate the Cost of Living Index number from the following data:
Items Price Weights
Base Year Current Year
Food 30 47 4
Fuel 8 12 1
Clothing 14 18 3
House Rent 22 15 2
Miscellaneous 25 30 1
259
CA FOUNDATION STATISTICS
BASE SHIFTING
18. Shift the base period of the following series of index numbers from 1978 to 1985:
Year 1982 1983 1984 1985 1986 1987 1988
Index No. [Base 120 125 132 140 150 158 175
1978 =100]
a) 85.71, 89.29, 100, 94.29, 107.14, 112.86, 125
b) 85.71, 89.29, 94.29, 100, 107.14, 112.86, 125
c) 85.71, 89.29, 101.98, 94.29, 107.14, 112.86, 125
d) 85, 89, 94, 100, 107, 112, 125
Ans.
CBIN: 100 104.24 104.08 93.14 103.16 102.04
20. From the chain base index number given below, prepare fixed base index numbers.
Year 2000 2001 2002 2003 2004
CBI 80 110 120 90 140
Ans.
FBIN 80 88 105.6 95.04 113.06
(Base 1992 = 100) for the years 1993–97. The construction of chain index is :
(a) 103, 100.94, 107, 118.72 (b) 103, 108.15, 121.13, 130.82
(c) 107, 100.25, 104, 118.72 (d) None of these.
260
CA FOUNDATION STATISTICS
SPLICING
22. The price relative for the year 2000 with base 1995 = 100 is 135. The price relative
for the year 1995 with base 1990 = 100 is 120. Find the price relative for 2000 with
base 1990 = 100.
(a) 100 (b) 83.33 (c) 162 (d) 135
Read the following data and answer the questions that follow:
23. When PI A is spliced with PI B, the spliced index for the year 2000 is:
a) 83.33 b) 93.33 c) 103.33 d) 113.33
24. When PI A is spliced with PI B, the spliced index for the year 2002 is:
a) 83.33 b) 93.33 c) 103.33 d) 95.83
25. When PI B is spliced with PI A, the spliced index for the year 2002 is:
a) 100 b) 110 c) 115 d) 120
26. When PI B is spliced with PI A, the spliced index for the year 2004 is:
a) 100 b) 125 c) 135 d) 150
27. Net monthly salary of an employee was ` 3000 in 1980. The consumer price index
number in 1985 is 250 with 1980 as base year. If he has to be rightly compensated
then, dearness allowances to be paid to the employee is :
(a) ` 4.800.00 (b) ` 4,700.00 (c) ` 4,500.0 (d) None of these.
261
CA FOUNDATION STATISTICS
28. Net Monthly income of an employee was ` 800 in 1980. The consumer price Index
number was 160 in 1980. It is rises to 200 in 1984. If he has to be rightly compensated.
The additional dearness allowance to be paid to the employee is :
(a) ` 240 (b) ` 275 (c) ` 250 (d) 200
REAL INCOME
29. During a certain period the cost of living index number goes up from 110 to 200 and
the salary of a worker is also raised from ` 325 to ` 500. The worker does not get
really gain. Then the real wages decreased by :
(a) ` 45.45 (b) ` 43.25 (c) ` 100 (d) 90.91
MISCELLANEOUS SUMS
30. In 1996 the average price of a commodity was 20% more than in 1995 but 20% less
than in 1994; and more over it was 50% more than in 1997 to price relatives using
1995 as base (1995 price relative 100) Reduce the data is :
(a) 150, 100, 120, 80 for (1994–97)
(b) 135, 100, 125, 87 for (1994–97)
(c) 140, 100, 120, 80 for (1994–97)
(d) None of these.
31. The price of a commodity increases from ` 5 per unit in 1990 to ` 7.50 per unit
in 1995 and the quantity consumed decreases from 120 units in 1990 to 90 units
in 1995. The price and quantity in 1995 are 150% and 75% respectively of the
corresponding price and quantity in 1990. Therefore, the product of the price ratio
and quantity ratio is :
(a) 1.8 (b) 1.125 (c) 1.75 (d) None of these.
32. When the cost of Tobacco was increased by 50%, a certain hardened smoker, who
maintained his formal scale of consumption, said that the rise had increased his
cost of living by 5%. Before the change in price, the percentage of his cost of living
was due to buying Tobacco is
(a) 15% (b) 8% (c) 10% (d) None of these.
262
CA FOUNDATION STATISTICS
33. If the price index for the year, say 1960 be 110.3 and the price index for the year, say
1950 be 98.4, then the purchasing power of money (Rupees) of 1950 will in 1960 is
(a) ` 1.12 (b) ` 1.25 (c) ` 1.37 (d) None of these.
34. The consumer price Index for April 1985 was 125. The food price index was 120 and
other items index was 135. The percentage of the total weight for the food index is:
(a) 66.67 (b) 68.28 (c) 90.25 (d) None of these.
35. The total value of retained imports into India in 1960 was ` 71.5 million per month.
The corresponding total for 1967 was ` 87.6 million per month. The index of volume
of retained imports in 1967 compared with 1960 (= 100) was 62.0. The price index
for retained imports for 1967 with 1960 as base is
(a) 198.61 (b) 197.61 (c) 198.25 (d) None of these.
THEORETICAL ASPECTS
36. A series of numerical figures which show the relative position is called
a) index number b) relative number
c) absolute number d) none
38. _________ play a very important part in the construction of index numbers.
a) weights b) classes c) estimations d) none
40. Index numbers show _________ changes rather than absolute amounts of change.
a) relative b) percentage c) both d) none
263
CA FOUNDATION STATISTICS
51. The value at the base time period serves as the standard point of comparison
a) false b) true c) both d) none
264
CA FOUNDATION STATISTICS
58. When the product of price index and the quantity index is equal to the corresponding
value index then the test that holds is
(a) Unit Test (b) Time Reversal Test
(c) Factor Reversal Test (d) none holds
265
CA FOUNDATION STATISTICS
59. The formula should be independent of the unit in which or for which price and
quantities are quoted in
(a) Unit Test (b) Time Reversal Test
(c) Factor Reversal Test (d) none
64. Fisher’s Ideal Formula for calculating index numbers satisfies the _______ tests
(a) Unit Test (b) Factor Reversal Test
(c) both (d) none
266
CA FOUNDATION STATISTICS
69. Theoretically, G.M. is the best average in the construction of index numbers but in
practice, mostly the A.M. is used
(a) false (b) true (c) both (d) none
71. ___________ is concerned with the measurement of price changes over a period of
years, when it is desirable to shift the base
(a) Unit Test (b) Circular Test
(c) Time Reversal Test (d) (b) and (c) both
267
CA FOUNDATION STATISTICS
77. If the prices of all commodities in a place have increased to 1.25 times in comparison
to the base period, the index number of prices of that place now is
(a) 125 (b) 150 (c) 225 (d) None of these.
78. If the index number of prices at a place in 1994 is 250 with 1984 as base year, then
the prices have increased on average by
(a) 250% (b) 150% (c) 350% (d) None of these.
79. If the prices of all commodities in a place have decreased by 35% over the base
period prices, then the index number of prices of that place is now
(a) 35 (b) 135 (c) 65 (d) None of these.
80. Cost of Living Index (C.L.I.) numbers are also used to find real wages by the process
of
(a) Deflating of Index number. (b) Splicing of Index number.
(c) Base shifting. (d) None of these.
268
CA FOUNDATION STATISTICS
HOMEWORK SECTION
4. In the data group Bowley’s index number = 150 and Laspeyre’s index number = 180.
What is the Paasche’s index number?
a) 120 b) 30 c) 165 d) None of the above
5. If Laspeyre’s index number is 90 and Paasche’s index number is 160, then what is
the Fisher’s index number?
a) 125 b) 120 c) 144 d) None of the above
6. If Fisher’s index = 150 and Paasche’s index = 144, then what is the value of Laspeyre’s
index?
a) 138.25 b) 156.25 c) 104.25 d) 147.77
269
CA FOUNDATION STATISTICS
10. If the price of a commodity in a pace have decreased by 30% over the base period
prices, then the index number of that place is:
a) 130 b) 70 c) 90 d) 170
11. Calculate Fisher’s Index from the following data: ∑P0Q0 = 116, ∑P0Q1 = 140, ∑P1Q0 =
97 and ∑P1Q1 = 117.
a) 119.56 b) 83.59 c) 120 d) 110
16. The suitable index number for the comparison of changes in price level of every year
is:
a) Fixed base index number b) Fisher’s ideal index number
c) Chain base index number d) None of the above
270
CA FOUNDATION STATISTICS
19. The index number of prices at a place in the year 2018 is 225 with 2014 as the base
price, then there is:
a) Average 100% increase in prices b) Average 225% increase in prices
c) Average 125% increase in prices d) None of the above
22. Cost of living index numbers are also used to find real wages by the process of:
a) Base shifting b) Deflating of index numbers
c) Splicing of index numbers d) None of the above
23. The ratio of price of the single commodity in a given period to its price in another
period is called:
a) Price ratio b) Price relative
c) Period price d) (a) and (b) both
271
CA FOUNDATION STATISTICS
30. Compute the consumer price index for 2016 on the basis of 2015 from the following
data:
272
CA FOUNDATION STATISTICS
31. Suppose Mr. M was earning Rs. 20,500 in the base period, what should be his salary
in the current period if his standard of living is to remain the same? Given ∑W = 25
and ∑IW = 3544.
a) 20960 b) 29060 c) 23060 d) 21060
32. Net monthly salary of Mr. J was Rs. 300,000 in 2000. The consumer price index
number in 2015 is 250 with 2000 as base year. If he has to be rightly compensated,
then the Dearness Allowance to be paid to him in 2015 is:
a) Rs. 400,000 b) Rs. 450,000 c) Rs. 750,000 d) Rs. 300,000
33. In 2014 for working class people wheat was selling at an average price of Rs. 16
per 20 Kg, cloth at Rs. 2 per meter, house rent Rs. 30 per house and other items at
Rs. 10 per unit. By 2015 cost of wheat rose by Rs. 4 per 20 Kg, house rent by Rs. 15
per house and other items doubled in price. The working class cost of living index
for the year 2015 (with 2014 as base) was 160. By how much did cloth rose in price
during the period?
a) 0.99 b) 1.28 c) 1.30 d) 1.73
34. Calculate the Fisher’s Ideal Index from the following data:
Group A B C D E F
Group Index 120 132 98 115 108 95
Weight 6 3 4 2 1 4
36. The prices of commodity in the year 2005 and 2010 were 25 and 30 respectively.
Taking 2010 as the base year, what is the price relative?
a) 83.33 b) 113.25 c) 93.33 d) 87.77
273
CA FOUNDATION STATISTICS
37. Calculate the Marshall – Edgeworth Index from the following data:
38. An enquiry into the budgets of middle class families in a village gave the following
information:
Calculate the price index number based on Weighted Arithmetic Mean of Price
Relatives.
a) 111.015 b) 101.015 c) 101.970 d) 98.52
39. Given the following information, which of the statements given is TRUE?
274
CA FOUNDATION STATISTICS
40. Monthly salary of Ms. G was Rs. 100,000 in the year 2010 and it was increased to
Rs. 200,000 in the year 2019 while the consumer price index is 240 in year 2019
with the base year 2010. What should be her salary in comparison of consumer
price index in the year 2019?
a) Rs. 250,000 b) Rs. 240,000
c) Rs. 280,000 d) Rs. 300,000
41. If with an increase of 10% in prices, the rise in wages is 20%, then the real wage has
increased by:
a) 20% b) Less than 20%
c) More than 20% d) Less than 10%
42. Consumer price index number for the year 1977 was 313 with 1960 as the base year
and was 100 for the year 1960. The average monthly wages in 1977 of the workers
of a factory be Rs. 160, their real wages is:
a) Rs. 51.12 b) Rs. 52.11 c) Rs. 15.21 d) Rs. 25.21
43. If the 1970 index with base 1965 is 200 and 1965 index with base 1960 is 150, what
is the index for 1970 on base of 1960?
a) 300 b) 500 c) 600 d) 700
44. Find by the Arithmetic mean method, Index Numbers from the following data:
Commodity A B C D E
Base Price 30 25 90 15 96
Current Price 36 28 108 21 120
275
CA FOUNDATION STATISTICS
45. Calculate the Cost of Living Index number from the following data:
48. Compute price index for the following data by applying weighted average of price
relative method (Arithmetic Mean Method).
276
CA FOUNDATION STATISTICS
49. For the following data (base 2002 = 100), construct the chain index for the years
2003 to 2006:
Year 2002 2003 2004 2005 2006
Link Index 100 103 105 112 108
a) 103, 108, 120, 135 b) 103, 108.15, 121.13, 130.82
c) 103, 108.15, 125.5, 131.82 d) None of the above
ANSWER KEY
Q. No. Ans Q. No. Ans Q. No. Ans Q. No. Ans Q. No. Ans
1 d 11 b 21 d 31 b 41 d
2 a 12 c 22 b 32 b 42 a
3 b 13 c 23 b 33 c 43 a
4 a 14 a 24 a 34 a 44 d
5 b 15 c 25 c 35 b 45 c
6 b 16 c 26 b 36 a 46 b
7 a 17 a 27 b 37 a 47 b
8 b 18 c 28 d 38 b 48 b
9 c 19 c 29 c 39 c 49 b
10 b 20 c 30 c 40 b 50 b
277
CA FOUNDATION STATISTICS
HOMEWORK SOLUTION
1. Index number for the base period is always taken as 100. Option d
4. 2B = L + P.
P = 2B – L = 2(150) – 180 = 300 – 180 = 120. Option a
5. F2 = L x P = 90 x 160 = 32 x 42 x 102
F = 3 x 4 x 10 = 120. Option b
6. F2 = L x P
L = F2 / P = (150)2 / 144 = 22500/144 = 156.25
Option b
7. The purpose always determines the type of index number to be used. Option a
9. If prices are increased by 20%, the price index number = 100 + 20% of 100 = 120.
Option c
10. If prices have decreased by 30%, the price index = 100 – 30% of 100 = 100 – 30 =
70. Option b
278
CA FOUNDATION STATISTICS
13. For calculating the CPI, prices are generally collected from the retail traders. Option c
14. Splicing means constructing one continuous series from two or more indices.
Option a
15. Purchasing power of money is the reciprocal of the price index number. Option c
16. Chain based index number is suitable for the comparison of changes in price level
of every year over the previous year. Option c
17. Consumer price index is generally known as the Cost of Living Index. Option a
18. Wholesale Price Index is price index with base period quantity as weights. Thus
Laspeyre’s Index is the best fit. Option c
20. In construction of Paasche’s Price Index, current year’s quantities are used as weights.
Option c
21. In construction of Laspeyre’s Price Index, base year’s quantities are used as weights.
Option d
22. The process used to find the real wages using CLI is known as deflating. Option b
25. The simple GM of PR and the weighted aggregative with fixed weights follows the
circular test. It is not followed by either Laspeyre’s or Paasche’s index. Option c
26. Bowley’s index is the AM between Laspeyre’s and Paasche’s index. Option b
27. Circular Test is the extension of the Time Reversal Test. Option b
279
CA FOUNDATION STATISTICS
28. Both Laspeyre’s and Paasche’s index numbers do not satisfy either Factor Reversal
Test neither Time Reversal Test. But both satisfies the Unit Test. Option d
CPI
174
= x 100 = 118.77
146.5
280
CA FOUNDATION STATISTICS
35. ∑IW = 120*6 + 132*3 + 98*4 + 115*2 + 108*1 + 95*4 = 720 + 396 + 392 + 230 + 108
+ 380 = 2226
∑W = 6 + 3 + 4 +2 + 1 + 4 = 20
General Index = 2226 / 20 = 111.30
Option b
281
CA FOUNDATION STATISTICS
282
CA FOUNDATION STATISTICS
CPI
182.5
= x 100 = 117.74
155
49. Calculation of Chain Index
2002 = 100
2003 = 100 * 103 / 100 = 103
2004 = 103 * 105 / 100 = 108.15
2005 = 108.15 * 112 / 100 = 121.13
2006 = 121.13 * 108 / 100 = 130.82
Option b
283
CA FOUNDATION STATISTICS
a) 116.48 b) 118.35
c) 115.43 d) 117.30
2. The index number for the year 2012 taking 2011 as the base year from the data
given below by using simple average of price relative method is:
Commodity A B C D E
Price in 2011 115 108 95 80 90
Price in 2012 125 117 108 95 95
a) 111 b) 112
c) 117 d) 120
4. If all the values are of equal importance, the index numbers are called:
a) Weighted index number
b) Un-weighted index numbers
c) Composite index numbers
d) Value index numbers
284
CA FOUNDATION STATISTICS
13. Which of the following index number covers all those index numbers which measures
the changes in total money worth during some period compared with other period?
a) Quantity Index Numbers b) Value Index Numbers
c) Price Index Numbers d) Volume Index Numbers
285
CA FOUNDATION STATISTICS
15. An index increases from 105 to 110. The change in the index is:
a) 4.75 points b) 5 points
c) 110 points d) None of the above
16. It has been decided to change the base of an index from 2008 to 2015. If the index
in 2015 is 148, what would the index in 2008 become after the change?
a) 100 b) 32.4
c) 67.6 d) 148
17. The turnover by a company in 2012 was Rs. 54.5 M and in 2014 it was Rs. 75 M.
If the CPI has increased from 123 in 2012 to 128 in 2014, the real change in the
turnover has been:
a) Rs. 16 M b) Rs. 17.6 M
c) Rs. 20.5 M d) Rs. 18.1 M
18. If the price index for bananas was 100 in 2015 and that for apples was 110, then
the price of a “Rupee” of bananas was:
a) Less than for apples b) More than for apples
c) Impossible to compare with apples d) None of the above
19. From the following data calculate the Fisher’s Quantity Index:
Commodities Q0 P0 P1 Q1
A 2 2 18 6
B 5 5 2 2
C 7 7 24 4
a) 85.24 b) 37.87
c) 78.93 d) 24.85
286
CA FOUNDATION STATISTICS
20. The total value of retained imports into a country X in 1990 was 71.5 M units per
month. The corresponding total for 1997 was 87.6 M units per month. The index of
volume of retained imports in 1997 compared with 1990 was 62. The price index for
retained inputs for 1997 over 1990 as base is:
a) 196.71 b) 197.61
c) 191.76 d) 197.11
21. The price index number using simple GM of the n relatives is given by:
a) Log Ion = 1 – (1/n)∑Log(Pn/P0)
b) Log Ion = (1/n)∑Log(Pn/P0)
c) Log Ion = (1/2n)∑Log(Pn/P0)
d) Log Ion = (1/2n)∑Log(Pn/P0)
23. From the following data calculate the Paasche’s Price Index:
Commodity A B C D
Price 2012 3 5 4 1
Quantity 2012 18 6 20 14
Price 2013 4 5 6 3
Quantity 2013 15 9 26 15
a) 144.25 b) 148.25
c) 146.41 d) None of the above
ANSWER KEY
Q. No. Ans Q. No. Ans Q. No. Ans Q. No. Ans Q. No. Ans
1 d 6 d 11 c 16 c 21 b
2 a 7 c 12 b 17 b 22 d
3 d 8 b 13 b 18 c 23 c
4 b 9 d 14 b 19 c 24 d
5 d 10 b 15 b 20 b
288
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
4. If all values are of equal importance, then all weights can be reduced to 1, and
hence the index numbers are called Un-weighted index numbers. Option B
289
CA FOUNDATION STATISTICS
6. When the prices of a commodity are to be compared, we use Price Index. Option D
10. False. When weights are of equal importance, it is known as un-weighted index
numbers. Option B
11. Index numbers are generally (often) constructed from the samples. Option C
13. Total money means price x quantity. Value index numbers measures the changes
in total money worth during some period compared with other period. Option B
14. The base period under study should be a normal period. Option B
15. When index increases from 105 to 110, there is change of (110 – 105) = 5 points.
Option B
290
CA FOUNDATION STATISTICS
291
CA FOUNDATION STATISTICS
292
CA FOUNDATION STATISTICS
a) 117.30 b) 115.43
c) 116.48 d) 118.35
5. _______ measure the general changes in the price level from one period to another.
a) Value Index Numbers b) Quantity Index Numbers
c) Volume Index Numbers d) Price Index Numbers
6. During the certain period the CLI goes up from 110 to 200 and the Salary of Ms. G
is also raised from 330 to 500, then in the real terms:
a) Loss by Rs. 90 b) Loss by Rs. 55
c) Loss by Rs. 75 d) Loss by Rs. 100
293
CA FOUNDATION STATISTICS
a) 224.50 b) 223.91
c) 225.32 d) None of the above
8. From the following data calculate the Marshall Edgeworth Index number:
Commodity A B C D
Base Price 7 6 11 4
Base Quantity 17 23 14 10
Current Price 13 7 13 6
Current Quantity 25 25 15 8
a) 148.25 b) 147.25
c) 140.49 d) None of the above
9. The index number in whole sale prices is 152 for July 2009 compared to July 2008.
During the year there is net increase in prices of whole sale commodities to the
extent of:
a) 45% b) 25%
c) 152% d) 52%
294
CA FOUNDATION STATISTICS
13. How many methods are used for the construction of index numbers?
a) 5 b) 2
c) 4 d) 3
16. The ratio of price of single commodity in a given period to its price in another period
is called the:
a) Relative price b) Base price
c) Price ratio d) (a) and (c) both
17. Which of the following is a statistical measure used to compare the average level
of magnitude of a group of distinct but related items in two or more situations?
a) Relative Numbers b) Index Numbers
c) Time Series Numbers d) Absolute Numbers
18. The CPI in July 2012 was 124.4 and by July 2014 it was 128.2. This represents a rise
in the price of goods by:
a) 3.38% b) 3.05%
c) 2.96% d) 3.80%
295
CA FOUNDATION STATISTICS
19. The CPI and RPI for July 2010 are 112.4 and 217.9 respectively. In July 2011 these
two indices have become 116.9 and 229.0. The difference in the rate of inflation
given by these two indices are:
a) 4.00%
b) 5.10%
c) 1.10%
d) Can’t compare as they are different indices
20. For constructing consumer price index, which of the following methods is used?
a) Marshall – Edgeworth Method b) Laspeyre’s Method
c) Bowley’s Method d) Paasche’s Method
22. The index number of prices at a place in the year 2018 is 325 with 2004 as the base
price, then there is:
a) Average 100% increase in prices b) Average 225% increase in prices
c) Average 325% increase in prices d) None of the above
25. The ________ Mean of group indices gives the General Index.
a) Geometric b) Arithmetic
c) Harmonic d) None of the above
296
CA FOUNDATION STATISTICS
ANSWER KEY
Q. No. Ans Q. No. Ans Q. No. Ans Q. No. Ans Q. No. Ans
1 a 6 b 11 c 16 d 21 a
2 d 7 d 12 a 17 b 22 b
3 b 8 c 13 b 18 b 23 a
4 c 9 d 14 d 19 c 24 b
5 d 10 b 15 b 20 b 25 b
297
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
2 14 2 18 7 9 18 14
3 6 5 5 2 1 3 10
52 59 63 52
5. Price index numbers measure the general changes in the price level from one period
to another. Option D
298
CA FOUNDATION STATISTICS
12. The value index is the total sum of all values of a given year divided by the sum of
the values of the base year. Option A
13. Method of aggregates or relative method is used for the construction of index
numbers. Option B
14. In chain base method, the base period keep on changing. It is not fixed. Option D
17. Index number is a statistical measure used to compare the average level of
magnitude of a group of distinct but related items in two or more situations.
Option B
18. (128.2 – 124.4) = 3.8. Rise in price by 3.8/124.4 * 100 = 3.05% Option B
299
CA FOUNDATION STATISTICS
20. For constructing consumer price index, Laspeyre’s method is used. Option B
21. Option A
22. (325 – 100) = 225. There is average 225% increase in prices. Option B
23. An index number is called a simple index when it is computed from single
variable. Option A
25. The arithmetic mean of group indices gives the general index. Option B
300
CA FOUNDATION STATISTICS
Probability
Subjective Objective
It is influenced by personal belief, bias, attitude, etc and this is used in decision
making management.
Definitions
e) Mutually Exclusive Events : Events are said to be mutually exclusive when the
occurrence of any one event excludes the occurrence of other or otherwise e.g. if a
coin is tossed occurrence of head and tail are mutually exclusive events because of
head will automatically exclude the occurrence of tail or vice versa.
f) Equally likely events: Events are said to be equality likely when they are equi-
probable i.e. the event should occur with same chance of occurrence (None can be
preferred over the other).
301
CA FOUNDATION STATISTICS
g) Exhaustive events: The events are said to be exhaustive when they include all
possible outcomes. Events will necessarily occur.
Note1:
a) Probability as defined above will always lie between 0 and 1, both inclusive i.e.,
0 ≤ P(A) ≤ 1 and P(A) ≥ 0.
Note2:
Complementary Probability
Let P(A) be the Probability of occurrence of event A.
Then = Probability of non-occurrence of event A.
Note3:
a) , which implies that A and Ac are collectively exhaustive.
302
CA FOUNDATION STATISTICS
3. Axiomatic definition
It is totally dependent on set theory
(i) P(A) ≥ 0 for all A S
(ii) P(S) = 1
(iii) If A & B are mutually exclusive events P(A B) = 0
P(A B) = P(A) + P(B).
Examples:
a) 2 coins are tossed. Total outcomes = 22 = 4
b) A coin is tossed five times. Total outcomes = 25 = 32
c) 2 dice are rolled together. Total outcomes = 62 = 36
• At least
Let x = 0, 1, 2, 3, ... ..., n
Then, x is at least k, implies x ≥ k, which implies that x = k, (k+1), (k+2), ... ... n
303
CA FOUNDATION STATISTICS
• At most
x is at most k implies x ≤ k, which means x = 0, 1, 2, ..., k
• At least One
x is at least one implies that x ≥ 1, i.e., x = 1, 2, 3, ... .., n
Hence, P(at least 1) = 1 – P(none) = 1 – P(0)
Observations:
A. Sum of faces on two Dice and the no. of ways of getting sum
Sum 2 3 4 5 6 7 8 9 10 11 12
No. of ways 1 2 3 4 5 6 5 4 3 2 1
304
CA FOUNDATION STATISTICS
• Leap Year
A leap year contains 52 weeks and 2 extra days. These two extra days can be
either of the following out-comes:
(M, T) (T, W) (W, Th) (Th, F) (F, Sat) (Sat, Sun) (Sun, M)
Partitioning of events
305
CA FOUNDATION STATISTICS
1. A and B (A B) or AB
2. A and not B A but not B A BC A - (A B)
3. B but not A B and not A B AC B - (A B)
4. Neither A nor B A “not” and B “not” AC BC
5. AC = (3) + (4)
6. BC = (1) + (4)
7. AC BC = (1) + (3)+ (4) = [2]C = (A B)C
8. AC BC = [4] =[1 + 2+ 3]C = (A B)C
Proof of P (A B):
Hence proved
Note 1:
For 3 events, A, B and C, not mutually exclusive, +
Note 2:
When A and B are mutually exclusive, the two sets are disjoint and accordingly
and
Note 3:
When 3 events A, B and C are mutually exclusive then
=0 and accordingly
306
CA FOUNDATION STATISTICS
Note 4:
When 3 events A, B and C are mutually exclusive and collectively exhaustive then,
Note 5:
Working Rules:
i. P(A BC) = P(A) – P(A B)
ii. P(AC B) = P(B) – P(A B)
iii. P(AC U BC) = P(A B)C = 1 – P(A B)
iv. P(AC BC) = P(A U B)C = 1 – P (A U B)
v. P(AC U B) = P(AC) + P(B) – P(AC B)
vi. P(A U BC) = P(A) + P(BC) – P (A BC)
307
CA FOUNDATION STATISTICS
Statement:
If A and B are two events, not mutually independent, then the probability of joint
or simultaneous occurrence of the two events A and B would be given by the
product of the probability of event A and the conditional probability of event B
assuming that, A has already occurred,
308
CA FOUNDATION STATISTICS
Note 2:
When the events A and B are independent, in such a case
Note 3:
a. When the events A and B are independent, then,
Hence, proved
Hence, proved
Note 4:
For three events, A, B and C which are not independent,
Note 5:
When 3 events, A and B and C are independent,
309
CA FOUNDATION STATISTICS
Note 6:
Two events A and B are,
i. Mutually exclusive, if
ii. Independent, if
iii. Equally likely, if
iv. Exhaustive, if
v. Mutually exclusive and exhaustive e, if
Note 7:
Two events with non-zero probability cannot be simultaneously mutually
exclusive and independent.
Note 8:
If two events A and B are independent, then
i. AC and BC are independent
ii. A and BC are independent
iii. AC and B are independent
Note 9:
If are n events, then the number of conditions to be satisfied for
proving their mutual independence are
310
CA FOUNDATION STATISTICS
CLASSWORK SECTION
Tossing of Coins
1. All heads.
a) 1/8 b) 6/8 c) 7/8 d) 4/8
2. All tails.
a) 1/8 b) 6/8 c) 7/8 d) 4/8
3. No Head.
a) 1/8 b) 6/8 c) 7/8 d) 4/8
4. No tail.
a) 1/8 b) 6/8 c) 7/8 d) 4/8
311
CA FOUNDATION STATISTICS
A card is drawn from a well-shuffled pack of 52 cards. Find the probability that:-
18. It is red.
a) ¼ b) ½ c) 1/5 d) 33/56
19. It is an Ace.
a) 2/13 b) 3/13 c) 1/13 d) 4/13
20. It is a spade.
a) 2/8 b) 6/8 c) 1/8 d) 4/8
312
CA FOUNDATION STATISTICS
Rolling of Dice
313
CA FOUNDATION STATISTICS
Children in a Family
In a family of three children there is at least one girl. Find the probability that;
314
CA FOUNDATION STATISTICS
Leap Year
Find the probability that a leap year selected at random will contain;
46. 53 Sundays.
a) 3/7 b) 1/7 c) 4/7 d) 2/7
315
CA FOUNDATION STATISTICS
Addition Theorem
A number is selected at random from a set of first 120 natural numbers. What is the
probability that it is divisible by:
54. 5 or 6
a) 1/3 b) 1/4 c) 2/12 d) None of the above
55. 7 or 9
a) 31/120 b) 30/120 c) 29/120 d) None of the above
56. A constructive company is bidding for two contracts, A and B. The probability that
the company will get contract A is 3/5, the probability that the company will get
contract B is 1/3 and the probability that the company will get both the contracts
is 1/8. What is the probability that the company will get at least one contact.
a) 79/120 b) 97/120 c) 89/120 d) None of these
57. The probability that the contractor will get one electric contract is 3/5, and the
probability that the company will not get a plumbing contract is 7/10. If the
probability of getting at least one contract is 4/7, what is the probability that he
will get both the contracts.
a) 32/70 b) 65/70 c) 23/70 d) None of these
58. P(A B)
a) 3/20 b) 1/10 c) ¼ d) ½
60. P(Ac B)
a) 3/20 b) 1/10 c) ¼ d) ½
316
CA FOUNDATION STATISTICS
62. P(Ac B)
a) 9/10 b) 7/10 c) 3/10 d) None of the above
Multiplication Theorem
64. If P(A) = 2/3, P(B) = 3/4, P(A / B) = 2/3, then what is P(B / A)?
a) 1/3 b) 1/3 c) 3/4 d) None of the above
65. P(A/B)
a) 2/8 b) 5/8 c) 3/8 d) None of the above
66. P(B/A)
a) 2/5 b) 3/5 c) 1/5 d) None of the above
67. P(Ac/B)
a) 5/8 b) 8/5 c) 3/5 d) None of the above
68. P(Bc/A)
a) 1/5 b) 2/5 c) 3/5 d) None of the above
69. P(B/Ac)
a) 2/3 b) 1/3 c) 1 d) None of the above
70. P(Ac/Bc)
a) 4/10 b) 5/10 c) 6/10 d) None of the above
71. P(Bc/Ac)
a) 1/3 b) 2/3 c) 4/5 d) None of the above
317
CA FOUNDATION STATISTICS
Independent Events
72. For two independent events A and B, what is P(A + B), given P(A) = 3/5 and P(B) =
2/3?
a) 11/15 b) 13/15 c) 7/15 d) None of the above
73. If for two independent events A and B, P(A U B) = 2/3 and P(A) = 2/5, what is P(B)?
a) 4/15 b) 4/9 c) 5/9 d) 7/15
74. A speaks truth in 60% and B in 75% of the cases. In what percentage of cases are
they likely to contradict each other in stating the same fact
a) 0.45 b) 0.54 c) 0.55 d) 0.65
78. It is solved.
a) 1/60 b) 2/5 c) 3/5 d) 13/30
318
CA FOUNDATION STATISTICS
83. The odds against a certain event are 5:2 and odds in favour of another event,
independent of the former, are 6:5. Find the chance that at least one of the events
will happen.
a) 25/77 b) 35/77 c) 52/77 d) 65/87
84. The odds in favour of an event are 4:3. The odds against another independent event
are 2:3. What is the probability that at least one of the events will occur?
a) 35/36 b) 28/35 c) 29/36 d) 29/35
85. The odds against 2 events are 2:7 and 7:5 respectively. If the events are independent,
find the probability that at least one of them will occur?
a) 45 / 74 b) 35 / 54 c) 45 / 54 d) 47 / 54
86. The odds that a book will be favorably reviewed by three independent critics are
5 to 2, 4 to 3, and 3 to 4 respectively. What is the probability that majority of the
critics reviewed the book favorably?
a) 225 / 343 b) 209 / 343 c) 391 / 400 d) 420 / 840
Case: 1 – Two bags are given, a ball is drawn from each bag
One bag contains 4 red and 2 black balls, another bag contains 3 red and 5 black balls.
If one ball is drawn from each bag, determine the probability that;
319
CA FOUNDATION STATISTICS
Case: 2 – Two bags are given, a bag is chosen at random, then ball(s) is/are drawn
A bag contains 5 red and 3 black balls and another bag contains 4 red and 5 black balls.
A bag is selected at random and a ball is selected. Find the chance that:
90. It is red.
a) 77 / 177 b) 87 / 144 c) 97 / 854 d) 77 / 144
91. It is black.
a) 67 / 144 b) 77 / 144 c) 97 / 144 d) 87 / 145
92. If 2 balls are drawn from the selected bag, find the chance that one red and one
black is drawn.
a) 275 / 957 b) 597 / 5149 c) 275 / 504 d) None of the above
Case: 3 – Two bags are given, 1 ball is chosen from Bag 1 and transferred to Bag 2. Now
a ball is drawn from Bag 2
There are two bags. The first contains 2 red and 1 white ball, whereas the 2nd bag
contains 1 red and 3 white balls. One ball is taken out at random from the 1st bag and
put into second bag. Then a ball is chosen at random from the second bag. What is the
probability that;
Miscellaneous Cases
95. What is the probability that 4 children selected at random would have different
birthdays?
a) 364 x 363 x 362 b) 6 x5 x4 c) 1/365 d) (1/7)3
(365)3 73
320
CA FOUNDATION STATISTICS
96. A bag contains 8 red and 5 white balls. Two successive draws of 3 balls are made
without replacement. The probability that the first draw will produce 3 white balls
and the second 3 red balls is
(a) 5/223 (b) 6/257
(c) 7/429 (d) 3/548
97. There are three persons aged 60, 65 and 70 years old. The survival probabilities for
these three persons for another 5 years are 0.7, 0.4 and 0.2 respectively. What is the
probability that at least two of them would survive another five years?
(a) 0.425 (b) 0.456
(c) 0.392 (d) 0.388
98. For a group of students, 30 %, 40% and 50% failed in Physics , Chemistry and at
least one of the two subjects respectively. If an examinee is selected at random,
what is the probability that he passed in Physics if it is known that he failed in
Chemistry?
(a) 1/2 (b) 1/3 (c) 1/4 (d) 1/6
100. X and Y stand in a line with 6 other people. What is the probability that there are 3
persons between them?
(a) 1/5 (b) 1/6 (c) 1/7 (d) 1/3
101. 12 persons amongst whom X and Y are included are seated at a round table. Find
the probability that there are 3 persons between X and Y.
a) 1/11 b) 3/11 c) 2/11 d) None of these
102. Four digits 1, 2, 4 and 6 are selected at random to form a four digit number. What
is the probability that the number so formed, would be divisible by 4?
(a) 1/2 (b) 1/5 (c) 1/4 (d) 1/3
321
CA FOUNDATION STATISTICS
Theoretical Aspects
322
CA FOUNDATION STATISTICS
112. If an unbiased coin is tossed once, then the two events Head and Tail are
(a) Mutually exclusive
(b) Exhaustive
(c) Equally likely
(d) All these (a), (b) and (c).
114. If for two events A and B, P(A B) P(A) x P(B), then the two events A and B are
(a) Independent (b) Dependent
(c) Not equally likely (d) Not exhaustive.
323
CA FOUNDATION STATISTICS
119. If a coin is tossed twice, then the events ‘occurrence of one head’, ‘occurrence of 2
heads’ and ‘occurrence of no head’ are
(a) Independent (b) Equally likely
(c) Not equally likely (d) Both (a) and (b).
324
CA FOUNDATION STATISTICS
123. If p : q are the odds in favour of an event, then the probability of that event is
p p
(a) q (b) p+q
q
(c) p+q (d) none of these.
125. If A, B and C are mutually exclusive and exhaustive events, then P(A) + P(B) + P(C)
equals to
1
(a) 3 (b) 1
(c) 0 (d) any value between 0 and 1.
126. If A denotes that a student reads in a school and B denotes that he plays cricket, at
the same time then
(a) P(A B) = 1 (b) P(A B) = 1
(c) P(A B) = 0 (d) P(A) = P(B).
129. For two events A and B, P(A B) = P(A) + P(B) only when
(a) A and B are equally likely events
(b) A and B are exhaustive events
(c) A and B are mutually independent
(d) A and B are mutually exclusive.
325
CA FOUNDATION STATISTICS
130. Addition Theorem of Probability states that for any two events A and B,
(a) P(A B) = P(A) + P(B)
(b) P(A B) = P(A) + P(B) + P(A B)
(c) P(A B) = P(A) + P(B) – P(A B)
(d) P(A B) = P(A) x P(B)
135. The Theorem of Compound Probability states that for any two events A and B.
(a) P(A B) = P(A) x P(B/A) (b) P(A B) = P(A) x P(B/A)
(c) P(A B) = P(A) x P(B) (d) P(A B) = P(B) + P(B) – P(A B).
326
CA FOUNDATION STATISTICS
137. If P(A–B) = P(B–A), then the two events A and B satisfy the condition
(a) P(A) = P(B). (b) P(A) + P(B) = 1
(c) P(A B) = 0 (d) P(A B) = 1
138. The number of conditions to be satisfied by three events A, B and C for complete
independence is
(a) 2 (b) 3 (c) 4 (d) any number.
140. A, B, C are three mutually independent with probabilities 0.3, 0.2 and 0.4 respectively.
What is P (A B C)?
(a) 0.400 (b) 0.240 (c) 0.024 (d) 0.500
327
CA FOUNDATION STATISTICS
HOMEWORK SECTION
1. All heads.
a) ¼ b) ½ c) ¾ d) None of these
2. All tails.
a) ¼ b) ½ c) ¾ d) None of these
3. No heads.
a) ¼ b) ½ c) ¾ d) None of these
4. No tails.
a) ¼ b) ½ c) ¾ d) None of these
328
CA FOUNDATION STATISTICS
329
CA FOUNDATION STATISTICS
A number is selected at random from a set of first 120 natural numbers. What is the
probability that it is divisible by:
21. 10 or 13
a) 7/40 b) 13/40 c) 1/10 d) None of the above
22. 8 or 10
a) 1/5 b) 2/13 c) 3/120 d) None of the above
23. One counter is drawn at random from a bag containing 70 counters marked with the
first 70 numerals. Find the chance that it is a multiple of 8 or 9.
a) 1/14 b) 2/14 c) 4/14 d) 3/14
24. From a set of 18 balls marked 1, 2, 3, ... ... ..., 17, 18, one ball is drawn at random.
What is the probability that its number is either a multiple of 3 or of 4?
a) 0.5 b) 0.67 c) 0.33 d) None of these
25. There are three men aged 60, 65 and 70 years. The probability to live 5 years more
is 0.8 for a 60 year old, 0.6 for a 65 year old and 0.3 for a 70 year old person. Find
the probability that at least 2 of the 3 persons will remain alive 5 years hence.
a) 0.216 b) 0.500 c) 0.612 d) None of the above
27. A person is known to hit 4 out of 5 shots, whereas another person is known to hit 3
out of 4 shots. Find the probability of hitting a target if they both try.
a) 0.29 b) 0.92 c) 0.95 d) 0.98
28. An urn contains 4 white and 5 black balls. A second urn contains 5 white and 4 black
balls. One ball is transferred from the 1st urn to the 2nd urn. What is the probability
that it is white?
a) 49/90 b) 59/90 c) 69/90 d) None of these
330
CA FOUNDATION STATISTICS
29. A bag contains 5 white and 4 black balls. A ball is drawn at random from the bag
and put into another bag, which contains 3 white and 7 black balls. A ball is drawn
at random from the 2nd bag. What is the probability that it is white?
a) 23/99 b) 43/99 c) 54/99 d) 32/99
30. A bag contains 3 red & 5 white balls and the 2nd bag contains 4 red and 6 white
balls. One ball is drawn at random from the first bag and put into 2nd bag. If now,
a ball is drawn from the 2nd bag, find the probability that it is red.
a) 35/88 b) 53/88 c) 16/89 d) None of the above
A nine digit number is to be formed with digits (1, 2, 3, ...., 9), without any digit being
repeated. Find the probability that:
36. Four married couples have gathered in a room. Two persons are selected at random
amongst them, find the probability that selected persons are a gentleman and a
lady but not a couple.
a) 1/7 b) 3/7 c) 1/8 d) 3/8
331
CA FOUNDATION STATISTICS
37. A card is drawn out of a standard pack of 52 cards. What is the probability of
drawing a king or red color card?
a) 4/13 b) 7/13 c) 1/5 d) 1/3
38. A team of 5 is to be selected from 8 boys and 3 girls. Find the probability that it
includes two particular girls.
a) 1/5 b) 2/11 c) 2/30 d) 8/9
39. One card is drawn from a pack of 52 cards, what is the probability that it is a king
or a queen?
a) 1/13 b) 2/13 c) 11/13 d) 5/13
40. In a packet of 500 pens, 50 are found to be defective. A pen is selected at random,
find the probability that it is non defective.
a) 1/100 b) 9/10 c) 9/500 d) 45/500
41. A letter is taken out at random from the word RANGE and another is taken out from
the word PAGE. The probability that they are the same letters is:
a) 1/20 b) 4/20 c) 3/20 d) 5/20
42. A box contains 9 balls, two of which are red, three blue and four black. Three balls
are drawn at random. What is the probability that they are of same color?
a) 3/27 b) 5/84 c) 20/31 d) None of the above
43. A card is drawn from a well-shuffled pack of 52 cards. Let E1 is “a king or a queen
is drawn” and E2 is “a queen or a jack is drawn”. Then which of the following is
CORRECT:
a) E1 and E2 are not independent
b) E1 and E2 are mutually exclusive
c) E1 and E2 are independent
d) None of the above is correct
44. From a pack of cards, two are drawn, the first being replaced before the second is
drawn. The chance that the first is a diamond and the second is king is:
a) 5/52 b) 1/52 c) 4/52 d) 3/52
332
CA FOUNDATION STATISTICS
45. The probability of getting qualified in Test I and Test II by a student are respectively
1/5 and 3/5. What is the probability that the student gets qualified for one of the
tests?
a) 3/25 b) 22/25 c) 17/25 d) 8/25
46. Suppose X and Y are two events of a random experiment. If the probability of
occurrence of X is 1/5 and the probability of occurrence of Y given X is 1/10, then
the probability of non-occurrence of at least one of the events X and Y is:
a) 49/50 b) 1/50 c) 48/50 d) 2/50
47. A bag contains 8 red and 5 white balls. Two successive draws of 3 balls are made
without replacement. The probability that the first draw will produce 3 white balls
and second 3 red balls is:
a) 7/429 b) 3/429 c) 5/429 d) 6/256
48. Three identical dice are rolled. What is the probability that the same number will
appear on each of them?
a) 1/6 b) 1/36 c) 1/216 d) 7/216
49. Among the examinees in an examination; 30%, 35% and 45% failed in Statistics,
in Mathematics and in at least one of the subjects respectively. An examinee is
selected at random. What is the probability that he failed in Mathematics only?
a) 0.15 b) 0.25 c) 0.225 d) 0.55
50. An article consists of two parts X and Y. The manufacturing process of each part is
such that probability of defect in X is 0.08 and that Y is 0.05. What is the probability
that the assembled product will not have any defect?
a) 0.864 b) 0.874 c) 0.748 d) 0.648
52. The probability that a trainee will remain with a company is 0.8. The probability
that an employee earns more than Rs. 30,000 per month is 0.4. The probability that
an employee, who was a trainee and remained with the company or who earns
more than Rs. 30,000 per month is 0.9. What is the probability that an employee
333
CA FOUNDATION STATISTICS
earns more than Rs. 30,000 per month given that he is a trainee, who stayed with
the company?
a) 1/8 b) 3/8 c) 7/8 d) 5/8
54. If A and B are two independent events and P(A U B) = 2/5; P(B) = 1/3, find P(A).
a) 1/9 b) 1/10 c) 2/9 d) 2/10
55. A bag contains 12 balls of which 3 are red. 5 balls are drawn at random. Find the
probability that in 5 balls, 3 are red.
a) 5/396 b) 1/22 c) 1/36 d) 3/123
59. What is the probability that a word selected at random from the words formed by
arranging letters of the word REGULATION, such that vowels come at odd places in
the chosen word?
a) 1/144 b) 1/252 c) 144/252 d) 4/5
60. In a pack of playing cards with two jokers, probability of getting king of spade is:
a) 2/54 b) 1/54 c) 2/52 d) 1/52
61. Consider two events A and B, not mutually exclusive, such that P(A) = ¼, P(B) = 2/5,
P(AUB) = ½, then P(AB) is:
a) 1/10 b) 3/20 c) 2/10 d) 3/7
334
CA FOUNDATION STATISTICS
62. If X be the sum of two numbers obtained when two die are thrown simultaneously,
then P(X ≥ 7) is:
a) 6/12 b) 7/12 c) 7/24 d) 6/36
64. A bag contains 3 white and 5 black balls and second bag contains 4 white and 2
black balls. If one ball is taken from each bag, what is the probability that both the
balls are white?
a) ½ b) ¼ c) 1/3 d) 1/5
65. The odds in favor of A solving a problem is 5:7 and odds against B solving the same
problem is 9:6. What is the probability that if both of them try, the problem will be
solved?
a) 147/180 b) 117/180 c) 118/180 d) 119/180
66. Consider Bag I – 2 white balls, 3 black balls and Bag II – 4 white balls, 6 black balls.
One ball is transferred from Bag I to Bag II, then one ball is drawn from Bag II. What
is the probability that drawn ball is white?
a) 22/65 b) 22/55 c) 22/46 d) 21/55
68. A bag contains 5 red balls, 4 blue balls and “X” green balls. If the random probability
of picking two green balls is 1/7, what is the number of green balls (X)?
a) 5 b) 6 c) 7 d) 8
69. The probability of Girl getting scholarship is 0.6 and the same probability for Boy is
0.8. Find the probability that at least one of the categories getting scholarship.
a) 0.32 b) 0.92 c) 0.44 d) 0.54
70. A coin is tossed 5 times, what is the probability that exactly 3 heads will occur?
a) 1/32 b) 5/16 c) 5/32 d) 3/32
335
CA FOUNDATION STATISTICS
71. What is the probability of selecting 3 girls from a group of 5 girls and 3 boys?
a) 15/28 b) 5/28 c) 1/28 d) 1/56
72. A and B appear for an interview for two vacancies. The probability of A’s selection is
1/3 and that of B’s selection is 1/5. Find the probability that only one of them will
be selected.
a) 3/5 b) 2/5 c) 4/5 d) 5/6
73. Two dice are thrown together. Find the probability of getting a multiple of 2 on 1st
dice and multiple of 3 on the 2nd dice.
a) 2/6 b) 1/6 c) 2/3 d) 1/3
74. A bag contains 6 red balls and some blue balls. If the probability of drawing a blue
ball from the bag is twice that of a red ball, find the number of blue balls in the
bag.
a) 10 b) 12 c) 16 d) 14
75. The probability of selecting a sample of size “n” out of a population of size “N” by
simple random sampling with replacement is:
a) 1/N b) 1/Nn c) 1/NCn d) 1/N!
76. A box contains 2 red, 3 green and 2 blue balls. Two balls are drawn at random.
What is the probability that none of the balls drawn is blue?
a) 11/21 b) 10/21 c) 5/7 d) 2/7
77. The odds that a book will be favorably received by 3 independent reviewers are 5
to 2, 4 to 3 and 3 to 4 respectively. What is the probability that out of 3 reviewers a
majority will be favorable?
a) 209/434 b) 209/343 c) 209/350 d) 209/334
78. Find the probability of drawing an ace on each of two consecutive draws from a well
shuffled pack of cards, without replacement:
a) 2/221 b) 1/221 c) 5/221 d) 5/51
336
CA FOUNDATION STATISTICS
79. The probability of a cricket team winning match in Kolkata is 2/5 and losing match
at Delhi is 1/7. What is the probability of the team winning at least one match?
a) 3/35 b) 32/35 c) 2/35 d) 1/7
80. Two coins are tossed simultaneously. Find the probability of getting exactly one
head.
a) ¼ b) 2/4 c) ¾ d) None of the above
81. If a pair of dice is thrown then the probability that the sum of the digits is neither 7
nor 11 is:
a) 5/9 b) 7/9 c) 6/9 d) 3/9
82. From 6 positive and 8 negative numbers, 4 numbers are chosen at random without
replacement and are then multiplied. The probability that the product of the chosen
numbers will be positive number is:
a) 70/1001 b) 409/1001 c) 420/1001 d) 505/1001
83. If an unbiased die is tolled once, the odds in favor of getting a point which is multiple
of 3 is:
a) 1 : 1 b) 1 : 2 c) 1 : 3 d) 1 : 4
84. A bag contains 15 one rupee coins, 25 two rupees coins and 10 five rupees coins. If
a coin is selected at random then probability for not selecting a one rupee coin is:
a) 0.20 b) 0.30 c) 0.35 d) 0.70
85. If two letters are taken at random from the word “HOME”, what is the probability
that none of the letters would be vowels?
a) ½ b) 1/6 c) 1/3 d) ¼
86. In a game, cards are thoroughly shuffled and distributed equally among four
players. What is the probability that a specific player gets all the four kings?
a) b)
c) d)
337
CA FOUNDATION STATISTICS
87. A bag contains 4 red and 5 black balls. Another bag contains 5 red and 3 black
balls. If one ball is drawn at random from each bag, then the probability that one
red and one black ball is drawn is:
a) 12/72 b) 37/72 c) 25/72 d) 13/72
30% of the students failed in Maths, 20% in Chemistry and 10% in both. Find the
probability that
95. He has failed in Maths known that he had not failed in Chemistry.
a) 2 / 5 b) 3 / 5 c) 1 / 7 d) 1 / 4
96. He has failed in Chemistry known that he has not failed in Maths
a) 1 / 7 b) 3 / 4 c) 1 / 5 d) 1 / 10
338
CA FOUNDATION STATISTICS
97. He has not failed in Chemistry known that he has failed in Maths.
a) 2 / 5 b) 2 / 3 c) 1 / 5 d) 1 / 10
98. He has not failed in Maths known that he has failed in Chemistry.
a) 2 / 5 b) 3 / 5 c) 1 / 3 d) 5 / 10
99. He has not failed in Chemistry, known that he has not failed in Maths.
a) 2 / 7 b) 3 / 7 c) 6 / 7 d) 1 / 10
100. He has not failed in Maths known that he has not failed in Chemistry.
a) 2 / 4 b) 3 / 4 c) 1 / 5 d) 1 / 10
339
CA FOUNDATION STATISTICS
HOMEWORK SOLUTION
Total Cases = 4
1. Favorable Cases = 1. P = 1 / 4. Option A
340
CA FOUNDATION STATISTICS
Total balls = 3 + 8 = 11
2 balls are drawn. Total cases = 11C2 = 55
16. One green one white = 3C1 x 8C1 = 24 cases. P = 24/55/ Option C
Total balls = 3 + 8 = 11
19. Two green one white = 3C2 x 8C1 = 24 cases. P = 24/165. Option C
Multiple of 13 = 9 cases
LCM of 10 and 13 is 130
Total cases = 12 + 9 = 21
P(10 or 13) = 21/120 = 7/40. Option A
341
CA FOUNDATION STATISTICS
Total cases = 9!
31. Begin with 5 = 1*8! = 8! Cases. P = 8!/9! = 1/9. Option D
342
CA FOUNDATION STATISTICS
33. Begin with 5 and end with 9 = 1*7!*1 = 7! Cases. P = 7!/9! = 1/72. Option C
34. Begin with 5 but not end with 9 = 8! – 7! = 7!*7 cases. P = 7!*7/9! = 7/72. Option A
35. 5 and 9 are at two extremes = 7!*2! = 2*7! Cases. P = 2*7!/9! = 2/72. Option B
36. Total ways = 8C2 = 28; Favorable cases = 4C1*3C1 = 12. P = 12/28 = 3/7. Option B
39. P(K) = 4/52; P(Q) = 4/52. P(K or Q) = 4/52 + 4/52 = 8/52 = 2/13. Option B
343
CA FOUNDATION STATISTICS
48. P(same number) =6 favorable case (111, 222, ... ...,666). P = 6/216 = 1/36. Option B
51. The probability of an event can assume any value between 0 and 1 option A
52. P(A) = 0.8, P(B) = 0.4, P(A or B) = 0.9 P(AB) = 0.8 + 0.4 – 0.9 = 0.3. P(B/A) = 0.3/0.8 =
3/8. Option B
53. The limiting relative frequency of probability is the Statistical Definition of probability.
Option B
56. P(AB) = 2/3+3/5 – 5/6 = (20 + 18 – 25)/30 = 13/30. P(B/A) = P(AB)/P(A) = 13/30 / 2/3
= 13/20. Option B
344
CA FOUNDATION STATISTICS
59. Total words = 10!. Words with vowels at odd places = 5P5*5P5 = 5!*5!. P=5!*5!/10! =
1/252. Option B
61. P(AB) = ¼ + 2/5 – ½ = 0.25 + 0.40 – 0.50 = 0.15 = 15/100 = 3/20. Option B
62. P(X ≥ 7) = P(X = 7, 8, 9, 10, 11, 12) = (6+5+4+3+2+1)/36 = 21/36 = 7/12. Option B
63. P(A/B) = P(AB)/P(B) = P(A). Thus, P(AB) = P(A).P(B). A and B are independent events.
Option B
65. P(A) = 5/12, P(B) = 6/15. P(A’B’) = 7/12 * 9/15 = 63/180. P(Solved) = 1 – 63/180 =
117/180. Option B
67. P(AB) = P(A) + P(B) – P(A U B) = P(A) + P(B) – P(A) = P(B). Option B
69. P(A’B’) = 0.4 * 0.2 = 0.08. P(at least one) = 1 – 0.08 = 0.92. Option B
345
CA FOUNDATION STATISTICS
1 4 2 1 6 2
= x + x = =
3 5 3 5 15 5
73. (23) (26) (43) (46) (63) (66) = 6 cases. P = 6/36 = 1/6. Option B
79. P(A’B’) = 3/5 * 1/7 = 3/35. P(At least 1 win) = 1 – 3/35 = 32/35. Option B
81. P(7) = 6/36, P(11) = 2/36, P(7 or 11) = 8/36. P(7’ or 11’) = 1 – 8/36 = 28/36 = 7/9.
Option B
83. Cases multiple of 3 are 3 and 6. P(A) 2/6 = 2/6 = 1/3 ∴P (A) = 2/3 ∴ odds in favour
= 1:2 Option B
346
CA FOUNDATION STATISTICS
85. 2 letters at a time. Cases – HO, HM, HE, OM, OE, ME = 6 cases
Favorable cases = HM = 1 case. P = 1/6. Option B
91. P(M C’) = P(M) – P(CM) = 0.30 – 0.10 = 0.20 = 2/10 = 1/5. Option C
347
CA FOUNDATION STATISTICS
1. A bag contains 2 red and 1 green balls. Another bag contains 2 red and 2 green
balls. A bag is selected at random and then a ball was drawn from it. If it was found
to be red, then the probability that it has been drawn from the first bag is:
a) 4/7 b) 3/7
c) 2/3 d) 14/24
2. For any two events A, B; let P(A) = 2/3; P(B) = 3/8 and P(A ∩ B) = ¼, then A and B
are:
a) Independent and mutually exclusive events
b) Independent and non mutually exclusive events
c) Mutually exclusive and non independent events
d) Non mutually exclusive and non independent events
3. An unbiased die is thrown twice. The probability of the sum of numbers obtained on
the two faces being divisible by 4 is:
a) ½ b) ¼
c) 1/3 d) 2/5
(For Q. No.: 5 – 8) The following table shows the qualification of 100 employees in
a firm according to their age and qualification?
Below 30 30 to 40 Above 40 Total
Graduate 10 20 15 45
Post-Graduate 15 25 15 55
Total 25 45 30 100
If an employee is selected at random, find the probability that
348
CA FOUNDATION STATISTICS
6. He is a graduate,
a) 8/20 b) 10/20
c) 13/20 d) 9/20
(For Q. No.: 9 & 10) A die is loaded in such a way that each odd number is as twice
likely to occur as even number. Find:
10. The probability that the number rolled is a perfect square provided that it is greater
than 3.
a) 0.22 b) 0.11
c) 0.33 d) 0.54
(For Q. No. 11 – 15) 100 students randomly selected from a group of students are
cross-classified by age and educational qualification as below:
349
CA FOUNDATION STATISTICS
A student is selected at random from this group. Find the probability that:
12. He is a graduate.
a) 0.45 b) 0.65
c) 0.85 d) 0.54
16. Three groups of children contain 3 girls and 1 boy, 2 girls and 2 boys and 3 boys and
1 girl. One child is selected at random from each of the group. The probability that
the three selections consist of 1 girl and 2 boys is:
a) 31 / 32 b) 16 / 32
c) 13 / 32 d) 15 / 32
17. In a group of equal number of men and women, 10 % men and 45% women are un-
employed. What is the probability that a person selected at random is employed.
a) 29 / 40 b) 28 / 41
c) 27 / 42 d) 29 / 41
350
CA FOUNDATION STATISTICS
18. A and B throw with a pair of dice. A wins if he throws 6 before B throws 7 and B
wins if he throws 7 before A throws 6. If A begins, the chance of his winning is:
a) 29 / 62 b) 31 / 70
c) 30 / 62 d) 30 / 61
19. In a card game, 4 cards are drawn at random. Find the probability that all 4 cards
are from the same suit.
a) 0.01 b) 0.021
c) 0.008 d) 0.012
20. A value is chosen at random from a population. What is the probability that the
value is larger than the median of the population?
a) 0.67 b) 0.5
c) 0.25 d) 0.23
21. The probability that the sum 8 appears in a single toss of a pair of fair dice is
a) 4/36 b) 5/36
c) 6/36 d) 7/36
(For Q. No.: 22 – 24) A bag contains 5 red and 3 black balls and another bag
contains 4 red and 5 black balls. A bag is selected at random and a ball is selected.
Find the chance that:
22. It is red.
a) 77 / 177 b) 87 / 144
c) 97 / 854 d) 77 / 144
23. It is black.
a) 67 / 144 b) 77 / 144
c) 97 / 144 d) 87 / 145
24. If 2 balls are drawn from the selected bag, find the chance that one red and one
black is drawn.
a) 275 / 957 b) 597 / 5149
c) 275 / 504 d) None of the above
351
CA FOUNDATION STATISTICS
(For Q. No.: 25 – 27) Out of the 20 games of chess played between two players A
and B, A won 12, B won 4 and 4 ended in a tie. In a tournament of three games find
the probability that:
28. Four cards are drawn from a pack of 52 cards. What is the probability that they are
from four different suits?
a) 219/2082 b) 2197/20285
c) 2197/20825 d) 2179/20825
29. The 1st 12 letters of the alphabet are written at random. Find the probability that
there are exactly 4 letters between C and D.
a) 6/66 b) 8/66
c) 7/66 d) 13/66
30. The letters of the word ‘EDUCATION’ are arranged at random. Find the probability
that there will be exactly four letters between A and D.
a) 1/7 b) 1/3
c) 1/9 d) 1/3
31. From 8 counters marked 1, 2, 3,...,8, 4 counters are selected at random. Find the
chance of getting at least one odd and one even counter.
a) 31/35 b) 34/35
c) 3/35 d) 1/35
352
CA FOUNDATION STATISTICS
32. 40% of students in a class are girls. If 60% and 70% of the boys and the girls
respectively of the class pass a test. What is the probability that a student selected
from this class will have passed the test?
a) 16/25 b) 15/36
c) 61/85 d) 53/65
33. In a group of 14 males and 6 females, 8 and 3 of the males and the females
respectively are aged above 40 years. What is the probability that a person selected
at random from his group is aged 40, given that selected person is a female?
a) 0.29 b) 0.33
c) 0.67 d) 0.50
34. The odds against a certain event are 5:2 and odds in favour of another event,
independent of the former, are 6:5. Find the chance that at least one of the events
will happen.
a) 25/77 b) 35/77
c) 52/77 d) 65/87
35. A person is known to hit 4 out of 5 shots, whereas another person is known to hit 3
out of 4 shots. Find the probability of hitting a target if they both try.
a) 0.29 b) 0.92
c) 0.95 d) 0.98
36. What is the probability that over a two day period the number of requests would be
either 11 or 12 if at a motor garage the records of service requests along with their
probabilities are given below:
Daily Demand 5 6 7
Probability 0.25 0.65 0.10
a) 0.7579 b) 0.8589
c) 0.7975 d) 0.7957
353
CA FOUNDATION STATISTICS
(For Q. No.: 37 & 38) Out of the numbers 1 to 120, one is selected at random. What
is the probability that it is divisible by:
37. 10 or 13?
a) 8/40 b) 7/40
c) 6/40 d) 13/40
38. 8 or 10?
a) 8/40 b) 7/40
c) 5/40 d) None
(For Q. No.: 39 & 40) A man has 2 French, 5 German and 4 Spanish friends. He invites
them in his birthday party. Find the probability that there will be:
41. Out of the 5 players of which 2 are members of a certain club, 3 are to be selected
to represent the country at an international tournament. Find the probability that
less than two of those selected to represent are members of the club.
a) 0.75 b) 0.60
c) 0.80 d) 0.70
42. The weekly wages of 6 worker in a factory are (Rs.) 62, 90, 78, 85, 79 & 68. Two of
the laborers are to be selected at random to serve as representatives. What is the
probability that at least one will have a wage lower than the average?
a) 0.40 b) 0.50
c) 0.60 d) None of these
354
CA FOUNDATION STATISTICS
43. There are 4 hotels in a town and 3 persons entered the town. Find the probability
that they check into different hotels.
a) 0.475 b) 0.325
c) 0.375 d) None of these
(For Q. No.: 44 & 45) A bag contains 3 green and 8 white balls. If one ball is drawn
at random, find the chance that:
44. It is green.
a) 1/11 b) 2/11
c) 3/11 d) 4/11
45. It is white.
a) 2/11 b) 4/11
c) 6/11 d) 8/11
(For Q. No.: 46 & 47) The independent probability that the three sections of the
costing department will encounter error is 0.1, 0.3 and 0.3, each week respectively.
Calculate the probability that there will be;
47. One and only one computer error encountered by the costing department.
a) 0.247 b) 0.427
c) 0.924 d) None of these
(For Q. No.: 48 – 50) Two balanced dice are thrown together. Find the probability of
obtaining:
355
CA FOUNDATION STATISTICS
49. Doublets.
a) 5/6 b) 5/36
c) 1/6 d) 1/36
1 2 3 4 5 6 7 8 9 10
A B B B A D A B A B
11 12 13 14 15 16 17 18 19 20
C D B D A C A D A B
21 22 23 24 25 26 27 28 29 30
B D A C B A A C C C
31 32 33 34 35 36 37 38 39 40
B A D C C C B A C D
41 42 43 44 45 46 47 48 49 50
D C C C D D B B C A
356
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
8. P(post graduate, known that age is between 30-40 years) = 25/45 = 5/9.
Option B
357
CA FOUNDATION STATISTICS
15. P(he is graduate / age is between 26-28 years) = 19/35 = 0.54. Option A
16. P(1 girl & 2 boys) = (3/4*2/4*3/4) + (1/4*2/4*3/4) + (1/4*2/4*1/4) = 26/64 = 13/32.
Option C
17. P(employed person) = (90% of 100 + 55% of 100)/200 = 145/200 = 29/40. Option A
19. P(4 cards from the same suit) = 4C1 * 13C4 / 52C4 = 2860 / 270725 = 0.01
Option A
26. P(B wins at least one) = 1 – P(B wins none) = 1 – (4/5)3 = 1 – 64/125 = 61/125
Option A
358
CA FOUNDATION STATISTICS
27. P(2 games ended in a tie) = (3C2)(4/20)2[12/20 + 4/20] = 3 * 1/25 * 4/5 = 12/125
Option A
28. P(4 cards from different suits) = 134/52C4 = 28561/270725 = 2197/20825 Option C
359
CA FOUNDATION STATISTICS
46. P(at least 1 error) = 1 – P(no error) = 1 – (0.9 * 0.7 * 0.7) = 1 – 0.441 = 0.559 Option
D
47. P(only one error) = (0.1 * 0.7 * 0.7) + (0.9 * 0.3 * 0.7) + (0.9 * 0.7 * 0.3)
= 0.049 + 0.189 + 0.189 = 0.427 Option B
360
CA FOUNDATION STATISTICS
361
CA FOUNDATION STATISTICS
1. A speaks truth in 60% and B in 75% of the cases. In what percentage of cases are
they likely to contradict each other in stating the same fact
a) 0.45 b) 0.54 c) 0.55 d) 0.65
2. A six faced die is so biased that it is twice likely to show an even number as compared
to an odd number when thrown. It is thrown twice. Find the probability that the sum
of the faces thrown will be even.
a) 7 / 9 b) 5 / 9 c) 8 / 9 d) 9 / 9
3. A bag contains 8 red and 5 white balls. Two successive draws of 3 balls are made
without replacement. Find the probability that the 1st drawing will give 3 white and
second drawing will give 3 red balls.
a) 4 / 294 b) 7 / 597 c) 7 / 429 d) 159 / 429
4. With respect to the above question; however, if 3 balls are replaced before the 2nd
draw, find the probability that the 1st drawing will give 3 white and the 2nd will
give 3 red.
a) 140 / 20449 b) 1598 / 20449
c) 140 / 20944 d) None of the above
5. The manufacturing process of an article consists of two parts X and Y. The probabilities
of defects in part X and Y are 10% and 15% respectively. What is the probability
that an assembled product will not have any defects?
a) 0.765 b) 0.756 c) 0.523 d) 0.532
6. A pair of fair dices is thrown. Find the probability of getting a sum of 7, when it is
know that the digit in the first die is greater than that of the 2nd.
a) 0.198 b) 0.200 c) 0.549 d) 0.259
362
CA FOUNDATION STATISTICS
8. In a single cast with 3 dices, find the chance of getting different digits.
a) 4/9 b) 4/13 c) 5/9 d) 5/13
9. In a single cast with 3 dices, find the probability of getting same digits in two dice
and different digit in another.
a) 6/12 b) 5/12 c) 5/13 d) 5/14
10. 5 students A, B, C, D, and E occupy their seats at random in a bench. Find the
probability that the students A and B are not consecutive.
a) 0.55 b) 0.35 c) 0.25 d) 0.60
11. 12 dissimilar balls are distributed at random into 5 boxes A, B, C, D, E. Find the
probability that these boxes contain respectively 3, 4, 1, 4, 0 balls.
a) 136/225 b) 136800/62589
c) 138569/36258 d) 5544/9765625
12. 5 Commerce and 4 science students are arranged at random in a row. Find the
probability so that the commerce and science students are placed alternately?
a) 1/126 b) 2/126 c) 1/216 d) 19/216
13. If the number 1, 2, … N are arranged in a line at random. What is the probability
that the number 1 and 2 will appear next to each other?
a) 2 / N b) N c) 1 / N d) 9 / N
14. The odds against 2 events are 2:7 and 7:5 respectively. If the events are independent,
find the probability that at least one of them will occur?
a) 45 / 74 b) 35 / 54 c) 45 / 54 d) 47 / 54
15. A class consists of 30 boys and 20 girls of which half the boys and half the girls
have blue eyes. What is the probability that a students selected at random is not a
girl with blue eyes?
a) 0.75 b) 0.98 c) 0.56 d) 0.80
363
CA FOUNDATION STATISTICS
16. The probability that a teacher will take a surprise test during any class meeting is
1/5. If a student is absent on two days, what is the probability that he will miss at
least 1 test?
a) 16 / 25 b) 1 / 25 c) 4 / 25 d) 9 / 25
17. A number is chosen at random from the first n natural numbers and suppose A and
B denotes the events the chosen number is divisible by 2 and 3 respectively. A and
B are independent if n = ?
a) 96 b) 97 c) 99 d) 100
(For Q. No.: 18 & 19) An urn contains 2 white & 2 black balls. A 2nd urn contains 2
white and 4 black balls.
18. If one ball is chosen from each urn, what is the probability that the selected balls
will be of the same colour?
a) 0.33 b) 0.50 c) 0.67 d) None
19. If an urn is selected at random and one ball is drawn from it, what is the probability
that it will be a white ball?
a) 5/12 b) 6/12 c) 7/12 d) 8/12
20. There are three men aged 60, 65 and 70 years. The probability to live 5 years more
is 0.8 for a 60 year old, 0.6 for a 65 year old and 0.3 for a 70 year old person. Find
the probability that at least 2 of the 3 persons will remain alive 5 years hence.
a) 0.216 b) 0.500 c) 0.612 d) None
21. A bag contains 5 white and 4 black balls. A ball is drawn at random from the bag
and put into another bag, which contains 3 white and 7 black balls. A ball is drawn
at random from the 2nd bag. What is the probability that it is white?
a) 23/99 b) 43/99 c) 54/99 d) 32/99
22. One shot is fired from each of the three guns. Let A, B, and C denote the events that
the target is hit by the first, second and the third gun respectively. Assuming that A,
B & C are mutually independent events and that P(A) = 0.5, P(B) = 0.6 and P (C) =
0.8. Find the probability that at least one hits the target.
a) 0.96 b) 0.69 c) 0.33 d) None
364
CA FOUNDATION STATISTICS
(For Q. No.: 23 & 24) Mr. X is called for an interview of 3 separate posts. At the first
interview there are 5 candidates, at the second 4 candidates and at the third 6
candidates. If selection of each candidate is equally likely, find the probability that
Mr. X will be selected for;
25. A bag contains 3 red & 5 white balls and the 2nd bag contains 4 red and 6 white
balls. One ball is drawn at random from the first bag and put into 2nd bag. If now,
a ball is drawn from the 2nd bag, find the probability that it is red.
a) 35/88 b) 53/88 c) 16/89 d) None
26. A certain player is known to win with probability 0.3 if the track is fast and 0.4 if the
track is slow. For Monday, there is 0.7 probability of a fast track 0.3 of a slow track.
What is the probability that player X will win on Monday?
a) 0.67 b) 0.99 c) 0.33 d) None
27. What is the probability of obtaining a T.V. owner if a household is selected at random?
a) 0.40 b) 0.60 c) 0.20 d) 0.30
365
CA FOUNDATION STATISTICS
28. If a household has income Rs. 18000 or less and is a telephone subscriber, what is
the probability he has a T.V.?
a) 0.40 b) 0.20 c) 0.60 d) 0.30
29. What is the conditional probability of drawing a household that owns a T.V. given
that the household owns a Telephone?
a) 0.20 b) 0.10 c) 0.50 d) 0.60
30. Are the events, ‘ownership of a T.V.’ and ‘telephone subscriber’ independent?
a) No b) Yes
c) Can’t Say d) None of the above
Two workers are selected at random. Find the probability that the wages of both are:
33. Equal to or greater than Rs. 200 but lesser than Rs. 300.
a) 27639/63275 b) 27639/62375
c) 19857/59847 d) 15987/62357
(For Q. No.: 34 & 35) Three lots contain respectively 4%, 5% and 10% defective
articles. One article is chosen at random from each lot. Find the probability of
getting;
366
CA FOUNDATION STATISTICS
36. A pair of dice is thrown. Find the probability that the sum is 10 or greater if a 5
appears on the first dice.
a) 0.67 b) 0.99 c) 1.00 d) 0.33
37. In supplies of 3 components viz. base, neck and switch for an electric lamp, the
% of defective on a day were 5, 20 and 10 respectively. An assembled lamp is
considered defective if at least 1 of the three components is defective. If components
are selected randomly, what is the probability that an assembled lamp would be
defective?
a) 79 / 250 b) 97 / 250 c) 25 / 250 d) 129 / 250
(For Q. No.: 38 – 42) Four balls are drawn from a bag containing 5 black and 8 white
balls. Find:
367
CA FOUNDATION STATISTICS
43. What are the odds against throwing an ace or six in a single cast with a dice?
a) 1 : 3 b) 1 : 4 c) 2 : 1 d) 1 : 2
44. A can solve 80% of the problems in statistics and B can solve 70%. What is the
probability that at least one of them will solve a problem selected at random?
a) 0.29 b) 0.18 c) 0.81 d) 0.94
45. How many tosses of a coin are needed so that the probability of getting at least one
head is 87.5%?
a) 2 b) 4 c) 3 d) 8
46. In a game of cards, four cards are drawn at random. What is the probability that it
will contain all 4 kings?
a) 11/4156 b) 11/5146 c) 11/4165 d) None
(For Q. No.: 47 & 48) Three fair coins are tossed once. Find the probability of
49. A bag contains 10 red and 6 green balls. Two successive drawings of three balls are
made without replacement. Find the probability that the first drawing will give 3
red balls and second will give three greens balls.
a) 51/1001 b) 25/1001 c) 51/1000 d) 15/1001
50. A bag contains 6 white and 9 black balls. 4 balls are drawn at a time. Find the
probability of the 1st draw to give 4 white balls and the 2nd to give 4 black balls
when the balls are not replaced before the 2nd draw.
a) 7/315 b) 3/715 c) 6/615 d) None
368
CA FOUNDATION STATISTICS
1 2 3 4 5 6 7 8 9 10
A B C A A B C C B D
11 12 13 14 15 16 17 18 19 20
D A A D D D A B A C
21 22 23 24 25 26 27 28 29 30
D A C B A C B C D B
31 32 33 34 35 36 37 38 39 40
B C B C D D A A A B
41 42 43 44 45 46 47 48 49 50
D A C D C D D B D B
369
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
2. P(Even) = 2 P(Odd)
3x + 6x = 1; x = 1/9; O(Odd) = 1/9, P(Even) = 2/9
P(Sum is even) = P(E).P(E) + P(O).P(O) = 9(4/81) + 9(1/81) = 45/81 = 5/9 Option B
6. P(Sum is 7) = 6/36
P(F > S) = 15/36
P(7 and F>S) = 3/36
Required probability = 3/36 / 15/36 = 3/15 = 1/5 = 0.200 Option B
370
CA FOUNDATION STATISTICS
371
CA FOUNDATION STATISTICS
33. P(200 < Wages < 300) = C2 / 500C2 = 55278/124750 = 27639/62375 Option B
333
36. P(S = 10,11,12 / 5 on first dice) = 2/36 / 6/36 = 1/3 = 0.33 Option D
372
CA FOUNDATION STATISTICS
373
CA FOUNDATION STATISTICS
RANDOM VARIABLE
Theory of Expectation - THEORY
A. RANDOM VARIABLES
X 0 1 2 3
P(x) 1/8 3/8 3/8 1/8
These values of ‘x’ {0,1,2,3} are called the values of the random variables
which are the outcomes of a random experiment.
4. Random Variables can be divided into the following two categories. They are
a. Discrete Random Variable
b. Continuous Random Variable
374
CA FOUNDATION STATISTICS
375
CA FOUNDATION STATISTICS
E.g. Calculation of Expectation of ‘x’ (where ‘x’ are the random variables
generated as a result of throwing an unbiased die)
X P XP
1 (x1) 1/6 (p1) 1/6 (x1p1)
2 (x2) 1/6 (p2) 2/6 (x2p2)
3 (x3) 1/6 (p3) 3/6 (x3p3)
4 (x4) 1/6 (p4) 4/6 (x4p4)
5 (x5) 1/6 (p5) 5/6 (x5p5)
6 (x6) 1/6 (p6) 6/6 (x6p6)
2. E(x) can assume any real number since ‘x’ can assume any real value.
3. If all the value of the random variable ‘x’ are equal then E(x) will be equal to
constant. i.e. E(c) = c
376
CA FOUNDATION STATISTICS
9. E(x - )=0
Proof
E(x - ) = E(x) – E( ) = - (since E(x) = = mean is constant) = 0
B. Symbolically
= A. M of (x – )2
= expectation of (x – )2 [Since expectation = A. M]
Where = E(X) = Mean of the random variable X
C. or variance of x is also denoted by Var (X) or V(X) and V(X) = E(x – )2 = E(x2)
– [E(x)]2
Proof :
377
CA FOUNDATION STATISTICS
2. Var(aX) = a2V(X)
e.g. Given V(X) = 3
Calculate V(3X)
Solution :
V(3X) = 9V(X) = 9(3) = 27
378
CA FOUNDATION STATISTICS
CLASSWORK SECTION
6. If a random variable x assumes the values x1, x2, x3, x4 with corresponding probabilities
p1, p2, p3, p4, then the expected value of x is:
a) p1 + p2 + p3 + p4 b) x1p1 + x2p2 + x3p3 + x4p4
c) x1 + x2 + x3 + x4 d) None of the above
7. For discrete random variable x, E(x) is defined as the sum of products of the different
values and the corresponding probabilities.
a) True b) False c) Both d) None
379
CA FOUNDATION STATISTICS
11. _______ is the expected value of (x – m)2, where m is the mean of the distribution.
a) Median b) Variance c) Standard deviation d) Mode
17. Let X be a random variable assuming values -3, 6 and 9 with probabilities 1/6, ½
and 1/3 respectively. Then find the value of E(X), E(X2) and E(2X+1)2
a) 5.5, 46.5, 209 b) 6.5, 45.5, 207 c) 6, 40, 200 d) None of these
380
CA FOUNDATION STATISTICS
18. A person tosses a coin once and is to receive Rs.4 for head and is to loss Rs.2 for tail.
Find the expectation and variance of his gain?
a) 1, 8 b) 1, 9 c) 2, 9 d) 2, 8
19. A player tossed two coins. If two heads show he wins Rs. 4. If one head shows he
wins Rs. 2, but if two tails show he pays Rs. 3 as penalty. The expected value of the
game to him (in rupees) is:
a) 1.25 b) 3.25 c) 4.25 d) 2.25
20. A player tosses 3 coins. He wins Rs. 16, if 3 heads appear, Rs.8 if two heads appear,
Rs. 4 if one head appears and Rs. 2 if no heads appear. Find his expected amount of
winning.
a) Rs. 5 b) Rs. 8.75 c) Rs. 6.75 d) Rs. 10
21. A player tosses three fair coins. He wins Rs. 12 if three tails occur, Rs. 7 if two tails
occur and Rs. 2 if only one tail occurs. If the game is fair, how much should he win
or lose in case no tail occurs?
a) Loss of Rs. 39 b) Income of Rs. 39
c) Neither Income nor Loss d) None of the above
22. A throws a coin 3 times. If he gets a head all the three times he is to get a prize of
Rs. 160. The entry fee for the game is Rs. 16. The mathematical expectation of A is:
a) 4 b) 7 c) 9 d) 10
23. If a player plays a game of chance where he can win Rs. 5000 with probability 0.6,
win Rs. 2500 with probability 0.3 and lose Rs. 15000 with probability 0.1, what is
his expected gain in one play of the game?
a) 2250 b) 2350 c) 2650 d) 2450
24. In a business venture, a man can make a profit of Rs. 2000 with a probability of 0.4
or a loss of Rs. 1000 with a probability of 0.6. What is his expected profit?
a) 100 b) – 100 c) – 200 d) 200
25. If it rains a taxi driver can earn Rs. 100 per day if it is fair, he can lose Rs. 10 per day.
If the probability of rain is 0.4 what is his expected gain?
a) 60 b) 34 c) – 6 d) – 40
381
CA FOUNDATION STATISTICS
26. A man runs an ice-cream parlor in a holiday resort. If the summer is mild he can
sell 2500 cups of ice cream; if it is hot, he can sell 4000 cups; if it is very hot, he can
sell 5000 cups. It is known that the probability, in any year, for the summer to be
mild is 1/7 and hot is 4/7. A cup of ice cream cost Rs. 2 and sells for Rs. 3.50. What
is his expectation?
a) 6100 b) 6107 c) 5489 d) 5987
27. The probability that a boy fishing at a particular place will catch 5, 6, 7, 8 fish are
0.4, 0.3, 0.2 and 0.1 respectively. What is the expected number of fish caught?
a) 3 b) 4 c) 5 d) 6
28. The probability that there is at least one error in an accounts statement prepared
by A is 0.2 and for B and C it is 0.25 and 0.4 respectively. A, B and C prepared 10,
16 and 20 statements respectively. Find the expected number of correct statements
in all
a) 32 b) 33 c) 34 d) None of the above
29. A man draws 2 balls from a bag containing 3 white and 6 black balls. If he is
to receive Rs. 14 for every white ball and Rs. 7 for every black ball; what is his
expectation?
a) 18.67 b) 19.25 c) 20.25 d) 25.19
30. A box contains 15 items of which 5 are defective. A man selects 3 items at random.
What is the expected number of defective items he has drawn?
a) 3 b) 2 c) 1 d) 0
31. Three items are drawn at random from a box containing 2 defective and 6 non
defective items. The expected number of non-defective items drawn is:
a) 4.25 b) 3.25 c) 2.25 d) 1.25
32. A number is chosen at random from the set 1, 2, 3, ...., 100 and another number
is chosen at random from the set 1, 2, 3 ..., 50. What is the expected value of their
product?
a) 5151 b) 5151/4 c) 5151/2 d) None of the above
382
CA FOUNDATION STATISTICS
33. If a person gains or loses an amount equal to the number appearing when a
balanced die is rolled once, according to whether the number is even or odd, how
much money can he expect per gain on long run?
a) 0.50 b) 0.75 c) 1 d) 2
34. In a random throw of 2 dice is the expectation of the sum of the points on them is :
a) 2.50 b) 3.50 c) 5.00 d) 7.00
35. A and B play for a prize of Rs.99. The prize is to be won by a player who first throw a
“3” with one dice. A first throws and if he fails, B throws and if he fails again throws
and so on. Find their respective expectation. (A’s expectation and B’s expectation)
a) 54 and 45 respectively b) 60 and 39 respectively
c) 45 and 54 respectively d) 39 and 60 respectively
36. A Bag Contains 2 white balls 3 black balls, four persons A,B,C,D in that order each
draws one balls and does not replace it. The first to draw a while balls receives Rs.
10. Determine their expectations.
a) 4, 3, 2, 1 respectively b) 3, 4, 2, 1 respectively
c) 4, 3, 1, 2 respectively d) 2, 3, 2, 1 respectively
383
CA FOUNDATION STATISTICS
44. If X and Y are two independent variables with means 50 and 120 and variances 10
and 12 respectively. Then the expectation of Z = 4X + 3Y is:
a) 360 b) 460 c) 560 d) 660
45. If x and y are random variables having expected values as 4.5 and 2.5 respectively,
then the expected value of (x - y) is
a) 2 b) 7 c) 6 d) 0
47. If two random variable x and y are related as y = – 3x + 4 and standard deviation
of x is 2, then the standard deviation of y is
a) -6 b) 6 c) 18 d) 3.50
384
CA FOUNDATION STATISTICS
a) b)
c) d)
50. The probability that there is at least one error in an account statement prepared by
A, B and C are 0.20, 0.30 and 0.10 respectively. If A, B and C prepare 60, 70 and 90
such statements, then the expected number of correct statements is:
a) 178 b) 170 c) 176 d) 180
385
CA FOUNDATION STATISTICS
HOMEWORK SECTION
1. There are six slips in a box and numbers 1, 1, 2, 2, 3, 3 are written on these slips.
Two slips are taken at random from the box. The expected values of the sum of
numbers on the two slips is:
a) 3 b) 5 c) 4 d) 7
2. The probability that there is at least one error in an account statement prepared by A
is 0.3 and for B and C, they are 0.4 and 0.45 respectively. A, B and C prepared 20, 10
and 40 statements respectively. The expected number of correct statements in all is:
a) 42 b) 24 c) 25 d) 52
3. Ms. G plays a game of tossing a dice. If the number is less than 3, she is getting Rs.
X, otherwise he has to pay Rs. 100. If the game is fair, find X.
a) 100 b) 150 c) 250 d) 200
4. A box contains 12 electric lamps of which 5 are defectives. A man selects three
lamps at random, what is the expected number of defective lamps in his selection?
a) 1.05 b) 1.25 c) 1.33 d) 2.50
6. An urn contains 6 white and 4 black balls. 3 balls are drawn without replacement.
What is the expected number of black balls that will be obtained?
a) 1/5 b) 4/5 c) 6/5 d) 7/5
386
CA FOUNDATION STATISTICS
10. E(13X + 9) = ?
a) E(13X) b) 13 E(X) + 9 c) 13 E(X) d) 13X + E(9)
11. A die is thrown once. What is the mathematical expectation of the number on the
die?
a) 1.67 b) 3.50 c) 2.50 d) 2.67
12. Two unbiased dice are thrown. The expected value of the sum of numbers on the
upper side is:
a) 3.5 b) 7 c) 10.5 d) 12
13. A player tosses two fair coins, he wins Rs. 5 if 2 heads appear, Rs. 2 if one head
appear and Re. 1 if no head occurs. Find his expected amount of winning.
a) Rs. 5.00 b) Rs. 2.50 c) Rs. 1.25 d) Rs. 10.00
15. A player tosses 3 fair coins. He wins Rs. 5 if 3 heads appear, Rs. 3 if two head
appears, Re. 1 if one head occurs. On the other hand, he losses Rs. 15 if three tail
occur. Find the expected gain of the player.
a) 0.15 b) 0.25 c) 0.35 d) 0.55
387
CA FOUNDATION STATISTICS
16. Let the distribution function of a random variable X be F(X) = P(X ≤ X). Then F(5) –
F(2) is:
a) P (2 < X < 5) b) P (2 < X ≤ 5)
c) P (2 ≤ X ≤ 5) d) P (2 ≤ X < 5)
17. A discrete random variable X takes three values -1, 2 and 3 with probabilities:
P(- 1) = 1/3, P(2) = 1/3 and P(3) = 1/3, then E(|X|) is:
a) 1 b) 2 c) 3 d) 3/2
18. An unbiased coin is tossed three times. The expected value of the number of heads
is:
a) 1.0 b) 1.5 c) 2.0 d) 2.5
19. There are 8 bolts in a packet of which 2 are defective. If 2 bolts are taken at random,
find the expected number of defective bolts.
a) 0.50 b) 1.00 c) 1.50 d) 2.00
20. There are 5 identical balls in a bag numbered 1, 1, 2, 2, 2. Two balls are selected
at random from the bag. Find the expectation of the total of the numbers on the
selected balls.
a) 3.00 b) 3.2 c) 5.40 d) 7.40
21. There are 100 tickets in a lottery of Re. 1 each. There is only one ticket in the lottery
bearing a price of Rs. 80. A person purchases 1 ticket. What is his expectation?
a) – 0.20 b) – 0.40 c) 2.00 d) 0.20
22. A person takes an insurance of Rs. 1000 and pays premium of Rs. 20. The probability
that any person of his age group dies within a year is 0.01. What is the expected
gain of the insurance company?
a) Rs. 20 b) Rs. 14 c) Rs. 12 d) Rs. 10
23. Two tickets are taken at random from 5 tickets numbered 1, 2, 3, 4, 5. What is the
expected value of the sum obtained on the two tickets?
a) 5 b) 6 c) 7 d) 36
388
CA FOUNDATION STATISTICS
24. There are 3 black and 2 white balls in a box. Two balls are taken from it. Rs. 24 is
given for each black ball drawn. What amount should be charged for each white
ball so that the game is fair?
a) Rs. 33 b) Rs. 36 c) Rs. 54 d) Rs. 74
25. There are 4 black and 2 white balls in a box. Two balls are taken at random from
it. If a person receives Rs. 4 for each white ball and loses Rs. 2 for each black ball
drawn, find the mathematical expectation of the amount received by him.
a) – 3 b) 0 c) 1 d) 5
26. There are 5 white and 3 black balls in a box. Three balls are taken at random from
the box. Find the expected number of black balls.
a) 2 / 3 b) 9/8 c) 2 d) 3
27. Two coins are tossed simultaneously. A person receives Rs. 8 for each head and
loses Rs. 10 for each tail. What is the expected value of the amount gained by him?
a) – 2.00 b) 2.00 c) 3.00 d) –3.00
28. Four coins are tossed simultaneously. What is the expected number of heads?
a) 1 b) 2 c) 2.5 d) 3
389
CA FOUNDATION STATISTICS
HOMEWORK SOLUTION
X P XP X2P
1 0.10 0.10 0.10
2 0.15 0.30 0.60
3 0.20 0.60 1.80
4 0.25 1.00 4.00
5 0.18 0.90 4.50
6 0.12 0.72 4.32
3.62 15.32
V(X) = E(X2) – [E(X)]2 = 15.32 – (3.62)2 = 15.32 – 13.1044 = 2.2156 = 2.22. Option b
390
CA FOUNDATION STATISTICS
8. E(X) = . Option d
9. 0 + 2K + 3K + K = 1
6K = 1
K = 1/6
P(X < 3) = P(X = 0) + P(X = 1) + P(X = 2) = 5K = 5/6. Option b
12. E(X) = 1/36 (2*1 + 3*2 + 4*3 + 5*4 + 6*5 + 7*6 + 8*5 + 9*4 + 10*3 + 11*2 + 12*1) =
252/36 = 7. Option b
14. E(X) = (-20 * 3/20) + (-10 * 1/5) + (30 * 0.5) + (75 * 1/10) + (80 * 1/20) = -3 – 2 + 15
+ 7.5 + 4 = 21.5. Option b
391
CA FOUNDATION STATISTICS
20. SS = {(1,1) (1,2) (1,2) (1,2) (1,2) (1,2) (1,2) (2,2) (2,2) (2,2)}
Sum = 2, 3, 4 – With probability 1 10, 6 10, 3 10 respectively.
E(X) = 2 x 1 + 3 x 6 + 4 x 3 = 2 + 18 + 12 = 3.2 . Option b
10 10 10 10
23. SS = (11) (12) (13) (14) (15) (22) (23) (24) (25) (33) (34) (35) (44) (45) (55)
Sum = 2, 3, 4, 5, 6, 4, 5, 6, 7, 6, 7, 8, 8, 9, 10 – With probability 1/15 each
E(Sum) = 1/15 (2+3+4+5+6+4+5+6+7+6+7+8+8+9+10) = 90/15 = 6. Option b
392
CA FOUNDATION STATISTICS
Balls X P XP
Two White 2*4=8 2C2 / 6C2 = 1/15 8/15
1 White 1 Black 4–2=2 4C1 * 2C1 / 6C2 = 8/15 16/15
Two Black 2 * -2 = - 4 4C2 / 6C2 = 6/15 -24/15
0
393
CA FOUNDATION STATISTICS
2. An unbiased coin is tossed three times. Find the expected value of the number of
heads.
a) 1.00 b) 1.50 c) 1.67 d) 2.50
4. An unbiased coin is tossed three times. Find the standard deviation of the number
of heads.
a) 0.67 b) 0.87 c) 0.97 d) 1.00
394
CA FOUNDATION STATISTICS
8. In a business venture, a man make a profit of Rs. 50,000 or incur a loss of Rs. 20,000.
The probabilities of making profit or incurring loss, from the past experience, are
known to be 0.75 and 0.25 respectively. What is his expected profit?
a) Rs. 50,000 b) Rs. 32,500 c) Rs. 25,300 d) Rs. 52,300
9. A box contains 12 electric lamps of which 5 are defectives. A man selects three
lamps at random. What is the expected number of defective lamps in his selection?
a) 1.25 b) 2.25 c) 1.55 d) 2.55
10. G draws 2 balls from a bag containing 3 white and 5 red balls. She gets Rs. 500 if
she draws a white ball and Rs. 200 if she draws a red ball. What is her expectation?
a) Rs. 400 b) Rs. 625 c) Rs. 225 d) Rs. 465
11. A number is selected at random from a set containing the first 100 natural numbers
and another number is selected at random from another set containing the first 200
natural numbers. What is the expected value of the product?
a) 5075.25 b) 5750.25 c) 7550.75 d) 2575.75
12. A die is thrown repeatedly till a “three” appears. Find the expected number of
throws, if X denotes the probability of getting a “Three” in a throw.
a) X b) X! c) 1/X d) XX
13. If a random variable X assumes the values 0, 1 and 2 with probabilities 0.30, 0.50
and 0.20 respectively, then find its expected value.
a) 0.90 b) 1.00 c) 1.50 d) 2.00
395
CA FOUNDATION STATISTICS
16. A bag contains 6 white and 4 red balls. If a person draws 2 balls and receives Rs.
10 and Rs. 20 for a white and red balls respectively, then his expected amount is:
a) Rs. 25 b) Rs. 26 c) Rs. 28 d) Rs. 29
For Q. No. 17 - 20
A random variable X had the following probability distribution:
X 0 1 2 3 4 5 6 7
P(X) 0 2K 3K K 2K K2 7K2 2K2+K
19. P (X ≥ 4)
a) 0.50 b) 0.40 c) 0.45 d) 0.55
ANSWER KEY
1 2 3 4 5 6 7 8 9 10
B B B B B A A B A B
11 12 13 14 15 16 17 18 19 20
A C A B D C B A B C
396
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
6. If all the values taken by a random variable are equal, its variance or standard
deviation is zero. Option a
397
CA FOUNDATION STATISTICS
14. 2X + 3Y + 14 = 0
4.V(X) = 9.V(Y)
24 = 9 V(Y)
V(Y) = 24/9 = 8/3
Option b
15. K + 2K + 3K + 5K + K = 12K = 1
K = 1/12
∑X.P(X) = K + 4K + 12K + 30K + 8K = 55K
∑X2.P(X) = K + 8K + 48K + 180K + 64K = 301K
V(X) = 301K – (55K)2 = 25.08 – 20.97 = 4.11
Option d
398
CA FOUNDATION STATISTICS
399
CA FOUNDATION STATISTICS
THEORETICAL DISTRIBUTION
THEORETICAL DISTRIBUTION
(Exist in theory as well as real life)
1. Theoretical Distribution is a distribution where the values of a variable are distributed
according to some definite mathematical laws.
THEORETICAL DISTRIBUTION
400
CA FOUNDATION STATISTICS
2. This distribution is a discrete probability Distribution where the variable ‘x’ can
assume only discrete values i.e. x = 0, 1, 2, 3,....... n
(i) Each trial must be associated with two mutually exclusive & exhaustive
outcomes – SUCCESS and FAILURE. Usually the probability of success is denoted
by ‘p’ and that of the failure by ‘q’ where q = 1-p and therefore p + q = 1.
(ii) The trials must be independent under identical conditions.
(iv) Probability of success and failure remains unchanged throughout the process.
Note 1 : A ‘trial’ is an attempt to produce outcomes which is neither sure nor impossible
in nature.
Note 2 : The conditions mentioned may also be treated as the conditions for Binomial
Distributions.
401
CA FOUNDATION STATISTICS
Case I : When (n + 1).p is not an integer the distribution is uni-modal and the
greatest integer contained in (n+1) p is the value of the mode.
E.g. n = 6; p = 1/3; find modal value.
Case II: When (n + 1)p is an integer; the distribution is bi-modal and the modal
values are (n+1)p and (n+1)p – 1 respectively.
E.g. n = 7 and p = 0.5; find mode or modes.
(x) The method applied for fitting a binomial distribution to a given set of data is
called “Method of Moments”.
402
CA FOUNDATION STATISTICS
1. If a discrete random variable ‘x’ assumes n possible values namely x1 x2, ....xn
with equal probabilities, then the probability of its taking any particular value is
always constant and is equal to (1/n). The p.m.f (Probability Mass Function) of such
distribution is given by f(x) = 1/n where x = x1 x2, .... xn. These distributions are known
as Uniform Distribution because the probability is uniform for all values of x.
x 1 2 3 4 5 6
p
403
CA FOUNDATION STATISTICS
CLASSWORK SECTION
1. A discrete random variable x follows uniform distribution and takes only the values
5, 7, 12, 15, 18. Find P (x >10).
a) 2/5 b) 3/5 c) 4/5 d) None of the above
2. If a discrete random variable x follows uniform distribution and assumes only the
values 8, 9, 11, 15, 18, 20. Then find P(|x – 14| < 5).
a) 1 b) ½ c) 2/3 d) 1/3
3. A random variable x follows Binomial distribution with mean 2 and variance 1.2.
Then the value of n is:
a) 3 b) 5 c) 7 d) 6
7. A student obtained the following results: For the binomial distribution mean = 4,
variance = 3. Comment on the accuracy of his results.
a) Correct results, with n = 12
b) Correct results, with n = 16
c) Wrong calculations, as q > 1
d) None of the above
8. An unbiased dice is tossed 500 times. The standard deviation of the number of
‘sixes’ in these 500 tosses are:
a) 50/6 b)51/6 c)15/6 d) None of these
404
CA FOUNDATION STATISTICS
b)
c)
d)
11. If mean and variance are 5 and 3 respectively then relation between p and q is
a) p > q b) p < q c) p = q d) p is symmetry
12. If x is a binomial variable with a parameter n = 15 and p = 1/3, then the value of
mode of the distribution
a) 5 b) 5 and 6 c) 5.5 d) 6
13. The mode of Binomial distribution for which mean is 4 and variance is 3, is equal to?
a) 4 b) 4.25 c) 4.5 d) 4.1
405
CA FOUNDATION STATISTICS
20. The probability that a student is not a swimmer is 4/5, then the probability that out
of five students four are swimmers is
a) b) c) d) None of these
21. The overall percentage of failures in a certain examination is 60. What is the
probability that out of a group of 6 candidates at least 5 passed the examination?
a) b) c) d)
22. If 15 dates are selected at random, then the probability of getting two Sundays is:
a) 0.29 b) 0.39 c) 0.49 d) 0.23
23. Assuming that half the population is vegetarian and each of 128 investigators
takes a sample of 10 individuals to see whether they are vegetarian. How many
investigators would you expect to report that 2 people or less vegetarians.
a) 5 b) 7 c) 9 d) 11
An experiment succeed twice as often as it fails. What is the probability that in the
next 5 trials, there will be:
24. 3 successes.
a) 77 / 243 b) 81 / 243 c) 80 / 243 d) None of the above
A man takes a step forward with a probability 0.6 and a step backward with a
probability of 0.4. Find the probability that at the end of 11 steps, the man is:
406
CA FOUNDATION STATISTICS
c) d)
c) d)
c) d)
29. If x and y are 2 independent binomial variable with parameters 6 and ½, 4 and ½
respectively, what is P(x + y ≥ 1)?
a) 1023/1024 b) 1056/1923
c) 1234/2678 d) None of the above
30. X is binomial variable with n = 20, what is the mean of X if it is known that x is
symmetric?
a) 5 b) 10 c) 2 d) 8
31. If X ~ B (n, p), what would be greatest value of the variance of x when n =16?
a) 2 b) 4 c) 8 d) 6
32. if x is a binomial variate with parameter 15 and 1/3, what is the value of mode of
the distribution
a) 5 and 6 b) 5 c) 5.50 d) 6
33. What is the probability of getting 3 heads if 6 unbiased coins are tossed
simultaneously?
a) 0.50 b) 0.20 c) 0.3125 d) 0.6875
407
CA FOUNDATION STATISTICS
34. What is the probability of making 3 correct guesses in 5 True – False answer type
questions?
a) 0.3125 b) 0.5676 c) 0.6875 d) 0.4325
35. X is binomial variable such that 2P(x=2) = P(x=3) and mean of X is known to be 10/3.
What would be the probability that X assumes at most the value 2?
a) 16/81 b) 17/81 c) 47/273 d) 26/243
36. Assuming that one-third of the population is tea drinkers and each of 1000
enumerators takes a sample of 8 individuals to find out whether they are tea
drinkers or not, how many enumerators are expected to report that five or more
people are tea drinkers?
a) 100 b) 95 c) 88 d) 90
37. Out of 128 families with 4 children each, how many are expected to have at least
one boy and one girl?
a) 100 b) 105 c) 108 d) 112
38. In 10 independent rollings of a biased die, the probability that an even number will
appear 5 times is twice the probability that an even number will appear 4 times.
What is the probability that an even number will appear twice when the die is rolled
8 times?
a) 0.0304 b) 0.1243 c) 0.2315 d) 0.1926
Calculation of Parameters
39. A binomial random variable x satisfies the relation 9P(x = 4) = P(x =2) when n = 6.
Find the value of the parameter ‘P’?
a) 1 / 2 b) 1 / 3 c) 1 / 4 d) 1 / 5
408
CA FOUNDATION STATISTICS
Theoretical Aspect
41. The probability distribution whose frequency function f(x) = 1/n,
x = x1, x2, ..., xn is known as:
a) Binomial distribution b) Poisson distribution
c) Normal distribution d) Uniform distribution
43. The no. of points obtained in a single throw of an unbiased die follows
a) Binomial Distribution b) Poisson Distribution
c) Uniform Distribution d) None
44. The no. of points obtained in a single throw of an unbiased, die has frequency
function
a) f(x) = ¼ b) f(x) = 1/5 c) f(x) = 1/6 d) None
45. If for any distribution f(x) = 1/n, then mean of the distribution would be:
a) n/2 b) (n + 1)/2 c) (n – 1)/2 d) None of the above
409
CA FOUNDATION STATISTICS
410
CA FOUNDATION STATISTICS
56. The maximum value of the variance of a Binomial distribution with parameters
and pis :
a) b) c) d)
411
CA FOUNDATION STATISTICS
63. For n independent trials in Binomial distribution, the sum of the powers of p and q
is always n,whatever be the number of successes.
a) True b) False
c) both of a) and b) above d) None of the above
64. The results of ODI matches between India and Pakistan follows:
a) Binomial distribution b) Poisson distribution
c) Normal distribution d) (b) or (c)
66. For a binomial distribution if variance = mean/2, then the values of n and p will be
a) 1 and 1/2 b) 2 and 1/2 c) 3 and ½ d) Any value and 1/2
412
CA FOUNDATION STATISTICS
75. When there are a fixed number of repeated trial of any experiments under identical
conditions for which only one of two mutually exclusive outcomes, success or failure
can result in each trail then, we use
a) Normal Distribution
b) Binomial Distribution
c) Poisson Distribution
d) None
413
CA FOUNDATION STATISTICS
B. POISSON DISTRIBUTION
1. The probability of ‘x’ no. of success or the p.m.f (Probability Mass Function) of a
Poisson Distribution is given by
P(x) = ( = גm)
where x = desired no. of success.
e 2.71828
Note1 : ( = m) Mean = variance = parameter of the distribution
Note2: is a constant and the value of which can be obtained from the table.
Note3: When the parameter ‘m’ is not provided but n and p are provided we shall
use m = np for evaluating the parameter.
2. It is a discrete probability distribution where the variable ‘x’ can assume values ‘x’=
0, 1, 2, 3,......∞.
3. This distribution is a limiting case of Binomial Distribution when
(i) n → ∞ (i.e. no. of trials become very large)
(ii) p → 0, (i.e. probability of success is very small)
(iii) q → 1, (i.e. probability of failure is very high)
(iv) np is finite and constant which is denoted by ‘m’ i.e. np = m or
5. The conditions under which the Poisson Distribution is used or the condition for
Poisson Model are as follows:
(i) The probability of having success in a very small time interval (t, t + dt) is K. dt
(where k > 0 and is constant)
In other words, probability of success in a very small time interval is directly
proportional to time internal dt.
(ii) The probability of having more than one success in this time interval is very
low.
(iii) Statistical independence is assumed i.e. the probability of having success in
414
CA FOUNDATION STATISTICS
this time interval is independent of time ‘t’ as well as of the earlier success.
Case I : When ‘m’ is not an integer then the distribution is uni-modal and the
value of the mode will be highest integral value contained in ‘m’.
E.g. m = 5.6 then modal value is 5 (greatest integer contained in 5.6)
Case II: When ‘m’ is an integer; the distribution is bimodal and the modal values
are m, m – 1
E.g . if ‘m’ = 4 (an integer, hence the distribution is bimodal and the modes
are 4 and 4 – 1 i.e. 4 and 3)
(viii)
Additive Property of Poisson Distribution: If ‘x’ and ‘y’ are two independent
Poisson Variates with parameters(m1) and (m2) respectively then (x + y) will
also follow a Poisson Distribution with parameter (m1 + m2). Symbolically the
fact is expressed as follows: X ~ P (m1), Y ~ P (m2)
X + Y ~ P(m1 + m2) provided x and y are independent
415
CA FOUNDATION STATISTICS
CLASSWORK SECTION
1. If a random variable X follows Poisson Distribution, such that P(X = 1) = P(X= 2), then
its mean and variance are:
a) 4, 4 b) 3, 3 c) 2, 2 d) 5, 5
3. For a Poisson Variate X, if P (X= 0) = 0.2, then the variance of the distribution is:
a) b) c) d) None of the above
7. P (X = 1)
a) 0.1464 b) 0.0732 c) 0.3725 d) 0.3679
8. P (X is at least = 3)
a) 0.7621 b) 0.2671 c) 0.6721 d) 0.08025
9. P(X is almost 2)
a) 0.3297 b) 0.2549 d) 0.2379 d) 0.91975
416
CA FOUNDATION STATISTICS
Between 4 and 5 PM, the average number of phone calls per minute coming into the
switchboard of the company is 3. Find the probability that in one particular minute
there will be: (Given e-3 = 0.0498)
16. Find the probability that the number of accident in a week exceeds 2.
a) 0.3235 b) 0.523 c) 0.352 d) None of the above
17. A radioactive source emits on the average 2.5 particles per second. Calculate that
2 or more particles will be emitted in an interval of 4 seconds.
a) b) c) d) None of the above
18. If the standard deviation of a Poisson variate X is 2, what is P (1.5 < X <2.9)?
a) 0.231 b) 0.158 c) 0.15 d) 0.1464
19. If the mean of a Poisson variable X is 1, what is P (X = takes the value at least 1)?
a) 0.456 b) 0.821 c) 0.632 d) 0.254
417
CA FOUNDATION STATISTICS
20. If X ~ P (m) and its coefficient of variation is 50, what is the probability that x would
assume only non-zero values?
a) 0.018 b) 0.982 c) 0.989 d) 0.976
21. If 1.5 per cent of items produced by a manufacturing unit are known to be defective,
what is the probability that a sample of 200 items would contain no defective item?
a) 0.05 b) 0.15 c) 0.20 d) 0.22
23. A renowned hospital usually admits 200 patients every day. One per cent patients,
on an average, require special room facilities. On one particulars morning, it was
found that only one special room is available. What is the probability that more
than 3 patients would require special room faculties?
a) 0.1428 b) 0.1732 c) 0.2235 d) 0.3450
25. 0 (Zero)
a) 0.3679 b) 0.2498 c) 0.3598 d) None of the above
27. In a company manufacturing toys, it is found that 1 in 500 is defective. Find the
probability that there will be at the most two defectives in a sample of 2000 units.
[Given e-4 = 0.0183]
a) 0.2597 b) 0.3549 c) 0.2549 d) 0.2379
418
CA FOUNDATION STATISTICS
Miscellaneous Problems
28. A local electric appliance has found from experience the demand for Tube light is
distributed as Poisson distribution with a mean of 4-tube light per week. [Given
e-4 = 0.0183] if the shop keeps 6 tubes during a particular week, what is the probability
that the demand will exile the supply during that week
a) 0.1114 b) 0.2224 c) 0.1525 d) 0.1254
29. A car hire firm has 2 cars which is hired out every day. The number of demand per
day for a car follows Poisson distribution with mean 1.20. What is the proportion of
days on which some demand is refused?
(Given e1.20 = 3.32)
a) 0.25 b) 0.3012 c) 0.12 d) 0.03
Theoretical Aspects
31. Poisson distribution is a _____________ probability distribution.
a) Continuous b) Discrete
c) Both of a) and b) above d) Neither a) nor b) above
419
CA FOUNDATION STATISTICS
38. When the number of trials is large and probability is small then the distribution
used is in place of binomial distribution is:
a) Poisson Distribution b) F – Distribution
c) t – Distribution d) Normal Distribution
420
CA FOUNDATION STATISTICS
421
CA FOUNDATION STATISTICS
1. It is a continuos probability distribution where the variable ‘X’ can assume any value
between –
where = mean
= Standard Division
Note 1 : and are the two parameters of Normal Distribution and hence it is
bi-parametric in nature.
f(z) =
Note2 : N(0,1) implies Standard Normal Distribution with Mean = 0 and S.D. = 1.
422
CA FOUNDATION STATISTICS
423
CA FOUNDATION STATISTICS
424
CA FOUNDATION STATISTICS
1. It is a bell shaped curve symmetrical about the line x = and assymptotic to the horizontal
axis (x = axis)
3. As the distance from the mean increases, The curve comes closer to the horizontal axis (x
= axis)
5. The two points of inflection of the normal curve are at x = and x = respectively
where the normal curve changes its curvature.
6. The same points of inflection under standard normal curve are at z = – 1 and z = 1.
8. The distribution has two parameters and . Where = mean = standard deviation.
Hence normal is bi-parametric distribution.
9. The normal curve has a single peak. Hence it is unimodal and mean. Median and mode
coincide. at x = .
425
CA FOUNDATION STATISTICS
15. QD : MD : SD = 10 : 12 : 15
16. (i) The total area under the Normal or Standard Normal Curve = 1 ( Total Probability =
1), Symbolically,
19. Symbilically
426
CA FOUNDATION STATISTICS
20. The curve being symmetrical area of portions cut off from right and left of X = (or z = 0)
are equal.
21. The probability that a normal variate Z will take a value less than or equal to a particular
value (say Z = K) will be denoted by (K) = P( Z K)
Note : The probability of success is calculated by evaluating the areas from the
standard normal curve, and the areas are obtained from normal table.
22. % Distribution of areas under Normal Curve / Standard Normal Curve
C-I
P(– 1 Z 0) = .3413,
P(0 Z 1) = .3413.
P(– 1 Z 1) = .6826.
68.26% of total area lies between Z = – 1 and Z = + 1 or X = and Z =
427
CA FOUNDATION STATISTICS
C-II
P(– 2 Z 0) = .4772.
P(0 Z 2) = .4772.
P(– 2 Z 2) = .9544.
95.44% of total area lies between Z = – 2 and Z = + 2 or X = and X =
C-III
P(– 3 Z 0) = .4987.
P(0 Z 3) = .4987.
P(– 3 Z 3) = .9974.
99.74% of total area lies between Z = – 3 and Z = + 3 or X = and X =
26. F(X) = P(X C) will imply the area under the probability curve to the left of vertical line at
C.
428
CA FOUNDATION STATISTICS
D. Probability that X lies between any two specified values C and D within the range (“b
and a”) is given by :
C-II
C-III
429
CA FOUNDATION STATISTICS
C-IV
C-V
C-VI
430
CA FOUNDATION STATISTICS
C-VII
C-VIII
NOTE:
1) If the -ve and +ve values happen to be identical .i.e P in such a case the total
area will be = 2P
2) When in the problem the magnitude of the given area is greater than “.5” it implies
that area from - to that particular value of ‘z’ is provided, for evaluating the area
from 0 to that particular value of ‘z’ subtract .5 from it.
431
CA FOUNDATION STATISTICS
30. Condition under which “Binomial” and “Possion” approaches “Normal Distribution”
Case I
Normal Distribution as a limiting case of Binomial Distribution when
a) n, the number of trials is infinitely large I.e. n
b) Neither p(or q) is very small, i.e. p and q are fairly near equal
c) In other words, if neither p nor q is very small but n is sufficiently large Binomial
Distribution approaches Normal Distribution.
d) In such a case, the Standard Normal Variate is given by
Case II
Poission Distribution tends to Normal Distribution with standardised Variable
Where m = Mean = = Variance
432
CA FOUNDATION STATISTICS
CLASSWORK SECTION
2. If the mean deviation of a normal variable is 16, what is its quartile deviation?
a) 10 b) 15 c) 13.5 d) 12.05
3. If the inflexion points of normal distribution are 6 and 14, find its standard deviation
a) 4 b) 6 c) 10 d) 12
4. If the quartile deviation of a normal curve is 4.05, then its mean deviation is:
a) 5.26 b) 6.24 c) 4.24 d) 4.80
5. If the 1st quartile and mean deviation about median of a normal distribution are
13.25 and 8 respectively, then the mode of the distribution is:
a) 10 b) 12 c) 15 d) 20
6. If the two quartiles of normal distribution are 14.6 and 25.4 respectively, what is
the standard deviation of the distribution?
a) 6 b) 8 c) 9 d) 10
7. For normal distribution with mean =150 and S.D = 45; find Q1 and Q3
a) 119.35 and 190.65 b) 119.65 and 180.35
c) 180.35 and 119.65 d) 123.45 and 183.65
a) 10 b) 4 c) 40 d) None of these
433
CA FOUNDATION STATISTICS
9. If for normal distribution Q1 = 54.52 and Q3 = 78.86 then the median of the
distribution is
a) 12.17 b) 39.43 c) 66.69 d) None of these
10. What is the first quartile of x having the following probability density function?
a) 4 b) 5 c) 5.95 d) 6.75
11. If x and y are 2 independent normal variable with mean 10 and 12 and SD 3 and
4 respectively, then (x + y) is also a normal distribution with mean ____ and SD
_____.
a) 22, 7 b) 22, 25 c) 22, 5 d) 22, 49
12. If the mean deviation of a normal variable is 16, what is the standard deviation?
a) 9 b) 6 c) 20 d) 8
13. If the two quartiles of N (µ, ) are 14.6 and 25.4 respectively, what is the standard
deviation of the distribution?
a) 9 b) 6 c) 10 d) 8
Area under Normal / Standard Normal Curve
Find the area under the standard normal curve for the following values of standard
normal variate :
14. Between
a) 0.5000 b) 0.4192 c) 0.1942 d) 0.2192
15. Between
a) 0.2323 b) 0.2159 c) 0.2594 d) 0.2123
16.
a) 0.0245 b) 0.3256 c) 0.0392 d) 0.0540
434
CA FOUNDATION STATISTICS
17.
a) 0.9750 b) 0.9580 c) 0.9980 d) 0.9780
18. If the standard normal curve between z = 0 to z =1 is 0.3413, then the value of (1)
is:
a) 0.5000 b) 0.8413 c) - 0.5000 d) 1
20. If X ~ N (3, 36) and Y ~ N (5, 64) are two independent normal variate then if (x+y) ~
N (8, A) also follows normal distribution, then value of A will be
a) 100 b) 10 c) 64 d) 36
21. If follows the Normal Distribution with Mean 12 and Variance 16, find
a) 0.22750 b) 0.25789 c) 0.02275 d) 0.03357
22. If a random variable x follows normal distribution with mean as 120 and standard
deviation as 40, what is P (x≤ 150 / x > 120)?
a) 0.85 b) 0.90 c) 0.95 d) 0.5468
23. For certain normal variate x, the mean is 12 and S.D is 4 find P(X≥20):
[Area under the normal curve from z=0 to z=2 is 0.4772]
a) 0.5238 b) 0.0472 c) 0.7272 d) 0.0228
24. if X follows normal distribution with µ = 50 and = 10, what is the value of
P (x≤60 / x>50)?
a) 0.8413 b) 0.6828 c) 0.1587 d) 0.7256
25. In a sample of 800 students, the mean weight and standard deviation of weight are found
to be 50 kg and 20 kg respectively. On the assumption of normality, what is the number
of students weighting between 46kg and 62kg? Given area of the standard normal curve
between z = 0 to z = 0.2 = 0.0793 and area between z = 0 to z = 0.60 = 0.2257.
a) 250 b) 244 c) 240 d) 260
435
CA FOUNDATION STATISTICS
26. The salary of workers of a factory is known as to follow normal distribution with
an average salary of ` 10,000 and standard deviation of salary as ` 2,000. If 50
workers receive salary more than ` 14,000, then the total no of workers in the
factory is
a) 2193 b) 2000 c) 2200 d) 2500
27. For a normal distribution with mean as 500 and SD as 120, what is the value of
k so that the interval [500, k] covers 40.32 per cent area of the normal curve?
Given (1.30) = 0.9032
a) 740 b) 750 c) 656 d) 800
28. If the weekly wages of 5000 workers in a factory follows normal distribution with
mean and SD as `700 and `50 respectively, what is the expected number of workers
with wages between ` 660 and ` 720?
a) 2050 b) 2200 c) 2218 d) 2300
29. 50 per cent of a certain product have weight 60kg or more whereas 10 per cent gave
weight 55 kg or less. On assumption of normality, what is the variance of weight?
Given (1.28) = 0.90.
a) 15.21 b) 9.00 c) 16.00 d) 22.68
30. The I.Q. of army volunteers in a given year are normally distributed with mean
=110 and standard deviation =10. The army wants to give advance training to 20%
of those recruits with the highest IQ. Find that lowest score acceptable for the
advanced training? [ø (0.84) = 0.80]
a) 118.4 b) 116.4 c) 108.4 d) 101.6
31. The sample of 100 dry battery cell tested to find the length of life produced the
following results:
What is percentage of battery cells are expected to have life less than 6 hours (Area
under normal curve from z = 0 to z=2 is 0.4772).
a) 2.28% b) 2.56% c) 4.56% d) 1.93%
436
CA FOUNDATION STATISTICS
Theoretical Aspects
32. If X ~ N (µ, ) than the range of the variable X is ___________
(a) 0 < X < (b) - < X <
(c) - < X < 0 (d) None
33. In Normal distribution, the maximum value of the p.d.f. f(x) is at X = __________
(a) µ (b) (c) 0 (d) None
ii.
iii.
a) All of I, II & III above b) All but II above
c) All but I above d) None of I, II, III above
437
CA FOUNDATION STATISTICS
37. Probability density function is associated with which of the following variable?
a)
Discrete
b) Continuous
c) Both Discrete and Continuous
d) Neither Discrete nor Continuous
a) Normal b) Gaussian
c) Rectangular d) Both of a) and b) above
c) d)
438
CA FOUNDATION STATISTICS
43. What are the mean and standard deviation of standard normal distribution?
a) 0 and 1 b)
c) d) The distribution has no parameters
44. If a random variable is normally distributed with mean and standard deviation
is called:
a) Normal Variate b) Standard Normal Variate
c) Chi-square Variate d) Uniform Variate
46. The curve of which of the following distribution is uni-modal and bell shaped with
the highest point over the mean
a) Poisson b) Binomial c) Normal d) All of the above
48. The curve of which of the following distribution(s) has single peak?
a) Poisson b) Binomial
c) Normal d) Both a) and b) above
51. In Normal distribution as the distance from the _______ increases, the curve comes
closer and closer to the horizontal axis.
a) Standard Deviation b) Mean
c) Both a) and b) above d) Neither a) nor b) above
439
CA FOUNDATION STATISTICS
54. Because of the symmetry of Normal distribution the median and the mode have the
______ value as that of the mean.
a) Greater b) Smaller c) Same d) Nothing can be said
55. In Normal distribution the probability has the maximum value at the
a) Mode b) Median c) Mean d) All of the above
56. The mean deviation about median of a Standard Normal Variate is:
a) 0.675 b) 0.675 c) 0.80 d) 0.80
60 If neither p nor q is very small but n is sufficiently large, then Binomial distribution
is very closely approximated by distribution.
a) Poisson b) Normal c) t d) None.
440
CA FOUNDATION STATISTICS
64. The symbol (a) indicates the area of standard normal curve between
a) 0 to a b) a to c) - to a d) - to
65. If the area of standard normal curve between z = 0 to z = 1 is 0.3413, the value of
(1) is
a) 0.5000 b) 0.8413 c) -0.5000 d) 1
66. For which distribution, whatever may be the parameter of distribution, it has same
shape,
a) Normal b) Binomial c) Poisson d) None
69. An approximate relation between Quartile deviation (QD) and Standard Deviation
(SD) of normal distribution is:
a) 5 QD = 4 SD b) 4 QD = 5 SD
c) 2 QD = 35 SD d) 3 QD = 2 SD
441
CA FOUNDATION STATISTICS
71. The probability that x assumes a specified value in continues probability distribution
is _________.
(a) 1 (b) 0
(c) -1 (d) None
Where a is constant find value of a?
(a) (b)
442
CA FOUNDATION STATISTICS
32 b 33 a 34 c 35. a 36. c
37. b 38. b 39. b 40. d 41. c
42. b 43. d 44. b 45. c 46. c
47. a 48. c 49. d 50. d 51. b
52. c 53. c 54. c 55. d 56. d
57. a 58. d 59. b 60. b 61. a
62. a 63. b 64. c 65. b 66. a
67. b 68. c 69. d 70. c 71. b
443
CA FOUNDATION STATISTICS
Central Moments
A central moment is the expected value of a specified integral power of the deviation
of the variable from the mean. Using central moments of a variable (or probability
distribution), different properties can be studied.
In general rth central moment can be obtained by the following formula. It is denoted
by µr.
Where r = 1, 2, 3, 4 ... ... ... ... ... .
Skewness:
In simple words skewness means “Lack of symmetry”. It is asymmetry in the data or
probability distribution. If skewness is present, the curve appears skewed either to
the left or to the right, unlike bell-shaped in the case of symmetrical distribution.
Kurtosis:
In simple words “Kurtosis is a measure of peakedness of the curve” Kurtosis is also
said to be a measure of “tailedness” of the data or probability distribution.
444
CA FOUNDATION STATISTICS
Measure of skewness:
Measure of kurtosis:
Note:
1. Generally ß1 and ß2 are used for measuring skewness and kurtosis.
2. The Sign outside in the formula of γ1 is same as that of µ3.
Formula for finding first four central moments for the Binomial distribution, Poisson
distribution and Normal distribution are as follows:
Central Moments
µ2 = npq µ2 = m µ2 = σ2
Note:
1. First central moment for any probability distribution is always 0.
2. Second central moments (µ2) for any probability distribution is always the variance of
the distribution.
3. For normal distribution, its odd numbered central moments (i.e. µ1, µ3... ...) are always
0.
4. For normal distribution µ4 = 3
5. For Poisson distribution, its second and third central (i.e. µ2 and µ3) are always equal.
445
CA FOUNDATION STATISTICS
Measures of skewness and kurtosis for Binomial distribution, Poisson distribution and
Normal distribution are as follows:
kurtosis
446
CA FOUNDATION STATISTICS
Examples :
1. For a binomial distribution with parameters 9 and . Find measure of skewness (γ1) and
measure of Kurtosis (γ2). Also decide the type of distribution from the value of γ2.
Sol. Here n = 9 and p =
.
Since, the value at γ2 is negative (i.e. γ2 < 0), therefore it is Platykurtic distribution.
3. For normal distribution having mean = 2 and variance = 4, find the central moment µ 4
.
Also find µ3.
Sol. Here, Variance σ2 = 4
Now, for normal distribution,
µ4 = 3 σ2 = 3 (4) = 12
µ3 = 0 (Since odd numbered central moments of normal distribution are always 0.)
4. For binomial distribution with = 4 and variance = 3.Find third central moment.
Sol. Here, mean = np = 4 & variance = npq = 3
Now, for binomial distribution,
µ3 = npq (1 - 2p)
447
CA FOUNDATION STATISTICS
448
CA FOUNDATION STATISTICS
HOMEWORK SECTION
BINOMIAL DISTRIBUTION
1. Seven coins are tossed simultaneously the probability of at least five heads is
(a) 12/128 (b) 17/128
(c) 21/128 (d) 29/128
2. The probability that a person hits a target is 1/3, the probability that he will hit the
target in 3 times out of 5 trials is
(a) 12/243 (b) 17/243
(c) 40/243 (d) 60/243
3. The probability of a smoker from a group of persons is 2/3. Five persons are selected
at random from the group, the probability that at least 4 of them are smokers is
(a) 112 / 243 (b) 117/243
(c) 221/243 (d) 32/243
4. Assuming that boys and girls are equally probable, the number of families out of
1600 families each having 5 children, with 3 boys are
(a) 800 (b) 500
(c) 200 (d) 100
5. Assuming that boys and girls are equally probable, the number of families out of
1600 families each having 5 children, with 5 girls are
(a)
50 (b) 100
(c)
800 (d)
25
6. Assuming that boys and girls are equally probable, the number of families out of
1600 families each having 5 children, with 2 or 3 boys are
(a) 800 (b) 600
(c) 1000 (d) 100
7. Assuming that boys and girls are equally probable, the number of families out of
1600 families each having 5 children, with all children of the same sex are
(a)
800 (b) 600
(c)
1000 (d) 100
449
CA FOUNDATION STATISTICS
8. Five dice are thrown simultaneously for 96 times. An even number is regarded as
success, the expected frequencies of all five successes are
(a) 8 (b) 16
(c) 3 (d) 32
9. A and B play a game in which the probability of winning of A is 2/3, the probability
that A will win at least 6 times out of 8 trials is
(a) 512/ 2187 (b) 64/2187
(c) 32/2187 (d) 1024/2187
10. The mean of Binomial distribution is 4 and its variance is 2.4, the value of p is
(a) 0.4 (b) 0.5
(c) 0.3 (d) 0.2
11. The mean of Binomial distribution is 4 and its variance is 2.4, the value of n is
(a) 4 (b) 5
(c) 8 (d) 10
12. It is given that on an average it rains for 10 days out of 30 days. The probability that
it will rain for at least 3 days in a week is
(a) 219/729 (b) 313/729
(c) 330/729 (d) 335/729
13. A and B play 12 games of chess of which 6 are won by A, 4 by B and 2 end in a tie.
They agree to play 3 more games, the probability that A wins all the three games is
(a) 7/8 (b) 5/8
(c) 6/8 (d) 1/8
14. A and B play 12 games of chess of which 6 are won by A, 4 by B and 2 end in a tie.
They agree to play 3 more games, the probability that two games end in a tie is
(a) 5/36 (b) 5/72
(c) 5/144 (d) 5/60
450
CA FOUNDATION STATISTICS
15. A and B play 12 games of chess of which 6 are won by A, 4 by B and 2 end in a tie.
They agree to play 3 more games, the probability that A and B win alternatively is
(a) 5/36 (b) 5/72
(c) 5/144 (d) 5/60
16. A and B play 12 games of chess of which 6 are won by A, 4 by B and 2 end in a tie.
They agree to play 3 more games, the probability that B wins at least one game is
(a) 19/27 (b) 9/27
(c) 9/18 (d) 5/27
17. What is the standard deviation of the number of recoveries among 48 patients
when the probability of recovering is 0.75?
(a) 36 (b) 81
(c) 9 (d) 3
18. X is a binomial variable with n = 20. What is the mean of X if it is known that x is
symmetric?
(a) 5 (b) 10
(c) 2 (d) 8
19. If X ~ B (n, p), what would be the greatest value of the variance of x when n = 16?
(a) 2 (b) 4
(c) 8 (d) √5
20. If x is a binomial variate with parameter 15 and 1/3, what is the value of mode of
thedistribution
(a) 5 and 6 (b) 5
(c) 5.50 (d) 6
21. If the overall percentage of success in an exam is 60, what is the probability that out
of agroup of 4 students, at least one has passed?
(a) 0.6525 (b) 0.9744
(c) 0.8704 (d) 0.0256
451
CA FOUNDATION STATISTICS
22. If it is known that the probability of a missile hitting a target is 1/8, what is the
probabilitythat out of 10 missiles fired, at least 2 will hit the target?
(a) 0.4258 (b) 0.3968
(c) 0.5238 (d) 0.3611
23. X is a binomial variable such that 2 P(X = 2) = P(X = 3) and mean of X is known to
be10/3. What would be the probability that X assumes at most the value 2?
(a) 16/81 (b) 17/81
(c) 47/243 (d) 46/243
24. In 10 independent rollings of a biased die, the probability that an even number will
appear5 times is twice the probability that an even number will appear 4 times.
What is theprobability that an even number will appear twice when the die is rolled
8 times?
(a) 0.0304 (b) 0.1243
(c) 0.2315 (d) 0.1926
ANSWERS
452
CA FOUNDATION STATISTICS
HOMEWORK SOLUTION
BINOMIAL DISTRIBUTION
P=
q=
P (x > 5) = P (x = 5) + P (x = 6) + P (x = 7)
Option D
P=
q=
Option C
453
CA FOUNDATION STATISTICS
P=
q=
P (x > 4) = P (x = 4) + P (x = 5) +
Option A
P=
q=
= 500 Option B
454
CA FOUNDATION STATISTICS
P=
q=
= 50 Option A
= 1000 Option C
455
CA FOUNDATION STATISTICS
q=
P (all boys) =
=
Success getting Girl
n=5
x=5
P=
q=
P (all girls) =
= 100 Option D
456
CA FOUNDATION STATISTICS
P=
q=
P (x = 5) =
Expected frequency =
= 3 Option C
P=
q=
P (x > 6) = P (x = 6) + P (x = 7) + P (x = 8)
Option D
457
CA FOUNDATION STATISTICS
10. Mean = np
np = 4 ....(i)
Variance = npq
npq = 2.4 ....(ii)
from (i) and (ii)
4 (q) = 2.4
q=
Were now
np = 4
n = 10 Option D
458
CA FOUNDATION STATISTICS
P=
q=
P (x 3) = P (x = 3) + P (x = 4) + P (x = 5) + P (x = 6) + P (x = 7)
= 1 – [P (x = 0) + P (x = 1) + P (x = 2)]
Option B
P=
q=
P (x = 3) =
Option D
459
CA FOUNDATION STATISTICS
P=
q=
P (x = 2) =
= Option B
B’s win
P (A B A) + p (B A B)
Option A
460
CA FOUNDATION STATISTICS
P=
q=
P (x 1) = 1 – P (x = 0)
Option A
17.
= 3 Option D
n = 20
mean = np
= 10 Option B
461
CA FOUNDATION STATISTICS
20. n = 15
= 5.33 Not integer
Mode = 5 } Integral part of (n + 1)P
P=
q=
P (x 1) = 1 – P (x = 0)
= 0.9744 Option B
462
CA FOUNDATION STATISTICS
= 0.3611 Option D
23.
463
CA FOUNDATION STATISTICS
Option B
24.
= 0.0304 Option A
464
CA FOUNDATION STATISTICS
HOMEWORK SECTION
POISSON DISTRIBUTION
1. A manufacturer produces switches and experiences that 2 per cent switches are
defective. The probability that in a box of 50 switches, there are at most two defective
is :
(a) 2 e–1 (b) e–1
(c) 2.5e–1 (d) 3e–1
2. A person has some cars, and the average demand of cars per day is 3, the probability
that on any day not more than 2 cars are in use is (e–3 = 0.0498)
(a) 0.5533 (b) 0.4233
(c) 0.7533 (d) 0.2533
3. The probability that a match stick is found without head is 1/100. Each match box
contains 50 sticks. Using Poisson distribution we can say that the percentage of
number of boxes having 0 sticks without head is (e–0.5 = 0.61)
(a) 40 (b) 51
(c) 61 (d) 25
4. The probability that a match stick is found without head is 1/100. Each match box
contains 50 sticks. Using Poisson distribution we can say that the percentage of
number of boxes having 1 stick without head is
(a) 40.5 (b) 50.5
(c) 75.5 (d) 30.5
5. The probability that a match stick is found without head is 1/100. Each match box
contains 50 sticks. Using Poisson distribution we can say that the percentage of
number of boxes having 2 sticks without heads is
(a) 8.5 (b) 7.63
(c) 8.75 (d) 10.25
6. In the production of electric fuses 2% are defective, the probability of getting at the
most 2 defective fuses in a box containing 200 fuses is (e–4 = 0.0183)
(a) 0.238 (b) 0.763
(c) 0.875 (d) 0.252
465
CA FOUNDATION STATISTICS
8. In the manufacturing of cotter pins it is known that 5% of the pins are defective. The
pins are sold in boxes of 100 and it is guaranteed that not more than 4 pins will be
defective in a box. What is the probability that a box will meet this guarantee?
(e–5 = 0.0067)
(a) 0.4480 (b) 0.5480
(c) 0.6480 (d) 0.4380
10. In one hospital 3 percent of the patients demand special rooms. On a particular day
3 special rooms were vacant. If 50 patients were admitted in the hospital on that
day, the probability that the demands for special room were not met is
(e–1.5 = 0.2231)
(a) 0.0658 (b) 0.1952
(c) 0.1304 (d) 0.1316
11. In a book, on an average there are 3 misprints in 5 pages. The number of pages
having more than 2 misprints in that book of 100 pages are (Use Poisson distribution)
(e–0.6 = 0.5488)
(a) 3.31 (b) 4.31
(c) 2.31 (d) 5.31
12. A factory produces 0.5% defective articles. If a sample of 100 articles is taken from
the production, the probability of getting 2 or more defective articles is
(e–0.5 = 0.6065)
(a) 0.0902 (b) 0.1902
(c) 0.1302 (d) 0.1102
466
CA FOUNDATION STATISTICS
13. A random variable x follows Poisson distribution with mean 2 then P(X > 0) is equal
to(e–2 = 0.1353)
(a) 0.1353 (b) 0.2706
(c) 0.8647 (d) none of them
17. x is a Poisson variate such that P(x = 3) = P(x = 4), its S.D. is
(a) 4 (b) 3
(c) 2 (d) none of them
20. x is a Poisson variate and P(x = 2) = 9.P(x = 4) + 90.P(x = 6) then its mean is
(a) 4 (b) 3
(c) 1 (d) none of them
467
CA FOUNDATION STATISTICS
21. The number of accidents in a year attributed to taxi drivers in a city follows poisson
distribution with mean 3. Out of 1000 taxi drivers, the number of drivers with no
accident in a year is (e–3 = 0.0498)
(a) 40 (b) 30
(c) 50 (d) none of them
22. The number of accidents in a year attributed to taxi drivers in a city follows poisson
distribution with mean 3. Out of 1000 taxi drivers, the number of drivers with at
least 3 accidents in a year is(e–3 = 0.0498)
(a) 477 (b) 377
(c) 177 (d) none of them
23. The average number of customers, who appear at a counter of a certain bank per
minute is 2, the probability that during a given minute three or more customers
appear is (e–2 = 0.1353)
(a) 0.32354 (b) 0.1353
(c) 1 (d) none of them
24. If the standard deviation of a Poisson variate X is 2, what is P (1.5 < X < 2.9)?
(a) 0.231 (b) 0.158.
(c) 0.15. (d) 0.144.
25. If the mean of a Poisson variable X is 1, what is P (X = takes the value at least 1)?
(a) 0.456. (b) 0.821.
(c) 0.632. (d) 0.254.
26. If X ~ P (m) and its coefficient of variation is 50, what is the probability that X would
assume only non-zero values?
(a) 0.018. (b) 0.982.
(c) 0.989. (d) 0.976.
468
CA FOUNDATION STATISTICS
28. A random variable x follows Poisson distribution and its coefficient of variation is
50. What is the value of P (x > 1 / x > 0)?
(a) 0.1876 (b) 0.2341
(c) 0.9254 (d) 0.8756
29. A car hire firm has 2 cars which is hired out every day. The number of demands per
day for a car follows Poisson distribution with mean 1.20. What is the proportion of
days on which some demand is refused? (Given e1.20 = 3.32).
(a) 0.25 (b) 0.3012
(c) 0.12 (d) 0.03
ANSWERS
469
CA FOUNDATION STATISTICS
HOMEWORK SOLUTION
POISSON DISTRIBUTION
1. P (defective) = 2% = 0.02
n = 50 P (Atmost 2 defective) =?
m = np
= 50 x 0.02
(c)
2. m=3
P (not more than two) = P (x 2) =?
P (y 2) = P (x = 0) + P (x = 1) + P (x = 2)
= e3 (8.5)
= 8.5 x 0.0498 = 0.4233 (b)
470
CA FOUNDATION STATISTICS
3. P=
n = 50
m = np
= 0.5
e-0.5 = 0.61
Number of boxes with zero stick without head
= e-0.5
= 0.61
= 61% (c)
4. P =
n = 50
m = np
= 0.5
Number of boxes having 1 stick without head
= 0.5 x e-0.5
= 0.5 x 0.61
= 0.305 (d)
471
CA FOUNDATION STATISTICS
5. P=
n = 50
m = np
= 0.5
2 sticks of without head
= 0.076
= 7.63% (a)
6. P (defective) = 2% = 0.02
n = 200
m = np
= 200 x 0.02
m=4
P (Atmost 2 defective)
P (x 2) = P (x = 0) + P (x = 1) + P (x = 2)
= 13 x 0.0183
= 0.2379 ≈ 0.238 (a)
472
CA FOUNDATION STATISTICS
7. P = 2% = 0.02
n = 200
m = np
= 200 x 0.02
m=4
P (x = 3 defective)
= 0.1952 (b)
8. P (defective) = 5% = 0.05
n = 100
m = np
= 100 x 0.05
m=5
P (x = not more than 4 defective) = P (x 4)
= P (x = 0) + P (x = 1) + P (x = 2) + P (x = 3) + P (x = 4)
= 0.0067 (1 + 5 + 12.5 + 20.83 + 26.041)
= 0.43798 ≈ 0.4380
(d)
473
CA FOUNDATION STATISTICS
9. P (defective) = 3% = 0.03
n = 100
m = np
= 100 x 0.03
m=3
e-3 = 0.0498
= 0.1008 (b)
= 1 – e-1.5 [(1) + (1.5) + (1.125) + (0.5625)]
= 1 – 0.2231 [4.1875]
= 1 – 0.93423
= 0.0658 (a)
474
CA FOUNDATION STATISTICS
11. P =
n = 100
m = np
= 0.6
P (x > 2) = [P(x = 0) + P(x = 1) + P (x = 2)]
More than 2 mix prints
= 1 – 0.54688 (1.78)
= 1 – 0.9768
= 0.0231
Number of paper = N x P
= 100 x 0.0231
= 2.31 (c)
m = 0.5
P (2 or more defective)
P (x 2) = 1 – [P(x = 0) + P(x = 1)]
=1 – e-0.5 [1 + 0.5]
=1 – 0.6065 (1.5)
=1 – 0.90975
= 0.0902 (a)
475
CA FOUNDATION STATISTICS
13. m = 2
P(x > 0) = ?
(e2 = 0.1353)
P (x > 0) = 1 - P(x = 0)
= 1 – e-2
= 1 – 0.1353
= 0.8647 (c)
14. m = 0.81
then S.D = ?
= 0.9 (b)
mean = m = 4 (c)
4=m
476
CA FOUNDATION STATISTICS
m=4
= 2 (c)
= 0.3681 (a)
477
CA FOUNDATION STATISTICS
= e-2 (a)
20. P (x = 2) = 9 P (x = 4) + 90 P (x = 6)
m=?
Take 1 cm
= 12m2 = 9m4 + 3m6
Use Pinto m = 1
12(1)2 = 9(1)2 + 3(1)6
12 = 9 + 3
12 = 12
m=1
478
CA FOUNDATION STATISTICS
21. m = 3
e-3 = 0.0498
n = 1000
P (x = drivers with no accident)
= 0.0498
Number of drivers = N x P
= 1000 x 0.0498
= 49.8
≈ 50 (c)
22. m = 3
n = 1000
P (x = At least 3 Accidents) = P (x 3)
= P (x 3) = 1 – [P (x = 0) + P (x = 1) + P (x = 2)]
= 1 – e-3 [8.5]
= 1 – 0.0498 x 8.5
= 0.5767
Number of drivers = 1000 x 0.05767
= 576.7
= 578 (d)
479
CA FOUNDATION STATISTICS
23. m = 2
e-2 = 0.1353
P (3 more customers)
= P (x 3) = 1 – [P (x = 0) + P (x = 1) + P (x = 2)]
= 1 – e-2 [1 + 2 + 2]
= 1 – 5e-2
= 1 – 5 x 0.1353
= 0.3235 (a)
24. S.D. = 2
4=m
P (1.5 < x < 2.9) = ?
= 0.1465 ≈ 0.15
(c)
480
CA FOUNDATION STATISTICS
25. m = 1
P (x = takes value at least 1)
P (x 1) = 1 – P (x = 0)
= 0.632 (c)
26.
481
CA FOUNDATION STATISTICS
27.
28.
482
CA FOUNDATION STATISTICS
29.
2 cars available
m=1.2 (given)
P(Demand is refused)
(The demand is refused when demand is more than 2)
483
CA FOUNDATION STATISTICS
484
CA FOUNDATION STATISTICS
HOMEWORK SECTION
NORMAL DISTRIBUTION
3. The area under the standard normal curve beyond the lines 1.96 is
(a) 95% (b) 90%
(c) 99.73% (d) 5%
5. If X is a normal variate representing the income in ` per day with mean = 50 and S.D.
= 10 of the workers in a factory. The total number of workers is 1000. The number
of workers having income more than ` 62.00 per day is
(a) 246 (b) 150
(c) 738 (d) 115
485
CA FOUNDATION STATISTICS
7. The mean of a normal variate is 20 and its variance is 9. The area between the
values of the variate 15.5 and 21.5 is
(a) 0.6247 (b) 0.2857
(c) 0.0228 (d) 0.9332
8. The mean of a normal variate is 20 and its variance is 9. The area between the
values of the variate 14 and 18.5 is
(a) 0.6247 (b) 0.3072
(c) 0.0228 (d) 0.9332
9. The mean of a normal variate is 20 and its variance is 9. The area for the values of
variate more than 26 is
(a) 0.6247 (b) 0.2857
(c) 0.0228 (d) 0.9332
10 . In an intelligence test administered to 1000 children the average score is 42 and its
s.d. is 24. Assuming that the scores are normally distributed, the minimum score of
the most intelligent 100 students is
(a) 371 (b) 383
(c) 72.7 (d) none of them
11. The customer accounts at a certain departmental store have an average balance
of ` 120 and a standard deviation of ` 40. Assuming that account balances are
normally distributed, what percentage of the accounts are over `150?
(a) 25.5 (b) 22.66
(c) 72.7 (d) 46.49
12. The customer accounts at a certain departmental store have an average balance of
`120 and a standard deviation of `40. Assuming that account balances are normally
distributed,what percentage of the accounts are between `100 and `150?
(a) 25.5 (b) 22.66
(c) 72.7 (d) 46.49
486
CA FOUNDATION STATISTICS
13. The average life of a battery is 400 hours and its standard deviation is 50 hours.
Assuming normal distribution, the percentage of batteries having life more than
350 hours is
(a) 84.13 (b) 22.66
(c) 72.7 (d) 46.49
14. The average life of a battery is 400 hours and its standard deviation is 50 hours.
Assuming normal distribution, approximate after how many hours 25% of the
batteries will still be working?
(a) 284 (b) 422
(c) 372 (d) 434
15. The average life of a battery is 400 hours and its standard deviation is 50 hours.
Assuming normal distribution, the percentage of batteries with life time between
300 and 500 hours is
(a) 84.13 (b) 22.66
(c) 95.44 (d) 19.35
16. In a normal distribution 50% of the observations are less than 35 and 89% of the
observations are less than 63. The mean of the distribution is
(a) 63 (b) 35
(c) 89 (d) none of them
17. In a normal distribution 50% of the observations are less than 35 and 89% of the
observations are less than 63. The standard deviation of the distribution is
(a) 63 (b) 35
(c) 89 (d) none of them
18. The mean and S.D. of a normal variate are 30 and 5 respectively, the probability of
| x – 30 | < 5 is
(a) 0.6826 (b) 0.0013
(c) 0.7653 (d) none of them
487
CA FOUNDATION STATISTICS
19. The distribution of marks of the students in a class is normal with mean 20 and
s.d. 5. The number of students getting more than 25 marks is 50. The number of
students of that class is
(a) 682 (b) 653
(c) 315 (d) none of them
21. What is the first quartile of X having the following probability density function?
(a) 4. (b) 5.
(c) 5.95. (d) 6.75.
22. In a sample of 800 students, the mean weight and standard deviation of weight
are found to be 50 kg and 20 kg respectively. On the assumption of normality, what
is the number of students weighing between 46 Kg and 62 Kg? Given area of the
standard normal curve between z = 0 to z = 0.20 = 0.0793 and area between z = 0
to z = 0.60 = 0.2257.
(a) 250 (b) 244
(c) 240 (d) 260
23. The salary of workers of a factory is known to follow normal distribution with an
average salary of ` 10,000 and standard deviation of salary as ` 2,000. If 50 workers
receive salary more than ` 14,000, then the total no. of workers in the factory is
(a) 2,193 (b) 2,000
(c) 2,200 (d) 2,500
24. For a normal distribution with mean as 500 and SD as 120, what is the value of k
so that the interval [500, k] covers 40.32 per cent area of the normal curve? Given
φ (1.30) = 0.9032.
(a) 740 (b) 750
(c) 656 (d) 800
488
CA FOUNDATION STATISTICS
25. 50 per cent of a certain product have weight 60 kg or more whereas 10 per cent
haveweight 55 kg or less. On the assumption of normality, what is the variance of
weight?
Given φ (1.28) = 0.90.
(a) 15.21 (b) 9.00
(c) 16.00 (d) 22.68
ANSWERS
489
CA FOUNDATION STATISTICS
HOMEWORK SOLUTION
NORMAL DISTRIBUTION
1. QD =
2. MD =
3.
4.
490
CA FOUNDATION STATISTICS
5. µ = 50
σ = 10
N = 1000
P (x 62) = ?
z = 1.2
P (x 62) =? = 0.5 - 0.3849
= 0.1151
Number of workers = N x P
= 1000 x 0.1151 ≈ 115
(d)
491
CA FOUNDATION STATISTICS
6. µ=5
σ = 0.05 Tolerance limit 4.90 cm to 5.10 cm.
N = 200 x1 x2
z1= -2
z2
z2= +2
% of shafts within the limits = 0.4772 + 0.4772
= 0.9544
Number of shafts within limits = N x P
= 200 x 0.9544
= 190.88
= 191
Number of shafts out of tolerance limits = 200 – 191
= 9
(b)
492
CA FOUNDATION STATISTICS
7. µ = 20
σ2 = 9
σ = 3
Area between 15.5 and 21.5 = ?
z2= +0.5
493
CA FOUNDATION STATISTICS
8. µ = 20
σ2 = 9
σ = 3
Area between 14 and 18.5 =?
494
CA FOUNDATION STATISTICS
9. µ = 20
σ2 = 9
σ = 3
Area about more than 26?
z1
z1
z1 = 2
Area above 26 = 0.5 – 0.4772
= 0.0228 (c)
10. N = 1000
µ = 42
σ = 24
z=0 z =?
µ = 42 Most intelligent 100 students
Area =
z for value (Area) 0.4 = 1.28
1.28 x 24 + 42 = x
x = 72.72 (c)
495
CA FOUNDATION STATISTICS
11. µ = 120
σ = 40
P (x 150) = ?
z = 0.75
496
CA FOUNDATION STATISTICS
12. µ = 120
σ = 40
Percentage of accounts between 100 and 150 = ?
P (100 x150) = ?
z1
z1
z1 = - 0.5
z2
z2
z2 = 0.75
497
CA FOUNDATION STATISTICS
13. µ = 400
σ = 50
P (x 350) = ?
z
z
z=-1
P (x 350) = 0.5 + 0.3413
= 0.8413
= 84.13% (a)
14. µ = 400
σ =50
0.67 x 50 + 400 = x
X = 433.5 ≈ 434
After 434 hrs 25% of batteries will be still working.
498
CA FOUNDATION STATISTICS
15. µ = 400
σ = 50
P (300 x 500) = ?
z1
z1
z1 = - 2
z2
z2
z2 = 2
499
CA FOUNDATION STATISTICS
16.
Find the mean µ?
1.22 σ = 63 - µ
µ + 1.22 σ = 63 (1)
0 = 35 - µ
µ = 35 (b)
500
CA FOUNDATION STATISTICS
18.
z2 = 1
(a)
501
CA FOUNDATION STATISTICS
19. µ = 20
σ = 5
Marks more than 25 there are 50 students.
Let N be the no. of students of the class.
Probability of std getting more than 25 marks =
=z=1
P (x > 25) = 0.5 – 0.3414
= 0.1587
P (x > 25) =
0.1587 =
502
CA FOUNDATION STATISTICS
20.
Standard function
On comparing σ = 4, µ = 10
= 40% (c)
21.
Q1 = µ - 0.6745 σ
= 10 – 0.6745 (6)
= 5.95 (c)
503
CA FOUNDATION STATISTICS
22. N = 800
µ = 50
σ = 20
P (46 x 62) = ?
= +0.6
504
CA FOUNDATION STATISTICS
23. µ = 10000
σ =2000
50 Workers are more than 14000
Total no. of workers =?
σ = 3.906
Variance = σ2
= (3.906)2
= 15.256
= 15.21
505
CA FOUNDATION STATISTICS
z
z=2
for z = 2
area from z = 0 to z = 2 is 0.4772
P (x 14000) = 0.5 – 0.4772
= 0.0228
= 2192.98 ≈ 2193
(a)
24.
µ = 500
σ = 120 [500, k] area between the two limits is 40.32%
k=?
ᶲ (1.30) = 0.9032
1.30 x 1.20 + 500 = k
k = 656
(c)
506
CA FOUNDATION STATISTICS
0 = 60 - µ
µ = 60
σ = 3.906
Variance = σ2
= (3.906)2
= 15.256
= 15.21 (a)
507
CA FOUNDATION STATISTICS
1. When there are a fixed number of repeated trial of any experiments under identical
conditions for which only one of two mutually exclusive outcomes, success or failure
can result in each trial then
a) Normal distribution b) Binomial distribution
c) Poisson distribution d) None is used
4. In Binomial distribution
a) Mean is greater than variance b) Mean is less than variance
c) Mean is equal to variance d) None
6. If in Binomial distribution np=9 and npq=2.25 then p and n are equal to is equal to
a) 0.25,36 b) 0.75, 12
c) 1,9 d) None
508
CA FOUNDATION STATISTICS
7. A random variable x follows Binomial distribution with E(X) = 2 and V(X)=1.2. Then
the value of n is
a) 8 b) 2 c) 5 d) None
10. What is the probability of making 3 correct guesses in 5 True-False answer type
questions?
a) 0.3125 b) 0.5676 c) 0.6875 d) 0.4325
11. In 10 independent rollings of a biased dice, the probability that an even number will
appear 5 times is twice the probability that an even number will appear 4 times.
What is the probability that an even number will appear twice when the die is rolled
8 times?
a) 0.0304 b) 0.1243 c) 0.2315 d) 0.1926
12. What is the probability of having at least one ‘six’ from 3 throws of a perfect die?
a) 5/6 b) (5/6)3
c) 1 - (1/6)3 d) 1 - (5/6)3
13. A coin is tossed 10 times. Assuming the coin to be unbiased, what is the probability
of getting at least 4 heads?
a) b)
c) d)
14. If 15 dates are selected at random, what is the probability of getting two Sundays’
a) 0.36 b) 0.44 c) 0.29 d) 0.57
509
CA FOUNDATION STATISTICS
16. X is a binomial variable such that 2 P(X=2) = P(X=3) and mean of X is known to be
10/3. What would be the probability that X assumes at most the value 2?
a) 16/81 b) 17/81 c) 47/2473 d) 46/243
18. For a Binomial distribution mean is 4 and variance is 3 then, 3rd central moment is
a) 1.5 b) 0.15 c) 15 d) 0.75
Answer Key
1. b 2. b
3. c 4. a
5. c 6. b
7. c 8. b
9. b 10. a
11. a 12. d
13. c 14. c
15. a 16. b
17. c 18. b
19. b 20. d
510
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
1. Binominal Distribution (b)
511
CA FOUNDATION STATISTICS
8. p(x = 0) = given n = 4
(b)
9. x is symmetric data
n = 20 given
mean = np
mean = 10 (b)
512
CA FOUNDATION STATISTICS
10. n = 5
x=3
(a)
11. p(x = 5) = 2p(x = 4) given n = 10
3p = 5 (1 - p)
3p = 5 – 5p
8p = 5
Success getting even no.
n=8
x=2
= 0.0304 (a)
513
CA FOUNDATION STATISTICS
P (x 1) = 1 – p(x = 0)
(d)
P (x 4) = 1 – [P (x = 0) + P (x = 1) + P (x = 2) + P (x = 3)
(c)
514
CA FOUNDATION STATISTICS
= 0.29 (c)
15. p = 3(q)
q = 3(1 - p)
p = 3 – 3p
4p = 3
(a)
515
CA FOUNDATION STATISTICS
16. mean =
n=5
p (x 2)
= p (x = 0) + p (x = 1) + p (x = 2)
(b)
17. SD = (c)
516
CA FOUNDATION STATISTICS
3rd central moment = np (1 - p) (1 – 2p)
(a)
19. n = 4
Variance = npq
(b)
20. mean = 20
np = 20
SD = 4
Variance = 16
npq = 16
20(q) = 16
(d)
517
CA FOUNDATION STATISTICS
3. In Poisson distribution -
a) Mean and SD are equal b) Mean, Variance are equal
c) SD and Variance are equal d) Both (a) and (b)
5. Find the mean and variance of x where x is a Poisson variate satisfying the condition
P (x = 2) = P (x = 3).
a) 2 b) 3 c) 4 d) 5
6. X is a Poisson variate satisfying the following relation:
P (X = 2) = 9 P (x = 4) + 90P (X = 6).
What is the standard deviation of X?
a) 1 b) 2 c) 1.55 d) 1.87
518
CA FOUNDATION STATISTICS
10. If 2 per cent of electric bulbs manufactured by a company are known to be defectives,
what is the probability that a sample of 150 electric bulbs taken from the production
process of the company would contain exactly one defective bulb?
a) 0.15 b) 0.86
c) 0.74 d) 0.23
11. The standard deviation of Poisson variate is 1.732. what is the probability that the
variable lies between -2.3 to 3.68?
a) 0.55 b) 0.65
c) 0.75 d) 0.85
12. The manufacturer of a certain electronic component is certain that two per cent of
his product is defective. He sells the components in boxes of 120 and guarantees
that not more than two per cent in any box will be defective. Find the probability
that a box, selected at random, would fail to meet the
guarantee? Given that e-2.40 = 0.0907.
a) 0.26 b) 0.52
c) 0.43 d) 0.86
519
CA FOUNDATION STATISTICS
13. Between 9 and 10 AM, the average number of phone calls per minute coming into
the switchboard of the company is 4. Find the probability that during one particular
minute there will be no phone calls.
a) 0.045445 b) 0.02454
c) 0.018316 d) 0.047251245
14. The number of accidents in a year attributed to taxi drivers in a locality follows
Poisson distribution with an average 2. Out of 500 taxi drivers of that area, what is
the number of drivers with atleast 3 accidents in a year?
a) 162 b) 180
c) 201 d) 190
15. A random variable X follows Poisson Distribution, such that P(X = k) = P(X = k + 1),
then its mean and variance is:
a) k – 1, k – 1 b) k + 2, k + 2
c) k + 3, k + 3 d) k + 1, k + 1
520
CA FOUNDATION STATISTICS
Answer Key
1. d 2. a
3. b 4. b
5. b 6. a
7. c 8. a
9. a 10. a
11. b 12. c
13. c 14. a
15. d 16. b
17. b 18. c
19. a 20. a
521
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
m=3
mean = np m = 3
SD =
Variance = np = m = 3 (b)
6. P (x = 2) = 9P (x = 4) + 90P (x = 6)
m4 + 3m2 – 4 = 0
m4 + 4m2 – m2 – 4 = 0
m2 (m2 + 4) – 1 (m2 – 4) = 0
m2 = 4 or m2 = –1
not possible m= 1
m = 1 or m = -1 not possible
m=1
mean = 1
S.D. = = 1 (a)
522
CA FOUNDATION STATISTICS
7. E(x) = 2.20
mean = m = 2.20
e-2.20 = 0.1108
p (at least one)
p (x 1)
= 1 – p (x = 0)
= 1 – 0.1108
= 0.8892
= 0.89 (c)
8. P (x 2/x 1)
= 0.57568
= 0.58 (a)
9. m=nxp
m=3
p (x = 0)
= 0.0498
= 0.05 (a)
523
CA FOUNDATION STATISTICS
10. m=nxp
m=3
p (x = 1)
= 0.1494
= 0.15 (a)
11. SD = = 1.732
m = 3
P (-2.3 < x < 3.68)
P (x = 0) + P (x = 1) + P (x = 2) + P (x = 3)
= 0.0498 [1 + 3 + 4.5 + 4.5]
= 0.6474
= 0.65 (b)
12. If x > 2.4 then guarantee would fail
m = 2.4
P (x > 2.4)
= 1 – P (x = 0) + P (x = 1) + P (x = 2)]
= 1 – 0.0907 [1 + 2.4 + 2.88]
= 0.4304
= 0.43 (c)
524
CA FOUNDATION STATISTICS
13. m=4
p (x = 0)
= 0.018316 (c)
14. m=2
P (x 3)
= 1 – [P (x = 0) + P (x = 1) + P (x = 2)]
= 0.3233
Expected no. of taxi drivers
= 500 x 0.3233
= 161.66
≈ 162 (a)
15. P (x = k) = P (x = k + 1)
m = k + 1
mean = k + 1
Variance = k + 1 (d)
20. ( ) (a)
525
CA FOUNDATION STATISTICS
7. for the two quartiles of N (µ, ) are 14.6 and 25.4 respectively, what is the standard
deviation of the distribution?
a) 9 b) 6
c) 10 d) 8
526
CA FOUNDATION STATISTICS
a)
b)
c)
d) None of these
a) 7 and 13 b) 8 and 12
c) 6 and 14 d) 5 and 15
12. The distribution of wages of a group of workers Is known to be normal with mean
` 500 and SD ` 100. If the wages of 100 workers in the group are less than ` 430,
what is the total number of workers in the group?
a) 289 b) 413
c) 568 d) 318
13. A discrete random variable x follows uniform distribution and takes the values 6, 8,
10, 12, 18. The probability of P(x≤12) is
a) 1/5 b) 4/5
c) 3/5 d) None
527
CA FOUNDATION STATISTICS
14. A continuous random variable x follows uniform distribution with probability density
function f(x)=1/2, (4<x<6). Then P(4≤x≤5)
a) 0.1 b) 0.5
c) 0 d) None
15. If the mean deviation of a normal variable is 16, what is its quartile deviation?
a) 10.00 b) 13.50
c) 15.00 d) 12.05
16. If X and Y are 2 independent normal variables with mean as 10 and 12 and SD as 3
and 4, then (X+Y) is normally distributed with
a) Mean = 22 and SD =7 b) Mean = 22 and SD = 25
c) Mean = 22 and SD =5 d) Mean = 22 and SD =49
17. The salary of workers of a factory is known to follow normal distribution with an
average salary of `10,000 and standard deviation of salary as `2,000. If 50 workers
receive salary more than `14,000, then the total no. of workers in the factory is
a) 2,193 b) 2,000
c) 2,200 d) 2,500
18. 50% of a certain product have weight 60 kg or more whereas 10% have weight 55%
or less assumption of normality, what is the variance of weight? Given f (1.28) =
0.90.
a) 15.21 b) 9.00
c) 16.00 d) 22.68
528
CA FOUNDATION STATISTICS
1. a 11. b
2. c 12. a
3. c 13. c
4. c 14. a
5. d 15. a
6. c 16. b
7. d 17. b
8. b 18. b
9. a 19. a
10. c 20. a
529
CA FOUNDATION STATISTICS
EXPLANATORY
ANSWERS
3. Continuous (b)
4. Normal (a)
5. Symmetrical. (c)
6. -1 and 1 (c)
7. 8 (d)
10. (a)
530
CA FOUNDATION STATISTICS
12.
s
z = - 0.7
z=0
Area between z = 0 and z = -0.7 = 0.2580
Area of wages below ` 430 = 0.5 – 0.2580
= 0.2420
= 0.5
531
CA FOUNDATION STATISTICS
15. (b) QD : MD = 10 : 12
= 13.33
≈ 13.50
17. (a)
σ = 2000
z=2
z=0
Area between z = 0 and z = 2 = 0.4772
Area of salary more than ` 14000 = 0.5 – 0. 4772
= 0.0228
532
CA FOUNDATION STATISTICS
18. (a)
z = -1.28
Variance = 15.21
20. -1 to +1 (a)
533
CA FOUNDATION STATISTICS
APPENDIX
534
CA FOUNDATION STATISTICS
3.1 .4990 .4991 .4991 .4991 .4992 .4992 .4992 .4992 .4993 .4993
3.2 .4993 .4993 .4994 .4994 .4994 .4994 .4994 .4995 .4995 .4995
3.3 .4995 .4995 .4995 .4996 .4996 .4996 .4996 .4996 .4996 .4997
3.4 .4997 .4997 .4997 .4997 .4997 .4997 .4997 .4997 .4997 .4998
535
CA FOUNDATION STATISTICS
536
CA FOUNDATION STATISTICS
537
CA FOUNDATION STATISTICS
36 5563 5575 5587 5599 5611 5623 5635 5647 5658 5670 1 2 4 5 6 7 8 10 11
37 5682 5694 5705 5717 5729 5740 5752 5763 5775 5786 1 2 3 5 6 7 8 9 10
38 5798 5809 5821 5832 5843 5855 5866 5877 5888 5899 1 2 3 5 6 7 8 9 10
39 5911 5922 5933 5944 5955 5966 5977 5988 5999 6010 1 2 3 4 5 7 8 9 10
40 6021 631 6042 6053 6064 6075 6085 6096 6107 6117 1 2 3 4 5 6 8 9 10
41 6128 6138 6149 6160 6170 6180 6191 6201 6212 6222 1 2 3 4 5 6 7 8 9
42 6232 6243 6253 6263 6274 6284 6294 6304 6314 6235 1 2 3 4 5 6 7 8 9
43 6335 6345 6355 6365 6575 6385 6395 6405 6415 6425 1 2 3 4 5 6 7 8 9
44 6435 6444 6454 6464 6474 6484 6493 6503 6513 6522 1 2 3 4 5 6 7 8 9
45 6532 6542 6551 6561 6571 6580 6590 6599 6609 6618 1 2 3 4 5 6 7 8 9
46 6628 6637 6646 6656 6665 6675 6684 6693 6702 6712 1 2 3 4 5 6 7 7 8
47 6721 6730 6739 6749 6758 6767 6776 6785 6794 6803 1 2 3 4 5 5 6 7 8
48 6812 6821 6830 6839 6848 6857 6866 6875 6884 6893 1 2 3 4 4 5 6 7 8
49 6902 6911 6920 6928 6037 6946 6955 6964 6972 6981 1 2 3 4 4 5 6 7 8
Example:
Log 2 = 0.3010: Log 20 = 1.3010: Log 200 = 2.3010: Log 2,000 = 3.3010 etc.
Log 2 = 0.3010 - 1 – (–) 0.699
Log 0.02 = 0.3010 - 2 – (–) 1.699
538
CA FOUNDATION STATISTICS
0 1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9
50 6990 6998 7007 7016 7024 7033 7042 7050 7059 7067 1 2 3 3 4 5 6 7 8
51 7076 7084 7093 7101 7110 7118 7126 7135 7143 7152 1 2 3 3 4 5 6 7 8
52 7160 7166 7177 7185 7193 7202 7210 7218 7226 7235 1 2 2 3 4 5 6 7 7
53 7243 7251 7259 7267 7275 7284 7292 7300 7306 7314 1 2 2 3 4 5 6 6 7
54 7324 7332 7340 7348 7358 7364 7372 7380 7388 7396 1 2 2 3 4 5 6 6 7
55 7404 7412 7419 7427 7435 7443 7451 7459 7466 7474 1 2 2 3 4 5 5 6 7
56 7452 7490 7497 7505 7513 7520 7528 7536 7543 7551 1 2 2 3 4 5 5 6 7
57 7559 7566 7574 7582 7589 7597 7604 7612 7619 7627 1 2 2 3 4 5 5 6 7
58 7634 7642 7649 7657 7664 7672 7679 7686 7694 7701 1 1 2 3 4 4 5 6 7
59 7709 7716 7723 7731 7738 7745 7752 7760 7767 7774 1 1 2 3 4 4 5 6 7
60 7782 7789 7796 7803 7810 7818 7825 7832 7839 7848 1 1 2 3 4 4 5 6 6
61 7853 7860 7868 7875 7882 7889 7896 7903 7910 7917 1 1 2 3 4 4 5 6 6
62 7924 7931 7938 7945 7952 7958 7966 7973 7980 7987 1 1 2 3 3 4 5 6 6
63 7993 8000 8007 8014 8021 8028 8035 8041 8048 8055 1 1 2 3 3 4 5 5 6
64 8062 8069 8075 8082 8089 8096 8102 8109 8116 8122 1 1 2 3 3 4 5 5 6
65 8129 8136 8142 8149 8158 8162 8169 8176 8182 8189 1 1 2 3 3 4 5 5 6
66 8195 8202 8209 8215 8222 8228 8235 8241 8248 8254 1 1 2 3 3 4 5 5 6
67 8261 8267 8274 8280 8287 8293 8299 8306 8312 8319 1 1 2 3 3 4 5 5 6
68 8325 8331 8338 8344 8351 8357 8363 8370 8376 8382 1 1 2 3 3 4 4 5 6
69 8388 8395 8401 8407 8414 8420 8428 8432 8439 8445 1 1 2 2 3 4 4 5 6
70 8451 8457 8463 8470 8476 8482 8488 8494 8500 8506 1 1 2 2 3 4 4 5 6
71 8513 8519 8525 8531 8537 8543 8549 8555 8561 8567 1 1 2 2 3 4 4 5 5
72 8573 8579 8585 8591 8597 8603 8609 8615 8621 8627 1 1 2 2 3 4 4 5 5
73 8633 8639 8645 8651 8657 8663 8669 8673 8681 8686 1 1 2 2 3 4 4 5 5
74 8692 8698 8704 8710 8716 8722 8727 8733 8738 8745 1 1 2 2 3 4 4 5 5
75 8751 8756 8762 8768 8774 8779 8785 8791 8797 8802 1 1 2 2 3 3 4 5 5
76 8808 8814 8820 8825 8831 8837 8842 8848 8854 8859 1 1 2 2 3 3 4 5 5
77 8865 8871 8876 8882 8887 8893 8899 8904 8910 8915 1 1 2 2 3 3 4 4 5
78 8921 8927 8932 8938 8943 8949 8954 8960 8965 8971 1 1 2 2 3 3 4 4 5
79 8976 8982 8987 8993 8998 9004 9009 9015 9020 9025 1 1 2 2 3 3 4 4 5
80 9031 9036 9042 9047 9053 9058 9063 9069 9074 9079 1 1 2 2 2 3 4 4 5
81 9085 9090 9096 9101 9106 9112 9117 9122 9128 9133 1 1 2 2 2 3 4 4 5
82 9138 9143 9149 9154 9159 9165 9170 9175 9180 9186 1 1 2 2 2 3 4 4 5
83 9191 9196 9201 9206 9212 9217 9222 9227 9232 9238 1 1 2 2 2 3 4 4 5
84 9243 9248 9253 9258 9263 9269 9274 9279 9284 9289 1 1 2 2 2 3 4 4 5
85 9294 9299 9304 9309 9315 9320 9325 9330 9335 9340 1 1 2 2 3 3 4 4 5
86 9345 9350 9355 9360 9365 9370 9375 9380 9385 9390 1 1 2 2 3 3 4 4 5
87 9395 9400 9405 9410 9415 9420 9425 9430 9435 9440 0 1 1 2 2 3 3 4 4
539
CA FOUNDATION STATISTICS
88 9445 9450 9450 9455 9460 9469 9474 9479 9484 9489 0 1 1 2 2 3 3 4 4
89 9494 9499 9504 9509 9513 9518 9523 9528 9533 9538 0 1 1 2 2 3 3 4 4
90 9542 9547 9552 9557 9562 9566 9571 9576 9581 9586 0 1 1 2 2 3 3 4 4
91 9590 9595 9600 9605 9609 9614 9619 9624 9628 9633 0 1 1 2 2 3 3 4 4
92 9638 9643 9647 9652 9657 9661 9666 9671 9675 9680 0 1 1 2 2 3 3 4 4
93 9685 9689 9694 9699 9703 9708 9713 9717 9722 9727 0 1 1 2 2 3 3 4 4
94 9731 9736 9741 9745 9750 9754 9759 9763 9768 9773 0 1 1 2 2 3 3 4 4
95 9777 9782 9786 9791 9795 9800 9805 9809 9814 9818 0 1 1 2 2 3 3 4 4
96 9823 9827 9832 9836 9841 9845 9850 9854 9859 9863 0 1 1 2 2 3 3 4 4
97 9868 9872 9877 9881 9886 9890 9894 9899 9903 9908 0 1 1 2 2 3 3 4 4
98 9912 9917 9921 9926 9930 9934 9939 9943 9945 9952 0 1 1 2 2 3 3 4 4
99 9958 9961 9965 9969 9974 9978 9983 9987 9991 9996 0 1 1 2 2 3 3 3 4
540
CA FOUNDATION STATISTICS
Table IV - ANTILOGARITHM
0 1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9
100 1000 1002 1005 1007 1009 1012 1014 1016 1018 1021 0 0 1 1 1 1 2 2 2
101 1023 1026 1028 1030 1033 1035 1038 1040 1042 1045 0 0 1 1 1 1 2 2 2
102 1047 1050 1052 1054 1057 1059 1062 1064 1067 1069 0 0 1 1 1 1 2 2 2
103 1072 1074 1076 1079 1081 1084 1086 1089 1091 1094 0 0 1 1 1 1 2 2 2
104 1096 1099 1102 1104 1107 1109 1112 1114 1117 1119 0 1 1 1 1 2 2 2 2
105 1122 1125 1127 1130 1132 1135 1138 1140 1143 1146 0 1 1 1 1 2 2 2 2
106 1148 1151 1153 1156 1159 1161 1164 1167 1169 1172 0 1 1 1 1 2 2 2 2
107 1175 1178 1180 1183 1186 1189 1191 1194 1197 1199 0 1 1 1 1 2 2 2 2
108 1202 1205 1208 1211 1213 1216 1219 1222 1225 1227 0 1 1 1 1 2 2 2 3
109 1230 1233 1236 1239 1242 1245 1247 1250 1253 1256 0 1 1 1 1 2 2 2 3
110 1259 1262 1265 1268 1271 1274 1276 1279 1282 1285 0 1 1 1 1 2 2 2 3
111 1288 1291 1294 1297 1300 1303 1306 1309 1312 1315 0 1 1 1 2 2 2 2 3
112 1381 1321 1324 1327 1330 1334 1337 1340 1342 1348 0 1 1 1 2 2 2 2 3
113 1349 1352 1355 1358 1361 1365 1368 1371 1374 1377 0 1 1 1 2 2 2 3 3
114 1380 1384 1387 1390 1393 1396 1400 1403 1406 1409 0 1 1 1 2 2 2 3 3
115 1413 1416 1419 1422 1426 1429 1432 1435 1439 1442 0 1 1 1 2 2 2 3 3
116 1445 1449 1452 1455 1459 1462 1466 1469 1472 1476 0 1 1 1 2 2 2 3 3
117 1479 1483 1486 1489 1493 1496 1500 1503 1507 1510 0 1 1 1 2 2 2 3 3
118 1514 1517 1521 1524 1528 1531 1535 1538 1542 1545 0 1 1 1 2 2 2 3 3
119 1549 1552 1556 1560 1563 1567 1570 1574 1578 1581 0 1 1 1 2 2 3 3 3
120 1585 1589 1592 1596 1600 1603 1607 1611 1614 1618 0 1 1 1 2 2 3 3 3
121 1622 1626 1629 1633 1637 1641 1644 1648 1652 1656 0 1 1 2 2 2 3 3 3
122 1660 1663 1667 1671 1675 1679 1683 1687 1690 1694 0 1 1 2 2 2 3 3 3
123 1698 1702 1706 1710 1714 1718 1722 1726 1730 1734 0 1 1 2 2 2 3 3 4
124 1738 1742 1746 1750 1754 1758 1762 1768 1770 1774 0 1 1 2 2 2 3 3 4
125 1778 1782 1786 1791 1795 1799 1803 1807 1811 1816 0 1 1 2 2 2 3 3 4
126 1820 1824 1828 1832 1837 1841 1845 1849 1897 1858 0 1 1 2 2 3 3 3 4
127 1862 1866 1871 1875 1879 1884 1888 1892 1941 1901 0 1 1 2 2 3 3 3 4
128 1905 1910 1914 1919 1923 1928 1932 1936 1941 1945 0 1 1 2 2 3 3 4 4
129 1950 1954 1959 1963 1968 1972 1977 1982 1986 1991 0 1 1 2 2 3 3 4 4
130 1995 2000 2004 2009 2014 2018 2023 2028 2032 2037 0 1 1 2 2 3 3 4 4
131 2042 2046 2051 2056 2061 2065 2070 2075 2080 2084 0 1 1 2 2 3 3 4 4
132 2089 2094 2099 2104 2109 2113 2118 2123 2128 2133 0 1 1 2 2 3 3 4 4
133 2138 2143 2148 2153 2158 2163 2168 2173 2178 2183 0 1 1 2 2 3 3 4 4
134 2188 2193 2198 2203 2206 2213 2218 2223 2228 2234 1 1 2 2 3 3 4 4 5
135 2239 2244 2249 2254 2259 2265 2270 2275 2280 2256 1 1 2 2 3 3 4 4 5
541
CA FOUNDATION STATISTICS
136 2291 2286 2301 2307 2312 2317 2323 2328 2333 2339 1 1 2 2 3 3 4 4 5
137 2344 2350 2355 2359 2366 2271 2377 2382 2388 2393 1 1 2 2 3 3 4 4 5
138 2399 2404 2410 2415 2421 2427 2432 2438 2443 2449 1 1 2 2 3 3 4 4 5
139 2455 2460 2466 2472 2477 2483 2489 2495 2500 2506 1 1 2 2 3 3 4 5 5
140 2512 2518 2523 2529 2535 2541 2547 2553 2559 2564 1 1 2 2 3 4 4 5 5
141 2570 2576 2582 2588 2594 2600 2606 2612 2618 2624 1 1 2 2 3 4 4 5 5
142 2630 2636 2642 2649 2655 2661 2667 2673 2679 2624 1 1 2 2 3 4 4 5 6
143 2692 2698 2704 2710 2716 2723 2729 2735 2742 2748 1 1 2 3 3 4 4 5 6
144 2754 2761 2767 2773 2780 2786 2793 2799 2805 2812 1 1 2 3 3 4 4 5 6
145 2818 2825 2831 2838 2844 2851 2858 2864 2871 2877 1 1 2 3 3 4 5 5 6
146 2884 2891 2897 2904 2911 2917 2924 2931 2938 2944 1 1 2 3 3 4 5 5 6
147 2951 2958 2965 2972 2979 2985 2992 2999 3006 3013 1 1 2 3 3 4 5 5 6
148 3020 3027 3034 3041 3048 3055 3062 3069 3076 3083 1 1 2 3 4 4 5 6 6
149 3090 3097 3105 3112 3118 3126 3133 3141 3148 3155 1 1 2 3 4 4 5 6 6
542
CA FOUNDATION STATISTICS
0 1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9
150 3162 3170 3177 3184 3192 3199 3206 3214 3221 3228 1 1 2 3 4 4 5 6 7
151 3236 3243 3251 3258 3268 3273 3281 3289 3296 3304 1 2 2 3 4 5 5 6 7
152 3311 3319 3327 3334 3342 3350 3357 3365 3373 3381 1 2 2 3 4 5 5 6 7
153 3388 3396 3404 3412 3420 3428 3436 3442 3451 3459 1 2 2 3 4 5 6 6 7
154 3467 3475 3483 3491 3499 3508 3516 3524 3532 2540 1 2 2 3 4 5 6 6 7
155 3548 3556 3565 3573 3581 3589 3597 3606 3614 3622 1 2 2 3 4 5 6 6 7
156 3631 3639 3648 3656 3664 3673 3681 3690 3698 3707 1 2 3 3 4 5 6 7 8
157 3715 3724 3733 3741 3750 3758 3767 3776 3784 3793 1 2 3 3 4 5 6 7 8
158 3802 3811 3819 3828 3837 3846 3855 3864 3873 3882 1 2 3 4 4 5 6 7 8
159 3890 3899 3908 3917 3926 3936 3945 3954 3963 3972 1 2 3 4 4 5 6 7 8
160 3981 3990 3999 4009 4018 4027 4036 4046 4055 4065 1 2 3 4 5 6 6 7 8
161 4074 4083 4093 4102 4111 4121 4130 4140 4150 4159 1 2 3 4 5 6 7 8 9
162 4169 4178 4188 4198 4207 4217 4227 4236 4246 4256 1 2 3 4 5 6 7 8 9
163 4266 4276 4285 4295 4305 4315 4325 4335 4345 4355 1 2 3 4 5 6 7 8 9
164 4365 4375 4385 4395 4406 4416 4426 4436 4446 4457 1 2 3 4 5 6 7 8 9
165 4467 4477 4487 4498 4508 4519 4529 4539 4550 4560 1 2 3 4 5 6 7 8 9
166 4571 4581 4592 4603 4613 4624 4634 4645 4656 4667 1 2 3 4 5 6 7 9 10
167 4677 4688 4699 4710 4721 4732 4742 4753 4764 4775 1 2 3 4 5 7 8 9 10
168 4788 4797 4808 4819 4831 4842 4853 4864 4875 4887 1 2 3 4 6 7 8 9 10
169 4898 4909 4920 4932 4943 4955 4986 4977 4989 5000 1 2 3 5 6 7 8 9 10
170 5012 5023 5035 5047 5058 5070 5082 5093 5105 5117 1 2 4 5 6 7 8 9 11
171 5129 5140 5152 5164 5176 5188 5200 5212 5224 5236 1 2 4 5 6 7 8 10 11
172 5248 5260 5272 5284 5297 5309 5321 5333 5346 5358 1 2 4 5 6 7 9 10 11
173 5370 5383 5395 5408 5420 5433 5445 5458 5470 5483 1 3 4 5 6 8 9 10 11
174 5495 5508 5521 5534 5546 5559 5572 5585 5598 5610 1 3 4 5 6 8 9 10 12
175 5632 5636 5649 5662 5675 5689 5702 5715 5728 5741 1 3 4 5 7 8 9 10 12
176 5754 5768 5781 5794 5808 5821 5834 5848 5861 5875 1 3 4 5 7 8 9 11 12
177 5858 5902 5916 5929 5943 5957 5970 5984 5998 6012 1 3 4 5 7 8 10 11 12
178 6028 6039 6053 6067 6081 6095 6109 6124 6138 6152 1 3 4 6 7 8 10 11 13
179 6166 6180 6194 6209 6223 6237 6252 6266 6281 6295 1 3 4 6 7 9 10 11 13
180 6310 6324 6339 6353 6368 6383 6397 6412 6427 6442 1 3 4 6 7 9 10 12 13
181 6457 6471 6486 6501 6516 6531 6546 6561 6577 6592 2 3 5 6 8 9 11 12 14
182 6607 6622 6637 6653 6668 6683 6699 6714 6730 6745 2 3 5 6 8 9 11 12 14
183 6761 6776 6792 6808 6823 6839 6855 6871 6887 6902 2 3 5 6 8 9 11 13 14
184 6918 6934 6950 6965 6982 6598 7015 7031 7047 7063 2 3 5 6 8 10 11 13 15
185 7079 7096 7112 7129 7145 7161 7178 7194 7211 7228 2 3 5 7 8 10 12 13 15
186 7244 7261 7278 7295 7311 7328 7345 7362 7379 7396 2 3 5 7 8 10 12 13 15
187 7413 7430 7447 7464 7482 7499 7516 7534 7551 7568 2 3 5 7 9 10 12 14 16
543
CA FOUNDATION STATISTICS
188 7586 7603 7621 7638 7656 7674 7691 7709 7727 7745 2 4 5 7 9 11 12 14 16
189 7762 7780 7796 7816 7834 7852 7870 7889 7907 7925 2 4 5 7 9 11 13 14 16
190 7943 7962 7980 7998 8017 8035 8054 8072 8091 8110 2 4 6 7 9 11 13 15 17
191 8128 8147 8166 8185 8204 8222 8241 8260 8279 8299 2 4 6 8 9 11 13 15 17
192 8318 8337 8356 8375 8395 8414 8433 8453 8472 8492 2 4 6 8 10 12 14 15 17
193 8511 8531 8551 8570 8590 8610 8630 8650 8670 8690 2 4 6 8 10 12 14 16 18
194 8710 8730 8750 8770 8790 8810 8831 8851 8872 8892 2 4 6 8 10 12 14 16 18
195 8913 8933 8954 8974 8995 9016 9036 9057 9078 9099 2 4 6 8 10 12 15 17 19
196 9120 9141 9162 9183 9204 9226 9247 9268 9290 9311 2 4 6 8 11 13 15 17 19
197 9333 9354 9376 9397 9419 9441 9462 9484 9506 9528 2 4 7 9 11 13 15 17 20
198 9550 9572 9594 9616 9638 9661 9683 9705 9727 9750 2 4 7 9 11 13 16 18 20
199 9772 9795 9817 9840 9836 9886 9908 9931 9954 9977 2 5 7 9 11 14 16 18 20
Example:
If Log x = 0.301. then x = Antilog 0.301 = 2
If Log x = 1.301. then x = (Antilog 0.301) × 10 = 20
If Log x = 2.301. then x = (Antilog 0.301) × 100 = 200
If Log x = (–) 0.699, then we can write Log x = (– 1 + 0.301) : Thus x = Antilog (0.301) / 10 = 0.2
If Log x = (–) 1.699, then we can write Log x = (– 2 + 0.301) : Thus x = Antilog (0.301) / 100 = 0.02
544