Professional Documents
Culture Documents
Observarion + (+1) TH Median of Continuous Frequency9
Observarion + (+1) TH Median of Continuous Frequency9
Observarion + (+1) TH Median of Continuous Frequency9
B. MEAN DEVIATION
-
-
frequency distribution, x=a+| xh M.D. (a) = -
where ' a is the assumed mean and d , = Xi - A
2-M
Median
h observarion +[+1]th observaio M.D. (M)=
.
MEDIAN OF CONTINUOUS FREQUENCY9
DISTRIBUTION
MEAN DEVIATION FOR GROUPED DATA
(a) Mean Deviation for Discrete Frequency
Distribution
Then, Median (M)- 2 h ) Mean deviation about the mean , for
where, = lower limit o f m e d i a n class, h = class size, observation x, I2 .-.. , OCcurring with
frequenciesJ1J2 Jn 15
frequencyofmedian
Softhe class, CCumulative frequency
class just betore the median class,
Zsk-
N=2 M.D. (T) =
N
- , where N = 2.
(11) Mean deviation about the median M, (iv) Shortcut Method to Find Variance (o
and Standard Deviation (o) for Discrete
or Continuous Frequency Distribution
M.D.(M)=
k-M
N whare A
i=|
N h
A is the assumed mean.
(1) Mean deviation about median M,
11. cOEFFICIENT OF VARIATION
M.D (M= i - l
l-M To compare the variability or dispersion of two or more
N
distribution, we calculate the coefficient of variation (
Limitations of mean devia tion: In a series, C.v.)ofcach distribution. The serics hav ing greater C.V.
where the degree of variability is very high, is said to be more variable (or less consistent) than the
the median is not a representative central other. The series having lesser C.V. is said to be more
consistent (or less variable) than the other.
ten
dency. Thus, the mean deviation about
median calculated for such series cannot be
fully relied. 12 CORRELATION ANALYSIS
10. VARIANCE AND STANDARD DEVIATION If two quantities vary in such a way that fluctuation in
one are accompanicd by fluctuation in other, these
() Variance ( ) and Standard Deviation
quantities are said to be correlated. The statistical tool
(o) for Ungrouped Data bywhich the relationship between two or more than two
be observations with the
Let x, A2 ,
mean, then
n as variables studied is called correlation.
The measures of correlation called the coeflicient of
correlation and denoted by r.
() Covariance: The covariance between two vanables
Where, =and =
where
N=2Sandf
i=l
s are the frequencies of x; s
(i) Karl Pearson's correlation coefficient:
(ii) Variance ( ) and Standard Deviation
(o) of a Continuous Frequency Cov,) -T-
Distribution (Grouped Data) Var (r).Var(y) E(-F-z{-
nEy-(2x)
where x, are the midpoints of the dasses and f their o-({-(E9
respective frequencies.
(i) Coefficient of rank correlation: This formula is 14. REGRESSION ANALYSIS
applhed to the problems in which data cannot be
measured quantitatively but qualitative assessment L i n e of regression ofy on x: The line of
is possible such as beauty, honesty etc. In this regression ofy on x gives the best estimate of the
case,
the best individual is given rank number, value of y given value of r and is given by
next rank 2 and so on. The coeflicient of rank
correlation is given by the formula
y-=b-X);b,, =rs, /o,
(i) Line of regression of x on y: The line of
R-l
R=1-4
) regression ofr on y gives the best estimate of the
vaule of r for given value of y is given by
the rank
where, d, is difference of corresponding
and n is the number of pairs of observations.
x-=by-F}:b =r.
13. CHARACTERISTICs OF CORRELATION
(i) of of
COEFFICIENT Coefficient regression
coefficient of regression
x
and y: The
denoted
of y onr by b
and is given by
) -1srsl
(i) Ifr=-1, then there is perfoct negative correlation Cov(x,y)
betwcen x and y i.e. corresponding to an inerease
(or decrease) in one variable, there is a proportional
decrease (or increase) in the aher variable.
(n) Ifr=1, then there is perfect positive correlation
between x and y i.e. corresponding to an inerease
nEx-(E)
of ofr
(or decrea se) in one variable, there is proportional v) Coefficiet regression on y: The coefficient
of regress ion of r on y denoted by b, and is given
increase (or decrease) in the aher variable.
)_n2V-Zxy
(iv) Ifr = 0, then x and y a r e not correlated i.e. the
by = h ==oVN,
changes in one variable are not followed by Oy y
nEy-(y)
changes in the other.
()Ifo<r<1, then there is a positive correlation 15. PROPERTIES OF REGRESsION
COEFFICIENTS
between x and y i.e. and increase (or decrease) in
one variable corresponds to an increase (or Both regression cocficients have the same sign i.e.
decrease) in the other. either both are positive or both are negative.
(vi) If-I <r<0, then there is negative correlation ci) The sign of correlation coeficient is same as that
between x and y i.e. an increase (or decrease) in
of regression cocficient i.e.
one variable corresponds to decrease (or increase)
in the other. r>0, if b 0 and bw0 and r 0, if b,< 0
and b,, <0.
(vii) The valuer is called coefticient of determination.
Ifr = 0.5, then r = 0.25, which means only 25% (i) The coefficients of corelation is the geonmetric
ofvariations are explained and remaining 75% are mean between the two regression coetticients
unexplained. They may due to external causes.
r by
The sign to be taken outside the square root is that
(vii)Standard Error (SE) is defined as SE =
Practice MCQs
1. Themode of'the followingseries3,4,2, 1, 7,6,7,6, 8,6,5 is 7. Coefficient of variation of twodistributions are 50 and 60
(a) b) (c) (d) 8 and their arithmetic means are 30 and 25, respectively.
Then, difference of their standard deviations is
2. Aset of numbers consists ofthree 4's, five 5's, six 6's, eight (c) I5
(a) b) d) 25
8's and seven 10's. The mode of this set of numbers is is
8
(a)6 (b) 7 (c) (d) 10
Theproduction of food grains in Maharashtra given for
the 12 years from 1992 to 2003. Which one of the following
3. Consider the following data which represents the runs representations is most suitable to depict the data ?
scored by two batsmen in their last ten matches as a) A simple bar diagram
(b) A pie diagram
Batsman A: 30,91,0,64, 42, 80, 30, 5, 117,71 ()A component bar diagram with the components
Batsman B :53,46, 48, 50, 53, 53, 58, 60, 57, 52 arranged in chronological order
Which of the following is/are true about the data? (d) A broken line graph
L Mean ofbatsman A runs is 53. 9. Mean of 100 items is49. It was discovered that three items
which should have been 60, 70, 80 were wrongly read as
L Median of batsman A runs is 42.
40, 20, 50 respectively. The correct mean is
L Mean of batsman B runs is 53. LA
V Median of batsman B runs is 53.
(a) OnlyI is true (b) I and IIl are true
(a) 48 (b) 825 (c) 50 d)
10. The obscrvations 29,32, 48, 50, x, x + 2, 72, 78, 84, 95 are
(c) I, Il and IV are true (d) All are true
arranged in ascending order. What is the value of x if the
The mean of 13 observations is 14. If the mean of the first median ofthe data is 63?
observations is 12 and that of the last 7 observations is
(a) 61 (b) 62 (c) 62.5 (d) 63
16, what is the value of the 7th observation ? LVI 1l. Given (i) 85 observations which are not sorted and (iî) 150
observati ons which are sorted and arranged in an incrcasing
(a) 12 b) 13 14 d) I5
order. The modian values of (i) & (i) respactivedy can be föund
5. When tested, the Iives(in hours) of 5 bulbs were noted as as
follows (a) (i) 43 observat ion (ii) A.M. of 75h and 76th
1357, 1090, 1666, 1494, 1623 observation
The mean deviations (in hours) from their mean is (b) (i)43 observation (i) 76" observation
(a) 178 (b) 179 C) 220 (d) 3 (c) (i) can not be found (i) can not be found
regression lines between height (x) and weight (c) The mean and median both will decrease
(y) a r e 4 y - 1 5 x + 4 1 0 = 0 and 30x - 2 y - 825 = 0, then
(d) The mean remains the same but median will decrea se
what will be the correlation coefficient between height
16. If the comelation coefficient between x and y is 0.6,
and weight? INDANA 2017|
covariance is 27 and variance of y is 25, then what is the
variance ofx? INDANA 2018
(a) b) ( d) 4
9 B1
9. In an examination, 40% of candidates got secon d class. a) (b) 25
(c) 9 (d)
When the data are represented by a pie chart, what is the
17. Lat g be the mean of lfx,at cy, for
angle corresponding
(a) 40 6)
tosecond()class?
144
INDANA 20171
d ) 320
x.*2*g
some constants a and c, then what will be the mean
of y.
10. Consider the following statements: NDA/NA 2017| 2 Y3 Y? INDANA 2018
Statementl: Rangeis not agood measure of dispersion.
Statement 2: Rangeishighly affected by the ex istence of
(a) a+cr (b) a- ( x-a (d)
extreme values.
Which one of the following is correct in respect of the 18. Consider the following statements:
above statements?
NDANA2018
(a) Both Statementl and Statement 2 are correct and LIf the correlation coefficient ry 0, then the two
lines of regression are parallel to each other.
2 is I
Statement the
correct explanation of Slatement 2 If the correlation cocficient ry+1, then the two
(b) Both Statement and Statement 2 are correet but
lines ofregression are perpendicular to each other.
2 is not the correct explanation of
Statement
Statementi Which of the above statements is/are correct?
(a) 1only (b) 2only
(C) Statement
(d)
I correct
is but Statement 2 is not correct
Statement 2 is correct but Statement I is not correct (c) Bothl and 2 (d) Ncither I nor 2
1. Ifthe data are modeately non-symmetrical, then which one of 19. If 4x 5 y +33 =0 and 20x -9y 107 are two lines of
the following empirical relaticnships is oaract? [NDA/NA 2017| regression, then what are the values of and y
(a) 2xStandard dev1atians* Mean deviation
(6) 5*Standard deviation= 2 x Mean deviation respectively? INDANA2018
4x 5 x Mean deviation (a) 12 and 18 (b) 18 and 12
(c) Standard deviation =
(d) 17and 13
(d)5x Standard deviatin =4x Mean deviation (c) 13and17
12. Data can be represented in which ofthe following forms? 20. Consider the following statements: NDANA 2018
L Textual form 2. Tabular form 3. Graphical fom independent of change in scale an d change
Select the correct answer using the code given below.
Mean is
inorigin.
(a) I and 2 only (b)
INDA/NA 2017
2 and 3 only
Variance is indepandent of change in scalebut not in arigin.
Which of the above statements is/are correct?
(c) I and 3 only (d) 1,2 and 3 (a) 1only (b) 2only
13. For given statistical data, the graphs for less than ogive
() Both I and 2 (d) Nither I nor 2
and more than ogi ve are
drawn. the pont ar wieu u 21. Consider the following statements: INDANA 2018
wO curves intersect 1sP,
then abscissa
of point P gives** T h e sum ofdeviations fYom mean is always zero.
the value of which one ofthe following measures of central
2 The sum of absolute deviations is minimum when
tendency? INDANA 2017|
(a) Median (b) Mean taken around median.
(c) Mode (d) Geometric mean Which of the above statements is/are correct?
(a) 1only (b) 2only
14. If the regression coeflicient of x on y and y on x are5 (c) Both I and 2 (d) Nither I nor 2
andrespetively, then what is the correlation 22. What is the modianofthe numbers 4,6,0,93. 48.7.6.2.3.
coe flicient between x and y?
12.7, 3.5, 8.2, 6.1,3.9,
S.2 INDA/NA 2018|
INDANA 20171
(a) 3.8 b) 49 (c) 5.7 (d) 6.0
23. 20%% of the 32. For the variables r and y, the regression lines are
In test
a
in Mathematics, students obtained
hrst class. It the data are represented by a Pre-Chart,
two
6x+ y = 30 and 3x + 2y = 25. What are the values of x ,
I1.25, 1.25
315s7 3
What is the value of median of the distribution?
(c)11.25,2.5 d) 12.5,2.5 NDA/NA 2019-1
27. Consider the following statements: NDA/NA 2019-1|I (a) 4 (b) () 6 (d)
I. The algebraic sum of deviat ions of a set or values 36. Mean of 100 observations is 50 and standard deviation is
from their arithmetic mean is always zero. 10. If5 is added to each observation, then what will be the
Arithmetic mean > Median> Mode for a symmetric
new mean and new standard deviation respectively?
distribution.
Which of the above statements is/are correct ? INDANA 2019-1
(a) s0,10 (b) 50,15 () , 10 (d) 55, 15
(a) 1 only 2only
(c) Both I and 2 (d) Neither I nor 2 37. If the range of a set of observations on a variable X is
known to be 25 and i f Y = 40+3X, then what is the range of
28. Let the correlation cocfficient between X and Y be 0.6.
variables Z and W are defined as Z X+ 5 and the set of corresponding observation.s on Y?
Random INDANA 2019-11]
W-.What isthe correlationcoefficient betwoen Zand W? (a) 25
(b) 40 (c) (d) 15
NDANA 2019-1 38. If V is the variance and M is the mean of first 15 natural
() 036 (d) 06 numbers, then what is V+M* cqual to?
(a) . (b) 2
29. Ifall the natural numbers between I and 20 are multiplied NDANA2019-1]
48
by 3, then what is the variance of the resulting series ? (a) 124 (b) ()
INDANA 2019-1 39. A car travels first 60 km ata speed of 3V km/hr and travels
(a) 99.75 (b) 19.75 (c) 299.25 (d) 399.25 next 60 km at 2vkm/hr. What is the average speod of thecar?
30. The modian ofthe observations 22,24, 33, 37,x+ I,x+3,46, INDA/NA 2019-11
47,57, 58 in ascend ing order is 42. What are the values of (a) 2.5v km/hr (b) 2.4vkm/hr
Sth and 6th observations respectively? [NDA/NA 2019-1| (c) 2.2vkm/hr (d) 2.1 vkm/hr
(c) 43,46 (d) 40,40 0. h e mean weighi ot 150 Sudents in a certain class is 60 kg.
(a) 42,45 (b) 41,43
The mean weight of boys is 70 kg and that of ginis is 55 kg.
31. Arithmetic mean of 10 observations is 60 and sum of squares
What are the number of boys and grls respectively in the
from S0 is S00. What is the standard deviation
of deviations class? INDANA 2019-1|
of the observations? NDANA2019-11] (a) 75 and 75 (6) 50 and 100
(a) 20 (6) 21 C) 22.36 (d) 24.70 (c) 70 and 80 (d) 100 and S0
DIRECTIONS (Qs. 41-43): Read the folloing information 46. The arthmeie mean of 100abservations is 40. Later, itwas
and answer the three ilems that Jollow: found that an observation '53' was wrongly read as 83.
What is the correct arithmeic mean? INDA/NA 2020-1|
INDA/NA 2020-11 a) 398 (b) 39.7 ( c ) 39.6 (d) 39.5
Number of stud enis 47. LetXand Yrepresent prices (in) ofa commodity in Kolkata
Marks and Mumbai respectively. It is given that X= 65, Y= 67,
Physics Mathematics
2 . 5 , ay = 3.5 and r{X, Y)= 0.8. What is the
10-- 20 10 equation of regress ion of Yon X? NDA/NA 2020-11
-30 21 (a) Y=0.175X-5 (6) Y=1.12X-5.8
(c) Y=1.12X-5 (d) Y-0.17X+5.8
30-40
48. Consider a random variable X which follows Binomial
40-50
distribution with parameters n = 10 and p = Then
50-60 15 10
ANSWER KEY
Practice Ouestions
b)5a)9 136) 17 (c)21 (e)25 (b)29 (b)33|(a)37(d)
2()6d)106)14 (c)18()22 (b)26a30dy34 a) 38(6)
3 7)11d)1sa)19 23|(d 27 (31(a3s d) 39 (
4c) 8(a)12 d16|(d)|20 a) 24 (d) 28 (a) 32 ( |36 (d) 40 (d)
Past Year Questions
1(a)1|b13 (a)19 (c25 b31(a)37(c43 ()49(b
2 (c)|8b)|14 (a) 20 (d)|26 (c)32 (c)|38 (c)|44(b)50 d)
d9(c)156)| 21|(c)|27(a)|33 | (6)396)|45 | (d)51(d
4 d ) 10(a)16 d)22b)28b)34 b)40(b)46 (b
5(c) 11 (c)|17 (d)|23(C)|29(c)|35(b)41(c)|47(b)
L6 (d)|12 | (d)|18 | (d)|24|(b)|30 | 6)|36 | (c)| 42|(a) | 48 | (d |