Professional Documents
Culture Documents
Module 11
Module 11
Module 11
DATA ANALYSIS
MEASURES OF CENTRAL TENDENCY
Measures of
central
tendency
Median
For ordinal, interval and ratio scale data
*
+
,-.
!"#$%& = ( + Xh
/
Measures of
dispersion
¡ Variance is defined as the mean squared deviation of a variable from its arithmetic
mean.
" ∑'
$%& ($ )*
"
! = for a population of size N.
+
¡ The positive square root of the variance is called standard deviation.
∑'
$%& ($ )*
"
s.d= !=
+
∑% (
"#$ /" &" '&
)
¡ In case we have grouped data, s=
*'+
Where, ,- = mid-point of the ith class
0- = Frequency of the ith class
MEASURES OF DISPERSION: COEFFICIENT OF VARIATION
¡ In case of ordinal scale data, the measure of association between two variables is
obtained through Spearman’s rank order correlation coefficient.
Given by,
6 ∑ ()*
!" = 1 −
+(+* − 1)
Where, !" = Spearman’s rank order correlation coefficient
() = (.//0!0+10 .+ 2ℎ0 !4+5.+6 7/ 2ℎ0 .2ℎ !0897+(0+2
n= sample size
MEASURE OF CORRELATION FOR INTERVAL/ RATIO SCALE
¡ Pearson’s correlation coefficient given by:
%&'(), +) 6 1 − +)
∑4123()1 − ))(+ 6
!"# = =
-. -/
6 7 ∑4123(+1 − +)
∑4123()1 − )) 6 7
Y= f(X)
If Y and X are assumed to have a linear relationship, they can be expressed as,
!" =# + %& +'"
Where, # ()* % are parameters that are to be estimated.
#= intercept
%= slope/ coefficient
e= stochastic error term
SIMPLE LINEAR REGRESSION
ESTIMATING THE ALPHA AND BETA
Error square term= ∑ "#$ = ∑('# − ')# )2 You could now solve for (1) and
is to be minimized. (2) and get,
- #
')# = +* + ./
+* = '8 − .- /8
We could write this as,
- # )2
∑ "#$ = ∑('# − +* + ./ And
To minimize, 9
∑ ;2 <2 −;8 <8 =>?(;,<)
-
.= :
9 =
0 ∑ 123 ∑ ;2 $ −;
8 $ ?BC(;)
=0 ----(1) :
04
5
0 ∑ 123
6
07
=0. -----(2)
ASSUMPTIONS OF CLASSICAL LINEAR REGRESSION MODEL
Classical Linear Regression Model (CLRM): !" = # + ∑ &" ∗ (" + )"