Professional Documents
Culture Documents
A2 StatsNotes
A2 StatsNotes
A2 StatsNotes
AL STATISTICS
*
Poisson Distribution
~
Representation : X -
Po CX ) where I mean
g?!
Formulae P(X )
'
: -
x =
E where * o.
3) Occur Independently
Variance : r =
Van Cx) =
X
→
BINOMIAL TO POISSON APPROXIMATION
~
If XNB ( n, h) & n > 50 and nh e5 ,
'
then
'
X can be approximated to W -
Po CX)
where a- nip
No correction Ccc )
continuity required
~
as discrete discrete
-
→
POISSON TO NORMAL APPROXIMATION
~
If Xu Po ( x) and X > 15 then ,
" "
X be to MN ( X 2)
can approximated ,
as →
Eg .
P CX > 55 ) -
B P ( X > 55.5)
g
PCX > 55) →
P (x > 44.5)
5)
* LINEAR COMBINATION OF RANDOM VARIABLES :
Hain
D Y -
Xi b -
2) Y -
all IMPORTANT :
3) Y -
ax + b 4) Y -
axe bw If two values are not independent
ECM -
a EG) b t E-Cx) =
a ECM + b Ecw) Var ( 2X) = 22 Var CX)
Vor =
a' var Cx) var =
Ivar Cx) t b' Van Cw)
NOTE :
Variance is added
always
~
" "
If XEY
'l l'
~
:
are
.
equal)
"
"
•
Poisson distributed then only Xt Y is poisson distributed ( as mean } Variance are
feel DX I
(Area
ci , 2)
=
under the curve =
negative)
"
a- is x is never
~
Pca exe b) = abffcx) doe
Medien (M) ~
Mean Ctu) /Expectation ,
Ecx,
-
Variance ,
Van CX) let
P (X E M) = pff da -
I µ -
E Cx ) =
pffocfcxjdx Var Cx) = or = pffoefcxsdx -
yr
BE SAMPLING
~
Entire collection is POPPULATION
~
SUBSECTION of population is a SAMPLE
Investigating pohpulation
~
entire is CENSUS
v0
WHY IS CENSUS NOT PREFERRED
•
MORE TIME AND EFFORT REQUIRED
•
MORE ERRORS LIKELY DURING MEASUREMENT OF ANALYSIS
•
TASK IS MORE COMPLEX AND CAN BE EXPENSIVE
•
MAY LEAD TO DISTINCTION OF POPULATION .
SAMPLING
1) BIASED SAMPLING →
OVER REPRESENTING OR UNDER REPRESENTING A FEATURE OF THE POPULATION .
'
OF BEING PICKED
'
SIZE N WHICH COULD BE TAKEN FROM THE POPULATION HAS THE SAME CHANCE -
Reject
•
SAMPLE MEAN { VARIANCE
n X
ECI ) Varcñ)
µ and
In
= =
BA
CENTRAL LIMIT THEOREM
For ✓
independently distributed random variables X, Xa and variances
'
sufficiently large Cn > 30), then I is normally distributed with mean , µ & variance ,
of .
In short ,
If n> 30 ,
In N ( sus E)
PLEASE NOTE
mm mm
:
Tf "
"
Note : when all the individual elements of n have If all the elements have not been sampled ,
A ESTIMATION
① If V' is
'
of taken from
' '
of size
the sample sample hohpulation
"
variance a n a with an
'
unknown variance ,
o then :
'
E- Cv)
And
=
x o
② If random values of X, , Xz . . . .
Xn is taken from a
hoppulation then ;
poppulation
=
-
n -
I ,
hohhulation
③ Confidence Interval
Confidence Interval for the mean, is a values which has the probability of trapping the poppredation mean
range of
' µ
-
← c →
c
i
Ju
e- c→
µ +e
.
tax
z
z
-
✗ %
Confidence Interval
Confidence Interval : I -
z
§n ,
I -12
-5 ]
where ,
I =
sample mean
oh =
hohulation variance
n =
sample size
2 =
0 (2) =
I -1-2 a-
( proportion)
'
of members
'
h with an attribute ,
the confidence interval
[ Ps ZHI -
, hs +2T¥ ]
The confidence Interval is approximateas :
•
A discrete data is approximated to continues
•
Population variance is estimated from the sample and the estimate used is biased .
The distribution
'
approximately normal
'
Ps
of is
•
Bat
Hypothesis Testing
null C theory of change )
Ho hypothesis
=
no
Alternate ( of
change)
H
hypothesis theory
=
,
One tail :
Ho :p =
x
H , :X
> X or H ,
:
ja e ×
Reject Ho if
Probating E X t .
Two tail :
Ho ×
-
:
ju
Hi :
yet I
Reject Ho if
Probability e-
f-
I
'
* Errors
Type I error :
Reject Null
Hypothesis if it is true
PC )
Null
Hypothesis Rejected 1 True
signifance level
Type 2 error :
Accept Null
Hypothesis if it is false
PC Null
Hypothesis Accepted / False )
Notes
By Jay Sha
: