Professional Documents
Culture Documents
Statistic Cheat Sheet
Statistic Cheat Sheet
Statistic Cheat Sheet
Statistics
Categorical bar chart,
pie chart
frequency
table,
relative
frequency
table,
proportion
Quantitative dotplot,
histogram,
boxplot
mean,
median, max,
min, standard
deviation,
z-score,
range, IQR,
five number
summary
Categorical vs
Categorical
side-by-side
bar chart,
segmented bar
chart
two-way
table,
difference in
proportions
Quantitative
vs Categorical
side-by-side
boxplots
statistics by
group,
difference in
means
Quantitative
vs
Quantitative
scatterplot correlation
Z* 95% = 1.96
90% = 1.645
99% = 2.58
- Single Proportion
- Difference in proportions
- Difference in means
- Chi Square 2
We are 95% confident that the average y value for x=x* lies in this interval
We are 95% confident that the y value for x = x* lies in this interval
1) Linearity
2) Constant Variability of Residuals
3) Normality of Residuals
R
2
is the proportion of the variability in Y that is explained by the model
*
sample statistic z SE
2
*
(1 )
z
n p p
ME
| |
=
|
\ .
1 1 2 2
1 2
(1 ) (1 ) p p p p
SE
n n
= +
2 2
1 2
1 2
SE
n n
o o
= +
*
0 1
y x | | = +
2
"Variability in Y explained by the model"
"Total variability in Y"
SSM
R
SST
= =
0 1 2
: ... 0
: At least one 0
k
a i
H
H
| | |
|
= = = =
=
Adjusted R
2
is like R
2
, but takes into account the number of explanatory variables
As the number of explanatory variables increases, adjusted R
2
gets smaller than R
2
P(A and B) is the probability that both events A and B will happen
P(A or B) is the probability that either event A or event B will happen
Events A and B are disjoint or mutually exclusive if only one of the two events can happen
Bayes Rule -
The more explanatory variables in the model, the more uncertainty.
( if ) ( )
( if )
( )
P B A P A
P A B
P B
=> =