Professional Documents
Culture Documents
Probability Distribution: - Discrete - Continuous
Probability Distribution: - Discrete - Continuous
Probability Distributions
- A table consisting of values a random variable can assume and the corresponding probabilities of the values. The probabilities
are determined theoretically or by observation.
REQUIREMENTS
- The sum of the probabilities of all the events in the sample space must equal to 1
- ∑P(X) = 1 or 100%
- The probability of each event in the sample space must be between or equal to 0 and 1
- 0 ≤ P(X) ≤ 1
- X HAS NO REQUIREMENT (can be negative)
X P(X) XP(X) X-µ (𝑋 − µ)
2 2
(𝑋 − µ) 𝑃(𝑋)
Variance:
Farther from 0 = more variable µ = ∑𝑋 · 𝑃(𝑋)
Closer to 0 = less variable (more uniform) 2 2
σ = ∑[(𝑋 − µ) · 𝑃(𝑋)]
No variability 0 = constant
A box contains four red and three blue marbles. Michele picks three marbles at random from this box. If Z is the random variable
representing the number of blue marbles picked from the box, complete the table below.
4𝐶3
7𝐶3
(4 red marbles choose 3)
4𝐶2•3𝐶1
7𝐶3
(4 red choose 2 • 3 blue choose 1)
4𝐶1•3𝐶2
7𝐶3
(4 red choose 1 • 3 blue choose 2)
3𝐶3
7𝐶3
(3 blue marbles choose 3)
2. Mean
- It is the most unbiased estimator
- All values have a contributing factor to the mean
5. Confidence interval
- is a specific interval estimate of a parameter determined by using data obtained from a sample and by using the specific
confidence level of the estimate.
* If no exact value for z-score, choose the one that’s farther from the mean
When σ is unknown (SAMPLE MEAN)
- Most of the time, the value of 𝝈 is not known, so it must be estimated by using s, namely, the standard deviation of the sample.
- When s is used, critical values greater than the values for 𝑧 α are used in confidence intervals. These values are taken from the
2
Degrees of freedom
- are the number of values that are free to vary after a sample statistic has been computed.
- df = n -1
HYPOTHESIS TESTING
- It is a decision-making process for evaluating claims about a population.
STEP 1: Hypotheses and Claim
STEP 2: Critical Value(s)
STEP 3: Test Value
STEP 4: Decide
STEP 5: Conclusion
Statistical Hypothesis
- Is a conjecture about a population parameter. (±) (+) (-)
- May or may not be true
Statistical test
- Uses data obtained from a sample to make a decision about whether the null hypothesis should be rejected
Test value
- Numerical value obtained from a statistical test
- If the test value is the same as the critical value, the null
hypothesis should be rejected.
P-VALUE METHOD
- probability of the test value in the direction of the alternative hypothesis
(ALWAYS) ↓
𝐻0: p = 0; This null hypothesis means that there is no correlation between the x and y variables in the population.
𝐻1: p ≠ 0; This alternative hypothesis means that there is a significant correlation between the variables in the population.
df = n - 2
Residual
- The difference between the actual value y and the
predicted value y’ (that is, the vertical distance) is called - If the residual is positive, the actual data is above the
a residual or a predicted error. regression line.
- y - y’ (use the value of y that corresponds to the x that - If the residual is negative, the actual data is below the
was used) regression line.
- If the residual is zero, the actual data is on the regression
line.