Professional Documents
Culture Documents
Introduction To Hypothesis Testing: Breakthrough Management Group
Introduction To Hypothesis Testing: Breakthrough Management Group
Module Objectives
To introduce statistical hypothesis testing
To provide a roadmap for selecting the appropriate hypothesis test for a given problem
DMAIC Roadmap
Define
Project Scope & Problem Validation
Measure
Refine the Project
Analyze
Failure Modes & Effects Analysis
Improve
Design & Execute An Experiment
Control
Optimize & Refine Solutions
Problem Statement
Define Y=f(x)
Project Metrics
Recommended Changes
Objective Statement(s) Data Collection Systems Team Members Process Capability Plan for DOE
Process Analysis
Analyze
Failure Modes & Effects Analysis
Real-time data
10
The original assumptions are usually applied unless the real-time data is convincing in suggesting a change.
Could this data have come from a process where time of day is not significant?
Time Of Day
Is this enough evidence to conclude that cycle time depends on time of day? How much evidence is enough?
Cycle Time
Considering the assumptions to be true, determine what the real-time data should look like.
Averages of 10 cycle times should range between 84 and 96 minutes. Frequency distribution should be symmetrical Average cycle time for each time-of-day should be close to 90 minutes
Collect data and organize it into appropriate statistics. If the real-time data is different than expected, modify the assumptions.
Example:
Assume that a process cycle time averages 90 minutes. Averages of 10 cycle times should range between 84 and 96 minutes. A real-time sample of ten averages..
94.3 minutes 97.6 minutes Keep the assumption. Conclude that mean cycle time has increased.
Or
Or
Population the entire group of objects about which one wishes to draw an inference
Sample Statistics
X
s
^ p
Populations have parameters which are fixed, and known only to God. Statistics is the science of guessing, from sample data, what God knows about population parameters.
Hypotheses
Hypotheses:
Hypotheses are statements about population parameters, not statements about samples.
Hypothesis Example
To meet an established performance standard the average daily balance of non-interest generating funds must be less than $10K.
What is H0 in common language? The average daily balance of non-interest generating funds is equal to $10K. What is H0 in statistical terms?
What is Ha in common language? The average daily balance of non-interest generating funds is less than $10K. What is Ha in statistical terms?
What is H0 in common language? There is no difference in average processing time between the old and the new processes. What is H0 in statistical terms? New = Old
What is Ha in common language? The new process completes orders faster, on average, than the old process. What is Ha in statistical terms? New < Old
What is H0 in common language? There is no difference in order time variability between distributors 1 and 2. What is H0 in statistical terms?
What is Ha in common language? The order time variation for distributor 1 is different than distributor 2. What is Ha in statistical terms?
2 1 = 2 2
2 1 2 2
What is H0 in common language? There is no difference in the proportion of buy orders between the historical population and on Mondays. What is H0 in statistical terms?
What is Ha in common language? The proportion of buy orders on Mondays is greater than the historical population. What is Ha in statistical terms?
PMondays = Ppopulation
True State of Nature Did Not Commit Crime Commited A Crime Guilty Goes Free
Correct Verdict
Commited A Crime
Correct Verdict
Rejecting the null hypothesis when it is, in fact, true Also known as producer risk The probability of a Type I error is denoted by . (0< <1) Accepting the null hypothesis, when it is, in fact, false Also known as consumer risk The probability of a Type II error is denoted by (0< <1)
Type II error
True State of Nature Ho Ha Conclusion Drawn Correct Decision Type I Error Type II Error Correct Decision Ho Ha
H0: There is no difference between the new document design time and the historical. new = std Ha: The 50 new documents were faster to design than the historical average.
2-sample t-test
Alternative Hypotheses
1 proportion test
Definitions
Null Hypothesis (H0) Statement of no change or difference; assumed to be true until sufficient evidence is presented to reject it. Alternative Hypothesis (Ha) Statement of a change or difference; assumed to be true if the null hypothesis is rejected. Type I Error The error that occurs when the null hypothesis is rejected when, in fact, it is true. Type II Error The error that occurs when the null hypothesis is not rejected when it is, in fact, false. Alpha Risk The maximum probability of making a Type I error. This probability is established by the experimenter and often set at 5%.
Definitions
Beta Risk The risk or probability of making a Type II error. Significant Difference The term used to describe the results of a statistical hypothesis test where a difference is too large to be reasonably attributed to chance Power (1-) The ability of a statistical test to detect a real difference when there is one; the probability of correctly rejecting the null hypothesis. Determined by alpha and sample size. Test Statistic A standardized value (Z, t, F, etc.) which represents the likelihood of H0, and is distributed in a known manner such that the probability for this value can be determined.