Professional Documents
Culture Documents
04 - ISI Six Sigma Analysis
04 - ISI Six Sigma Analysis
Hypothesis
Develop a Organize
Testing and Design of
focused problem Process Analysis potential
Regression Experiments
statement causes
Analysis
Brainstorm on X’s
Find change of which X’s affect Y and in what manner
Ultimately find which X’s are critical to move the Y in the desired
direction
Yes
Decision Step 6 End
No
Step 4 Step 5
Activity flowcharts
are specific about 1
Approach front desk
2
Is there
YES 3
Wait
a line?
Clear
what happens in a direction of
NO flow (top to
process. They often Numbered 4
bottom or
steps Step up to desk left to right)
capture decision
points, rework 5
Clerk NO
6
loops, complexity, available?
Wait
Key of symbols
etc. YES
7 Consistent
Give room number
Start/End level of
detail
Action/Task
8
Check bill
Decision
9
Charges NO 10
Sequence correct? Correct charges
YES
11
Clear starting
Pay bill and ending
Date of creation points
or update &
name of creator
between people or 4 10
10 days
12
Pays bill
6
Receives and
records payment
Horizontal lines
7
clearly identify
Reviews weekly
report of overdue
handoffs
accounts
No
Yes
Yes
No
Adjust Find
Help
Yes Another Fix No
Page? Problem?
No Yes
activity 9.
1.
flowchart into Stop
packing line
Adjust
settings
No
an opportunity
2. 5. 6. 7. 8. Yes 10.
Yes
flowchart by Same
product?
Change
length
Adjust
speed
Run test Speed
OK?
Adjust
stapler
cartons
highlighting No
Yes
the steps that 3.
Pick-up
4.
4.
Timing
Clean
No 12.
Closed?
11.
Staple
appropriate okay? test carton
add waste and tools machine
Yes
complexity to 18. 13.
Stop Load carton
the process. line
No
Yes 17. 16. No 15. 14.
Right Call Product? Refill
product? foil
filling
Yes
19.
Start
production
What is Brainstorming ?
• Brainstorming is a simple but effective technique for
generating many ideas of a group of people within a
short span of time to solve a given problem
FANTASIZE FREELY
• Don’t operate with your brakes on. The participants
are encouraged, urged to let themselves go and
generate ideas, no matter how fanciful these ideas are.
BUILD ON IDEAS
• Idea of one participant is more effectively built up by
another participant.
Man Machine
Problem
/ Effect
Material Method
Spark plugs
Impatience Heavy Contacts
Poor Bad Life
Craze
anticipation attitude
Body Technical
Wrong Poor details
skill Shape
Always gears Fuel mix
late Lack of Inexperience High H.P Carburetor
awareness Wrong
Riding on culture Engine
clutch
Cylinders High Petrol
Spurious Consumption
Crossings Spares
Restrictions Impurities
Traffic Incorrect
One way
No turn Tyres Inferior Octane no.
Frequent Petrol
Faulty
Circuitous stops Negligence
Speed Breakers pressure Additives
Road
Ignorance
Potholes Irregular Incorrect viscosity
Low pressure
Poor servicing
Clogged
condition Oil
False filters
Steep economy Not changed
Low level
Road Maintenance Materials
No Relationship Yes
Known?
Establish
Relationship Yes
Desired Yes Use Investigation
State Method (GEMBA)
Known?
No
Use Test of
Hypothesis
H a
: o
< x
Hypotheses of
H o
: A
= Means
H a
: A
> B
Hypotheses of
Standard
Deviations
1. What are the average weights of red and green apples in the garden?
2. I think the weights of red and green apples are not same.
3. I think red apples are more uniform than the green apple.
Statistical Inference
y = f ( x1 , x2 ,..., x k )
Key Terms
Ho = Null Hypothesis ,Ha = Alternative
Hypothesis ,P-Value = Probability Value
Note : Hypothesis is all about population or process parameters, it is not about the sample statistics.
Therefore, one should use the symbol µ, σ, σ2 while writing hypothesis.
Ho = _______________________________________
Ha = _______________________________________
256 Indian Statistical Institute,SQC & OR Unit, Mumbai
S
tatistic
a lp
rob
lem S
tatistic
a lS
olu
tio
n
In test of hypothesis we collect the evidence in the form of sample data. Then,
appropriate statistical test are used to examine whether the evidence is
against or in-favour of the H0 .
3. Paired t-test
Compares the mean 3. Equal variance or 3. Chi-square test
between two dependent Bartllet's -test Compares the proportion
samples . Compares the variance among multiple samples .
among multiple samples .
4. ANOVA
Compares the mean Note : Test for means and variance mentioned in this diagram
among multiple multiple assume data follow normal distribution. For non-normal data
samples . equivalent non-parametric tests are used.
260 Indian Statistical Institute,SQC & OR Unit, Mumbai
Hypothesis testing using Minitab and Excel
Y = f( x1, x 2 , x 3 ,..., xk )
Remember this simple equation ?
DATA TYPE :Discrete
Counts of Discrete Events ( 1, 2, 3, 4 Defects)
Qualitative Descriptions
Democrat / Republican
Good / Bad
Machine 1 / Machine 2
Continuous
Decimal sub-divisions are meaningful
Time, Weight, Thickness, etc...
Y = Continuous Y = Discrete
Comparison Type Mean Variance Defective Defects
Against Standard 1 Sample t Chi-Square Test 1 sample p 1 sample defect
rate
Between Two 2 Sample t F-test 2 Sample p 2 sample defect
OR rate
Paired t
Among Many ANOVA Bartlett's Test Chi-Square test Chi-square
Note: The test mentioned for Y (Continuous) is applicable only when Y follows
Normal Distribution. In case Y does not satisfy the Normality, then we need to use
Non Parametric tests. For carrying out ANOVA, the condition of ‘Equality of variance’
to be satisfied.
Comparing Means
1 Factor 2 Factors 3 or
more
factors
1 Sample 2 Samples 2 or
more
samples
known not known independent paired
The test statistic for the one sample case is given by:
X −
t =
s/ n
The current rate for producing 5 amp fuses at General Electric Co. is
250 per hour. A new machine has been purchased and installed that,
according to the supplier, will increase the production rate. A sample of
10 randomly selected hours from last month revealed the mean hourly
production on the new machine was 256, with a sample standard
deviation of 6 per hour. At the .05 significance level can General
conclude that the new machine is faster?
X −
Step 3: Select Test Statistic…. t=
s/ n
Step 4: Decision Rule...H0 is rejected if t >1.833, df=9 or p
value less than 0.05
Step 5: Compute t, p value using software and decide...
Commands:
Graph > Time Series Plot
Stat > Basic Statistics > Display Descriptive Statistics (graphical
summary)
Stat > Basic Statistics > 1-Sample t
Stat > Basic Statistics > 1-Sample t > Graphs
X1 − X 2
t =
1 1
Thus…. s
2
+
p
n1 n2
Test Statistic:
Step 1: H0 : 2 1 H1: 2 1
X1 − X 2
Step 3: Select Test Statistic… t=
1 1
s +
2
p
n1 n2
Step 4: Formulate Decision Rule...
H0 is rejected if t > 1.708, df = 25 or if p < 0.05
d=
d
n
The standard deviation of the differences, sd, is computed using
the formula:
(d ) 2
d −
2
sd = n
n −1
1 13.2(L) 14.0(R)
2 8.2(L) 8.8(R)
3 10.9(R) 11.2(L)
4 14.3(L) 14.2(R)
5 10.7(R) 11.8(L)
6 6.6(L) 6.4(R)
7 9.5(L) 9.8(R)
8 10.8(L) 11.3(R)
9 8.8(R) 9.3(L)
10 13.3(L) 13.6(R)
Oserved Value
Prob
0.2
0.1
0.0
-4 -3 -2 -1 0 1 2 3 4
T-V alue
Ho: 1 = 2 = 3 = 4
HA: At least one µk is different
Advantage:
To test the NULL HYPOTHESIS (all 4 mean values are equal), we would have
to test hypotheses for 6 combinations using the technique previously described
(t-test). Using the ANOVA technique, we can decide whether to “reject the null
hypothesis” or “keep the null hypothesis” with a single test.
Ho : 1 = 2 = 3 = 4
Comparing Variances
For the two tail test, the test statistic is given by:
2
S
F = 1
2
S 2
S12 and S22 are the sample variances for the two samples.
The null hypothesis is rejected if the computed test
statistic is greater than the critical (table) value with
confidence level / 2 and numerator and denominator
degrees of freedom.
Fcalc = s12/s22
where : s12 = variance of one distribution , s22 = variance of the other
distribution. The larger variance always serves as the numerator.
The critical f-value is read from an f-table, where n-1 equals the
number of degrees of freedom for the numerator and denominator.
If the numerator and denominator have different sampling sizes, the
correct value must be used for each factor.
Levene's Test
Because the samples are distributed
normally we interpret the F-test.
Test Statistic: 0,775 We can see no deviation in
variance homogeneity.
P-Value : 0,390
Attribute Data
target
vs . value
P
Practical Question ‘‘Is the population proportion
(example) statistically different from the
target value?”
Statistical Question
Ho : P = target value
Ha : P ≠ target value
32 135
Product A
Product B 32 80
Product C
42 98
Got Hired
What’s the Y ? _____________ Discrete
Type of Data ? ______________
Age
What’s the X ? _____________ Discrete
Type of Data ? ______________
Chi-Square
What type of tool would you use ? ________________________
Old 30 150
Young 45 230
Old
Young
How do we do that?
Young 45.3
___ ___
229.7 275
Young ___
-0.3 ___
0.3 275
Young ___
.09 ___
.09 275
Young ___
.002 ___
.0004 275
0 1 2 3 4 5
Collect Data
Run Minitab
Tables
Chi-Square
Command
Evaluate The
P Value
Examine
Contingency
Table
Make
Decision
2 45 230 275
45.33 229.67
A P-Value !
Many people confuse these rules, mistakenly believing that a high p-value
is reason for rejection. One can avoid this confusion by remembering the
following:
NOTE:
Not Guilty Innocent
336 Indian Statistical Institute,SQC & OR Unit, Mumbai
Decision Errors
Your Decision
Accept Ho Reject Ho
Type I
Ho True Correct Error
The (-Risk)
Truth
Type II Error
Correct
Ho False ( -Risk)
Jury’s Decision
25.01 ml, 24.89 ml, 25.10 ml, 24.95 ml, 24.97 ml,
25.04 ml, 25.08 ml, 24.91 ml, 25.07 ml, 24.85 ml
Test the null hypothesis that says the dispenser provides 25 ml agent
against the test hypothesis that this is not the case.
Test the null hypothesis that the warm and cold water have the same calcium
concentration, against the test hypothesis that warm water has a higher
concentration.
4.2, 5.3, 3.5, 4.3, 3.7, 3.2, 3.5, 2.8, 3.5, 3.7,
2.8, 3.3, 2.7, 3.0, 3.1, 3.0, 3.7, 3.3, 3.4, 2.3
Test the null hypothesis that the process is unchanged (3% ethyl
alcohol), against the test hypothesis that the mean values of the
process are more than 3%.
Two brands (A and B) of air conditioner dust filters were tested to determine
whether one was better than the other. All filters were tested on the same system,
and the dust quantity (in grams) filtered over a 6 hour period was measured. The
data obtained for the two filters is as follows:
Filter A: 9.1, 11.8, 1.5, 7.2, 4.2, 9.6, 8.7, 10.2, 4.4, 7.8, 4.3
Filter B: 15.6, 9.3, 16.9, 5.1, 14.5, 19.0, 10.3, 12.5, 13.3, 16.1, 2.6
Test the null hypothesis there there is no difference in average dust quantity
filtered against the test hypothesis that one filter is better than the other.
35
Mileage (km/Lit)
30
25
20
15
25 35 45 55 65 75
Speed (km/h)
Y Y Y
X X X
Strong Positive Correlation Moderate Positive Correlation No Correlation
r = .95 r = .70 r = .006
R2 = 90% R2 = 49% 2
R = .0036%
Y Y Y
X X X
Strong Moderate Other Pattern -
Negative Correlation Negative Correlation No Linear Correlation
r = -.90 r = -.73 r = -.29
R2 = 81% R2 = 53% R2 = 8%
1050
1045
1040
Severed Length (mm)
1035
1030
1025
1020
1015
1010
1005
1000
5 5.5 6 6.5 7 7.5 8 8.5 9
Conveyor Speed (cm/sec)
Drive Slower
Tune-up Car
What if it works?
What if it doesn’t?
How many more Combinations would you need to figure out the best
combination of variables? (3 Variables at two settings; 2x2x2 = 8 total)
How can you explain the above results? (Combination 2 is the answer)
If there were more variables, how long would it take to get a good solution?
(Multiply by another 2 for each one)
What if there’s a specific combination of two or more variables that leads to
the best mileage? (Too hard for me to figure out; What do you think?)
F
I
N
I
S
H
Speed X Speed Y
Tools like brainstorming and cause & effect diagrams helps in identification of
factors and preparing a complete list of the factors involved in any experiment.
Factors listed can be classified into three categories :
1. Experimental Factors
Experimental factors are those which we really experiment with by varying them at
various levels.
2. Control Factors
Control Factors are those which are kept at a constant (controlled) level throughout
experimentation.
3. Error or Noise Factors
Error or Noise factors are those which can neither be changed at our will nor can
be fixed at one particular level. Effect of these factors causes the error component
in the experiment and as such these factors are termed as error or noise factors.
Note : At the planning stage itself all the factors viz. Experimental, Control and error should
be recognized. This will help to tackle them appropriately during experimentation.
1. RANDOMISATION
2. REPLICATION AND
3. LOCAL CONTROL OR ERROR CONTROL
Definition
Replication means repeating all the experimental conditions (or
running a combination) two or more times.
− This does not mean measuring an experimental unit twice
− It does mean repeating a certain set of conditions and measuring the new
output
− Two replicates means that for an 8-run design you will do 16 runs in one
experiment
Minitab will randomize all the runs (including replicates) at the same time
If for some reason you cannot, or choose not to, do all the runs at the
same time, you need to be concerned about blocking (a topic we’ll
discuss later in this module)
− One replicate really means no replication
Definition
To assign the order in which the experimental trials will be run
using a random mechanism
− It is not the standard order
− It is not running trials in an order that is convenient
− To create a random order, you can “pull numbers from a hat” or have
Minitab randomize the sequence of trials for you
Why?
− Averages the effect of any lurking variables over all of the factors in
the experiment
Prevents the effect of a lurking variable from being mistakenly
attributed to another factor
− Helps validate statistical conclusions made from the experiment
REMEMBER:
-1 signifies one level (normally lower) and +1 signifies the other level
(normally higher)
-1 signifies one level (normally lower) and +1 signifies the other level
(normally higher)
WHICH EXPERIMENT?
23 FULL FACTORIAL EXPERIMENT
Click Design
Click OK
Indian Statistical Institute,SQC & OR Unit, Mumbai
Designing the Experiment
The Design out put along with the data obtained after conducting experiment.
Note here the second column gives the run order on which the experiment
has to be conducted.