Professional Documents
Culture Documents
Topic 1.0 Statistics QC
Topic 1.0 Statistics QC
Topic 1.0 Statistics QC
Objectives
Analyze data by calculating min, range
and standard deviation
Construct repetetive distribution table and
sketch histogram.
Identify normal curve and calculate the
area under the normal curve by using
table of area under normal curve and be
able to solve problem related to quality
control.
Populations and Samples
A population is a complete set of all of
the possible instances of a particular
object
for example, the People in this room.
A sample is a subset of the population
for example, any one of the Teams.
Why use samples?
The population may to too large to count
the number of people on earth, the number
of stars in the sky.
The population may be too dangerous to
observe
automobile wrecks, explosions, etc.
The population may be too difficult to measure
sub atomic particles.
Team Exercise (5 min.)
Estimate the average age in months of the
persons in the room to three figure accuracy
based upon your Team.
Measures of Central Tendency
If you have to declare a single value to
represent a population or a sample, what do
you use?
The most common value is the mean, also
called the average or the expected value.
What is the Mean?
The mean is the sum of all of the values in
the data set divided by the number of values.
The equation for calculating the mean is the same
for both samples and populations.
Mean (Ungrouped Data)
n
1
x xi
n i 1
Where:
X-bar is the mean
xi are the data points
n is the sample size
Mean(Grouped Data)
x1 f1 x 2 f 2 ......
x = .
f1 f 2 ......
f i xi
=
fi
Where:
fi is the frequency in a cell of an observed
value
xi are the data points
Team Exercise (5 min.)
Have 20 to 30 students go to the board and
write their age in months.
Calculate the mean, mode, and median for
the data set on the chalk board in the
classroom.
Have the teams calculated the same statistics
for their team.
What Is the Range?
range: the distance between the lowest and
the highest values in the set.
For Ungrouped data:
R= XH- XL
For example, the time to drive to Houston is
2-hours plus or minus 15 minutes. Or, 105 to
135 minutes. Thus the range is 30 minutes.
For Grouped data:
( xi ) @ xi
2
N i 1
S= n
n 1
Grouped data
2
2 f i xi
f i xi
S= fi
fi 1
Team Exercise (5 min.)
Compute the range, variance and
standard deviation for the data set
(student ages) on the board.
Histogram Defined
A histogram is a bar graph that shows
frequency data.
Histograms provide the easiest way to
evaluate the distribution of data.
Histograms
Creating a Histogram
Collect data and sort it into categories.
Then label the data as the independent set or the
dependent set.
The characteristic you grouped the data by would be the
independent variable.
The frequency of that set would be the dependent variable.
Each mark on either axis should be in equal
increments.
For each category, find the related frequency and
make the horizontal marks to show that frequency.
Ungrouped data
i) Calculate the number of cell;
k= 1+ 3.3 log n, n= number of observed value
ii) Calculate the cell interval;
= Range/k
iii) Construct a table and tally sheet
iv) Post the cell frequency
v) Construct the histogram
The Normal Curve
1 2
1 2
x / 2
RF e
2
Standard Normal
Define: Z x /
Area = 1.00
Then
0.5
0.4
1 2
1 2
Z
rf e 0.3
2 0.2
0.1
0.0
-4.0 -3.0 -2.0 -1.0 0.0 1.0 2.0 3.0 4.0
Standard Normal
Mean = 0
Std. Deviation = 1
Total Area Under Curve = 1
Curve perfectly symmetrical about
mean
Thus, area under half the curve = 0.5
Some handy things to know.
50% of the area lies on either side of
the mid-point for any normal curve.
A standard normal distribution (SND)
has a total area of 1.00 or 100%
“Z-Tables” which show the area under
the SND for all of usual values of Z, can
be used to find the area between any
two points on the Z-axis.
Using Z Tables
Finding the area between -1.0 and plus 2.0
From the Z table in the text, a Z of -1.0
corresponds to an area of 0.3413 or 34.13%.
Similarly, a Z of 2.0 equals and area of
47.72%
Add the two together for a total of 81.85%.
The “tails” contain 18.15%
Example:
A sample was taken of ENGR 112 students and it
was found that the cost of the calculators they use in
the class was normally distributed. The mean cost
of a calculator is $100 with a standard deviation of
$10. Assuming that 1000 students will be enrolled
in ENGR 112 this fall, how many students would
you predict spent: a) more than $110; b) less than
$80; c) between $80 and $110.