Professional Documents
Culture Documents
Chapter12 Datahandling
Chapter12 Datahandling
1
CONTENT
12.1 Data Handling in Analytical Chemistry
2
12.1 DATA HANDLING IN ANALYTICAL
CHEMISTRY
3
Data Handling in Analytical Chemistry
▪ Since analytical chemistry is the science of making quantitative measurements, it is important that raw
data is manipulated and reported correctly to give a realistic estimate of the uncertainty in a result.
▪ Simple data manipulations may only require keeping track of significant figures . More complicated
calculations require propagation-of-error methods.
▪ The uncertainty in a result can be categorized into random error and systematic error.
4
Data Handling in Analytical Chemistry
• Any measurement of a physical quantity always involves some uncertainty or experimental
error. This means that if measurement of some quantity and then repeat the measurement,
we will most certainly obtain a different value the second time around.
The question then is: Is it possible to know the true value of a physical
quantity?
5
Data Handling in Analytical Chemistry
▪ However, with greater care during measurements and with the application of
more experimental methods, errors can be reduced and, thereby gain better
confidence that the measurements are closer to the true value.
▪ Thus, one should not only report a result of a measurement but also give some
indication of the uncertainty of the experimental data.
6
Data Handling in Analytical Chemistry
It is important to note that no matter how keenly planned and
executed, all experiments have some degree of error or
uncertainty. Thus, one should learn how to identify, correct, or
evaluate sources of error in an experiment and how to express
the accuracy and precision of measurements when collecting
data or reporting results.
7
Why do we need statistics in analytical
chemistry?
Each step of the chemical analysis process has some random error associated
with it. It is therefore clear that if a major error is made in any single step of
the analysis it is unlikely that the result of the analysis can be correct even if
the remaining steps are performed with little error.
8
Why do we need statistics in analytical
chemistry?
9
What Does Statistics Involve?
10
Some Useful Statistics Terms
11
12.2 ACCURACY AND PRECISION
12
ACCURACY AND PRECISION
Accuracy and Precision: Is There a Difference?
Accuracy: degree of agreement between measured Precision: degree of agreement between replicate
value and the true value. measurements of same quantity.
• Absolute true value is seldom known • Repeatability of a result
• Realistic Definition: degree of agreement • Standard Deviation
between measured value and accepted true
value.
• Coefficient of Variation
• Range of Data
• Confidence Interval about Mean Value
13
ACCURACY AND PRECISION
Accuracy can be defined as the degree of agreement between a measured value and
the true or accepted value. As the two values become closer, the measured value is
said to be more accurate.
▪ Accuracy measures how close the measured value is to the true value or accepted
value. In other words, how correct the measurement is.
▪ Quite often however, the true or accepted value of a physical quantity may not be
known, in which case it is sometimes impossible to determine the accuracy of a
measurement.
14
ACCURACY AND PRECISION
Precision is defined as the degree of agreement between replicate measurements of the
same quantity.
15
ACCURACY AND PRECISION
The concepts of precision and accuracy are demonstrated by the series of targets shown in the figure below.
16
ACCURACY AND PRECISION
Assuming the correct or accepted value is represented by the center of the circles;
If all values occurred close together within, for example, the red or
blue circles, results are precise but not accurate.
If all values occurred within the yellow circle, results are both
accurate and precise.
If results were scattered randomly in one direction of the center,
results are neither precise nor accurate.
17
ACCURACY AND PRECISION
18
ACCURACY AND PRECISION
Example
19
19
12.3 DETERMINATE ERRORS
20
DETERMINATE ERRORS- TYPE OF ERRORS
Types of experimental errors
21
DETERMINATE ERRORS- TYPE OF ERRORS
1. Random/indeterminate Errors
▪ Random (or indeterminate) errors arises due to some uncontrollable fluctuations in variables that
affect experimental measurements and therefore has no specific cause.
▪ These errors cannot be positively identified and do not have a definite measurable value; instead,
they fluctuate in a random manner.
▪ These errors affect the precision of a measurement and are sometimes referred to as two-sided
errors because in the absence of other types of errors, repeated measurements yield results that
fluctuate above and below the true value.
▪ With sufficiently large number of experimental measurements, an evenly distributed data scattered
around an average value or mean is achieved.
23
DETERMINATE ERRORS- TYPE OF ERRORS
2. Systematic/Determinate Errors
24
DETERMINATE ERRORS- TYPE OF ERRORS
2. Systematic/Determinate Errors
Constant determinate error (Ecd) gives the same amount of error independent of the
concentration of the substance being analyzed, whereas proportional determinate error (Epd)
depends directly on the concentration of the substance being analyzed .
(i.e., Epd = K C), where K is a constant and C is the concentration of the analyte.
Therefore, the total determinate error (Etd) will be the sum of the proportional
and constant determinate errors.
25
DETERMINATE ERRORS- TYPE OF ERRORS
Determinate Errors
Are They Systematic?
26
DETERMINATE ERRORS- TYPE OF ERRORS
3. Gross Errors
27
DETERMINATE ERRORS- EXPRESSING AND CALCULATING ERRORS
28
DETERMINATE ERRORS- EXPRESSING AND CALCULATING ERRORS
Significant Figures
▪ Except in situations where numbers or quantities under investigation are integers it
is often impossible to get or obtain the exact value of the quantity under
investigation.
▪ It is precisely for this reason that it is important to indicate the margin of error in a
measurement by clearly indicating the number of significant figures, which are
really the meaningful digits in a measurement or calculated quantity.
▪ When significant figures are used usually the last is understood to be uncertain.
29
DETERMINATE ERRORS- EXPRESSING AND CALCULATING ERRORS
Significant Figures
For example:
The average of the experimental values 51.60, 51.46, 51.55, and 51.61 is 51.555. The
corresponding standard deviation of the sum is ± 0.069. It is clear from the above that the
number in the second decimal place of the experimental values is subject to uncertainty. This
implies that all the numbers in succeeding decimal places are without meaning, and we are
therefore forced to round the average value accordingly.
However, consider the question of taking 51.55 or 51.56, given that 51.555 is equally spaced
between them. As a guide, when rounding a 5, always round to the nearest even number so
that any tendency to round in a set direction is eliminated, since there is an equal likelihood that
the nearest even number will be the higher or the lower in any given situation.
30
DETERMINATE ERRORS- EXPRESSING AND CALCULATING ERRORS
Ways of Expressing Accuracy
31
DETERMINATE ERRORS- EXPRESSING AND CALCULATING ERRORS
Percent Error (% Error): This is sometimes referred to as
fractional difference and measures the accuracy of a
measurement by the difference between a measured
value or experimental value (E) and a true or accepted
value (A).
Percent Difference (% Difference): This measures
precision of two measurements by the difference
between the measured experimental values E1 and E2
expressed as a fraction of the average of the two values.
32
DETERMINATE ERRORS- EXPRESSING AND CALCULATING ERRORS
Mean and Standard Deviation
33
DETERMINATE ERRORS- EXPRESSING AND CALCULATING ERRORS
The mean ( x̄ ) is the sum of the individual
measurements (xi) of some quantity divided by the
number of measurements (N). where x̄ is
the ith measured value of x.
34
DETERMINATE ERRORS- EXPRESSING AND CALCULATING ERRORS
Standard Deviation of a Finite Set of Experimental Data
35
DETERMINATE ERRORS- EXPRESSING AND CALCULATING ERRORS
Example: Assume that the following values were obtained in the analysis of the weight of iron in 2.0000g
portions of an ore sample: 0.3791, 0.3784, 0.3793, 0.3779, and 0.3797 g. Calculate the mean, standard
deviation, Relative standard deviation and % of relative standard deviation.
36
DETERMINATE ERRORS- EXPRESSING AND CALCULATING ERRORS
Check your answers!
1) The mean = x̄ = 1.8944g / 5 = 0.37888g
2) The standard deviation = σx = 2.09 x 10-6 g2)/4 = 5.2 x 10-7 g2
3) Rel. std. dev. = sr = 0.00072g/0.37888g = 0.0019
4) % Rel. std. dev. = (0.0019) x 100 = 0.19%
37
DETERMINATE ERRORS- STATISTICAL TEST
▪ Sometimes a value in a data set appears so far removed from the rest of the
values that one suspects that that value (called an outlier) must have been the
result of some unknown large error not present in any of the other trials.
▪ Statisticians have devised many rejection tests for the detection of non-random
errors.
▪ We will consider only one of the tests developed to determine whether an outlier
could be rejected on statistical rather than arbitrary grounds and it is the Q test.
38
DETERMINATE ERRORS- STATISTICAL TEST
The Q test: Rejecting data.
Note that we can always reject a data point if something is known to be “wrong” with with the
data or we may be able to reject outliers if they pass a statistical test that suggests that the
probability of getting such a high (or low) value by chance is so slight that there is probably an
error in the measurement and that it can be discarded.
The values of Qcrit are then examined in statistical tables. These values depend on the number of trials in the
data set, in this case 4. For example, the table below shows the values of Q for rejection of data at the 90%
confidence level.
40
DETERMINATE ERRORS- STATISTICAL TEST
The values are as follows:
Since Qexp > Qcrit at 90% confidence, the value of 25.61 can be rejected with 90% confidence.
• What does this mean? It means that in rejecting the datum the experimentalist will be right an average
of 9 times out of 10, or that the chances of the point actually being bad are 90%.
• Is this the one time out of 10 that the point is good? This is not known! When data are rejected, there
is always a risk of rejecting a good point and biasing the results in the process. Since Qexp < Qcrit at the
96% and 99% levels, the datum cannot be rejected at these levels. What this says is that if one wants to
be right 96 times or more out of 100, one cannot reject the datum. It is up to you to select the level of
confidence you wish to use
41
END OF CHAPTER
THANK YOU
42