Analysis of Experimental Data

American University of Beirut, Physics Department, Phys210L
ANALYSIS OF EXPERIMENTAL DATA
CONTENTS
1. Significant Figures
2. Errors of Observation
3. Fractional Uncertainty
4. Theory of Errors
5. Propagation of Errors
6. Least-Squares Curve Fitting
7. Percentage Difference, percentage error,
and percentage change
8. References
1. SIGNIFICANT FIGURES , ACCURACY AND PRECISION
Physical quantities are measured in the laboratory with the help of various instruments. These
instruments have scales with divisions. The digits that are read and estimated on the scale are
called significant figures.
Suppose the length AB of an object is measured with a meter stick (Fig. 1). The smallest
subdivision on the meter stick is 1 cm. Point A is aligned with the 0 graduation mark and point B
falls between the 7 and 8 graduation marks. Therefore the length AB is somewhere between 7
and 8 cm. A reasonable estimate is that B is located at 7.6 cm. Of
0 1 2 3 4 5 6 7 8 9 10 11
A B
Figure 1.
those two digits the first one is certain, the last one is doubtful. The length AB can therefore be
given to two significant figures. The digit preceding the doubtful digit represents in general the
smallest subdivision on the scale. If the length AB had been recorded as 7.65 cm, the impression
would have been given that the scale of the ruler was divided into a tenth of a centimeter and that
the point B was located between the 7.6 and the 7.7 graduation mark. Thus the length AB would
1
be closer to the value of 7.65 cm than to 7.64 or 7.66 cm. More information is therefore given
and there are now three significant figures.
Very often in the process of calculation, extra figures are accumulated and sometimes reported in
the result. This is a meaningless procedure. No figures should be included beyond the
precision of the original data.
In the case of random errors the root-mean-square error in the arithmetic mean (see following
sections) determines the number of significant figures. As an example
L=(2.56 ± 0.03)m and not L=( 2.556 ± 0.034 ) m or L=( 2.56 ± 0.034 ) m .
If a resistance of 105 Ω is written as 0.000105 MΩ, there are still only three significant figures.
The resistance can namely also be written as 105 * 10 -6 M. A figure such as 25600 has five
significant figures. It means that the true value is located somewhere between 25601 and 25599.
There are only three significant figures if the figure had been reported as 256 * 10 2. In this case
the number has been specified to the nearest hundredth. Some other examples are
30.31 Four significant figures

35 Two significant figures
15.5 Three significant figures
20.000 Five significant figures
0.0001059 Four significant figures
5.0001050 Eight significant figures
In publications and reports, the results of the measurements in the laboratory are given.
However, no measurement is ever made with absolute accuracy. For example, a number like the
speed of light, c, is a rather exact number. Rulers and stopwatches are used to measure the speed
of light. Since these instruments are not ideal and their scales cannot be read exactly, the
resultant measured value of the speed of light is not exact. The better the measuring instruments
are, the more exact the resulting values of the speed of light can be.
A distinction should be made between accurate and precise measurements. A precise

measurement of the speed of light can yield a number with more significant figures, such as c =
3.09035 * 108 m/s. However, this result may not be accurate.
Precision is measured by the uncertainty in the end result and, for example, can be reported as
follows
c = (2.997925  0.000003) * 108 m/s
This means that if the measurement is repeated, the resulting value of c will have a certain
probability of being located between 2.997928 * 108 m/s and 2.997922 * 108 m/s. The smaller the
uncertainty the more precise the measurement is.
Accuracy gives the closeness of the experimental result to the actual or correct value of the
physical quantity. In contrast, the precision is the closeness with which repeated measurements
2
agree with one another. Both accuracy and precision are important. In a well-designed
experiment, the measured value should fall close to the accepted value (accurate) and within the
experimental uncertainty (precision).
2. RULES FOR ROUNDING OFF NUMERICAL VALUES
When the number of digits of a numerical value has to be reduced, the remaining least significant
digit has to be rounded off, according to the following rules.
1. If the most significant digit to be eliminated is 0, 1, 2, 3, 4, then the remaining least

significant digit is left unaltered. For example, 12.34 ' 12.3 .
2. If the most significant digit to be eliminated is 6, 7, 8, 9, or 5 followed by at least a
nonzero digit, then the remaining least significant digit is incremented by one. Examples:
12.36 ' 12.4 ; 12.355 ' 12.4 .
3. If the most significant digit is 5 followed only by zeroes, then the remaining least
significant digit is left unaltered if it is even; it is incremented by one if it is odd.
Examples: 12.45 ' 12.4, 12.35 ' 12.4 .
3. ERRORS OF OBSERVATION
All measurements contain errors. A study of errors is therefore important as a step in finding
ways to reduce them and also as a way of estimating the reliability of the final result of the
experiment.
There are two kinds of errors that are related to observations or measurements:
1. Systematic errors
2. Random errors
Systematic errors are due to the wrong calibration or wrong construction of the instruments used
in making the measurement. External conditions (such as temperature, humidity, magnetic fields)
and observational errors can contribute to systematic errors as well. The results of the
measurement are then consistently too large or too small. Careful planning of the experiment can
in general eliminate systematic errors.
On the other hand random errors are due to no known cause; they are of a statistical nature. They
can be thought of being due to a large number of independent causes, producing small
fluctuations in the measurement.
If N measurements of the quantity x are made then the best estimate for x is the average or the
mean of x 1 , x 2 , x 3 ,… .. x N . That is
x=
∑ xi (1)
N
The differences
d i=x i−x (2)
3
are called the deviations and indicate how much the ith measurement x i differs from the average x
. The mean deviation is of course equal to zero. The standard deviation or root-mean-square
deviation of the N measurements of x i which characterizes the reliability of the measurements, is
defined as
σ=
√ ∑ d 2i
N−1
(3)
This quantity is a measure of the spread of obtained values. Finally the standard deviation of the
mean or the root-mean-square error in the arithmetic mean of N measurements is given by
α=
∑ d 2i
N ( N−1) √ (4)
Equation (4) characterizes the uncertainty in the mean x , the best estimate for x . α is a measure of
the probable error in the best estimate. In this course, the quantity α should be rounded to one
significant figure only. There is an exception to this rule. If the leading digit in the uncertainty
is 1 (or arguably 2), then it may be better to retain two significant digits, e.g. if α = 0.14 or 0.24,
then rounding to 0.1 or 0.2 would be a significant proportional reduction.
Table 1 gives an application of the above definitions. As an example, the length L of a table is measured ten times
with a meter stick. The smallest sub-division on the scale of the meter stick is one tenth of a meter. The
measurements are tabulated in the second column of the table. The arithmetic mean of the length L and therefore the
best estimate of the length L of the table is then given by
L=
∑ Li =2.556 m
N
The deviations are given in the third column and the standard deviation is obtained from the fourth column. Finally
the end result is quoted with its mean-square-error.
Measurement # Li (m) Deviation d i d 2i ×1000

1 2.52 -0.036 1.296
2 2.69 +0.134 17.956
3 2.46 -0.096 9.216
4 2.58 0.024 0.576
5 2.39 -0.166 27.556
6 2.41 -0.146 21.316
7 2.62 +0.064 4.096
8 2.66 +0.104 10.816
9 2.67 +0.114 12.996
10 2.56 +0.004 0.016
L=2.556 ∑ d 2i =0.106
4
σ =0.1
α =0.03
L=( 2.56 ± 0.03 ) m
Note that the least significant figure in the result should be of the same order of magnitude as
the uncertainty.
Occasionally a single measurement from a set of measurements differs widely from the others,
so that the experimenter is tempted to discard the measurement.
The standard deviation can be used in order to decide if an observation should be rejected or not.
A simple rule to follow in such cases is
(a) Calculate the standard deviation σ for all the data including the suspected one.
(b) If any deviation exceeds twice the value of σ , that particular measurement can be
rejected. This criterion (Chauvenet) assumes that no more than 10 measurements will be
made (generally true in this course).
3. THEORY OF ERRORS
If a quantity x is measured a very large number of times, the resulting values x i are distributed at
random, assuming that the experiment has been set up in such a way that systematic errors have
been eliminated. A plot of the measured values, x i, against the number of observations
(frequency) of a particular value x i yields a so-called frequency distribution curve for the
observations.
There are a number of well-known frequency distributions. In the case of random errors in
scientific measurements, the frequency curve is called the normal or Gaussian distribution (fig.
2).
1.0
0.6
5
Figure 2.
This distribution is a bell-shape curve, centered at x , and can be presented by the normalized
function
1
y= exp ⁡[−( x−x)2 /2 σ 2 ] (6)
σ √2π
x is the mean of the distribution and σ , which is a measure of the width of the curve, is the
standard deviation of the distribution. The curve (fig. 2) is symmetric about the arithmetic mean
or average value x of the measured values x i and it can be shown that for x=x ± σ , y  0.607
times the maximum amplitude 1/ ( σ √ 2 π ) of the curve at x=x . The area under the curve
represents the probability that an observation will fall into a given interval. The total area under
the normalized curve is equal to one, and 68% of the measured values, fall within one standard
deviation σ of the mean, 95% fall within two standard deviations of the mean. Furthermore, in a
normal distribution half of the observations (50%) are located within x ± 0.6745 σ .
In this course you will state your final result as x ± α . Furthermore, when the measured value x
deviates from the accepted value by more than twice α , this is an indication of possible
systematic errors. It becomes imperative to indicate the possible sources of these errors and
justify such deviations.
4. PROPAGATION OF ERRORS
In order to estimate the error in compound quantities, the following procedure is followed. If a
number of measured quantities have arithmetic means x , y , and z with root-mean-square errors
of α x , α y , and α z respectively, then the root-mean-square error α F in any function F of x , y , and
z is given by
αF= (
√ ∂ F 2 ∂F
∂x
α x ) +(
∂y
α y )2 +(
∂F 2
α )
∂z z (8)
Example: Suppose that you want to calculate the density ρ of a solid. You have measured its mass m = 5.0  0.2 kg
and its volume V = 2.00  0.08 m3.
√
α d= (
αm
V
)2 +(
m∗α V
V2
)2
α d =√(0 . 2/2)2 +(0 .08∗5 /4 )2=0. 14
6
3
ρ=2. 50±0. 14 kg /m
5. LEAST-SQUARES FITTING OF A CURVE
Many experiments yield a series of pairs of data values. Usually the x i values are selected and the
y i values are measured. A graph is plotted with each pair (x i , y i ) representing a point. If the
points of the graph are located within a narrow band, the variables x and y are said to be
correlated, and a relation exists between x and y .
The method of least squares is used to fit a curve (find a theoretical equation) to a set of
experimental data. First, assume that a linear relation exists between y and x
y= Ax+ B (9)
Substitution of x=x iwill in general not give the value of y i. The “errors” will be
e i= y − y i= A xi + B− y i (10)
To determine the best straight line that fits the N , sets of data, A and B have to be chosen so that
the sum of the squares of the “errors” is minimized. This means that the simultaneous equations,
obtained by equating the partial derivatives of ( y− y i)2with respect to A and B to zero, should be
solved. This condition leads then to the following results
N ∑ (xi y i)−∑ x i ∑ y i
A=
∆
(11)
and
B=
∑ x 2i ∑ y i −∑ x i ∑ (xi y i)
∆
(12)
where
∆=N ∑ x i −( ∑ xi )
2 2
(13)
In the case that the line goes through the origin ( B = 0), it can be shown that
A=
∑ ( xi yi ) = ∑ yi = y
∑ x 2i ∑ x i x
(14)
The correlation coefficient r provides an indicator of how good a fit the best straight line is. This
coefficient is defined as
7
r=
∑ (x i−x )( y i− y )
(15)
√ ∑ (xi −x)2 ∑ ( y i− y)2
For r = 0, the values of x and y are independent of one another and there is no linear correlation.
The closer r is to +1 or to –1, the better the linear correlation is.
Finally, the error in A is given by:
2
σ =
N ∑ e2i
A
N−2 ∆
(16)
1 ∑ ei ∑ x i
2 2
2
σ B=
And the error on B is given by N−2 ∆
(17)
If the relation between the two variables x and y is not linear, a polynomial of higher order than one has to be fitted
to the N data points (see references 1 and 2). The coefficients can then be determined by the least square method as
in the case of the linear relationship.
However the following functional relationships between x and y can be reduced to a linear
relation:
1. Exponential curve y=B e

Ax
ln ( y )=ln ( B ) + Ax
2. Logarithmic curve y=B+ A ln ( x )
3. Power curve y=B x
A
ln ( y )=ln ( B ) + A ln(x)
The comparison of the above curves with a straight line is summarized in the following table
A B x y
Linear A B x y
y= Ax+ B
Exponential A ln ⁡(B) x ln ⁡( y )
Ax
y=B e
Logarithmic A B ln ⁡(x) y
y=B+ A ln( x)
Power A ln ⁡(B) ln ⁡(x) ln ⁡( y )
y=B x A
An Example
The volume of a sample of an ideal gas is kept constant and a series of values for T and P is
measured. Since the volume is constant, the temperature of the gas is a linear function of its
8
pressure. T = AP+ B ; the constant B is the temperature at which the pressure P would drop to
zero, it’s called the absolute zero of temperature and it has the accepted value
o
B=−273.15 ❑C
The following results are obtained
Pressure (KPa) Temperature (❑oC )

8.666 -20
9.999 17
11.332 42
12.666 94
13.999 127
Assuming that the points should fit a straight line of the form T = AP+ B one can calculate
the best estimate for the constant B and its uncertainty.
The slope and the y-intercept are calculated with the following results:
slope A=27.826 ❑C / KPa , y−intercept B=−263.335, correlation coefficient r =0.995
o
Using equation (10) e i=27.826 x i−263.335− y i, the e i values are calculated and the sum is
obtained ∑ e2i =133.584 , also ∑ x i=56.662 and ∑ x 2i =659.893 , then ∆ is calculated using
equation (13), here ∆=88.883
Thus σ Aand σ B are calculated using equations (16) and (17) with the results σ A=1.583 and
σ B=18.182
The slope A is reported as A=(27.8 ± 1.6) ❑oC / KPa

And the y−¿intercept is reported as B=−(2 .60 ± 0.18)× 102 ❑oC which agrees with the
accepted value of absolute zero =-273❑oC
N.B. All values should be reported as a number between 1 and 9 multiplied by a power of 10.
6. REFERENCES
1. Errors of Observation and their Treatment.

J. Topping, Chapman and Hall (1972), London, UK.
2. An Introduction to Error Analysis.
J.R. Taylor, Oxford University Press (1982), UK
3. The Statistical Analysis of Experimental Data
J. Mandel, Dover Publications, Inc.(1984), USA
4. SPSS for Windows
M.J. Norusis, SPSS Inc. (1993), USA
5. The Uncertainty in Physical Measurements
Paolo Fornasini, Springer, 2008, USA

Analysis of Experimental Data

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Analysis of Experimental Data

Uploaded by

Copyright:

Available Formats

American University of Beirut, Physics Department, Phys210L

ANALYSIS OF EXPERIMENTAL DATA

1. SIGNIFICANT FIGURES , ACCURACY AND PRECISION

30.31 Four significant figures

A distinction should be made between accurate and precise measurements. A precise

c = (2.997925  0.000003) * 108 m/s

1. If the most significant digit to be eliminated is 0, 1, 2, 3, 4, then the remaining least

Measurement # Li (m) Deviation d i d 2i ×1000

α d =√(0 . 2/2)2 +(0 .08∗5 /4 )2=0. 14

5. LEAST-SQUARES FITTING OF A CURVE

1. Exponential curve y=B e

Pressure (KPa) Temperature (❑oC )

The slope A is reported as A=(27.8 ± 1.6) ❑oC / KPa

1. Errors of Observation and their Treatment.

You might also like