Professional Documents
Culture Documents
KS - PA - Analyzing Vectors - Group 1
KS - PA - Analyzing Vectors - Group 1
(KAPITA SELEKTA)
TOPIC:
Analyzing Vector
Group 1
5
Analyzing Vector
6
4.1. Parameter estimation
• The Fisher probability density function (Fisher [1953] )
is given by:
Two examples of Fisher distributions, one with a large degree of scatter (k=
5) and one that is relatively tightly clustered (k=50) are shown in Figure 4.2.
Also shown are the Fisher mean directions and for each data set.
4.2 Watson’s Test For Randomness
His test relies on the calculation of R given by Watson [1956] also showed how to estimate
by:
equation 4.2.
If the value of R exceeds the null hypothesis of total randomness can be rejected at a specified
level of confidence. If R is less than randomness cannot be disproved.
4.3 Comparing Known and
Estimated Directions
For special cases where the two data sets are Fisher distributed with the
same dispersion, criteria have been developed to test the significance of the
difference in data sets by comparing R values for the two data sets
separately and together. The Watson test calculates the statistics F as
follows:
R1, R2, and R are the resultants of If F exceeds the value given in an F
the first, second, and combined data table for 2 and 2(N-2) degrees of
sets, respectively. freedom, the null hypothesis that
N is the total number of data points the two data sets have a common
in both data sets mean can be rejected.
4.4 Combining Vectors and Great
Circles
The data from sample tst1a seem to
hover around a given direction; a
mean direction for the last few
demagnetization steps can be
calculated using Fisher statistics. We
can calculate a best-fit line from the
data for sample tst1b (Figure 4.4b)
using the principal component
method of Kirschvink [1980] as
outlined in Chapter 3. The data from
tst1c track along a great circle path
and can be used to calculate the
pole to the best-fit plane.
4.4 Combining Vectors and Great
Circles
McFadden and McElhinny [1988] outlined a method for estimating the mean direction and
the from sites that mix planes (great circles on an equal area projection) and directed lines:
1. Calculate M directed lines and N great circles using principal component analysis or
Fisher statistics.
2. Assume that the primary direction of magnetization for the samples with great circles lies
somewhere along the great circle path (i.e., within the plane).
3. Assume that the set of M directed lines and N unknown directions are Fisher distributed.
Iteratively search along the great circle paths for directions that max[1]imize the resultant
vector R for all the M + N directions.
4. Having found the set of N directions that lie along their respective great circles, estimate
the mean direction using equation 4.3 and as:
4.5. Inclination only data
in the case of unoriented drill cores. Cores can be drilled and arrive at the surface in short,
unoriented pieces. Specimens taken from such core material will be oriented with respect to the
vertical, but the declination data are unknown. It is often desirable to estimate the true Fisher
inclination of data set having only inclination data, but how to do this is not obvious. Consider the
data in Figure 4.5. The true Fisher mean declination and inclination are shown by the asterisk. If we
had only the inclination data and calculated a gaussian
4.5. Inclination only data
4.6. Is a data set Fisher
distributed?
The first step to determine whether a given data set is Fisher distributed is to calculate
the orientation matrix T of the data and the associated eigenvectors and eigenvalues.
Substituting the direction cosines relating the geographic coordinate system X to the
coordinate system defined by V, the eigenvectors, where X is the “old” and V is the
“new” set of axes in equation 3.1, we can transform the coordinate system for a set of
data from “geographic” coordinates where the vertical axis is the center of the diagram,
to the “data” coordinate system,where the principal eigenvector lies at the center of the
diagram, after transformation into “data” coordinates.
4.6. Is a data set Fisher
distributed?
the probability P of finding a direction within the band:
Plots such as those shown in Figure 4.7 are not as helpful for this purpose as a plot
known as a Quantile-Quantile (Q-Q) plot (see Fisher et al. [1987]).
4.6. Is a data set Fisher
distributed?
In a Q-Q plot, the data are graphed against the value expected from a particular
distribution. Data compatible with the chosen distribution plot along a line. In order to
do this, we proceed as follows (Figure 4.8):
4.6. Is a data set Fisher
distributed?
Figure 4.10 Hypothesis of non-fisherian data sets. Normal and inverse polarity data that are not
distributed symmetrically. Plot of a filled (open) circle in the lower hemisphere (top)
4.7 Non-parametric analysis of
vectors
● A non-parametric method is one that does not require
data to be distributed. Some known distributions can
be characterized by several parameters.
4.7 Non-parametric analysis of
vectors
Figure 4.11 Q-Q plot for the hypothetical data set shown in Figure 4.10 a. normal declination, b. normal
co-clination c. reverse declination, d. Co-inclination reversal. Neither normal nor inverted declination
fits the Fisher distribution.
4.8 Non-Fisherian Directional
Data
Figure 4.12 Bootstrap is applied to normal distributions. a.) 500 data
points are taken from the gaussian distribution with a mean of 10
and a standard deviation of 2. b) plots Q-Q data in intervals of 95%
for mean given by Gauss statistics as +_ 0.17. 10,000 new data sets
(para) are generated by randomly drawing N data points from the
original data set. C) the histogram average of the para-data al set.
95% of the average is in intervals of 10.06 +0.16/-0.16, hence the
bootstrap interval is similar to that calculated by gaussian statistics.
- Randomly draw N data points from the data shown in Figure 4.10. This is a para-data set for a
simple bootstrap. One can also do a “parametric” bootstrap if each data point represents a site mean
of Fisher distributed directions having an estimated k of k. In this case, draw a para-data set by
random selection of data points and by calculation of a substitute data point by sampling from a
Fisher distribution with the same N, k and mean direction.
- Because paleomagnetic data are often bimodal, the data must be split into normal and reversed
polarity modes.
- Calculate a Fisher mean for each para-data set. Alternatively, if a more robust estimate of the
“average” direction is desired, calculate the principal eigenvector V1 which is less sensitive to the
presence of outliers.
- The Kent distribution is the elliptical analogue of the Fisher distribution (Kent [1982] ) and is given as
follows:
4.10. A bootstrap for unit vectors
Where α is the angle between a given direction and the true mean direction (estimated by the
principal eigenvector of the orientation matrix T), and φ is an angle in the plane perpendicular to
the true direction with φ = 0 parallel to the major eigenvector V1 in that plane. K is a
concentration parameter similar to the Fisher k and β is the “ovalness” parameter. c(k, β) is a
complicated function of k and β (Fisher et al. [1987]), When β is zero, the Kent distribution
reduces to a Fisher distribution.
4.11 The Parametric Bootstrap
The bootstrap described here is a “simple” or “naive” bootstrap in
that no distribution is assumed for the data. We must assume,
however, that all the uncertainty inherent in the data is reflected in
the data distribution. If the data set is smaller than about N = 20,
this leads to uncertainty ellipses that are too small
if we are able to assume some parametric form for data from e.g., a
given site, we can perform a superior technique which is known as
the parametric bootstrap
For large data sets (N > 25), the parametric and simple bootstraps
yield very similar confidence ellipses. For smaller data sets, the
parametric ellipses are larger, and are probably more realistic.
4.12 Application to VGPs
the transformation of data from directions to VGPs can cause a
distortion from rotationally symmetric data to elliptically
distributed data. To illustrate what happens if data are distributed
as in our hypothetical normal data set, we calculate VGPs in
Figure
The VGPs are plotted in Figure and the dp,
dm is shown as a dotted line. Because the
declinations are smeared and the dp must
point towards the site (shown as a triangle),
the long axis of the so-called 95% confidence
regions is perpendicular to the actual data
distribution. The 95% confidence ellipse,
calculated using the bootstrap, is shown as a
solid line.
4.13. When are two data sets distinct?
In Figure show the means of two data sets
and each drawn from distributions with a of
10. The mean direction of each lies outside
the confidence region of the other, but the F
test of Watson [1956] gives a value of 2.03,
which passes the test for a common mean
In order to compare the two populations of bootstrapped means, we
convert them to cartesian coordinates. Histograms of the cartesian
coordinates of the bootstrap means are unimodal, which suggests a
common mean. As a more quantitative test, we plot the intervals from
each data set that contain 95% of the respective sets of cartesian
coordinates of the bootstrapped means. These overlap, which
confirms that the two data sets cannot be distinguished at the 95%
confidence level.
4.14 Application to the “reversals test”
The so-called reversals test in
paleomagnetism constitutes a test
for a common mean for two modes,
one of which has been “flipped” to
its antipode
We apply our bootstrap test for common mean to the data shown in
Figure. The histograms of the cartesian coordinates of the
bootstrapped means are shown in Figure. There are two “humps” in
the bootstrap test. However, the confidence intervals for the normal
and reversed antipodes overlap, thereby suggesting that the two
means cannot be distinguished at the 95% level of confidence.
Thus, the data in Figure 4.10 pass the bootstrap reversals test.
4.15 The Bootstraped Fold Test
The last test is particularly useful in paleomagnetism: the fold
test. One of the key components in paleomagnetic studies is to
determine the coordinate system (geographic, tilt adjusted or
somewhere in between) for which the directional data are most
tightly clustered.
According to Graham [1949], the directions of magnetization
of a deformed rock unit are assumed to be most closely parallel
in the orientation in which the magnetization was acquired. If a
rock has retained an original magnetization through a
subsequent folding or tilting event, the magnetic directions will
cluster most tightly after they have been rotated back to their
original positions.
4.15 The Bootstraped Fold Test
In this test, primary problem shown probably is that
paleomagnetic vectors are never perfectly parallel. So, it’s need
a statistical test to determine whether clustering is “significantly”
better in one orientation or another.
The statistical test according to McElhinny done by
calculated calculated before and after adjusting for bedding tilt
the concentration of the data (using Fisher’s precision
parameter, )and that the ratio of the two values should be
compared with those listed in statistical “F” tables.
When the ratio higher than F values, then the “F” value for a
given N were deemed to constitute a significant increase in
concentration after adjusting for tilt, thus representing a positive
fold test.
4.15 The Bootstraped Fold Test
Several problems with the McElhinny [1964] fold test are:
• The geomagnetic field has two preferred states and is not
perfectly dipolar, so the directions observed are not only
scattered but are often of two polarities.
• The magnetic directions may be most tightly clustered
somewhere other than in “geographic” or 100% tilt adjusted
coordinates.Finally, structual “corrections” are not perfectly
known.
• Because of this uncertainty, we might reasonably ask
whether if the data are actually most tightly clustered at, say
90% tilt adjusted (as opposed to 100%), does this constitute
a “failed” fold test.
4.15 The Bootstraped Fold Test