Professional Documents
Culture Documents
Summary of Geostatistical Analysis
Summary of Geostatistical Analysis
Deterministic Methods
• Not good for irregular, highly varied data, & data prone to error or
uncertainty.
Geostatistical Methods
h – Scatterplots
(xi - yi, xj- yj). The vector (distance) between these two
points (i to j) will now be referred as, hij.
• Lag - Multiply lag size x number of lags. This value should be <1/2
the largest distance among all points.
• Lag (bin) size – a limit of the distance (width) defining a bin into
which pairs of locations of approx. equal distance & direction are
placed to reduce the number of combinations.
• Only average distance & semi-variance for all pairs in a bin are plotted
as a single point on the empirical semivariogram.
• An empirical semivariogram value (from the y axis) for each bin for
each direction is plotted as a red dot. X axis is a distance from the center
of the bin to the center of the semivariogram surface).
Stationarity
Step 1 – Given a data set of five locations [(1,5), (3,4), (1,3), (4,5),
(5,1)]and their associated values [(100), (105), (105), (100), (115)],
calculate the distance (x values) between each pair of locations
using the Pythagorean theorem.
Table 1
Increasing the number of points (five in this example) can soon result
in a very large number of paired combinations (Table 1), since each of
the five point locations has an accompanying point to form a vector.
Table 2
5 (5)2 = 25 12.5
5 (5)2 = 25 12.5
0 (0)2 = 0 0
15 (15)2 = 225 112.5
0 (0)2 = 0 0
5 (5)2 = 25 12.5
10 (10)2 = 100 50
5 (5)2 = 25 12.5
10 (10)2 = 100 50
15 (15)2 = 225 122.5
Step 3 – Binning the Empirical Semivariogram. Since a
data set can consist of 100’s to 10,000’s of control points,
the individual vectors (delineated by two ordered
coordinate pairs) and the empirical semivariance values
associated with each pair, will quickly result in a cloud of
points in the semivariogram that will be unintelligible to
interpret.
• Model the residuals & reintroduce the trend in the prediction model.
Si = a location.
• Employed to satisfy normality, stationarity assumptions & to
model local, short-range variations.
• The length of the semi-minor axis (minor range) from the center of
the cloud will reach the sill sooner (plot as a steeper grade) than the
semi-major (major range) axis.
• Directional Binning (Anisotropy)
Properties include –
• Any relationship between the variance & the trend are removed w/
these transformations.
• Use when data are not random or regular spaced, i.e. higher
concentration of points in some regions.
• Cell declustering –
• Voronoi Map –
• All locations w/in the polygon are closer to the sample point than
any other sample point.
• Border polygons are often given more weight since the borders are
composed of truncated rectangular sides.
Normal & General QQPlot
Si = a location.
• Like indicator Kriging, only uses cokriging since there are two
types of random errors (e1(s) & e2(s)) generated from two constants
(m1 &m2).
• Unusual points are set aside, & may require refitting the
autocorrelation model.
• Validation – creates a model for a subset of the data set.
• Error plot – same as the predicted plot, except true values are
subtracted from the predicted values.
• Standardized Error plot – measurement values are subtracted from
the predicted values, then divided by the estimated Kriging standard
errors.
• Generally, the best model has the parameters as per those listed in
the previous summary.