Professional Documents
Culture Documents
Papritz A., Kunsch HR., Webster R. (1993) PDF
Papritz A., Kunsch HR., Webster R. (1993) PDF
8, 1993
Normal cross-variograms cannot be estimated from data in the usual way when there are only a
f e w points where both variables have been measured. But the experimental pseudo cross-variogram
can be computed even where there are no matching sampling points, and this appears as its principal
advantage. The pseudo cross-variogram may be unbounded, though for its existence the intrinsic
hypothesis alone is not a sufficient stationarity condition. In addition the differences between the
two random processes must be second order stationary. Modeling the function by linear coregion-
alization reflects the more restrictive stationarity condition: the pseudo cross-variogram can be
unbounded only if the unbounded correlation structures are the same in all variograms. As an
alternative to using the pseudo cross-variogram a new method is presented that allows estimating
the normal cross variogram from data where only one variable has been measured at a point.
INTRODUCTION
Cokriging is now widely recognized as a means of estimating values of spatially
cross-correlated variables at unsampled points and over larger blocks. It can be
used to estimate all variables in a set, either to improve precision (Joumel and
Huijbregts, 197 8; Wackernagel, 1993 ) or for coherence (Matheron, 1979; Myers,
1983). However, its principal use has been to estimate values of a sparsely
sampled variable from data on it plus those o f one or more variables that have
been sampled more densely, commonly termed "the undersampled case". In
these circumstances it can confer substantial benefits in terms of precision,
efficiency, and cost (Atkinson e t a l . , 1992).
The cokriging equations are usually written in terms of variograms. These
must be estimated from data, and for each experimental normal cross-variogram
there must be numerous places at which values of both variables are known. In
the undersampled case only a subset of the values of the fully sampled variable
1015
P R O P E R T I E S OF T H E P S E U D O C R O S S - V A R I O G R A M
Consider two intrinsic random processes of order zero, Z~ (x) and Z2 (x),
with means/zl and/~2, with variograms "rll(h) and "r22(h), and with a normal
cross-variogram defined by
to the variances of the differences only if both Z1 (x) and Z 2 (x) have constant
and equal means (tz 1 = #a). He therefore proposed the more general definition
for the pseudo cross-variogram "Y~l (h) as half the variance of the differences:
Then Eq. (6) follows directly from Eq. (5') of Myers (1991). In particular, we
obtain from Eq. (6)
1 p 1 p
721 (h) --- UY21 (h) + ~'), 12(h) - "YPl (0) (7)
Hence the usual cross-variogram can be expressed in terms of the pseudo cross-
variogram. The converse is not possible: we cannot obtain the pseudo cross-
variogram from the cross-variogram unless the pseudo cross-variogram is even
and more restrictive stationarity conditions are satisfied. When Z 1(x) and Z 2 (x)
are merely intrinsic of order zero the pseudo cross-variogram does not in gen-
eral exist. Equation (7) suggests that we need in addition that the variance of
Z 2 (x) - Zl (x) is finite and independent of x. On the other hand there are
situations where Z 1(x) and Z 2 (X) are not stationary, but the pseudo cross-vari-
ogram exists nevertheless. The following example shows what can happen.
Let Z~ (x) and e(x) be two uncorrelated intrinsic processes with mean zero.
From these we form Z2 (x) as the following linear combination
P
Z2(x) = Z
j=l
XjZI(x- hj) + e(x) (8)
with fixed coefficients, Xj, and fixed lags, hi. Then Z2(x) is intrinsic with E[Z2 (x)]
= 0,
P P
V22(h) = Z Z ~kj)kk['~ll(h -- hj + hk) - V l l ( h j - hk)l + ")/ce(h) (9)
j=l k=l
and
P
"Y21(h) = Z ~ky[13'll(h - hi) + ½"Yll(h + hy) - "y|l(hy)] (10)
j=l
Note that under these conditions Z 1(x) and Z 2 (x) are in general only intrinsic,
but not stationary. Both Z~ (x) and Zz (x) can fluctuate substantially, but their
fluctuations are mainly parallel so that the differences, Z~ (x) - Z2 (x), fluctuate
much less.
This example can be generalized in two ways. First, we can introduce
dependence between Z1 (x) and e(x). For the existence of 3'z2 and "Y21 it is
sufficient that the cross-variogram "Yet exists. For the existence of 3,P~ we need
Zl (x + h) - Zl (x) and e(x) to be jointly second order stationary for any h.
Pseudo Cross-Variogram 1019
E S T I M A T I N G AND M O D E L I N G
A possible estimator to compute the experimental pseudo cross-variogram
from sample data is
1 U(h)
~'[l(h) - ~] {[Z2(xi + h) - ~ ] - [Zl(xi) - 21]} 2 (14)
2N(h) i= 1
where ~{1 and Z2 denote the arithmetic means of nl and n2 observations. This
estimator is biased. For small lag distances the bias equals approximately the
variance of the difference of the sample means (see Priestley, 1981).
1 n=~
Ij=l~ 3"11(Xj- X i ) - Ln2i n=~1j=l~ 3'22(Xj __ Xi) (15)
n2i
Myers (1991) has suggested a different estimator for the experimental pseudo
1020 Papritz, Kiinsch, and Webster
where #k is the mean of the kth process. If one denotes the sum E Pp= lajpakpS
~ by
b]k then the autovariograms and the normal cross-variogram can be represented
by
S
3'jk(h) = E b~kgS(h) (18)
s=l
Pseudo Cross-Variogram 1021
A sufficient condition for the variogram matrix F(h) to be CNSD is that the
matrices of coefficients [bj~] are positive semi-definite.
The pseudo cross-variogram exists only if for each s either a~p = a~p =
ap, or Yp (x) is second order stationary with covariance function C ( h ) . Note
that the first condition implies b~j = b~2 = b~t = b ~. Without loss of generality
we may assume that the first condition holds for s <_ So the number of unbounded
correlation structures, and the second for s > S o. Then we obtain
So S
and
S
Two remarks must be added. First, Eqs. (19) and (20) once again confirm
the additional stationarity required for the existence of the pseudo cross-vario-
gram. The unbounded components must be the same for all variograms; only
the bounded components may differ. This is one step away from the assumption
of second order stationarity, though we still lack experience to say how useful
this extension is likely to be in practice with extreme undersampling.
Second, an asymmetric pseudo cross-variogram cannot be modeled using
Eq. (20) because the elementary variograms, gS(h), and covariance functions,
CS(h), are even. If the asymmetry consists simply of a "lag effect", i.e., if the
pseudo cross-variogram can be made symmetrical by shifting one variable with
respect to the other by some vector h ~ 0 then the linear model may be
generalized (Journel and Huijbregts, 1978, p. 173) to describe such asymmetry.
Other types of asymmetries, however, cannot be modeled by linear coregion-
alization. Modeling these requires more detailed assumptions about the structure
of the coregionalization. Equations (8) and (11) are an example of such a model.
But unlike multivariate temporal phenomena, asymmetrical cross-correlation
patterns are not likely to occur frequently in space. Thus, the limitations inherent
in the linear model of coregionalization are not seriously restricting.
A special problem arises in modeling the pseudo cross-variogram where
the two variables are sampled at completely different places and there is a nugget
variance, i.e., where for some s gS(0) = 0 and gS(h) = 1 for ih[ > 0. The
experimental pseudo cross-semivariance ~ (0) cannot be computed if no sam-
pling locations coincide. Equation (20) shows, however, that b~ can be deter-
mined only if the pseudo cross-semivariance for [h I = 0 is known. Thus, the
exact value of b ~ remains unknown here, but its allowed bounds are obtained
from Schwarz's inequality: [bill must lie in the interval [0, ~-~1b~2]. Esti-
mating and modeling the cross-covariance function from data from different
1022 Papritz, Kiinseh, and Webster
locations suffers from the same limitation. As with the pseudo cross-variogram,
its nugget constant cannot be determined. The uncertainty about the nugget
constant affects the prediction by cokriging if either the term C21 (0) or 3'~1 (0)
appears in the cokriging system.
We have shown above that using the pseudo cross-variogram does not
completely circumvent the problems that arise from strong undersampling and
unbounded variation. Therefore, the following alternative for estimating and
modeling the normal cross-variogram may be helpful where no or only a few
sampling locations of the different variables match. Assume that Z l (x) is ob-
served at locations Xl,i, i = 1 . . . . . hi, and that Zz(x) is observed at locations
x2, k, k = 1 . . . . . n2, with xl,/ :# x2,k v (i, k). Then form linear combinations
of the observed values such that for each variable the weights sum to zero, and
use the identity:
nl n2 12
+ Z Z X{Xk2"/12(xl,i -- x2,k; 012) (23)
i=1 k=l
The first summation operator denotes summation over all possible sets of weights
Pseudo Cross-Variogram 1023
that can be sensibly chosen. One obvious choice of weights for the first variable
is xi~ = 1, X~ = - 1 , i 4: j and X1~ = 0, v k ~ (i, j). The weights for the
second variable may be chosen similarly: X2k = 1, X~ = - 1, k :g l and X~ =
0, v i :~ (k, l). The sum Ql2 can then be written
nl -- I rtl n2 -- I n2
012 = '~J '~J ~ Z {[ZI(XI,i) -- Z l ( X l , j ) ] [ Z 2 ( X 2 , k ) - Z2(x2,1) ]
i=1 j=i+l k=l l=k+l
Thus, for this choice of weights the four variogram terms must be fitted to the
cloud of (~') (~2) products of differences. This procedure is closely related to
fitting directly the variogram cloud. The autovariograms, %1 (h) and 722 (h), can
be modeled correspondingly. The sum of squared deviations, QI~ and Q22, are
given by
nk- 1 nk
{[Zk(Xk, i) - - Z k ( X k , j ) ] 2 - - "Ykk(Xk, i -- X k , j ; 0kk)} 2 (25)
i=1 j = i + l
from Eq. (24) it can be seen that approximately 108 values of the cross-vafiogram
are needed to compute Q12- Since the models are nonlinear in the scale param-
eters fitting must be iterative, and the number of operations is a multiple of 108.
Currently, we are developing and exploring computationally feasible approxi-
mations for minimizing Q12 and Q, respectively.
Second, robust fitting techniques might be necessary to reduce the influence
of extreme values in the variogram cloud. Despite these difficulties the new
approach is promising because it uses all the information on the fully sampled
variable to establish the cross-correlation with the undersampled variable. Fur-
ther, it circumvents the limitations inherent in attempting to cokrige from the
pseudo cross-variogram or the cross-covariance function.
1024 Papritz, Kiinseh, and Webster
CONCLUSION
The pseudo cross-variogram is undoubtedly attractive for describing cross-
correlation between two variables where measurements on both have not been
or cannot be made at the same places and where there is severe undersampling.
However, its main application in practice is likely to be for cokriging. Although
it may be unbounded in theory the stationarity condition required for its existence
is more restricting than the intrinsic hypothesis. Modeling it by linear ¢oregion-
alization either restricts it to second order stationary processes, or the unbounded
structures of the auto- and the pseudo cross-variogram must be the same. Thus,
it offers some advantage over the cross-covariance function which can be used
only for second order stationary processes.
Another way to overcome the difficulties arising with unbounded variation
and strong undersampling is the proposed method for estimating and modeling
the normal cross-variogram. Further investigations are necessary to decide which
approach is more advantageous.
APPENDIX
and
"Y11(h)
lira - - - 1 (29)
h--* oo "/11 ( h )
Pseudo Cross-Variogram 1025
= 2 ~'/3"~t(0)3"11(h) (30)
the third term of the right-hand side of Eq. (27) becomes
[Cov [Z2 (x + h) - Z1 (x + h), Z t (x + h) - Z 1(X)]{
0 < lim
h~ ~ 3'll (h)
which in turn implies that the slopes of TPl (h) and 3"11(h) must be equal for
large lag distances.
By analogous argument, i.e., by expanding Var [Z2(x + h) - Z2(x) +
7-2 (x) - Z1 (x)], it can be shown that
3'~t (h)
lira - 1 (33)
h ~ o~ 3"22 (h)
REFERENCES
Atkinson, P. M., Webster, R., and Curran, P. J., 1992, Cokriging with Ground-Based Radiometry:
Remote Sensing of Environment, v. 41, p. 45-60.
Clark, I., Basinger, K. L., and Harper, W. V., 1989, MUCK--a Novel Approach to Cokriging.
in B. E. Buxton (Ed.), Proceedings of the Conference on Geostatistical, Sensitivity, and
1026 Papritz, Kiinsch, and Webster
Uncertainty Methods for Ground-Water Flow and Radionuclide Transport Modeling: Battelle
Press, Columbus, p. 473-493.
Delfiner, P., 1976, Linear Estimation of Non Stationary Spatial Phenomena. in M. Guarascio, M.
David, and C. Huijbregts (Eds.), Advanced Geostatistics in the Mining Industry: D. Reidel
Publishing Company, Dordrecht, p. 49-68.
Goulard, M., and Voltz, M., 1992, Linear Coregionalization Model: Tools for Estimation and
Choice of Cross-Variogram Matrix: Math. Geol., v. 24, p. 269-286.
Joumel, A. G., and Huijbregts, C. J., 1978, Mining Geostatistics: Academic Press, London,
600 p.
Matheron, G., 1979, Recherche de Simplification dans un Probl~me de Cokrigeage: Publication
N-628, Centre de G6ostatistique, Ecole des Mines de Paris, Fontainebleau. 19 p.
Myers, D. E., 1982, Matrix Formulation of Co-Kriging: Math. Geol., v. 14, p. 249-257.
Myers, D. E., 1983, Estimation of Linear Combinations and Co-Kriging: Math. Geol., v. 15, p.
633-637.
Myers, D. E., 1991, Pseudo-Cross Variograms, Positive-Definiteness, and Cokriging: Math. Geol.,
v. 23, p. 805-816.
Papritz, A., and Fliihler, H., in press, Temporal Change of Spatially Autocorrelated Soil Properties:
Optimal Estimation by Cokriging: Geoderma.
Priestley, M. B., 1981, Spectral Analysis and Time Series: Academic Press, London, 890 p.
Wackemagel, H., 1988, Geostatistical Techniques for Interpreting Multivariate Spatial Information.
in C. F. Chung, A. G. Fabbri, and R. Sinding-Larsen (Eds.), Quantitative Analysis of Mineral
and Energy Resources: D. Reidel Publishing Company, Dordrecht, p. 393-409.
Wackernagel, H., in press, Cokriging vs. Kriging in Regionalized Multivariate Data Analysis:
Geoderma.