+ All Categories
Home > Documents > Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

Date post: 21-Dec-2016
Category:
Upload: asher
View: 212 times
Download: 0 times
Share this document with a friend
11
Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions ESHEL BRESLER* AND ASHER LAUFER ABSTRACT Crop yields are often highly variable due (partly) to spatial het- erogeneity of soil properties. The conditional multivariate normal (CMVN) method is systematically described and applied to 24 spa- tial random functions. These include: 12 soil properties, four soil variables, and two yield components of each of four field crops [wheat (Triticum aestivum L.), vetch (Vicia sativum L.), corn (Zea mays L.), and peanut (Arachis hypogaea L.)]. The statistical parameters char- acterizing the joint probability density function in a two-dimensional field were estimated, from 20 to 60 measurements, by the maximum likelihood procedure. An estimation procedure for three parameters of four covariance models (piecewise linear, exponential, spherical, and Gaussian), constant mean, and linear drift, is described. It was found that the covariance function of all the 24 spatial functions can be represented by the four models and that there is a significant two-dimensional linear drift of the mean. The parameter that rep- resents the nugget effect (i.e. the variance of uncorrelated values of the spatial function) is estimated to be either zero or nonsignificant for the soil properties but has a small significant value for most crop yield components. Soil properties as well as crop yield components are characterized by the existence of a significant correlation scale on the order of 10 to 30 m. Estimations of conditional expectation, conditional covariance, the variance, and the variance of the esti- mated variance were made and demonstrate that the CMVN method can be applied to real field data. The specific covariance model used generally has a small effect on the estimated results. Comparisons with kriged values and kriging variance demonstrate the similarity and differences in the results. P ELD MEASUREMENTS have indicated consistently that soil properties as well as crop yields vary throughout the field space in an irregular manner (e.g., Biggar and Nielsen, 1976; Bresler et al, 1981, 1982; Nielsen et al., 1973; Russo and Bresler, 1981; Trang- mar et al., 1986). Values of these variables at different points in the field are subjected to uncertainty due to a limited number of observation points and because the measurements themselves are error prone. To deal with uncertainty and spatial variability, soil properties and various components of crop yield are regarded as random functions of the space coordinates, or as re- gionalized variables in the geostatistical terminology (Delhomme, 1978; Journel and Huijbregts, 1978; Matheron, 1971, 1973). The actual field is viewed as a given realization of an ensemble of all possible re- alizations, of which only one realization (comprised of the actual measurements) is available for the sta- tistical analysis. Hence, the statistical structure of any soil property or a crop yield component can be inferred only if some kind of statistical assumptions are made. Field mea- surements of a soil property or a crop yield compo- nent can be used for the inference of the probability density function (pdf) and its parameters. The con- ditional geostatistical approach uses the observations themselves to reduce the uncertainty in the values of the random function at any point in the field other than the observation points. The approach investi- gates the stochastic interpolation problem by using the kriging method (Delhomme, 1978; Journel and Huijbregts, 1978), which predicts the best unbiased estimates of the expectations and covariances condi- tion on the observations. The statistical structure (i.e. Dep. of Soil Physics, Inst. of Soils and Water, A.R.O., The Volcani Ctr., P.O. Box. 6, Bet Dagan 50-250, Israel. Contribution 2096-E, 1987 series, from the Inst. of Soils and Water A.R.O., Israel. Re- ceived 27 July 1987. "Corresponding author. Published in Soil Sci. Soc. Am. J. 52:1234-1244 (1988).
Transcript
Page 1: Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

Statistical Inferences of Soil Properties and Crop Yields as Spatial Random FunctionsESHEL BRESLER* AND ASHER LAUFER

ABSTRACTCrop yields are often highly variable due (partly) to spatial het-

erogeneity of soil properties. The conditional multivariate normal(CMVN) method is systematically described and applied to 24 spa-tial random functions. These include: 12 soil properties, four soilvariables, and two yield components of each of four field crops [wheat(Triticum aestivum L.), vetch (Vicia sativum L.), corn (Zea mays L.),and peanut (Arachis hypogaea L.)]. The statistical parameters char-acterizing the joint probability density function in a two-dimensionalfield were estimated, from 20 to 60 measurements, by the maximumlikelihood procedure. An estimation procedure for three parametersof four covariance models (piecewise linear, exponential, spherical,and Gaussian), constant mean, and linear drift, is described. It wasfound that the covariance function of all the 24 spatial functions canbe represented by the four models and that there is a significanttwo-dimensional linear drift of the mean. The parameter that rep-resents the nugget effect (i.e. the variance of uncorrelated values ofthe spatial function) is estimated to be either zero or nonsignificantfor the soil properties but has a small significant value for most cropyield components. Soil properties as well as crop yield componentsare characterized by the existence of a significant correlation scaleon the order of 10 to 30 m. Estimations of conditional expectation,conditional covariance, the variance, and the variance of the esti-mated variance were made and demonstrate that the CMVN methodcan be applied to real field data. The specific covariance model usedgenerally has a small effect on the estimated results. Comparisonswith kriged values and kriging variance demonstrate the similarityand differences in the results.

PELD MEASUREMENTS have indicated consistentlythat soil properties as well as crop yields vary

throughout the field space in an irregular manner (e.g.,Biggar and Nielsen, 1976; Bresler et al, 1981, 1982;Nielsen et al., 1973; Russo and Bresler, 1981; Trang-

mar et al., 1986). Values of these variables at differentpoints in the field are subjected to uncertainty due toa limited number of observation points and becausethe measurements themselves are error prone. To dealwith uncertainty and spatial variability, soil propertiesand various components of crop yield are regarded asrandom functions of the space coordinates, or as re-gionalized variables in the geostatistical terminology(Delhomme, 1978; Journel and Huijbregts, 1978;Matheron, 1971, 1973). The actual field is viewed asa given realization of an ensemble of all possible re-alizations, of which only one realization (comprisedof the actual measurements) is available for the sta-tistical analysis.

Hence, the statistical structure of any soil propertyor a crop yield component can be inferred only if somekind of statistical assumptions are made. Field mea-surements of a soil property or a crop yield compo-nent can be used for the inference of the probabilitydensity function (pdf) and its parameters. The con-ditional geostatistical approach uses the observationsthemselves to reduce the uncertainty in the values ofthe random function at any point in the field otherthan the observation points. The approach investi-gates the stochastic interpolation problem by using thekriging method (Delhomme, 1978; Journel andHuijbregts, 1978), which predicts the best unbiasedestimates of the expectations and covariances condi-tion on the observations. The statistical structure (i.e.Dep. of Soil Physics, Inst. of Soils and Water, A.R.O., The VolcaniCtr., P.O. Box. 6, Bet Dagan 50-250, Israel. Contribution 2096-E,1987 series, from the Inst. of Soils and Water A.R.O., Israel. Re-ceived 27 July 1987. "Corresponding author.

Published in Soil Sci. Soc. Am. J. 52:1234-1244 (1988).

Page 2: Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

BRESLER & LAUFER: STATISTICAL INFERENCES OF SOIL PROPERTIES AND CROP YIELDS 1235

the variogram) which has been inferred by some ar-bitrary averaging procedure, is assumed to be knownwith certainty. The fact that the parameters charac-terizing the pdf are also random variables has beengenerally disregarded. An exception is the work of Ki-tanidis and Vomvoris (1983). Dagan (1982, 1985) sug-gested an alternative approach based on the classicalstatistics of conditional probability assuming multi-variate normal (MVN) pdf (Mood et al., 1974).

Unfortunately the estimated parameters character-izing the MVN pdf are random variables because theymust be inferred from observations of a finite sample,and they are thus subjected to uncertainty. Recogniz-ing this additional source of uncertainty, Feinermanet al. (1986) developed a method to infer the spatialdistribution of the statistical moments of a stationarynormal spatial function, based on a finite set of mea-sured values. Estimates of the expectations, covari-ances, and variances at any arbitrary point in the fieldwere demonstrated. The estimates were conditionedon simulated observations points, however, rather thanon real measurements.

In this paper the new conditional multivariate nor-mal (CMVN) approach (Feinerman et al., 1986) willbe applied to a large set of real field data comprisedof soil properties, soil variables, and various crop yieldcomponents. These observed data were inferred to bedrawn from a population with normal or lognormaldistribution (e.g., Bresler et al., 1981; Russo and Bres-ler, 1981; Stern and Bresler, 1983). The methodologyfor CMVN pdf is systematically described. Subse-quently, the set of statistical parameters characterizingthis pdf is estimated for 24 soil properties and cropyield components. Estimations are carried out withthe aid of the maximum likelihood procedure (Theil,1971) and applied, using four covariance models, forthe measured data representing two-dimensional ran-dom functions. The results are then analyzed and pa-rameter estimations of different covariance models arecompared. Results of estimated conditional MVN ex-pectation, covariance and variance, and the varianceof estimated variance, are illustrated. Numerical val-ues of the inferred expected values and variances, fordifferent covariance models, are compared. Resultsbased on kriging are also compared with CMVN re-sults.

STATISTICAL PRELIMINARIESConsider a soil property or any component of crop yield

as a generic two-dimensional random function, or a region-alized variable. It has a value z(x) with x being the coordi-nate vector of a point in an agricultural field. As in Feiner-man et al. (1986), it is assumed that z = (z,, z2, . . ., ZK) isa vector or random variables, the statistical structure of whichis denned by the joint pdf/(z,, z2, . . . ZK) at an arbitrary setofK points (x,, x2, . . ., \K) within the field domain. It is alsoassumed that z, or its residual vector defined by z' = z —E(z), is stationary in the ordinary sense (i.e., that fa) is in-variant under translation in the field space of the points x,)and that fa) is a nonsingular multivariate (A"-variate) nor-mal pdf/z) - (27r)-^2|fl|-"2exp{-0.5[z -

X [z - E(z)]} . [I]Here fl > 0 is the covariance matrix, |fl| its determinant, Q~'

its inverse, superscript T denotes transpose, and E(z) is theexpected value of z defined by

£(z) = Jz./(z)rfz. [2]The covariance fi is defined for two arbitrary points (x,,

XH), by, x,,) =

= ff2p(xi,xn) [3]where a2 is the variance, p is the autocorrelation of z'(x,)with z'(x,,), and the covariance Q(x,, xn) depends only onthe lag r, r = |xn — x,|. In particular, for r = 0 the variance

= fl(x,, x,) = Q(xn, xn) = «2 [4]is constant. It should be emphasized that the definition ofstationarity can be generalized to consider, in addition tothe case of constant mean (i.e., E(z) = a,), the case whena, is not constant and z has a trend with mean E(z) = «i(x).Thus, when z has a drift its residual z' is stationary in anordinary sense.

Let us subdivide the vector of z of size .AT into z = (Y, y)where Y and y are subvectors of z; Y = (Yt, Y2, . . . , YN)represents the random sample of size N, and y = (y,, y2, . . .. WA/) of size M with M = K — N. The conditional proba-bility density function of y given Y is obtained from

= Ay, Y)/A(Y) = [5]where Xy»Y) is the joint pdf of y and Y, and /z(Y) is themarginal pdf of Y. Since z has a multivariate normal (MVN)pdf (Eq. [1]), the marginal pdf of its subvectors are alsoMVN, characterized by their expectations £(y) and E(Y) andby their covariances J2y and QY (e.g., Press, 1982, p. 71-72).The conditional pdf of y given Y = Y* (Eq. [5]) is alsonormal with the mean vector a linear function of Y* andthe covariance matrix independent of Y*, where Y* is a vec-tor of observed values of the random variable. That is (e.g.Press, 1982, p. 73)£[y(x)|Y = Y*]

+ nyY.Q[y(x)|Y = Y*]

where QyY is defined by

[Y* - EV.(Y*)] [6]flyY. (QY.)-' 0Y.y [7]

[8]INFERRING THE STRUCTURE OF A SPATIAL

RANDOM FUNCTIONThe uncertainty of the random function z(x) can be re-

duced by making the inference of the statistical structureconditional on a subset of given realizations y(x,), F(x2),. . ., Y(xN), i.e. on a vector of measured values of the soilproperty, or crop yield, Y(y,, Y2, . . . , YN). For each arbitrarypoint in x, the expected value of y(\), conditional on themeasurement vector Y, can be inferred in the assumed sta-tionary system from Eq. [6] and [7], using

£[y(x)|Y] = «,(Y)|(x) + £ X/x)7,7-1

where X/x) and £(x) are defined by[9]

[10]

[11]Note that here and hereafter the hatted variables denote es-timates which are therefore random variables. For two ar-bitrary points x,, x,,, the conditional MVN pdf, j(y\,yn\Y), is

Page 3: Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

1236 SOIL SCI. SOC. AM. J., VOL. 52, 1988

bivariate and is characterized, in addition to the conditionalexpectation, Eq. [9], by the two-points covariance (C) con-ditional on the measurement vector Y, and is estimated (seeAppendix I for details) from:C(x,, x,,|Y) = [12]

Here, the estimate ffoLh xlt) is the two-point equivalent ofthe expression

(see Eq. [7], [8], [3], and [4]) and is denned by

x,,) = p(x,, x,,) - . [13]

Hence, for any arbitrary point x in the field, the N valuesof X,(x) (Eq. [10]) to be used in Eq. [9], [11], and [13] areobtained from the solutions of the linear systemN

Tx/x) C(x,, x,) = C(x, x,) [14]

or (using EQ. [3]) from the solution of

i/x) p(x7, x*) = p(x, XA.) [15]j-i

k=l,2,...,Nwhere x, and XA are measurement points.

The best attainable estimate of the estimated conditionalcovariance C is given by the expected value [E(Q] of C inEq. [12], considering the estimators a, and «2 to be deter-ministic (Feinerman et al., 1986), and calculated from

; x,,)] = C(xI)XlI|y)

= a2(Y)7?(x,) . [16]

Equation [16] is a general formula for conditional MVNcovariance by inclusion of the last term, which incorporatesthe uncertainty of the estimate 5,, i.e., a|,. Since the estimateC is a random variable, its variance can be estimated bytaking the variance over C in Eq. [16] as

Var (Q ,, x,,) . [17]

For x, — xn = x, Eq. [16] and [17] predict the estimatedvariance of the error and the estimate of its variance, re-spectively, as

t, x) = «2(Y)n(x, x) +

x).

(x, x) [18]

[19]

Given a set of measured values ¥(7,, Y2, ... , YN) takenfrom field locations x,, x2, ... \N, the aim is to predict, orto estimate at any arbitrary point (x) in the field, the ex-pected value of y, the best attainable estimate of the covar-iance, and also the variance of its variance. The predictionproblem is thus to find

E[y(x)], C = E[C(x)], and <r£<x)conditional on the measured values Y(Ylt ... , YN). Sincethe true values are not known, and the only available in-formation is the set of W measured values Y, this aim isobtained from the estimations given in Eq. [9], [16], or [18],and [17] or [19], based on the measurements vector y.

ESTIMATIONS OF FIELD PARAMETERS BYTHE MAXIMUM LIKELIHOOD METHOD

To infer the conditional moments Eq. [9] and [12], theuncertainty involved in the estimated parameters a (i.e., «i,a2, p) must be taken into account. As these parameters arenot known, their values must be estimated from the onlyavailable information about y, i.e., from the measurementsvector Y = (7,,..., YN). By using the maximum likelihood(ML) method applied tofy), Eq. [1], a set of estimates a(Y)are obtained for the vector of parameters a. For a sufficientlylarge number of observation points and under quite generalconditions it is generally assumed that the estimates a areasymptotically normal, unbiased, and efficient estimates i.e.,they are of minimum variance equal to the lower bound ofthe Cramer-Rao inequality (Theil, 1971, p. 392-395). More-over, the maximum likelihood (ML) procedure provides anestimate of the variance-covariance matrix of the ML esti-mates of a which can serve for statistical inference of theestimated values of a. Hence, parameter estimations madewith the aid of the maximum likelihood (ML) procedure areconsidered to be a very powerful statistical method (Kitan-idis and Vomvoris, 1983). To simplify matters further, theML variance-covariance matrix will be taken here as deter-ministic, although strictly speaking, it is also random, be-cause it is an estimator.

Models for the Statistical StructureMany models may be considered to describe the statistical

structure (first two moments) of soil properties and the re-sultant crop yield. Two models were selected in this studyfor the mean, and four models were studied for the covar-iance of each soil property or crop yield component. For themean, constant mean and linear drift have been examined.For the covariance (Eq. [3]), the four models are: (i) linearcovariance (C); (ii) exponential C; (iii) spherical C, and (iv)Gaussian model for C.

These models are formulated as follows(a) Constant mean a, = 00 [20](b) Linear drift «j = /?„ + /3, x, + 02 x2 [21]

where x, and x2 are the horizontal, two-dimensional spatialcoordinates.(i) Linear covariance: C(r) = a2[l — (a4/a2)][l —

(r/a3)] for r < a3

C(r) = 0. for r > a3 [22](ii) Exponential covariance:

C(r) = a2[l - («4/«2)]exp(- r/«3) [23](iii) Spherical covariance: C(r) = «2[1 — a^/a2]

[1 - (1.5 r/a3) + 0.5(r/«3)3] for r < «3

C(r) = 0. for r > a3 [24](iv) Gaussian covariance: C(r) = a2{exp[ — (r/a3)2]

[1 - («4/«2)]}. [25]Here again, r is the separation distance, al is the expected

value, «2 is the variance, and a3 is the correlation rangerelated to the integral scale. Specifically, «3 is equivalent tothe linear integral scale of model (ii) in which estimatedvalues of «3 represent the spatial correlation (i.e., the dis-tance over which a soil property or a crop yield componenttends to be uncorrelated); while «4 is the value of the whitenoise or the nugget, and is a measure of the nonstructuralvariability, i.e., small scale variance. The parameters ft, i =0, ... , 2 are the linear drift constants.

The ML method has been used to estimate «,-, / = 1, . . . ,4, and ftj, j = 0, ... , 2 (depending on the existence or

Page 4: Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

BRESLER & LAUFER: STATISTICAL INFERENCES OF SOIL PROPERTIES AND CROP YIELDS 1237

nonexistence of a linear trend or drift) for each data set ofany soil property, or its log transformation, and any cropyield component. To derive the maximum likelihood esti-mates of a, and 0,-, the method ofSchweppe (1973) has beenused (similar method was adopted by Kitanidis and Vom-voris, 1983). The procedure involves the minimizing of thenegative log of the MVN (Eq. [1]) for the measured valuesYI, Y2,..., YN. The solution (see Appendix II) was obtainedwith the aid of the MINOS computer program (Saunders,1980), which had been designed to solve general nonlinearminimizing problems. The search algorithm was inaugu-rated as follows: for the parameters /30, 0i, and 02 by a linearregression analysis of the Y data as dependent variables andthe spatial coordinates x{ and x2 as independent variables;for the parameters a2, «3, and «4 from a preliminary vario-gram calculation with a constant number of points to av-erage a given lag r.

DATA SETS OF TWO-DIMENSIONALRANDOM FUNCTIONS

The data to be used for the statistical inference are fieldmeasurements of certain soil properties and a few crop yieldcomponents along with spatial coordinates of the locationsof the observation points in the field from which the mea-surements were taken. Russo and Bresler (1981) provided aset of field-measured data on six different hydraulic prop-erties of a Rhodoxeralf soil at Bet Dagan, Israel. These prop-erties are saturated hydraulic conductivity (Ks), air entry val-ues (hw), sorptivity (5), saturated and residual water contents(6S, and 8n respectively), the constant /J characterizing thesoil water characteristic curve, and the constant f, the powerin the unsaturated conductivity function (both are constantsin the Brooks-Corey equations). Additional soil data whichhave been obtained from the samples taken from the samelocation and at the same time are: soil solution concentra-tion of Na and Ca given as sodium adsorption ratio (SAR),electrical conductivity of the soil solution (EC), and soil tex-ture data (percent clay, percent sand, and percent silt).

Crop yield data were collected from the same sites in thesame Bet Dagan field on two components of peanut yield(Bresler et al., 1981) and sweet corn yield (Stern and Bresler,1983). Additional sets of yield data (Y) and seasonal root-zone averages of water contents (6,, and 8W) were measuredduring the winters of 1980-1981 and 1981-1982 on vetchhay and wheat (Triticum durum Dest). A summary list ofdata sets of 24 spatial random functions is given in Table1. The sample sizes of the data sets are also given in Table1. Note that a sample size smaller than N = 20 does notcontain sufficient information for reasonably accurate esti-mates of the six parameters (if trend is assumed) and evennot for the four parameters (in the constant mean models).On the other hand, data sets sizes > 60 cause computationaldifficulties associated with transforming the covariance mat-rices and inverting them in the ML procedure.

RESULTS OF ESTIMATED PARAMETERSTable 2 summarizes the ML analysis which was per-

formed on each data set (Table 1) using the four Co-variance models with a linear drift (Eq. [21]-[25]). Thistable gives the best estimated values of the vector a,i.e., 0o, £i, 02, «2> «3) and «4, and also the significancelevels of the estimated parameters. For this purposethe Students' t parameters were calculated from /„ =a/a;, and Mests were performed to test the hypothesisof zero a values, i.e., H0:a = 0. The NS superscriptsadjacent to the numbers in Table 2 indicate that theestimated values of these parameters are nonsignifi-cant at the 0.05 level. One asterisk denotes the 0.05level of significance, and two asterisks denote the 0.01

Table 1. A list of variable names, sample sizes and data sourcesof 12 soil properties, four soil variables, and yield componentsof four field crops.

VariableNo. Type Name

Sample———— size SourcesSymbol (JV) of data

1. Soil2.3. Properties4.5.6.78.9.

10.11.12.

13. Soil14.15. Variables16.

17. Crop18. Yield19. Components20.21.22.

23.24.

Saturated conductivityLog scaling parameterPower in unsaturated KPower in retention curveLog in retention curveAir entry valueResidual water contentSaturated water contentSorptivity% clay% silt% sand

Electrical conductivitySodium adsorption ratioAverage 9 for vetchAverage 8 for wheat

Wheat, dry matterWheat, grainVetch, hay balesVetch, dry matterCorn, kernelsCorn, dry matter

Peanuts, dry matterPeanuts, pods

K,rHet0tln/StA*T9r8,s---ECSAR«„

W

Ywi* Wg

YhcYMYckYcf

YpdYpp

303030303030303030303030

30306058

58•5860603030

2020

Russo andBresler,1981

Bresleret aL, 1984

Bresleret al., 1984

New dataNew dataNew dataNew dataNew dataNew dataStern and

Bresler,1983

Bresleret al., 1982

t T = logtlffs/X*)1'2], and e and 0 are Brooks-Corey parameters (for details,see Dagan and Bresler, 1979).

level of significance; all the other nonsuperscripted es-timated parameters in Table 2 are significant at levels> 0.01.

An examination of Table 2 suggests several obser-vations. First, all soil properties and yield componentsare characterized by a nonconstant mean and the ex-istence of a significant linear trend in the data. Thissuggests that the residuals Y' should be considered forthe analysis, and that Eq. [21], in combination withEq. [22] to [25] (with or without a4 = 0), gives muchbetter estimates than Eq. [20] with [22] to [25], al-though the latter may be preferred because it containsless parameters than the former (i.e., 3 or 4, as com-pared with 5 or 6). Second, in general, soil propertiesand yield components are characterized by the exist-ence of a significant (a3 > 0) correlation scale on theorder of 10 to 30 m. Out of the 14 soil parameters andfield properties, a statistically significant nonzero cor-relation scale was calculated in 12 cases, and in onlytwo cases the «3 estimates were not statistically sig-nificant. This does not necessarily indicate a lack ofspatial correlation in these two parameters because thevalues of the nonsignificant correlation lengths («3)were generally comparable with the correlation scaleof the remaining 12 significant cases (excluding theEC case). The latter exceptional case may be indica-tive of the difficulties in determining the range in whichthe autocorrelation exists. All yield components arealso characterized by a significant correlation scale,the value of which is generally smaller than the esti-mated equivalent value for the soil properties (a3 is,in general, between 10 and 30 m for soil variables andbetween 2 and 10 m for various crop yield compo-nents).

Page 5: Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

1238 SOIL SCI. SOC. AM. J., VOL. 52, 1988

Table 2. The ML values of the estimated parameters,

Estimated parameters

Variable

1. Ks (cm/h)

2. T

3. e

4.0

5. In 0

6. hw (cm)

7. 6,

8. 9S

9. S, (cm/Vs)

10. % Clay

11. % Silt

12. % Sand

13. EC (dS/dm)

Model

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

00

23.822.622.922.612.5

1.771.741.781.751.085.045.055.055.053.421.521.521.521.520.700.810.830.810.810.09.09.08.99.07.40.0240.0280.0250.0250.070.320.330.320.320.360.0860.0860.0850.0850.0693.854.893.703.83

13.960.350.080.160.253.87

97.095.096.396.082.253.653.653.653.629.2

01

-0.12-0.11-0.12-0.11

-0.0076-0.0073-0.0077-0.0075

-0.020-0.019-0.019-0.019

-0.0097-0.0097-0.0097-0.0097

-0.015-0.015-0.015-0.015

-0.012-0.013-0.013-0.013

0.000570.000510.000570.00058

0.000540.000510.000540.00057

-0.00011*-0.00011*-0.00009-0.00009

0.0930.0740.0930.095

0.0450.0420.0450.045

-0.17-0.12-0.14-0.14

-0.34-0.35-0.34-0.35

02

-0.09-0.08-0.08**-0.08**

-0.0069-0.0064**-0.0069**-0.0067**

-0.015-0.015-0.015-0.015

-0.0074-0.0075-0.0075-0.0075

-0.013-0.014-0.014-0.013

-0.022-0.022-0.021-C.022

0.000550.000530.000520.00050

0.000310.000210.000310.00031

-0.00025-0.00028*-0.00027-0.00027

0.1310.1340.1380.131

0.0280.0390.0340.031

-0.16-0.17-0.17-0.16

-0.18NS-0.18NS-0.18NS-0.18NS

02

7.859.739.26

10.7030.4NS0.0790.0840.0790.0880.170.140.140.140.140.580.0350.0350.0360.0360.170.110.110.110.110.511.441.441.441.441.80.000310.000330.000310.000330.000880.000660.000680.000660.000650.000990.00013**0.00014**0.000140.000140.00022*

13.817.714.014.427.3

3.503.673.323.495.61

34.534.027.928.452.986.076.276.276.2

150.0

03

20.414.532.616.758.2NS17.29.74*

21.5**13.028.820.0NS20.3NS22.9NS21.2NS26.020.0NS25.322.817.348.212.813.315.27.9

26.311.311.79.8

11.94.3

17.211.622.411.943.65.08.65.07.3

18.328.420.051.327.334.6NS23.322.533.715.017.721.613.426.313.015.721.621.031.914.417.15.9NSO.ONS6.0NSO.ONS

18.7**

enO.ONSO.ONSO.ONSO.ONS0.0

O.ONSO.ONSO.ONSO.ONS0.00.123*0.1420.13NS0.07NS0.00.030.040.0330.0360.00.00.110.00.0050.01.441.440.01.440.00.00.00.00.00.00.00.00030.00050.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0

0.00.00.00.0

5.2NS10.5NS2.71.10.0

(continued on next page)

The third general observation is related to the nug-get effect as represented in Table 2 by the estimatedvalue of a4. Generally speaking, most soil propertiesare characterized by the lack of a nugget effect (esti-mated values of a4 are either zero or nonsignificant).Again, out of the 14 soil properties and field variables,

only five soil parameters show nonzero «4 estimatesbut just one or two exhibit nugget effects with signif-icant estimates of «4 values. These cases were gener-ally associated with soil parameters which are difficultto determine and may therefore show a relatively highmeasurement error. Unlike the soil observations, most

Page 6: Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

BRESLER & LAUFER: STATISTICAL INFERENCES OF SOIL PROPERTIES AND CROP YIELDS 1239

Table 2. Continued.

Estimated parameters

Variable Model 0o 0i 02

14. SAR

15. 8W Average 9 wheat

16. 9V Average 9 vetch

17. Wheat dry matter(kg/4 m2)

18. Wheat grain(kg/4 m2)

19. Vetch bales(kg/4 m2)

20. Vetch dry matter(kg/4 m2)

21. Corn kernels(kg/8 m2)

22. Corn dry matter(kg/8 m5)

23. Peanut dry matter(kg/4 m2)

24. Peanut pots(kg/4 m2}

III

IIIIVVI

III

HIIV

III

IIIIV

III

IIIIVVI

III

IIIIVVI

IIIIIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

IIIIIIIVVI

III

IIIIVVI

11.811.811.811.98.40.0280.0330.0310.0290.0630.0630.0580.0594.594.554.594.555.51.651.661.651.692.069.038.868.898.74

17.572.232.212.22.213.9

20.820.820.822.026.111.311.311.311.311.8

198200200200253213217218218275

-0.051-0.048-0.05-0.05

0.000450.000410.000410.000450.000430.000490.000550.000540.011*0.011**0.011**0.011**

0.0027**0.0028*0.0027*0.0026*

0.0530.0570.0580.060

0.0160.0160.0160.016

0.020NS0.020NS0.019NS0.020NS

-0.003-0.003-0.003-0.003

0.58*0.58*0.58*0.58*

0.65*0.65*0.64*0.64*

-0.037**-0.035**-0.037-0.034*

0.00130.00130.00130.00130.00180.00180.00180.00180.013*0.013*0.013*0.013**

0.00800.00770.00790.0070

0.180.180.180.17

0.0280.0280.0280.028

0.12NS0.1 2NS0.12NS0.1 2NS

0.02NS0.02NS0.02NS0.02NS

0.62NS0.55NS0.56NS0.56NS

0.87*0.60NS0.71*0.70*

3.373.473.373.657.43*0.000310.000380.000320.000410.000470.000500.000470.000510.660.630.660.640.810.0820.0810.0810.0810.11

15.015.014.214.530.10.900.910.910.911.5

30.130.230.230.536.25.45.45.45.45.5

1899189918991518190223502349234923492352

20.0**13.4*23.7**29.6**29.3*13.413.421.910.118.912.321.810.9115.55.6**

17.15.08.7

22.26.8*

27.3*2.86*4.7

10.710.6*14.114.6*14.613.5**8.7

23.7*11.0*5.40.00.00.00.03.7NS9.0NS5.7NS6.03.1NS0.0

12.0NS0

12.3NS4.8NS

11.5**20.00.0

22.0*10.4**11.4*

1.2NSl.ONS0.9NS2.1**0.0000000000.210.0003NS0.11NS0.017NS0.00.0570.049*0.0570.014NS

4.1*5.2*4.4*8.4*

0.67**0.710.710.71

3.44.83.30.00.05.5NS5.4NS3.55.4

0000

0000

10o, 0i, and fa (Eq. [21]) and «2, 03, and 04 (Eq. [22]-model I, Eq. [23]-model II, Eq. [24]-model III, and Eq. [25]-model IV), and ML estimates of 0o (Eq.[20]) 02, a3, and at (Eq. [23]-model VI).

t The numbers without superscripts designate the significant level of the estimated parameters greater than 0.001; * and ** designate significance at thelevels of 0.05 and 0.01, respectively; and NS means nonsignificant estimates.

yield components have a nonzero significant nuggeteffect or small scale variability. This probably resultsfrom genetic or other plant variations rather than frommeasurement errors or other small scale effects of soilproperties on crop yields.

A summary of the data relative to the estimatedparameters of the trend, variance, correlation scale,and the nugget, for each soil property and crop yieldcomponent (Table 2), indicates that the covariancesof the random functions in Bet Dagan are describedin a similar manner by all four models. In general, theestimated parameters (except «4 and a few «3 cases)are highly significant in all the models.

CRITERIA FOR SELECTING THE "BEST"COVARIANCE MODEL

The maximum likelihood (ML) method leads to op-timum parameter estimates for one of the models givenby Eq. [20] to [25], with no indication which of themodels provides the best estimate of the real system.For those cases in which all parameters in all fourmodels are statistically significant at the same level ofsignificance, several objective criteria, based on thelikelihood concept, can be used to rank the alternativemodels. Carrera and Neuman (1986) suggested fourdiscrimination models by which the most appropriateML model can be identified,

Page 7: Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

1240 SOIL SCI. SOC. AM. J., VOL. 52, 1988

Table 3. Model structure discrimination for the data sets of Table Table 3. Continued.1 and for tl

Variable

1. K, (cm/h)

2. 7

3. e

4.0

5. Ing

6. hw (cm)

7. 6r

8. Os

9. S (cm/VS)

10. % Clay

11. % Silt

12. % Sand

he eight 1

Model

III

IIIIVVI

III

IIIIV

VIII

IIIIIIVVI

IIIIIIVVI

III

IIIIVVI

IIIIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIV

VIII

IIIIIIV

VIIII

IIIIIIV

VIII

nested miL

Eq.[A5, PI]

71.359.7*72.070.276.72.93.93.32.7*6.4

13.2*13.213.213.228.1-7.5-7.5*-7.5-7.5

1.59.2*9.69.39.4

25.948.148.148.0*48.151.2

-80.2-79.8-79.9-80.3*-76.7

~-68.4*-67.2-67.8-66.0-97.7-96.4-97.7*-97.7-94.85

77.179.077.976.4*82.957.6*59.258.358.761.988.689.488.887.9*93.6

adels.TAICEq.[1]

152.6129.4*154.0150.4159.415.817.816.615.4*18.836.5*36.536.536.562.2-5.0-5.0*-5.0-5.0

9.028.4*29.328.728.857.8

106.2106.2106.0*106.2108.4

-150.4-149.6-149.7-150.6*-147.3

--126.8*-124.4-125.6-126.0-183.4-180.8-183.4-183.4-183.8*

164.2168.0165.8162.8*171.8125.2*128.4126.6125.6129.8187.2188.8187.6185.8*193.2

BICEq.[2]

159.6136.4*161.0157.4163.422.824.823.622.4*23.043.4*43.543.543.566.42.02.0*2.02.0

13.235.4*36.335.735.852.1

113.2113.2113.1113.2112.6*

-143.4-142.6-142.8-143.6*-143.3

-119.8-117.4-118.6-121.8*-178.4-175.8-185.4*-185.4-179.8

171.2175.0172.8169.8*176.0132.2*135.4133.6132.6134.0184.2185.8184.6182.8*197.4

0Eq.[3]

148.7125.5*150.1146.3157.111.913.912.711.5*16.532.5*32.632.632.659.9-8.9-8.9*-8.9-8.9

6.724.5*25.424.825.055.5

102.3102.3102.0*102.2106.2

-154.3-153.5-153.8-154.6*-149.6

--130.7*-128.3-129.5-125.9-189.3-186.7-189.3*-189.3-185.9

160.3164.0161.9158.9*169.5121.3*124.5122.7121.7127.5183.3184.9183.7181.9*190.9

dEq.[4]

161.1131.5*159.0158.4172.355.753.553.855.112.2*67.626.5*65.558.855.437.6

-35.0*32.7

-15.05.7

64.229.6*61.763.850.648.4

117.548.1*99.652.0

-71.6-75.5-73.7-71.4

-166.8*

-57.40.6

-54.9-148.3*

--89.8--

-207.9*170.8165.3*167.7167.8176.7139.7137.4138.3139.6129.9*185.8181.6*184.6185.7200.0

(continued)

Variable

13. EC (dS/dm)

14. SAR

15. Average 6wheat

16. Average 8vetch

17. Wheat drymatter(kg/4 m2)

18. Wheat grain(kg/4 m2)

19. Vetch bales(kg/4 m2)

20. Vetch drymatter(kg/4 m2)

21. Corn kernels(kg/8 m2)

22. Corn drymatter(kg/8 m2)

23. Peanut drymatter(kg/4 m2)

24. Peanut pods(kg/4 mj)

'* That is, I. Eq.

Model

III

IIIIVVI

III

IIIIVVI

III

IIIIV

IIIIIIIV

III

IIIIVVI

IIIIII

IVVI

IIIIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

III

IIIIVVI

[21] andIV. Eq. [21] and [25]; V.f9Hl nnH T941' flTl r f V T T I U

L A ir* RTP/\1L> 1311̂Eq. Eq. Eq.

[A5, PI] [1] [2]

107.7 225.4 232.4107.6* 225.2* 232.2*107.6 227.2 232.2107.6 227.2 232.2116.0 238.0 242.259.8 129.6 136.659.7 129.4 136.460.6 130.6 137.659.0* 128.0* 135.0*63.5 133.0 137.0

-166.2 -322.4 -311.9-164.2 318.4 -307.9-166.0 -322.0 -311.5-167.1* -324.1* -313.7*-162.1* -314.2* -303.7*-159.2 -308.4 -297.9-160.6 -311.2 -300.7-159.6 10.0 20.5

64.1 138.2 148.763.4 136.8 147.363.7 137.4 147.963.0* 136.0* 146.565.6 137.1 143.4*7.7* 25.4* 35.9*8.2 26.4 36.97.9 25.8 36.38.0 26.0 36.5

22.7 51.4 56.7161.0 332.0 342.5160.7* 331.4* 341.2*161.4 332.8 343.3160.8 331.6 342.1166.9 339.8 346.181.45 172.9 183.481.63 173.2 183.781.4 172.8 182.4*81.3* 172.6* 183.291.3 188.6 194.993.9 197.8 202.6*93.7* 197.4* 204.493.7 197.4 204.493.8 197.6 204.696.8 199.6 203.667.7 145.4 152.467.7 145.4 152.467.7 145.4 152.467.7* 145.4 152.468.2 142.4* 146.6*

101.8 213.6 218.5101.9 213.8 218.7101.9 213.8 218.7101.6* 213.2 218.1103.5 213.0* 216.0*103.9* 217.8 222.8104.0 218.0 223.0104.0 218.0 223.0104.0 218.0 223.0105.7 217.4* 220.4

[22]; II. Eq. [21] and [23]; III

<t>Eq.[3]

221.5221.3*221.3221.3235.7125.7125.5126.7124.1*130.7

-318.3-314.3-318.9-320.0*-300.1*-294.3-297.1

24.1142.3140.9141.2140.1*140.229.5*30.529.930.153.9

336.1335.5*336.9335.7337.4177.1177.2176.8176.6*191.1193.9*199.6199.6199.8200.9147.6147.6147.6147.6143.7*210.2210.4210.4209.8*213.6218.8219.0219.0219.0218.0*

dEq.[4]

215.4214.3*215.2237.5232.0141.7138.2140.9137.2*139.2

-233.7-237.1-237.2*-230.5-229.3*-228.9-228.7

93.5169.9168.1168.2172.8133.4*69.470.068.373.839.7*

343.3341.7*342.8342.7344.8201.1199.5198.9200.5185.0*128.6127.2*-

-

-

-162.1*..

166.7165.9178.4182.4-

179.0*179.5218.7

. Eq. [21] and [24];Eq. [20] and [22]; VI. Eq. [20] and [23];n F9A1 onrl f9^1 The ooforiob-o Aooifmntji 1

VII. Eq.Khp mnHfil

with the lowest discrimination criterion.

(i) AIC(a) = 2L(a) + 2M [26](ii) BIC(a) = 2L(a) + M ln(N) [27]

(iii) *(«) - 2L(a) + CMln[ln(7V)] [28](iv) d(a) = 2L(a) + In (N/2*) + In \F\ [29]

Here, a is the parameter vector as estimated by theML method, L is the minimum value of the negativelog-likelihood for the fitted model, N is the number of

observations points, Mis the dimension of significanta, i.e., the number of independently adjusted signifi-cant (suitable) parameters of the model, C ^ 2 is aconstant (normally C = 2), and \F\ is the determinantof the Fisher information matrix (see Theil,, 1971).The "best" selected model is the one with a minimumcriterion (AIC, BIC, $, or d) value. Obviously, wheneverything but M is the same, the model with thesmallest number of estimated parameters (M) will have

Page 8: Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

BRESLER & LAUFER: STATISTICAL INFERENCES OF SOIL PROPERTIES AND CROP YIELDS 1241

the lowest criterion value and therefore be selectedwith its estimated values of a being used as the bestchosen model. This criterion must be accompanied bythe requirement that these values of a are statisticallysignificant at any desired level.

MODEL STRUCTURE DISCRIMINATIONThe eight possible models given by Eq. [20] to [25]

are denoted as follows: (i) For linear drift, I-Linearcovariance, II-Exponential covariance, Hi-Sphericalcovariance, and IV-Gaussian covariance, (ii) For con-stant mean, V-Linear, VI-Exponential, Vll-Spherical,and VIII-Gaussian. To identify the best model struc-ture among the eight options given by these combi-nations, the criteria in Eq. [26] to [29] have been ap-plied to each model using the data sets of Table 1.Results of structure discrimination of five differentmodels are summarized in Table 3. The table lists, foreach set of data of soil properties or crop yield com-ponents, the minus log-likelihood L of Eq. [A5], thefour discrimination criteria (Eq. [26]-[29]), and theoptional models designated by the Roman numbers.All four options of the linear drift model (I-IV) arelisted in the table but only one—the best constant meanmodel—is given for illustration purposes. It should beemphasized that since, generally, the estimated nuggetparameters («4) are zero or statistically insignificant(see Table 2), the number of independent and signif-icant estimated parameters for the data given in Table2 is either M = 5 (/30, ft, ft,, a2, and «3, for the lineardrift models) or M = 3 (/30, a2, and «3). The preferredmodel among the five listed in Table 3 for each soilproperty or crop yield component, is the one with thelowest discrimination criterion. This model is markedby the asterisks attached to the appropriate numberin the table.

An examination of Table 3 shows that, generally,the constant mean model is not necessarily the pre-ferred, although it contains two less significant param-eters. It is also important to note that generally onlysmall differences exist between the various values ofthe calculated criteria, for the same number of param-eters and the same sample sizes. This may indicatethat one can use either one of models I to IV with thesame level of confidence. A similar conclusion mayalso be drawn for models V to VIII because the sametendency was observed. The fact that the five-param-eter models, although containing two more significantparameters, generally have smaller discrimination cri-teria than the three-parameters models (after elimi-nating the trend from the original data), suggests thesuperiority of the former over the latter, at least forthe Bet Dagan field.

INFERRED EXPECTATIONS, KRIGEDVALUES, VARIANCES, AND VARIANCES OF

ESTIMATED VARIANCESGiven the ML estimates of /? and a based on the

measured values vector Y (Table 2), we can now pre-dict (see Appendix III) the value ofE(y), its variance,and the variance of its variance at any arbitrary pointx in the field. It should be remembered that X, andhence £(x) and »j(x, x) depend on a3 in a nonlinearmanner. It is assumed, for simplicity that «3 is deter-

ministic so that X/ are also deterministic variables.Hence, only a{ (including ft) and a2=a2

y are random.The variance-co variance matrix of a and /8 needed forthe inference procedure (Eq. [All] and [A 12] is takenfrom the inverse of the Fisher information matrix (seeTheil, 1971, Chapter 8), which gives values of al, Sjand the associated covariances.

In the estimation procedure we divided the entirefield domain into a rectangular grid containing K =441 nodal points distributed regularly. Subsequently,at each of the TV locations of coordinate x,, the valuesof X/x) are determined by solving the linear system ofEq. [14] with C(x}, xk) taken from Eq. [22] to [25].Then the predictor E(y(x)^), the estimated varianceof the predicted error a2

E, and the estimate of its var-iance, were computed by Eq. [Al 1], [A12], and [A13],respectively, at each of the A!" nodal points of the grid.Results, using all covariance models, were obtainedfor all the 24 random functions (see Bresler and Lau-fer, 1986), but to economize on space only a few rep-resentative numerical values are presented in Tables4 and 5.

Effect of the Covariance ModelThe contour lines' maps of the estimates of the pre-

dictor of E[y(x)], its variance or error, and the vari-ances of the predictor error, using all the covariancemodels and for the 24 random functions were givenelsewhere (Bresler and Laufer, 1986). Small part of thenumerical values that were used to construct thesecontour maps are demonstrated in Table 4 for onesoil property (Ks) and one crop component (dry matterof wheat). It can be seen in these examples that thespecific covariance model used has a negligible smalleffect on the estimated results of both variances andexpectation. In addition, these models do not affectthe quality of the expectation estimates to serve asinterpolators of the observation points and not theuncertainty (or error) involved in the interpolation andits uncertainty which is related to the conditional sec-ond-order statistics.

Comparison with Kriging MethodThe analogy between the kriging method (Del-

homme, 1 978) and the estimated CMVN method (firstintroduced by Feinerman et al., 1986) may be attainedfrom the following. Since /(x) = y(x) — a,(x) hasstationary increments, one may use the variogramwhich in cases of bounded variance (a2) is given by

r(x,, XH) = ff2 - C(x,, xn) . [30]

Substituting Eq. [30] into [14] yields for X,

= T(x, k = 1, 2, . . . , N [31]For a finite value of [a2[\ — 2jLi X/x)]} and for a verylarge variance of y (i.e., a2

y —> °°), then

H - ;1 -» 0. or, x,. = 1 . [32]j-\ j-\Hence, the conditional expected value E\y(x) Y] as ob-

Page 9: Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

1242 SOIL SCI. SOC. AM. J., VOL. 52, 1988

Table 4. Numerical examples of the estimated expectation (Eq.[All]), estimated variance (Eq. [A12]), and variance of estimatedvariance (Eq. [A13J of two random functions (Ks and dry mat-ter of wheat) in nine locations of coordinate (xj, X2) as inferredusing the four covariance models and kriging.

Expectation(Eq. [All])

*2,m

22

60

105

Model

III

IIIIV

KrigI

IIIIIIV

KrigI

IIIIIIV

Krig

16

19.219.418.918.819.413.614.113.713.414.110.310.210.310.310.3

44

19.919.919.819.819.912.312.112.112.212.15.55.65.35.45.6

76

14.013.913.813.813.99.08.88.99.28.84.54.85.36.34.8

Variance(Eq. [A12])

16

Ks3.95.84.62.35.85.26.35.53.76.32.03.42.40.53.4

Wheat dry22

60

105

III

IIIIV

KrigI

IIIIIIV

KrigI

IIIIIIV

Krig

4.834.784.854.834.785.945.855.875.515.855.905.875.925.925.87

4.864.904.804.994.905.915.885.955.925.886.236.216.256.286.21

5.805.935.805.785.926.196.206.196.216.206.666.696.676.706.69

0.370.410.370.400.400.620.620.640.670.610.830.790.820.750.66

Xi, m

44

2.64.33.20.84.32.14.13.00.74.16.07.26.65.17.0

matter0.430.480.430.560.480.490.550.540.640.550.800.760.800.740.65

76

9.410.19.8

10.28.70.91.41.00.31.48.9

10.69.9

10.58.9

0.810.740.800.740.650.810.770.800.740.660.920.890.910.820.66

Variance of est.var. (Eq. [A13])

16

1.33.11.90.5-

2.24.22.61.4-

0.31.20.50.1

••

0.0070.0080.0060.007

0.0170.0170.0180.017-

0.0210.0190.0210.017

44

0.61.90.90.1-

0.41.80.80.1

-•2.65.12.52.5-

0.0090.0110.0090.013-

0.0120.0140.0140.016-

0.0210.0190.0210.017

-

76

4.87.45.87.7-

0.10.20.10.1-

4.47.65.67.5-

0.0210.0190.0210.017-

0.0210.0190.0210.017-

0.0210.0190.0210.017-

tained by kriging to spatial random functions of un-bounded variance (a2

y —> °°), with a well denned var-iogram, and of stationary increments, is identical tothat of Eq. [9], whereas the conditional covariance inEq. [16] is replaced in kriging by

C(Xl,

x,,) . [33]

Note that in the case of a bounded variance, the re-sults based on the CMVN probability may differsomewhat from those which are based on kriging (Ta-ble 5). Furthermore, since the parameters ab a2, and«3 are estimates and are not known with certainty,then the kriging approach may give different estimatesof the conditional moments Eq. [9] and [33] than thosebased on the CMVN probability approach Eq. [All]and [16]. Note that for better accounting of the un-certainty in the inference of the conditional moments,the estimated parameter «3 should have been consid-ered as random rather than deterministic. This wouldhave resulted in larger differences between the krigingand the MVNC approaches, but would largely com-plicate the calculations of the stochastic, rather thanthe deterministic vector X.

Table 5. Numerical examples of kriged values, estimated expec-tation (Eq. [All]), kriging variance, estimated variance (Eq.[A12]), of wheat grain in 48 spatial points of coordinates (Xi,X2). The exponential covariance model (II) was used for thecalculations.

X2 ,m Model 4 16 28

X,,

36

m

44 56 68 76

Expectation Eq. [All]5

22

50

60

77

105

IIKrig

IIKrigII

KrigII

KrigII

KrigII

Krig

1.631.561.601.541.871.941.881.931.901.911.981.98

1.781.761.821.812.022.122.002.051.992.022.062.06

1.911.971.971.972.082.08-2.102.152.091.982.152.15

Variance5

22

50

60

77

105

IIKrigII

KrigII

KrigII

KrigII

KrigII

Krig

0.0880.0770.0750.0480.0860.0780.0870.0790.0890.0790.1000.084

0.0800.0610.0740.0450.0800.0680.0830.0740.0800.0640.0960.084

0.0790.0600.0790.0660.0790.0690.0800.0710.0730.0410.0930.083

1.982.072.012.022.112.102.172.242.232.362.222.21

1.951.872.011.942.132.112.152.152.232.332.282.27

2.052.012.152.222.222.232.122.002.242.172.372.37

2.142.012.282.352.322.352.322.312.382.372.472.48

2.242.222.322.362.382.412.402.412.452.462.532.54

Eq. [A12]

0.0810.0670.0800.0700.0770.0570.0750.0470.0700.0410.0930.083

0.0710.0190.0760.0540.0780.0600.0780.0650.0760.0560.0930.082

0.0740.0360.0730.0330.0830.0750.0750.0400.0760.0500.0960.083

0.0760.0230.0850.0730.0890.0820.0890.0820.0880.0700.1010.084

0.0930.0770.0930.0810.0940.0840.0940.0840.0960.0820.1060.085

For the comparison between the CMVN method ofstochastic interpolation and the traditional krigingmethod (Journel and Huijbregts, 1978), the same K= 441 grid points were used. The variograms wereassumed to have the analytical exponential form ofEq. [23] substituted into Eq. [30] with the unknownparameters a being estimated by the same ML pro-cedure (Table 2). Results of the contour maps of krigedvalues and the kriging variances of all the 24 data sets(Table 1) were superimposed on the CMVN resultsand were presented in Fig. 6 and 7 of the BARD report(Bresler and Laufer, 1986). The maps showed that most(18 out of 24) of the estimated predictors obtained bythe kriging method were identical to the results of theCMVN procedure. In six cases the results differed butwere close, as is seen in the case demonstrated in Ta-ble 5, in which the expectation contours deviate themost from kriged values. The two methods yield alsoclose, but not necessarily identical, predication vari-ances (see Tables 4 and 5). Generally, the estimatedvariances of the two methods differ in areas more re-mote from the measured points, and also the krigingvariances are smaller than those based on the CMVNdistribution.

It should be noted that the estimated variance ofestimation of the variance (Eq. [A 13], Table 4), cannotbe obtained by the classical kriging approach, becausekriging variance is considered to be deterministic. Thedata illustrated in the last three columns of Table 4may serve as a measure of reliability of the data ofestimated variance results. The variance data may alsoserve as a guide for selecting additional observationpoints in future experiments (see Russo and Bresler,

Page 10: Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

BRESLER & LAUFER: STATISTICAL INFERENCES OF SOIL PROPERTIES AND CROP YIELDS 1243

1982). The estimates a2 and a\2 appear to be a signif-icant factor in economic optimization of the com-mercial yield F(Feinerman et al., 1985), which in turnmay depend, in a linear or nonlinear manner, on thesoil properties, and both are spatial random functions.

SUMMARYIn the present paper the multivariate normal meth-

odology for statistical inference of spatial randomfunctions has been applied to inferred expectations,variances and variances of the inferred variances ofcertain soil properties and crop yield components ina Bet Dagan field. Inferences of the spatial distributionof the statistical moments of the properties, as normalrandom spatial functions, are based on a finite set ofmeasured values. In this inference method the uncer-tainty of estimation of the parameters characterizingthe multivariate normal pdf is incorporated in a sys-tematic manner using classical statistics theory in theprocess of inference, prediction, and conditioning onthe actual measured data. The method has been ap-plied to 12 soil properties (Ks, e, /3, In 0, T, hm 8S, Bnpercent clay, percent sand, percent silt, and 5), foursoil variables (SAR, EC, average 6n and 6W\ and twocomponents of four crops (vetch for hay, wheat, corn,and peanuts). The procedure has been applied to casesin which a linear stochastic trend is presented.

ACKNOWLEDGMENTThis study was supported by Grant US-420-81 from

BARD, the United States-Israel Binational Agricultural Re-search and Development Fund.

APPENDIX/. Estimation of the Conditional Covariance (Eg. [12])

The estimate of the conditional covariance (Eq. [12]) isobtained as follows. Add to and subtract from the estimateC(x,, x,,|Y) = E[[yi - E(yM][yu - E(yn^)\]\Y\ [Al]the expectations E(y{Y) and E(yn\Y) and obtain:C(x,, x,,|Y) = E{[y{ - E(y,\Y)

[A2]Rearrange [A2]C(x,, x,,(Y) = E{[(yt -

E{(A E{(H D + F

» - E(yu |Y]; .

[A3]where

BH = \y, -D = [£(y,,|Y) -£(y,,|Y)] = K = CONSTANTF = [£(y,|Y) - £(j>,|Y)] = K = CONSTANT

Using [9], [1 1] and [12] and because K E[H] + KE[G] =0 + 0 = 0, then,C(x,, = E{(A E{(H D + F G)|Y}

[A4]

II. Estimations of the Parameters a and ft (Eg. [20] to [25])by the Maximum Likelihood (ML) Method

The problem is to select the parameter vectors a and ffsuch that the negative log of the MVN Eq. [1] is minimized.Hence, the problem is to minimize L with the best fitted aand /?, where

L = A/2 ln(2ir) + 0.5 [In \Q + b Y'] [A5]C is the (symmetric) covariance matrix C(r,7) of one of themodels (Eq. [20]-[25]) with r,y = |x, — xj, and Y' the resid-uals and b are vectors defined by

[A6][A7]

Y' = Y - £(Y) = Y - a,b = Y' rC-'.

The minimizing procedure is carried out with the aid ofthe computer code MINOS (Saunders, 1980) as follows:

1. Input: (i) the first guess of the parameters a and /? andtheir uppermost and lowest possible values (first guesses ofP0, 0!, and /32 are obtained from a linear regression analysisof Eq. [21]; first guesses of «2, a}, and «4 are obtained froma preliminary calculation of the variograms); (ii) an indexfor the selected covariance (variogram) model from Eq. [20]to [25]; (iii) number of the data points (the sample size N);(iv) N values of the vector Y and its respective coordinates

2. Calculate: (i)r/, = |x, — xj; (ii) C from the selected model(one from I through VIII); (iii) the determinant |C|; (iv) thevector Y' from [A6] using a, of Eq. [20] for the nondriftcases or of [21] for the drifted cases; (v) the vector b aftercalculating the transpose T of Y' and the inverse of C.

3. Calculate the partial derivatives: (i) for each element ofthe vector /8

[A8]

[A9]

= band (ii) for each element of the vector adL/da = 0.5 Tr [C~l (dC/det)]

- 0.5 b(dC/<3a)(C-')Y'where Tr is the trace (i.e., sum of diagonal terms). Equations[A8] and [A9] transfer scalar values to MINOS in which thebest fitted a and /? are estimated.

4. Using the estimated values of a and ft calculate the Mjkelements of the Fisher Information matrix (Theil, 1971) fromMJk = 0.5 Tr[C-l(dC/dOj)C-l(dC/dOk)

+ (dcuWpC-^daiWk)where 0,, 6k are the best fitted (estimated) parameters a and0.

5. Calculate the estimated variances of the estimated ex-pectations of the parameters a and 0 (i.e., al), and the as-sociated covariances, from the inverse of the Fisher infor-mation matrix.

6. Calculate the Students' t statistics from t, — 6/0,--.1. Print all 6 (i.e., «,, a3, a4, j30, 181, 0->, L, Mu, Q), and all

, andIII. CalculationsThe computation procedure to predict the expectation,

the variance, and the variance of the predicted variance (ofeach soil property and crop component and at any point xin the field domain) involves several steps, as follows:

1. Select the "best" covariance (variogram) model fromone of the models of Table 2 (Eq. [20] through [25]).

2. Input measurements values Y = y,-, j = 1 , 2, . . . , N,the respective space coordinates x,, values of a = J30, /?,, |92,

Page 11: Statistical Inferences of Soil Properties and Crop Yields as Spatial Random Functions

1244 SOIL SCI. SOC. AM. J., VOL. 52, 1988

a2, a3, a4 from Table 2 and values of£«„, */>„ • • - , ̂ , C(P0, &), C(p0, /?,) C(fr, P2) from the var-iance-cpvariance matrix of the ML output (see Appendix II).

3. Divide the field into Nx X Ny = K grid points, whereNx and Ny are number of grid points in the x = xt and y= x2 directions, respectively.

4. Calculate the lag \tj = |x, — xj, ij = 1, 2 , . . . , N, andthe lag rkm = \xk - \m\, k, m = 1, 2, . . . , K, and computethe covariance matrix C(r) from the model selected fromTable 3 and Eq. [22] to [25]), using the best-fitted and sig-nificant parameters a (Table 2).

5. Calculate X,- from Eq. [14] or [15], |(x) from Eq. [11]and Ti(xh xn) from Eq. [13]. Note that since, in general, asignificant linear drift has been inferred (Table 2), then«i(Y)£(x) in Eq. [9] is given by

«,(Y)|(x) = £ ft £,(x); [A10]i-O

where,N

£o(x) = 1 - I X,-;1=1

N N

£,(x) = x, - £x,- 4 £2(x) - x2 - X Xj x{7=1 7=1

where xb x2 are the horizontal space coordinates, and /3, arethe ML estimates for a,(Y) (Eq. [21]).

6. Calculate%(x)|Y] = 3o£(x) + 3,l,(x)

A1

+ tex) + S\<x)^ [AH]7-1

from Eq. [9] and [A 10].7. Calculate

*KX) = 527,(X) + ^2(x) + £2,£2(x) + J2^2(x)

+ 2Ca^^(x)^(x) + IQ^Wtfx) + 2Q2a,|,(x)^(x)

[A12]from Eq. [18] but after replacing the nondrifted term<rl^2(x) by the relevant drifted one.

8. Calculate from Eq. [19]£co«,x) = £;2 = £i>?2(x) . [A13]

The variance of estimated covariance, or variance (Eq. [A13])is related exclusively to the uncertainty of the estimation of

2Note'that in Eq. [All], [A12], and [A13] x, = x,, = x,and that the estimated parameters a3 and a4 are assumed tobe deterministic (nonstochastic) parameters, so that the aboveare appropriate versions. If the significant nugget is sto-chastic such that

C(x,, x,,) = 0:4 5UI + a2 P(\I,XU) [A 14](where {M, is the Kronecker delta) then a2 jj(x, x) in Eq. [12]and [16] are replaced by

N

a2 + «4 - «2 Z X7 P(X> x>) • [A1517-1

Note also that in case of a significant nugget, the values ofoi4 and of C,iatri(x, x) should be added to Eq. [A 12]. It is

assumed, however, that all aji,, Q,.̂ ., /', j = 0, . . . , 2,and a«3 (a} being the scale) are deterministic.

9. Output: K values of Efy(\)\\)], alfx), and CT|X,X)(X)and the associated field coordinates.


Recommended