+ All Categories
Home > Documents > soil_sampling.pdf

soil_sampling.pdf

Date post: 03-Apr-2018
Category:
Upload: gauchex697355
View: 216 times
Download: 0 times
Share this document with a friend

of 50

Transcript
  • 7/29/2019 soil_sampling.pdf

    1/50

    Sampling for soil survey

    D G RossiterDepartment of Earth Systems Analysis

    International Institute for Geo-information Science & Earth Observation (ITC)

    December 28, 2008

    Copyright 2008 ITC.

    All rights reserved. Reproduction and dissemination of the work as a whole (not parts) freely permitted if thisoriginal copyright notice is included. Sale or placement on a web site where payment must be made to access this

    document is strictly prohibited.

    To adapt or translate please contact the author (http://www.itc.nl/personal/rossiter).

    http://www.itc.nl/personal/rossiterhttp://www.itc.nl/personal/rossiter
  • 7/29/2019 soil_sampling.pdf

    2/50

    Soil Sampling 1

    Topic: Sampling for soil survey

    1. Sampling in routine survey

    2. Sampling for detailed survey

    3. Sampling for detailed survey, with prior information

    4. Sampling for environmental correlation

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    3/50

    Soil Sampling 2

    1 Soil sampling in routine soil survey

    Routine soil survey follows the Discrete Model of Spatial Variability (DMSV):

    homogeneous soil bodies mapped as polygons

    conceptually-sharp boundaries

    So the main aim of sampling is to characterize the soils in each map unit, i.e.

    legend category

    set of polygons with the same soil type

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    4/50

    Soil Sampling 3

    An area-class map

    State Soil Geographic Database, Schuyler County, NY (USA)

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    5/50

    Soil Sampling 4

    Sample is a very small proportion of the population

    A soil pit is about 1x2 m surface area; a typical soil borehole (auger hole is

    10 cm diameter, so 0.00157 m2

    So in 1 ha there are 10000/2 = 5000 potential pit sites, or10000/(0.052 ) 1273240 potential bore hole sites!

    Sampling density is usually specified as one field observation per 1 4 cm2 of

    map (regardless of map scale)

    Example: at 1:25 000, 1cm2m = 250 mg 250 mg = 62 500 m2g = 6.25 ha

    So, one observation per 6.2525 ha

    This is a tiny sampling fraction!

    How can we make a map with such a low sampling density?

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    6/50

    Soil Sampling 5

    Representative sampling

    Solution: the surveyor uses expert opinion of the soil-landscape model:

    Soils occur in specific positions because of the specific combination ofsoil-forming factors (Jenny equation)

    So, place observations in the most representative (typical, modal, central

    concept) sites, where the soil class is expected to be best-expressed

    * Some observations nearby to get an idea of heterogeneity

    * Maybe some quick observations (not full samples) near boundaries to

    improve their location

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    7/50

    Soil Sampling 6

    Block diagram of a soil landscape

    Wysocki, D. A., Schoeneberger, P. J., & LaGarry, H. E. (2005). Soil surveys: a window to the subsurface. Geoderma,

    126(1-2), 167-180

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    8/50

    Soil Sampling 7

    Some landscapes to sample

    Dorchester, England (GB)

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    9/50

    Soil Sampling 8

    Truxton, Cortland County, NY (USA)

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    10/50

    Soil Sampling 9

    Herikhuizerveld, Rheden (NL)

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    11/50

    Soil Sampling 10

    Sampling for associations

    In smaller-scale maps (depending on landscape, from 1:50 000 down) we usually

    expect more than one soil type in each map unit.

    The map units are usually associations of related soils (e.g. hillslope catena).

    Then the surveyor observes at the central concept of each component.

    The proportion of components is estimated by landscape analysis.

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    12/50

    Soil Sampling 11

    2 Soil sampling in detailed soil survey

    These are usually grid samples, to completely cover an area of interest.

    Example: an area of suspected soil pollution.

    The grid is then interpolated into a raster map, usually by kriging.

    Two-step sampling:

    1. For modelling the variogram

    2. For kriging, once the variogram is known

    Note: the success of kriging depends on a correct variogram model!

    Note: the variogram may be known from similar studies

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    13/50

    Soil Sampling 12

    Sampling to model spatial dependence

    Must have several separations to estimate structure

    Especially important are some closely-separated observations, to estimatenugget

    Can use a transect with variable spacing or a 2-D scheme (random directions,

    fixed separations in a hierarchy)

    Webster, R., Welham, S. J., Potts, J. M., & Oliver, M. A. (2006). Estimating the

    spatial scales of regionalized variables by nested sampling, hierarchical analysis

    of variance and residual maximum likelihood. Computers & Geosciences, 32(9),

    1320-1333.

    Lark, R. M. (2002). Optimized spatial sampling of soil for estimation of the

    variogram by maximum likelihood. Geoderma, 105(1-2), 49-80.

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    14/50

    Soil Sampling 13

    What sample size to fit a variogram model?

    Stochastic simulation from an assumed random field with a known variogram

    suggests:

    1. < 50 points: not at all reliable

    2. 100 to 150 points: more or less acceptable

    3. > 250 points: almost certaintly reliable

    More points are needed to estimate an anisotropic variogram.

    This is very worrying for many environmental datasets (soil cores, vegetation

    plots, . . . ) especially from short-term fieldwork, where sample sizes of 40 60

    are typical. Should variograms even be attempted on such small samples?

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    15/50

    Soil Sampling 14

    How to design the nested sample

    Widest spacing s1 is the station, which are assumed so far away from each

    other as to be spatially independent

    * furthest expected dependence . . .

    * . . . based on the landscape . . .

    * . . . and expected range of process to be modelled

    Closest spacing sn is the shortest distance whose dependence we want to know

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    16/50

    Soil Sampling 15

    Geometric series

    A geometric series increases terms by multiplication

    It allows us to cover a wide range of distances (possible ranges) with a fewstages.

    Increase spacing in geometric series:

    s = s1 sn

    Fill in series with further geometric means

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    17/50

    Soil Sampling 16

    Geometric series: example

    First series: s1 = 600m (stations), s5 = 6m (closest)

    Intermediate spacing: s3 = 6m 600m = 60m

    Series now {600m, 60m, 6m}

    Fill in with the geometric means

    * s2 = 600m 60m 190m* s4 =

    60m 6m 19m

    Final series {600m, 190m, 60m, 19m, 6m}

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    18/50

    Soil Sampling 17

    Locating the sample points

    Objective: cover the landscape, while avoiding systematic or periodic features

    Method: random bearings from centres at each stage

    Stations can be along a transect if desired (no spatial dependence)

    From a centre at stage i (Ei, Ni), to find a point (Ei+1, Ni+1) at the next spacingsi+

    1:

    * = random uniform[0 . . . 2 ]* Ei+1 = Ei + (si+1 sin )* Ni+1 = Ni + (si+1 cos )

    D G Rossiter

  • 7/29/2019 soil_sampling.pdf

    19/50

    Soil Sampling 18

    Number of sample points

    Number of stations selected to cover the area of interest

    At each stage Si, the next stage Si+1 has in principle double the samples

    One is for all the previous centres from stage S1 . . . S i1 and one is for the newcentre from stage Si

    So the total number doubles: half old, half new centres

    D G Rossiter

    l l

  • 7/29/2019 soil_sampling.pdf

    20/50

    Soil Sampling 19

    Unbalanced sampling

    After the first 4 stages, use an unbalanced design

    Only half the centres at Si (i 4) are further sampled at Si+1

    This still covers the area, but only uses half the samples at the shortest ranges

    Number of pairs is still enough estimate short-range dependence

    D G Rossiter

    S il S li 20

  • 7/29/2019 soil_sampling.pdf

    21/50

    Soil Sampling 20

    Number of sample points: example

    Five stages {600m, 190m, 60m, 19m, 6m}

    Nine stations: n1 = 9

    Double at stages 2 . . . 4: n2 = 18, n3 = 36, n4 = 72

    At stage 5, only use half the 72 centres, i.e. 36

    Total at stage 5: 72+ 36 = 108 (would have been 144 with balanced sampling)

    D G Rossiter

    S il S li 21

  • 7/29/2019 soil_sampling.pdf

    22/50

    Soil Sampling 21

    Nested ANOVA : Partition Variability by sampling level

    Linear model:

    zijk...m = +Ai + Bij + Cijk + + Qijk...m + ijk...m

    Link with regional variable theory (semivariances): m stages; d1 shortest

    distance at mth stage; dm largest distance at first stage

    2m = (d1)2m1 + 2m = (d2)

    ...

    21 + . . .+ 2m = (dm)

    F-test from ANOVA table; for stage m+ 1 : F= MSm/MSm+1

    D G Rossiter

    S il S li 22

  • 7/29/2019 soil_sampling.pdf

    23/50

    Soil Sampling 22

    Nested ANOVA : Interpretation

    There is spatial dependence from the closest spacing until the F-ratio is not

    significant.

    Samples from this distance are independent

    To take advantage of spatial interpolation, must sample closer than this

    Can estimate how much of the variation is accounted for at each spacing

    D G Rossiter

    Soil Sampling 23

  • 7/29/2019 soil_sampling.pdf

    24/50

    Soil Sampling 23

    Grid sampling for kriging

    This assumes the Continuous Model of Spatial Variaility (CMSV).

    So the soil property is modelled as a random field and the map is made by

    kriging.

    D G Rossiter

    Soil Sampling 24

  • 7/29/2019 soil_sampling.pdf

    25/50

    Soil Sampling 24

    Kriging prediction Kriging prediction variance

    Note: Prediction variance depends only on the spatial configuration of the

    observations, not on the data value.

    D G Rossiter

    Soil Sampling 25

  • 7/29/2019 soil_sampling.pdf

    26/50

    Soil Sampling 25

    Sampling designs with the CMSV: objectives

    1. Maximize information

    Cover the largest possible area at minimum cost

    Minimize some optimization criterion

    2. Minimize costs

    3. (Incorporate any existing sample see next subtopic)

    D G Rossiter

    Soil Sampling 26

  • 7/29/2019 soil_sampling.pdf

    27/50

    Soil Sampling 26

    What is to be optimized?

    An optimization criterion is some numerical measure of the quality of the

    sampling design. Some possibilities:

    1. Minimize the maximum kriging variance in the area: nowhere is more poorly

    predicted than this maximum

    2. Minimize the average kriging variance over the entire area

    D G Rossiter

    Soil Sampling 27

  • 7/29/2019 soil_sampling.pdf

    28/50

    Soil Sampling 27

    Optimal point configuration (CMSV)

    In a square area to be mapped, given a fixed number of points that can be

    sampled, in the case of bounded spatial dependence:

    Points should in on some regular pattern; otherwise some points duplicate

    information at others (in kriging, will share weights)

    Optimal (for both the minimal maximum and minimal average criteria):

    equilateral triangles (If the triangle is 12, max. distance to a point

    =

    7/4 0.661) Sub-optimal but close: square grid (max. distance =

    2/2 0.707)

    * Grid should be slightly perturbed so samples do not line up exactly; avoids

    unexpected periodic effects

    (Problems: edge effects in small areas; irregular areas.)

    D G Rossiter

    Soil Sampling 28

  • 7/29/2019 soil_sampling.pdf

    29/50

    Soil Sampling 28

    Optimal point configuration in the presence of anisotropy

    Optimal designs are easily adjusted for anisotropy (different range of spatial

    dependence in two orthogonal axes)

    The regular grid may be adapted for affine or geometric anisotropy: stretch it inthe direction of maximum dependence, based on the anisotropy ratio.

    E.g. for a ratio of0.5, squares become rectangles, with the distance in the

    direction with the longest range twice that of the shortest range.

    D G Rossiter

    Soil Sampling 29

  • 7/29/2019 soil_sampling.pdf

    30/50

    Soil Sampling 29

    Computing an optimal grid size

    Reference: McBratney, A. B. & Webster, R. (1981) The design of optimal

    sampling schemes for local estimation and mapping of regionalized variables -

    I and II. Computers and Geosciences, 7(4), 331-334 and 335-365; also inWebster & Oliver.

    Key point: In kriging, the estimation error is based only on the sample

    configuration and the chosen model of spatial dependence, not the actual

    data values

    So, if we know the spatial structure (variogram model), we can compute the

    maximum or average kriging variances before sampling, i.e. before we know

    any data values.

    This is known as OSSFIM from the original articles.

    D G Rossiter

    Soil Sampling 30

  • 7/29/2019 soil_sampling.pdf

    31/50

    Soil Sampling 30

    Error variance

    Recall: The kriging variance at a point is given by:

    2( x0) = bT

    = 2N

    i=1i( xi, x0)

    Ni=1

    Nj=1

    ij( xi, xj)

    This depends only on the sample distribution (what we want to optimise) and

    the spatial structure (modelled by the semivariogram)

    In a block this will be lowered by the within-block variance (B,B)

    D G Rossiter

    Soil Sampling 31

  • 7/29/2019 soil_sampling.pdf

    32/50

    p g

    Reducing kriging error

    Once a regular sampling pattern is decided upon (triangles, rectangles, . . . ), the

    kriging variance is decreased in two ways:

    1. reduce the spacing (finer grid) to reduce semivariances; or

    2. increase the block size of the prediction

    These can be traded off; but usually the largest possible block size is selected,

    based on the mimimum decision area.

    D G Rossiter

    Soil Sampling 32

  • 7/29/2019 soil_sampling.pdf

    33/50

    p g

    Error as a function of increasing grid resolution

    Consider 4 sample points in a square

    To estimate is one prediction point in the middle (furthest from samples highest kriging variance)

    Criterion is minimize the maximum prediction error

    If the variogram is close-range, high nugget, low sill, we need a fine grid to

    take advantage of spatial dependence; high cost

    If the variogram is long-range, low nugget, high sill, a coarse grid will give

    similar results

    D G Rossiter

    Soil Sampling 33

  • 7/29/2019 soil_sampling.pdf

    34/50

    p g

    Kriging variances at centre point

    spacing

    block.size

    20

    40

    60

    80

    100

    120

    100 200 300 400

    0.3

    0.4

    0.5

    0.6

    0.7

    0.8

    0.9

    1

    spacing

    block.size

    20

    40

    60

    80

    100

    120

    100 200 300 400

    0.3

    0.4

    0.5

    0.6

    0.7

    0.8

    0.9

    1

    long range variogram (1200 m) short range variogram (600 m)

    D G Rossiter

    Soil Sampling 34

  • 7/29/2019 soil_sampling.pdf

    35/50

    3 Sampling with prior information

    Problem: how to optimally place a limited number of observations in a study

    area in order to extract the maximum information at minimum cost.

    We consider here the information to be a map over some study area, made byordinary kriging from the sample points; so the assumptions of the CMSV must

    be met.

    Reference:

    van Groenigen, J.-W. (2000). The influence of variogram parameters on optimal

    sampling schemes for mapping by kriging. Geoderma, 97(3-4), 223-236.

    also contained in the PhD thesis:

    van Groenigen, J.-W. Constrained optimisation of spatial sampling Enschede, NL:

    ITC.

    D G Rossiter

    Soil Sampling 35

  • 7/29/2019 soil_sampling.pdf

    36/50

    Problems with the optimal grid

    The optimal grid presented in the previous section is optimal only in restricted

    circumstances. There are many reasons that approach might not apply:

    Edge effects: study area is not infinite

    Irregularly-shaped areas, e.g. a flood plain along a river

    Off-limits or uninteristing areas, e.g. in a soils study: buildings, rock

    outcrops, ditches . . .

    Existing samples, maybe from a preliminary survey; dont duplicate the effort!

    Impossible to compute an optimum analytically (as for the regular grid on an

    infinite plane).

    D G Rossiter

    Soil Sampling 36

  • 7/29/2019 soil_sampling.pdf

    37/50

    Annealing

    Slowly cooling a molten mixture of metals into a stable crystal structure.

    During annealing the temperature is slowly lowered.

    At high temperatures, molecules move around rapidly and long distances

    At low temperatures the system stabilizes.

    Critical factor: speed with which temperature is lowered

    too fast: stabilize in a sub-optimal configuration

    too slow: waste of time

    D G Rossiter

    Soil Sampling 37

  • 7/29/2019 soil_sampling.pdf

    38/50

    Simulated annealing

    This is a numerical analogy to actual annealing:

    Some aspect of a numerical system is perturbed

    The configuration should approach an optimum

    The amount of perturbation is controlled by a temperature

    D G Rossiter

    Soil Sampling 38

  • 7/29/2019 soil_sampling.pdf

    39/50

    Outline of SSA

    1. Decide on an optimality criterion

    2. Place the desired number of sample points anywhere in the study area (grid,random . . . ); compute fitness according to optimality criterion

    3. Repeat (iterate):

    (a) Select a point to move; move it a random distance and direction

    (b) If outside study area, try again

    (c) Compute new fitness

    (d) Ifbetter, accept new plan; if worse also accept with a certain probability

    4. Stop according to some stopping criterion

    D G Rossiter

    Soil Sampling 39

  • 7/29/2019 soil_sampling.pdf

    40/50

    Example of a single step

    Colour ramp is from blue (low kriging variance) to red (high).

    Point at lower right is moved to middle-bottom:

    A large hot area (high kriging variance) is now cooler.

    D G Rossiter

    Soil Sampling 40

  • 7/29/2019 soil_sampling.pdf

    41/50

    Temperature

    The distance to move a point is controlled by the temperature; this is used to

    multiply some distance.

    Tk+1 = Tk (1)

    where k is the step number and < 1 is an empirical factor that reduces the

    temperature; we must also specify an initial temperature T0.

    D G Rossiter

    Soil Sampling 41

  • 7/29/2019 soil_sampling.pdf

    42/50

    Fitness

    Several choices, all based on the kriging variance:

    Mean over the study area (MEAN OK)

    * appropriate when estimating spatial averages to a given precision

    Maximum anywhere in the study area (MAX OK)

    * appropriate when the entire area must be mapped to a given precision, e.g.

    to guarantee there is no health risk in a polluted area.

    D G Rossiter

    Soil Sampling 42

  • 7/29/2019 soil_sampling.pdf

    43/50

    Stopping criterion

    Possiblities:

    fixed number of iterations

    reach a certain (low) temperature

    after a certain number of iterations with no change.

    D G Rossiter

    Soil Sampling 43

  • 7/29/2019 soil_sampling.pdf

    44/50

    Acceptance criterion

    Metropolis criterion: the probability P (S0 S1) of accepting the new scheme is:

    P (S0

    S1)=

    1, if(S1)

    (S0) (2)

    P (S0 S1) = exp(S0)(S1)

    c

    , if(S1) > (S0)

    where S0 is the fitness of the current scheme, S1 is the fitness of the proposed

    new scheme, and c is the temperature. This can also be written:

    p = ef /Tk (3)

    where Tk is the current temperature and f is the change in fitness due to theproposed new scheme.

    Note that this will be positive for a poorer solution, so its complement is used for

    the exponent.

    D G Rossiter

    Soil Sampling 44

  • 7/29/2019 soil_sampling.pdf

    45/50

    A real example

    Industrial area, existing samples; more must be taken to lower the prediction

    variance to a target level everywhere; where to place the new samples?

    Reference: van Groenigen, J. W., Stein, A., & Zuurbier, R. (1997). Optimization of

    environmental sampling using interactive GIS. Soil Technology, 10(2), 83-97D G Rossiter

    Soil Sampling 45

  • 7/29/2019 soil_sampling.pdf

    46/50

    4 Soil sampling for environmental correlation

    We want to make a set of observations of soil properties, from which to build

    regression models from a set of environmental covariables, e.g.

    terrain parameters

    digital imagery

    climate-related layers (elevation, aspect . . . )

    For example, if z is some soil property:

    z = f( zxy

    , CTI, z, . . . )

    So the soil observations must somehow represent this feature-space, as well asgeographic space, efficiently.

    D G Rossiter

    Soil Sampling 46

  • 7/29/2019 soil_sampling.pdf

    47/50

    Regression modelling

    1. Simple (one dominant factor)

    2. Multiple

    3. (Stepwise: automatic selection of predictor set dangerous!)

    4. Standardized principal components: removes multi-colinearity

    (inter-correlated predictors), measurements on different scales

    Generally linear models are used; may linearize some predictors if necessary.

    z = o +n

    i=1

    iqi

    (See standard regression textbooks)

    D G Rossiter

    Soil Sampling 47

    li h

  • 7/29/2019 soil_sampling.pdf

    48/50

    Feature-space sampling schemes

    The aim is to efficiently sample combinations of feature-space predictors.

    But:

    not all combinations are found in nature (e.g. steep slope + high TWI)

    combinations occupy different proportions of the area

    Latin hypercube:

    Minasny, B., & McBratney, A. B. (2006). A conditioned Latin hypercube method

    for sampling in the presence of ancillary information. Computers & Geosciences,

    32(9), 1378-1388.

    D G Rossiter

    Soil Sampling 48

    Mi d li h

  • 7/29/2019 soil_sampling.pdf

    49/50

    Mixed sampling schemes

    Try to optimize sample placement in both feature and geographic space.

    Simulated annealing

    Brus, D. J., & Heuvelink, G. B. M. Optimization of sample patterns for universal

    kriging of environmental variables. Geoderma, 138(1-2), 8695

    D G Rossiter

    Soil Sampling 49

    D i i li l

  • 7/29/2019 soil_sampling.pdf

    50/50

    Designing a sampling plan

    1. Define the study area

    2. Determine the objectives of the sampling

    inferring spatial processes? mapping? decision support?

    3. Define costs (budget) vs. benefits (precision needed)

    4. Decide on any stratification by differential objectives, costs, benefits

    Good luck!

    D G Rossiter