+ All Categories
Home > Documents > Sampling and sampling strategies for environmental analysis(RSD) are within 25% for...

Sampling and sampling strategies for environmental analysis(RSD) are within 25% for...

Date post: 12-Mar-2020
Category:
Upload: others
View: 3 times
Download: 0 times
Share this document with a friend
13
Intern. J. Environ. Anal. Chem. Vol. 92, No. 4, 15 April 2012, 466–478 Sampling and sampling strategies for environmental analysis J. Zhang a and C. Zhang b * a College of Environmental & Resource Science, Zhejiang University, Hangzhou 310028, China; b Department of Environmental Sciences, University of Houston-Clear Lake, Houston, Texas 77058, USA (Received 11 December 2010; final version received 8 April 2011) Sampling errors are generally believed to dominate the errors of analytical measurement during the entire environmental data acquisition process. Unfortunately, environmental sampling errors are hardly quantified and docu- mented even though analytical errors are frequently yet improperly reported to the third decimal point in environmental analysis. There is a significant discrepancy in directly applying traditional sampling theories (such as those developed for the binary particle systems) to trace levels of contaminants in complex environmental matrices with various spatial and temporal heterogene- ities. The purpose of this critical review is to address several key issues in the development of an optimal sampling strategy with a primary goal of sample representativeness while minimizing the total number of samples and sampling frequencies, hence the cost for sampling and analysis. Several biased and statistically based sampling approaches commonly employed in environmental sampling (e.g. judgmental sampling and haphazard sampling vs. statistically based approaches such as simple random, systematic random, and stratified random sampling) are examined with respect to their pros and cons for the acquisition of scientifically reliable and legally defensible data. The effects of sample size, sample frequency and the use of compositing are addressed to illustrate the strategies for a cost reduction as well as an improved representa- tiveness of sampling from spatially and temporally varied environmental systems. The discussions are accompanied with some recent advances and examples in the formulation of sampling strategies for the chemical or biological analysis of air, surface water, drinking water, groundwater, soil, and hazardous waste sites. Keywords: sampling strategy; environmental sampling; sampling frequency; sampling design 1. Introduction Environmental sampling is an integral part of the overall environmental data acquisition processes that include, but are not limited to, sample collection, preservation, preparation, instrumental analysis, field and laboratory quality assurance/quality control (QA/QC), and data assessment. As the first step in the chain, its importance in the overall data quality cannot be overstated [1–3]. It is self explanatory that if a sample is collected improperly, then all our subsequent careful lab work is useless. A poorly collected and *Corresponding author. Email: [email protected] ISSN 0306–7319 print/ISSN 1029–0397 online ß 2012 Taylor & Francis http://dx.doi.org/10.1080/03067319.2011.581371 http://www.tandfonline.com Downloaded by [University of New Hampshire] at 18:57 24 March 2012
Transcript
Page 1: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

Intern. J. Environ. Anal. Chem.Vol. 92, No. 4, 15 April 2012, 466–478

Sampling and sampling strategies for environmental analysis

J. Zhanga and C. Zhangb*

aCollege of Environmental & Resource Science, Zhejiang University, Hangzhou 310028, China;bDepartment of Environmental Sciences, University of Houston-Clear Lake,

Houston, Texas 77058, USA

(Received 11 December 2010; final version received 8 April 2011)

Sampling errors are generally believed to dominate the errors of analyticalmeasurement during the entire environmental data acquisition process.Unfortunately, environmental sampling errors are hardly quantified and docu-mented even though analytical errors are frequently yet improperly reported tothe third decimal point in environmental analysis. There is a significantdiscrepancy in directly applying traditional sampling theories (such as thosedeveloped for the binary particle systems) to trace levels of contaminants incomplex environmental matrices with various spatial and temporal heterogene-ities. The purpose of this critical review is to address several key issues in thedevelopment of an optimal sampling strategy with a primary goal of samplerepresentativeness while minimizing the total number of samples and samplingfrequencies, hence the cost for sampling and analysis. Several biased andstatistically based sampling approaches commonly employed in environmentalsampling (e.g. judgmental sampling and haphazard sampling vs. statisticallybased approaches such as simple random, systematic random, and stratifiedrandom sampling) are examined with respect to their pros and cons for theacquisition of scientifically reliable and legally defensible data. The effects ofsample size, sample frequency and the use of compositing are addressed toillustrate the strategies for a cost reduction as well as an improved representa-tiveness of sampling from spatially and temporally varied environmental systems.The discussions are accompanied with some recent advances and examples in theformulation of sampling strategies for the chemical or biological analysis of air,surface water, drinking water, groundwater, soil, and hazardous waste sites.

Keywords: sampling strategy; environmental sampling; sampling frequency;sampling design

1. Introduction

Environmental sampling is an integral part of the overall environmental data acquisitionprocesses that include, but are not limited to, sample collection, preservation, preparation,instrumental analysis, field and laboratory quality assurance/quality control (QA/QC),and data assessment. As the first step in the chain, its importance in the overall dataquality cannot be overstated [1–3]. It is self explanatory that if a sample is collectedimproperly, then all our subsequent careful lab work is useless. A poorly collected and

*Corresponding author. Email: [email protected]

ISSN 0306–7319 print/ISSN 1029–0397 online

� 2012 Taylor & Francis

http://dx.doi.org/10.1080/03067319.2011.581371

http://www.tandfonline.com

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2

Page 2: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

unrepresentative sample will by no means generate any meaningful and reliable data.

Rather, it may provide misleading information owing to its defect quality.Despite its utmost importance, sampling is a subject analogous to the weather in a

sense that much discussion has been made over the years but very little has been done

about developing more scientific methods of sampling [4,5]. In an editorial note published

over a half century ago, Murphy [4] pointed out the lack of scientific work on sampling in

general, including sampling in metallurgy, pharmaceutical, food, petroleum industries as

well as sampling in industrial hygiene and environmental science. In the field of

environmental sampling, for example, only a few organized efforts were noted thus far,

including the published work as a result of symposiums by the American Chemical

Society (ACS) Committee on Environmental Improvement [6–9]. Research work on

environmental sampling is very limited, as is evident from the number of papers in two

primary ACS journals (Environmental Science and Technology and Analytical Chemistry)

(Figure 1). A search in these two journals revealed that sampling papers account for far

less than 1% of the total papers published, although there seems to be an overall increased

number of papers published in recent years. In a recent study, Ort et al. [10] reported that

only 11% of the 87 surveyed papers on pharmaceutical and personal care products

(PPCPs) in wastewater systems provided justifications for sampling compared to 99%

on analysis.Some general misconceptions probably attribute to the lack of attention paid to

environmental sampling as well as sampling in other industries. Oftentimes one may

incorrectly assume that the quality of data is primarily determined by the nature of

analytical measurement methods and accordingly the expensive state-of-the-art instru-

mentation rather than by sampling and sample preparation techniques [11]. Therefore, it is

not uncommon to see that analytical results are reported to the third decimal point [4],

whereas sampling uncertainties and associated errors are hardly mentioned. To many less

Figure 1. Total numbers of ‘sampling’ papers per year in two primary American Chemical Society(ACS) journals: Environmental Science and Technology and Analytical Chemistry.

International Journal of Environmental Analytical Chemistry 467

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2

Page 3: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

skilled field crews, sampling can just be perceived as simple as the haphazard collection ofgrab samples, rather than a complicated science that depends on solid statistics and priorknowledge about the variability of the population being sampled.

It is apparent that the science of sampling is an enormous subject that deserves amultidisciplinary effort [5]. This paper will primarily address some important samplingstrategies that are often overlooked in the analytical community who places more focus onlaboratory sample preparation and instrumental analysis. It is beyond the scope of thisreview to cover many other important sampling issues such as new sampling devices (e.g.passive sampling), sample preservation, preparation, and sampling QA/QC [12–19]. Wewill first examine the errors of sampling relative to the errors of laboratory analysis, whichis followed by the introduction of several sampling approaches commonly employed inenvironmental sampling as well as recent advances to formulate an optimal samplingstrategy. Several key issues regarding, for example, the use of stratified random sampling,compositing, the effects of sample size and sampling frequency will be addressed withexamples of sampling and sampling strategies applicable for chemical or biologicalanalysis of air, water, soil, and hazardous waste sites.

2. Sampling errors versus analytical errors

The total error (St) of environmental analysis is the sum of two independent sources oferrors, i.e. sampling error (Ss) and analytical error (Sa) [20]:

S2t ¼ S2

s þ S2a ð1Þ

Both sampling errors and analytical errors in Equation (1) are expressed in terms of theabsolute values of variance or the relative % of the total variance. Note that errorsexpressed otherwise such as relative standard deviation (RSD) are not directly additive asshown in Equation (1). Although it is generally believed that sampling errors dominate theerrors of analytical measurement (90% or higher, [21]), direct experimental measurementsof error contributions from sampling compared to analysis have only been made availablein a very few well-designed studies [22–24]. Lame and Defize [22] determined samplingerrors and analysis errors to be in the ranges of 91.0–98.6% and 0.3–5.5%, respectively,depending on the soil sample size in the range of 0.1 to 6,500 g for the determination ofcyanide contents in soil samples. In measuring oil concentrations in Boston Inner Harborsurface water samples, Ahmed et al. [23] obtained similar results – sampling errors owingto variability of the number of oil-sorbed particles and the weight per particle to be 96%whereas analytical errors to be only 4%. Sampling errors for handling particulatematerials were further divided into two sources, i.e. the fundamental error owing tovariations of chemical compositions among different soil particles (clays, sands, etc.), andthe segregation error owing to variations of locations (lots). The fundamental error waskept minimal by increasing sample size to410 g, whereas segregation error was reduced bymixing soil samples from multiple locations (i.e. compositing). In an extreme case with aheterogeneous distribution of soil contaminants such as explosives compounds present in anugget form in soils, increasing subsample size from 2 to 50 g did not reduce samplingerrors [25].

Mar et al. [26] pointed out that in watershed monitoring, typical measurement errors(RSD) are within 25% for physical/chemical and 50% for biological entities, whereassampling errors owing to natural variation are in the range of 100–400% of observed mean

468 J. Zhang and C. Zhang

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2

Page 4: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

values for physical, chemical, and biological characteristics. Figure 2 depicts how the totalerrors are correlated with sampling errors and analysis errors based on 1000 iterations ofthe Monte Carlo simulations. It is evident from this simulation that given the predominantnature of sampling errors, the total error is significantly correlated to the sampling errors(R2¼ 0.97) whereas the effect of analytical errors on the total errors is not significant

(R2¼ 0.04). This result has important implications in that a further reduction in analytical

uncertainty (even it is possible) is probably of little importance to cost-effectively improvethe overall data quality [27–29]. In fact, it would be more cost-effective to employ aless precise in-situ measurement technique (such as portable X-ray fluorescence formetal analysis), rather than the more traditional but time-consuming ex-situ instrument(such as acid digestion followed by the analysis using inductively coupled plasmaspectroscopy) [28,30].

Unlike analytical errors which can be easily quantified by spiking and standardaddition, sampling errors are difficult to quantify, because it is impossible to spike theanalyte in the entire sampling population (e.g. a lake). This is a primary reason whyanalytical errors are frequently (yet improperly) reported to the third decimal point, andwhy sampling errors are rarely documented in environmental analyses. Such a discrepancyalso attributes to the lack of a theoretical basis in quantifying sampling errors owing tovarious types of spatial and temporal heterogeneities. Gy’s sampling theories [31,32]addressed sampling errors related to granular materials for geological and metallurgicalapplications. According to Gy, the types of sampling errors (sources) can be grouped intoseven categories, including fundamental error (inherent sample characteristics such asparticle sizes and compositions), segregation error (sampling lots, particle density inducedstratifications), long-range heterogeneity error (spatial variations), periodic heterogeneityerror (temporal and spatial fluctuations), delimitation error (inappropriate samplingdesign and the wrong choice of equipment), extraction error (failure of sampling procedure

Figure 2. Monte Carlo simulation of total error vs. analytical error and sampling error (the resultsas shown are based on a total of 1000 iterations assuming analytical errors and sampling error are inthe range of 0–50% and 100–400%, respectively).

International Journal of Environmental Analytical Chemistry 469

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2

Page 5: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

to precisely extract the intended increment), and separation error (loss, contamination,and alteration of a sample or subsample). However, such sampling theories developed forparticulate systems cannot be directly applied to complex environmental matrices for tracechemicals. In assessing temporal uncertainties of monitoring trace levels of PPCPs inwastewater, Ort et al. [33] were able to determine errors associated with different samplingmodes and frequencies (continuous vs. discrete) using model simulated data against resultsexperimentally measured at a 5-min sampling interval. Their results indicated thatsampling errors owing to relatively long sampling intervals and inadequate samplingmodes can lead to over-interpretation of measured data and ultimately wrong conclusions,This Monte Carlo simulation approach was also used to compare different samplingstrategies for estimating tributary loads [34]. Model simulations suggested that flowstratified sampling provided the best estimate of tributary load among all samplingstrategies examined. It should be noted that the magnitudes of sampling errors for tracecontaminants cannot be readily quantified unless, for example, in-situ real-time monitor-ing data can be acquired and then data are retrospectively assessed with the aid ofmodelling approaches [10].

The general strategies to reduce sampling errors are either through replicate samplesfor analysis or through an increased sample size (volume or weight) [23,28]. The bestsample size is certainly to use all the samples after homogenization. Practically, this isimpossible for environmental sampling. In determining petroleum in Boston Inner Harborwater samples, Ahmed et al. [23] showed that increasing sample size (e.g. 1 L to 1 gallon) isnot worthwhile, whereas keeping a sample size of 1 L but having four replicates shows animproved precision. Their study indicated that increasing the number of replicates (n)improved overall precision by a factor of n1/2 over that of a single analysis. In manycircumstances, however, increasing both replicates and sample volume is apparently notpreferred. For example, an increased number of replicate analyses may be cost-prohibitivefor certain difficult-to-analyse trace contaminants. An aliquot extracted from an excessiveamount (volume) of a sample will make subsequent sample clean-up harder owing to thematrix effect. In the following sections, we confine our discussions to the development ofan optimal and practical sampling strategy that considers primarily sample representa-tiveness while minimizing the total number of samples hence the total cost of sampling andanalysis.

3. Biased and unbiased sampling approaches

Among a variety of sampling strategies, judgmental sampling and haphazard sampling areboth non-probability based. Judgmental sampling is the subjective selection of samplinglocations based on information on the sampling site, visual inspection, and personalknowledge and experience. While judgmental sampling can reduce sampling number, it isbiased (if done improperly) and does not support any statistical inference. In studying theeutrophic status of a large number of lakes (over 11,000 lakes with size� 1 ha) inthe northeastern US, Peterson et al. [35] revealed that judgmental sampling biased towardthe subjective selection of larger lakes which resulted in an underestimation of eutrophicstatus (i.e., greater depths of Secchi disk transparency of the lake water). Haphazardsampling claims ‘any sampling location will do’ and hence it encourages taking samplesconsciously at convenient locations or times for spatial or temporal sampling, respectively[1,28]. Mistakenly employed by many as a random approach, haphazard sampling is also

470 J. Zhang and C. Zhang

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2

Page 6: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

biased and legally not defensible. The integration of prior knowledge into a randomsampling procedure is possible through a statistically-based double sampling approachsuch as ranked set and weighted double sampling [36]. Both approaches employ a frugalmeasurement (e.g. visual observation of biological habitants) which is correlated to a morecostly but accurate measurement (e.g. measuring tape) in a subsequent sampling stage. Inranked set sampling, a frugal measurement is used to rank a subset of samples followed bythe actual measurement of one sample from each set until all sample sets are completed. Inweighted double sampling, prior information from a frugal measurement is used tocategorize samples into groups (strata), and the overall mean is calculated from theweighted average of all strata.

Figure 3 shows three common probability-based sampling designs for two-dimensionalspatial sampling as well as one-dimensional temporal sampling [1,37]. With theassumption of insignificant variability within the sampling medium, simple randomsampling is applicable mostly for relatively homogeneous populations. It will allow forstatistical verification of clean-up, but typically result in more samples which is not as cost-effective as other sampling designs in large-scale sampling campaigns [38]. This drawbackcan be overcome by a systematic sampling design either through systematic grid orsystematic random approaches. In mapping spatial patterns of environmental pollution,systematic sampling approach can be used to delineate spatial patterns such as airborneand soil contaminants within a geographical region [e.g. 39, 40]. In monitoring temporalvariations, systematic random sampling gave a better representativeness for atmosphericPCBs with diurnal variations corresponding to daily temperature changes [41]. The gridsizes (e.g. 2 km�2 km in space, 24 hr in time) vary depending on how well the resolution ofa particular spatial or temporal pattern need to be defined. The spatial grid shape istypically square, but in hazardous waste site spatial sampling, Parkhurst [42] illustratedthat triangular grids can give more spatial coverage and result in 23% fewer groundwatersampling wells than required with square grids.

Stratified random sampling is an extensively used sampling approach with a promise ofvarious environmental applications. This sampling approach can significantly reducestandard deviation (hence improved sampling precision) as compared with otherapproaches particularly if the strata are quite different from one another. Scott [43]developed a computerized stratified random site-selection approach to delineate spatialpattern of groundwater quality in relation to land uses and hydrogeological settlings.

Figure 3. Three common probability-based sampling designs for sampling in (a) two-dimensionalspace domain, and (b) one-dimensional time domain: 1. simple random sampling; 2. stratifiedrandom sampling (three strata); 3. systematic grid sampling; 4. systematic random sampling.

International Journal of Environmental Analytical Chemistry 471

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2

Page 7: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

The reduction of sample number and cost is achieved by collecting more samples inrelatively more heterogeneous strata, or fewer samples in strata where sampling costs arerelatively higher than other strata (e.g. saturated vs. vadoze zone). As shown in Figure 4,the types of strata in environmental analysis vary depending on the differences ingeographical, hydrological, meteorological, demographical, chemical, or biologicalfeatures. For example, six sampling strata in the combinations of three contaminantcategories (heavily oiled, moderately oiled, and not oiled) and two beach lengths (5100m,4100m) were chosen to estimate the total contaminated area and mass remaining inPrince William Sound, Alaska after the Exxon Valdez oil spill [44]. In a subsequent studyon the vertical distribution of shoreline oil residues, six other strata in the combination ofthree regions (Herring Bay, Lower Pass, and Bay of Isles) and two contaminant categories(heavily and moderately oiled) were designated [45]. Another well-designed sampling studyworthy of note was conducted by El-Shamy for the estimation of the population of fishimpinged on the intake screens of power plants [46]. Stratified random sampling usingoptimum allocation, i.e. sampling most intensively during months of peak fish abundanceand variation, resulted in a significantly improved precision as compared to a routinemethod with evenly distributed sampling events of every four days throughout the year(RSD 4% vs. 32% for a representative fish species). It should be noted that, regardless ofvarious uses of stratified random sampling reported to date, sample representativeness andsampling precision are seldom justified in the majority of literature.

4. Reducing sample number and sampling frequency

The preceding section presents ‘where’ and ‘when’ to take samples, an equally importantcomponent of sampling strategy is to estimate ‘how many’ samples and ‘how frequent’ ofthe sampling events. In a broad sense, a sampling frequency also implies a correspondingsample number with respect to temporal sampling – both target at a minimal samplingeffort for the best statistical representation of the population. Unfortunately, there is no

(location; land uses; political boundaries; natural boundaries (roads, rivers))

(absence/presence of species; growth period)

(race; ethnicity; gender; age)

(saturated / unsaturated zones; downgradient / upgradient; soil type, soil depth)

(downwind/upwind)

Types of Strata

Hydrogeological Meteorological

Geographical

Chemical

Biological

Demographical

(diurnal/seasonal variation; density difference)

Figure 4. Stratified random sampling: example types of strata in environmental applications.

472 J. Zhang and C. Zhang

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2

Page 8: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

universal guideline to achieve such a goal without considering site-specific information.

Determining a minimally required sample number and sample frequency is often a

challenging task for any environmental project. The key to the determination of sample

number or sample frequency is to gather any available data on the spatial and temporal

variabilities of the population for which samples are to be collected. The variabilities in

turn depend on the properties of the contaminants and more importantly the types of

environmental matrices, such as groundwater, surface water (flowing vs. stagnant),

atmosphere, soils, or domestic sewage effluents. An example of such a spatial variability

is when certain contaminants (e.g. explosives compound) exist in the ‘nugget’ form

whereas a soil sample collected a few centimeters away may be totally depleted of this

compound [25].The minimal sample number to be representative of a population at a specified

tolerable error limit can be determined as follows:

n ¼S2t2

e2ð2Þ

where s, t, and e are standard deviation, Student’s t value, and allowable error,

respectively. Equation (2) clearly defines the dependence of sample number on variance

(s2), and the inverse relationship between sample number and the square of the tolerable

error level. Stated in another way, the required sample number (n) is directly proportional

to the square of the ratio of standard deviation to the tolerable error (s/e). As the ratio of

s/e and confidence level increase at the high end, the required sample number increases at a

more significant rate (Figure 5). Equation (2) can also be written with a standard normal

variate Z if the number of samples or replicates is larger [47]. Additionally, in regulatory

compliance monitoring, the allowable error (e) can be substituted by the deviation of

contaminant level from its regulatory threshold (RT- �x), where RT is a regulatory

threshold (e.g., emission standard) and �x is the estimate of mean (actual concentration of

Figure 5. Required number of samples as a function of the ratio of standard error to tolerable error(s/e) at three confidence levels (80%, 90%, and 95%).

International Journal of Environmental Analytical Chemistry 473

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2

Page 9: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

the contaminant of interest). This implies more samples are required when the estimatedsample mean approaches the regulatory limit. Conversely, fewer samples are needed at thesame confidence level if contaminant concentrations are far below or above the regulatorylimit.

Since the t-value depends on sample number and confidence level, a trial-and-errorcalculation is needed to estimate n [1,6]. Equation (2) also indicates that an optimalsampling design is at the expense of a prior knowledge regarding the natural variability.This becomes challenging because preliminary data on spatial or temporal variabilities aretypically not available. For example, a dataset for a period of two years was considered tobe essential to determine an optimal groundwater sampling frequency [48]. Nevertheless,this need may be justified for long-term monitoring programs with a large number ofmonitoring stations and costly analysis, such as the case in groundwater monitoring inSuperfund remediation and post closure of landfill sites. Efforts have been made to reducethe cost by reducing the sampling frequency based on site-specific scenarios, such as thecost-effective sampling (CES) algorithm considering trend, variability, magnitude statis-tics, periodicity, and autocorrelation [49,50]. The underlying principle for CES algorithmis that sampling frequency should be determined primarily by the rate of change inconcentrations observed in the recent past. Monthly and quarterly sampling frequencyrepresents a good initial choice for the network design of surface water and groundwatermonitoring, respectively [48,51]; however, the adequacy should always be re-evaluated ona site-to-site basis. For example, ground water sampling frequency of the existingmonitoring scheme can be downgraded (e.g. semiannual to annual, or quarterly tosemiannual) when contaminant concentration changes in groundwater wells become lessinsignificant overt time [52].

It should be noted, however, that the efforts to achieve optimal sampling should beexercised with cautions as problems may arise when sampling becomes too infrequent oreven too frequent. For example, the estimates of annual mean herbicide concentrations inMidwestern rivers owing to ‘spring flush’ during spring and early summer runoff eventswere underestimated by quarterly sampling scheme as required by the US EPA formunicipalities using surface water as a source of drinking water [53]. A similar problemwas noted recently by Ort et al. [33] in sampling PPCPs from sewage treatment plants.Their work indicated that an extremely short 5-min interval was needed to enable thepositive detection of the peak concentrations of trace PPCPs as a result of a single toiletflush in a small community. An important implication of this finding is that results reliedon a single ‘lucky’ grab sample may provide misleading information particularly for tracelevels of emerging environmental contaminants such as PPCPs and hormonal substances.Similarly in atmospheric monitoring of particulate matter, current regulatory requirementof a low sampling frequency at one day every six days may have caused misclassifying anonattainment area as being in compliance [54]. In contrast to infrequent sampling, astrategic oversampling (termed ‘sampling out’) was also noted in some drinking watertreatment systems where oversampling of drinking water was intentionally employed toavoid violation of the Total Coliform rule proscribed by the US EPA [55]. This samplingout issue should be avoided from both technical and policy standpoints. The strategicoversampling is different from a conservative sampling campaign commonly seen in themonitoring of industrial effluents, where processes are being monitoring more closely tojustify unnecessary potential emission charges from a regulatory agency.

There is an alternative strategy to mitigate the high cost associated with the analysis ofa large number of environmental samples for the measurement of multiple parameters.

474 J. Zhang and C. Zhang

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2

Page 10: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

Compositing is such an appealing sampling strategy to reduce sample numbers whensampling cost is low relative to analytical cost and the primary interest is to estimate themean at the expense of variations [1,56,57]. Compositing can be achieved by physicalgrinding of solid samples or mixing with volume-, time-, or flow-proportion mechanismsfor liquid samples. These mixing devices have been made available commercially forwastewater sample collection, and storm water collections when manual collection isdifficult because of a short storm runoff duration [58]. There are two basic environmentalapplications using composite sampling strategy. The binary classification is to detect thepresence or absence of certain chemicals/biological indicators in the environment orbiological tissues [59], such is the case in determining whether a Superfund site has beencleaned up or not, or the detection of certain pathogens in a drinking water supply. Themixing approach is a more widely used compositing approach for the determination of anaverage concentration of chemicals excluding volatile organic compounds (VOCs) sincesamples for VOCs cannot be mixed.

Figure 6 is a simplified schematic showing the cost-savings of using compositesampling if the analytical cost is assumed to be $200 per sample. In the case of binaryclassification, a reduction of sample number from 10 to 7 samples results in a 30% cost-saving. Here we also assume a 10% prevalence rate (detection rate) for a biological orchemical parameter. In Figure 6(b), where composting of every 5 samples is employed toestimate the mean of 10 original samples, an even greater 80% cost-saving can be achieved.The success of composting approach depends on several factors. The modelling work doneby Johnson and Patil [59] indicated that the binary approach is cost-effective only forcontaminants with a low prevalence (detection) rate of 510%. The cost-effectiveness ofcomposting sampling is also strongly dependent on the cost of sampling (Cs) relative to thecost of analysis (Ca). The cost-effectiveness improves from Ca/Cs ¼ 1 to 10, and the changein cost-effectiveness becomes negligible if the cost of analysis further increases beyond thisrange [59]. In a related study, Rohlf et al. [56] provided mathematical algorithms toestimate sample size and the means to optimizing composite sampling protocols. Thesealgorithms were used to find the optimum sampling protocol that stays within a fixed

Figure 6. Composite sampling to reduce sample numbers: (a) to confirm clean up or not (binaryclassification), (b) to estimate the mean. Symbol � denotes absence of contaminants (belowdetection limit);

Ndenotes presence of contaminants (above detection limit). Five samples are

composited for both (a) and (b). In 6(a), if the composite is tested positive, each of the five samples isthen analyzed. The total number of samples analyzed is seven. In 6(b), only two composited samplesare analyzed from initial 10 samples.

International Journal of Environmental Analytical Chemistry 475

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2

Page 11: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

budget, or to find the least costly sampling protocol that is still able to reliably detect aspecified difference in means. Such algorithms are useful for determining compositesampling protocols when dealing with animal tissue samples, and other media such aswater and sediment.

5. Concluding remarks

Some relevant guidelines for environmental sampling have existed for over a century [60],yet there is still an apparent need to raise the awareness of the importance of samplingduring the entire environmental data acquisition process. It is crucial for us to recognizethat all our careful lab work is wasted if a sample is not collected properly. Beginningprofessionals including college students in the environmental and analytical chemistryfields (and many other professions) should be trained to comprehend ‘there is no analysiswithout sampling’ [61] philosophy. Several hands-on training materials have beenrecommended for educational use in environmental and chemistry curriculum [1, 62–67].Analytical chemists should pay more attention to the precision-limiting step of ‘samplingerrors’ not just the state-of-the-art instruments. It is also crucial that all relevant researchpapers should present a clear justification for essential sampling details to document thesample representativeness and/or sampling errors.

In designing an optimal site-specific sampling strategy with regard to samplinglocation, time, frequency and sample number, one need to thoroughly examine the sourcesof temporal and spatial variability [57]. Several statistically-based sampling approachescapable of achieving cost-effectiveness of environmental sampling, such as stratifiedrandom sampling, systematic sampling, and compositing sampling are currently stillunderutilized. In compliance monitoring, it is also likely that the optimal sampling strategymay not agree with what is required by the current regulatory agency. These issues shouldbe studied on a case-by-case basis since a universal sampling protocol is nonexistent.Sampling is complicated field of science, and much research is needed to revitalize itthrough some fundamental interdisciplinary and coordinated efforts among practitioner,researchers, regulators, and governmental agencies. Future research is needed to advancethis field toward the development of a more scientifically sound theoretical framework aswell as practical protocols for various environmental sampling scenarios.

References

[1] C. Zhang, Fundamentals of Environmental Sampling and Analysis (John Wiley & Sons, New York,

2007).[2] J.P. Maney, Environ. Sci. Technol. 36, 383A (2002).

[3] O.U. Anders, Anal. Chem. 49, 33A (1977).[4] W.J. Murphy, Anal. Chem. 26, 441 (1954).[5] T.R. Dulski, A Manual for the Chemical Analysis of Metals, ASTM International (1996).

[6] ACS Committee on Environmental Improvement, Anal. Chem. 52, 2242 (1980).[7] L.H. Keith, Environ. Sci. Technol. 24, 610 (1990).[8] G.E. Schweitzer and J.A. Santolucito, Environmental Sampling for Hazardous Wastes, ACS

Symposium Series 267, American Chemical Society (1984).[9] H.K. Lawrence, Environmental Sampling and Analysis: A Practical Guide (Lewis Publishers,

New York, 1991).

476 J. Zhang and C. Zhang

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2

Page 12: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

[10] C. Ort, M.G. Lawrence, J. Rieckermann, and A. Joss, Environ. Sci. Technol. 44, 6024 (2010).[11] R.E. Clement, Anal. Chem. 64, 1076A (1992).[12] Y. Chen and J. Pawliszyn, Anal. Chem. 75, 2004 (2003).[13] G. Quyang, Y. Chen, and J. Pawliszyn, Anal. Chem. 77, 7319 (2005).

[14] L. Augulyte and P.-A. Bergqvist, Environ. Sci. Technol. 41, 5044 (2007).[15] P. Mayer, J. Tolls, J.L.M. Hermens, and D. Mackay, Environ. Sci. Technol. 37, 185A

(2003).[16] I.J. Allan, K. Booij, A. Paschke, B. Vrana, G.A. Mills, and R. Greenwood, Environ. Sci.

Technol. 43, 5383 (2009).[17] M. Shaw and J.F. Mueller, Environ. Sci. Technol. 43, 1443 (2009).[18] R. Lohmann and D. Muir, Environ. Sci. Technol. 44, 860 (2010).

[19] S.J. Hayward, T. Gouin, and F. Wania, Environ. Sci. Technol. 44, 3410 (2010).[20] B. Kratochvil, Fresenius J. Anal. Chem. 337, 808 (1990).[21] D.M. Crumbling, C. Groenjes, B. Lesnik, K. Lynch, J. Shockley, J.V. Ee, R. Howe, L. Keith,

and J. McKenna, Environ. Sci. Technol. 35, 440A (2001).[22] F.P.J. Lame and P.R. Defize, Environ. Sci. Technol. 27, 2035 (1993).

[23] S.M. Ahmed, M.D. Beasley, A.C. Efromston, and R.A. Hites, Anal. Chem. 46, 1858 (1974).[24] T.B. Whitaker, Pure Appl. Chem. 49, 1709 (1977).[25] M.E. Walsh, C.A. Ramsey, and T.F. Jenkins, Chemosphere 49, 1267 (2002).

[26] B.W. Mar, R.R. Horner, J.S. Richey, R.N. Palmer, and D.P. Lettenmaier, Environ. Sci.

Technol. 20, 545 (1986).

[27] B. Kratochvil, D. Wallace, and J.K. Taylor, Anal. Chem. 56, 113R (1984).[28] B. Kratochvil and J.K. Taylor, Anal. Chem. 53, 924A (1981).[29] O.U. Anders and J.I. Kim, J. Radioanal. Chem. 39, 435 (1977).[30] P.D. Taylor, M.H. Ramsey, and P.J. Potts, Environ. Sci. Technol. 38, 6824 (2004).

[31] P. Gy, Sampling for Analytical Purposes (Wiley, New York, 1998).[32] F.F. Pitard, Pierre Gy’s Sampling Theory and Sampling Practice, 2nd ed. (CRC Press, Inc., Boca

Raton, FL, 1993).[33] C. Ort, M.G. Lawrence, J. Reungoat, and J.F. Mueller, Environ. Sci. Technol. 44, 6289 (2010).[34] R.P. Richards and J. Holloway, Water Resources Res. 23, 1939 (1987).

[35] S.A. Peterson, N.S. Urouhart, and E.B. Welch, Environ. Sci. Technol. 33, 1559 (1999).[36] N.A. Mode, L.L. Conquest, and D.A. Marker, Environmetrics 13, 513 (2002).[37] R.O. Gilbert, Statistical Methods for Environmental Pollution Monitoring (van Nostrand

Reinhold, New York, 1987).[38] U.S. EPA, Superfund Program Representative Sampling Guidance, volumes 1-5 (1995).

[39] P.R. Sheppard, R.J. Speakman, C. Farris, and M.L. Witten, Environ. Sci. Technol. 41, 406

(2007).

[40] Y. Z. Wu, J. Chen, J.F. Ji, Q.J. Tian, and X.M. Wu, Environ. Sci. Technol. 39, 873 (2005).[41] J.C. Wallace and R.A. Hites, Environ. Sci. Technol. 30, 444 (1996).[42] D.F. Parkhurst, Environ. Sci. Technol. 18, 521 (1984).

[43] J.C. Scott, U.S. Geological Survey Water-Resources Investigations Report 90-4101 (1990).[44] J.W. Short, M.R. Lindeberg, P.M. Harris, J.M. Maselko, J.J. Pella, and S.D. Rice, Environ. Sci.

Technol. 38, 19 (2004).[45] J.W. Short, J.M. Maselko, M.R. Lindeberg, P.M. Harris, and S.D. Rice, Environ. Sci. Technol.

40, 3723 (2006).[46] F.M. El-Shamy, Environ. Sci. Technol. 13, 315 (1979).[47] G.E. Schweitzer and S.C. Black, Environ. Sci. Technol. 19, 1026 (1985).[48] U.S. EPA, Sampling Frequency for Ground-Water Quality Monitoring, EPA 600 (1989).

[49] M. Ridley and D. MacQueen, presented at the American Society of Civil Engineers, Anchorage,

AK, United States (2005).

[50] A. Szucs and G. Jordan, Water Sci. Technol. 30, 73 (1994).[51] J.C. Loftis and R.C. Ward, Environ. Manag. 4, 521 (1980).

International Journal of Environmental Analytical Chemistry 477

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2

Page 13: Sampling and sampling strategies for environmental analysis(RSD) are within 25% for physical/chemical and 50% for biological entities, whereas sampling errors owing to natural variation

[52] V.M. Johnson, R.C. Tuckfield, M.N. Ridley, and R.A. Anderson, Environ. Sci. Technol. 30,355 (1996).

[53] W. Battaglin and L.E. Hay, Environ. Sci. Technol. 30, 889 (1996).[54] J.E. Davidson and P.K. Hopke, Environ. Sci. Technol. 18, 571 (1984).

[55] L.S. Bennear, K.K. Jessoe, and S.M. Olmstead, Environ. Sci. Technol. 43, 5176 (2009).[56] F.J. Rohlf, H.R. Akcakaya, and S.P. Ferraro, Environ. Sci. Technol. 30, 2899 (1996).[57] R.L. Whitman and M.B. Nevers, Environ. Sci. Technol. 38, 4241 (2004).

[58] R.D. Harmel, K.W. King, and R.M. Slade, Appl. Engr. Agri. 19, 667 (2003).[59] G.D. Johnson and G.P. Patil, Environ. Eco. Stat. 8, 91 (2001).[60] L.S. Clesceri, A.E. Greenberg, and A.D. Eaton, Standard Methods for the Examination of Water

and Wastewater, 21st Edition (American Public Health Association, Washington, DC, 2005).[61] R.W. Murray, Anal. Chem. 69, 269A (1997).[62] B. Kratochvil, R.S. Reid, and W.E. Harris, J. Chem. Edu. 57, 518 (1980).

[63] C.F. Bauer, J. Chem. Edu. 62, 253 (1985).[64] R.D. Guy, L. Ramaley, and P.D. Wentzell, J. Chem. Edu. 75, 1028 (1998).[65] J.E. Vitt and R.C. Engstrom, J. Chem. Edu. 76, 99 (1999).[66] M.R. Ross, D.W. Bacon, and W.C. Wolsey, J. Chem. Edu. 77, 1015 (2000).

[67] B.A. Logue and S.L. Youso, J. Chem. Edu. 87, 316 (2010).

478 J. Zhang and C. Zhang

Dow

nloa

ded

by [

Uni

vers

ity o

f N

ew H

amps

hire

] at

18:

57 2

4 M

arch

201

2


Recommended