JEQ Grow Your Career With ASA
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Related articles in JEQ
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Demougeot-Renard, H.
Right arrow Articles by Renard, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Demougeot-Renard, H.
Right arrow Articles by Renard, P.
GeoRef
Right arrow GeoRef Citation
Agricola
Right arrow Articles by Demougeot-Renard, H.
Right arrow Articles by Renard, P.
Related Collections
Right arrow Geostatistics
Right arrow Uncertainty Analysis
Right arrow Data Management
Right arrow Heavy Metals
Right arrow Soil Pollution
Published in J. Environ. Qual. 33:1694-1702 (2004).
© ASA, CSSA, SSSA
677 S. Segoe Rd., Madison, WI 53711 USA

TECHNICAL REPORTS

Heavy Metals in the Environment

Forecasting the Number of Soil Samples Required to Reduce Remediation Cost Uncertainty

Hélène Demougeot-Renarda,*, Chantal de Fouquetb and Philippe Renarda,c

a Eidgenössische Technische Hochschule Zürich, Institut für Raumplanung und Landschaftsentwicklung, Hönggerberg, CH-8093 Zürich, Switzerland (current address: University of Neuchâtel, Centre for Hydrogeology, 11 rue Emile Argand, CH-2007 Neuchâtel, Switzerland)
b Ecole Nationale Supérieure des Mines de Paris, Centre de Géostatistique, 35 rue Saint Honoré, 77305 Fontainebleau, France
c Université de Neuchâtel, Centre d'Hydrogéologie de Neuchâtel, 11 rue Emile Argand, CH-2007 Neuchâtel, Switzerland

* Corresponding author (helene.demougeot{at}unine.ch).

Received for publication October 7, 2003.

    ABSTRACT
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 APPLICATION TO A FORMER...
 CONCLUSIONS
 REFERENCES
 
Sampling scheme design is an important step in the management of polluted sites. It largely controls the accuracy of remediation cost estimates. In practice, however, sampling is seldom designed to comply with a given level of remediation cost uncertainty. In this paper, we present a new technique that allows one to estimate of the number of samples that should be taken at a given stage of investigation to reach a forecasted level of accuracy. The uncertainty is expressed both in terms of volume of polluted soil and overall cost of remediation. This technique provides a flexible tool for decision makers to define the amount of investigation worth conducting from an environmental and financial perspective. The technique is based on nonlinear geostatistics (conditional simulations) to estimate the volume of soil that requires remediation and excavation and on a function allowing estimation of the total cost of remediation (including investigations). The geostatistical estimation accounts for support effect, information effect, and sampling errors. The cost calculation includes mainly investigation, excavation, remediation, and transportation. The application of the technique on a former smelting work site (lead pollution) demonstrates how the tool can be used. In this example, the forecasted volumetric uncertainty decreases rapidly for a relatively small number of samples (20–50) and then reaches a plateau (after 100 samples). The uncertainty related to the total remediation cost decreases while the expected total cost increases. Based on these forecasts, we show how a risk-prone decision maker would probably decide to take 50 additional samples while a risk-averse decision maker would take 100 samples.

Abbreviations: Cc, cleanup cost • Ci, investigation cost • Cr, overall remediation cost • Cu, uncertain cost • S, remediation cutoff • Ve, excavated volume • Vlr, low-risk volume • Vc, cleanup volume • Vs, stored volume • Vu, uncertain volume • {alpha}, inferior probability threshold • ß, superior probability threshold


    INTRODUCTION
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 APPLICATION TO A FORMER...
 CONCLUSIONS
 REFERENCES
 
MANAGING INDUSTRIAL polluted sites requires an assessment not only of the volume of polluted soil but also of the corresponding uncertainty. Indeed, uncertain estimates result in environmental and financial risks. On one side, polluted soil may stay in place or be discovered unexpectedly, while on the other side, clean soil may be excavated for no reason. The uncertainty results from the small quantity of information available as compared with the complexity of the spatial patterns of pollutant concentration in the soil. Geostatistical techniques allow modeling of the uncertainty based on available data. As a consequence, it is now becoming more and more frequent to use geostatistical techniques to map soil pollution, estimate the volumes exceeding a given concentration threshold, or improve the process of selective remediation (Flatman and Yfantis, 1984; Garcia and Froidevaux, 1996; von Steiger et al., 1996; Hendriks et al., 1996; Demougeot-Renard and de Fouquet, unpublished data, 2003; Saito and Goovaerts, 2003).

The consulting practice shows, however, that high volumetric and cost uncertainties are still common when designing a remediation scheme. One of the reasons is that at the beginning of a contaminated site investigation, sampling is mainly designed to answer the specific questions raised by the risk assessment: evaluation of the toxicological and ecotoxicological properties of pollutants, or investigation of the risk of migration. Sampling is subsequently used to map pollutant concentrations and assess remediation volumes. Sampling in this phase is thus generally not designed for geostatistical interpolations and additional sampling becomes necessary. It may be questioned, therefore, at a given stage of a site investigation, whether it is possible to use all available information to forecast the optimal number of additional samples to sufficiently reduce the volumetric and cost uncertainty.

Because of its high practical interest, optimal sampling procedure is a topic widely studied in the literature. Within the geostatistical framework, a classical issue is the definition of rules to locate the additional samples when the variogram is known. The most usual approach is to locate subsequent samples where the interpolated concentrations are the most uncertain (i.e., where the kriging standard deviation is high) (Burgess et al., 1981). An improved approach is to identify the points where the probability to have a high concentration is above a threshold (Van Tooren and Mosselman, 1996; Van Groeningen et al., 1997). Johnson (1996) follows the same concepts but within a combined geostatistical–Bayesian approach. Barabàs et al. (2001) used indicator kriging within a transformed space (to follow the geometry of an estuarine river) to delineate contaminated areas and to forecast additional sampling needs. One of the most recent tools to optimize sampling locations is the simulated annealing technique, which allows searching for optimal locations while accounting for constraints such as building locations (Van Groeningen et al., 2000). However, it was soon recognized that optimal sampling should also account for economical factors (Srivastava, 1987). Along this line, Englund and Heravi (1993)(1994) proposed a strategy to minimize remediation costs, including sampling costs and misclassification error costs. Okx and Stein (2000) combine statistical decision trees and indicator kriging to assess the economic validity of an additional sampling stage.

All of these methods present one or several drawbacks. Often, the probabilities of exceeding a concentration cutoff are calculated approximatively, on the basis of ordinary kriging (Burgess et al., 1981; Englund and Heravi, 1993, 1994) or residual kriging (Van Tooren and Mosselman, 1996). Some authors use indicator kriging, but in that case they reduce the pollution phenomenon, often described as a continuous spatial variable, to a discrete phenomenon (Johnson, 1996; Van Groeningen et al., 1997; Okx and Stein, 2000; Barabàs et al., 2001). Another issue is that the sampling strategy is based on estimations, but not on forecasts (Van Tooren and Mosselman, 1996; Van Groeningen et al., 1997). An additional data sampling is advised on the basis of probabilities calculated using real data. Probabilities that may be calculated taking into account additional data that are still not sampled are not forecasted. Furthermore, sampling is usually not optimized with respect to economical criteria (Van Tooren and Mosselman, 1996; Johnson, 1996; Van Groeningen et al., 1997). Finally, most of these approaches aim at reducing the environmental and financial risks due to soil misclassification using probability mapping, while the decision makers think instead in terms of overall amounts of soil requiring remediation.

The aim of this paper is to propose a new methodology to forecast the number of samples required to reach an acceptable uncertainty level in terms of overall volumes and costs. The methodology is applied during an iterative sampling campaign. The calculations are based on conditional simulations, which permit taking nonlinearity into account as well as modeling the specific features related to remediation activities. The basic principle of the calculation is to simulate the addition of new samples within the geostatistical model and then to forecast the evolution of the volume, the cost, and their respective uncertainties. The methodology includes (with great care) the possible effects that may bias the estimations (sampling errors, support effect, information effect) and follows in detail the procedure, which is used in the field to excavate a polluted soil. Finally, the method is illustrated on a former smelting work, which polluted the underlying soils with lead.


    MATERIALS AND METHODS
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 APPLICATION TO A FORMER...
 CONCLUSIONS
 REFERENCES
 
We assume that preliminary site investigation and risk assessment have shown that part of the soil requires remediation. We also assume that we are in the phase of site investigation, the aim being to estimate the amount of polluted soil and to estimate a remediation cost. Thus, we are not yet in the remediation phase. It is further assumed that soils with pollutant concentrations greater than the remediation cutoff (S) will be treated either on site or ex situ, since nowadays, these are the most frequently applied remediation techniques. Soils will thus be selected and excavated before being sent to the treatment unit (Fig. 1). We consider that volumes and costs are "estimated" when they are calculated using real data. We consider that volumes and costs are "forecasted" when they are calculated using both real data and "simulated" data of an additional "simulated" sampling stage. We use "calculations" when both estimates and forecasts are considered.



View larger version (36K):
[in this window]
[in a new window]
 
Fig. 1. The different steps of a common remediation works (soil selection, excavation, transport, storage, and treatment), and the corresponding soil volumes (Ve, excavated volume; Vs, stored volume; Vc, cleanup volume).

 
Principle
The methodology is integrated into an iterative sampling campaign. After a sampling stage j, the aim is to model the impact of the future resampling (stage j + 1) on the uncertainty. Five main steps are involved in the calculation:

Step 3 to 5 are repeated for a series of increasing numbers of additional simulated data. It allows forecasting of the evolution of uncertainty with the number of samples. The result is then used as a decision aid tool in the design of the next sampling stage. After the sampling stage j + 1, the volume and cost estimates may be different from the forecasts calculated at stage j. Such differences may occur, especially at the beginning of the investigation when few data are available. Estimations at stage j + 1 are, as a matter of fact, based on the available information on the spatial structure at stage j + 1 while forecasts at stage j + 1 are based only on the available information at stage j. The sampling design procedure can then be applied to a supplementary "simulated" sampling stage j + 2. The iterative sampling is stopped when the volumetric and financial uncertainty estimations are acceptable. The different steps of the procedure are detailed in the following sections.

Calculation of Remediation Volumes and Uncertainties Using Geostatistics (Steps 1 and 4)
Both the estimation and forecast of the volumes are obtained using conditional simulations of pollutant concentrations. A simulation T(x) of a random function [Z(x): x Rn] is defined as a realization of Z(x), randomly selected from the set of all possible realizations (Matheron, 1973; Journel, 1974). The term T(x) follows the same spatial distribution, in particular the same expectation, covariance, and histogram as Z(x). It does not minimize the estimation variance, so it is nonbiased, but not an optimal estimator of Z(x). Because T(x) reproduces the spatial distribution of Z(x), the application of a cutoff S to T(x): [T(x) ≥ S] provides a nonbiased estimator of [Z(x) ≥ S]. Moreover, a conditional simulation T(x|n) honors the experimental data at the n sampling points. A diffusive model is selected here because it permits simulation of continuous variables such as pollutant concentrations. Conditioning data are real data for volumetric estimations, while they include both real and "simulated" data for volumetric forecasts. The cumulative distribution function F(S) of Z(x) defines the probability that Z(x) is greater than the cutoff S: F(S) = P[Z(x) > S|n]. This probability can be estimated using a set of K conditional simulations. At each location x, the probability estimate is calculated as the ratio of the number of simulated values Ti (x|n) that exceed S, to the total number K of simulated values:

[1]
where I[Ti(x|n);S) = 1 when Ti(x|n) > S, and I[Ti(x|n);S) = 0 otherwise. In addition, the remediation volumes can only be correctly assessed if the conditions of soil selection for cleanup are modeled. Three effects have to be accounted for:

  1. Change of support. In Europe, the remediation volume is usually estimated using site investigation data measured in boreholes. The investigation samples are thus rather small (order of magnitude: dm3). They can be considered as points, while soils will be excavated for cleanup in blocks (remediation blocks) of large dimensions (order of magnitude: m3). The remediation volume resulting from applying S on a distribution of point pollutant concentrations differs from the volume resulting from applying S on a distribution of block pollutant concentrations (Rivoirard, 1994).
  2. Information effect (Fig. 2). During the remediation stage, blocks of soil are selected for cleanup using pollutant concentrations measured in composite samples. One composite sample is composed of a set of L small samples collected systematically or randomly in each block. However, there is a difference between the pollutant concentration measured in the composite and the true yet unknown block concentration (Rivoirard, 1994). Applying S on the distribution of concentrations measured in composite samples thus provides a different remediation volume estimate from the volume estimate resulting from applying S on the distribution of the true block concentrations.
  3. Sampling and analytical errors. They depend on the laboratory equipment, field equipment, and the experience of the operators, but they always affect real data.



View larger version (16K):
[in this window]
[in a new window]
 
Fig. 2. Information effect: the true block concentration, always unknown, is estimated by the concentration of a composite of samples collected in the block.
 
To account for these soil selection conditions, we performed conditional simulations in the following manner. A series of K point simulations is generated on a fine grid discretizing the blocks of the remediation grid. To account for the information effect, each block-simulated concentration is subsequently estimated as the mean of L simulated point concentrations:

[2]
where Tv is a block simulation, Tp a point simulation, and L the number of point samples used to make the composite sample. To model the presence of sampling and analysis errors, a variable {epsilon}(x) is added as follows:

[3]

The variable {epsilon}(x) is a relative error chosen with a uniform distribution between –1 and +1, and independent of Tv(x|n). The resulting variable HV(x|n) models realistic block-simulated values, which are used to calculate the probabilities of exceeding S in the blocks. At each block, the probability is the ratio of the number of block-simulated values exceeding S to the total number K of simulations. The remediation volumes are calculated using the resulting set of block conditional simulations and block probabilities (Fig. 1 and Fig. 3), accounting for the two phases that usually lead to a refined volumetric estimation:

  1. At the end of the site investigation, the envelope Ve for the remediation volume is estimated.
  2. During the remediation stage, the remediation volume is more finely investigated. Within Ve, pollutant concentrations are measured systematically in the blocks of the remediation grid (Fig. 1). Due to the remaining uncertainty, Ve still includes nonpolluted blocks. Consequently, Ve is made of both the cleanup volume Vc composed of blocks of soils with concentrations exceeding S, and the stored volume Vs composed of nonpolluted blocks of soils to be excavated to access the polluted blocks.



View larger version (23K):
[in this window]
[in a new window]
 
Fig. 3. Theoretical graph of block concentration as a function of block probabilities of exceeding the remediation cutoff S. It illustrates the meaning of the low-risk volume Vlr (blocks of soils with probabilities less than {alpha}), the uncertain volume Vu (blocks of soils with probabilities greater than {alpha} and less than ß), and the excavated volume Ve (blocks of soils with probabilities greater than ß). The excavated volume is composed of the cleanup volume Vc (blocks of soils with pollutant concentrations greater than S) and the stored volume Vs (blocks of soils with pollutant concentrations less than S).

 
The first remediation volume estimate Ve is defined as the set of blocks with probabilities higher than a maximal "acceptable" probability ß:

[4]

The probability threshold ß corresponds to the maximal acceptable risk of soil misclassification, which is either a risk of including nonpolluted soils in Ve, or a risk of excluding polluted soils of Ve. The major difficulty is to define ß taking into account the risk assessment study, the possible site use following restoration, and the general context in which restoration takes place. Within Ve, the volume of soil Vc(i) where pollutant concentrations exceed S is calculated for each block conditional simulation (index i). The term Vc(i) is calculated as the sum of the unit volumes Vb of the t block-simulated values exceeding S included in Ve:

[5]

Similarly, the volume of soil Vs(i) in Ve where the pollutant concentration falls below S is calculated for each block conditional simulation i. The term Vs(i) is calculated as the sum of the unit volumes Vb of the t block-simulated values falling below S:

[6]

These calculations lead to two distributions of volumes Vc(i) and Vs(i). Depending on the skewness of the distributions, their mean or their median provides an estimate of the volumes Vc and Vs excavated from Ve. Uncertainties of the volumetric calculations are modeled inside and outside Ve. Inside Ve, the volumetric uncertainty is modeled by the dispersion of the distributions Vc(i) and Vs(i). This may be quantified by the interquartile range [Q25%Q75%] or by the coefficient of variation {sigma}/µ. Outside Ve, the volumetric uncertainty can be modeled providing the definition of a second probability threshold {alpha} ({alpha} < ß). Environmental risks may be considered as nonsignificant outside Ve for blocks with a probability smaller than {alpha}:

[7]

These blocks can remain without any remediation or additional monitoring. They make up the low-risk volume Vlr. Environmental and financial risks are still significant in the blocks where probabilities exceed {alpha} and fall below ß:

[8]

Blocks with pollutant concentrations exceeding S may remain untreated while blocks with pollutant concentrations below S may be excavated for remediation, although unnecessary. These blocks make up the uncertain volume Vu, which represents the nonacceptable uncertainty remaining outside Ve.

Calculation of the Overall Remediation Cost and Uncertainty (Steps 2 and 5)
Cost estimates and cost forecasts are calculated using the block simulations and block probabilities. Cleanup costs include both the costs of remediating the polluted soils within Ve and the costs due to the excavation and the storage of nonpolluted soils within Ve. The cost function has been developed to calculate a frequency distribution of cleanup costs Cc(i) from the volumetric estimates Ve, Vc(i), and Vs(i). The cost function (Renard-Demougeot, 2002) is parameterized to fit various pollution scenarios and different commercial and technical proposals. The general cost model is the following:

If VC < Vthreshold:

[9]
If not:

where A, B, C1, C2, C3, D, E, and F are composite unit prices allowing inclusion of planning, cleanup, transport, excavation, storage, filling up, and soil screening. Because in practice, different remediation techniques may be applied simultaneously in Ve depending on the pollutant concentrations and the granulometric classes of the soil, the cleanup volume Vc has to be partitioned in granulometric and concentration classes. The notation is Vc = Vc = Vc, where nc and np are respectively the number of concentration and granulometric classes. The total cleanup cost Cc is the sum of the costs of these different remediation techniques. Furthermore, a technical proposal may provide different cleanup unit prices. Unit prices may be higher for the volume of soil exceeding the remediation volume Vthreshold that has been declared before starting remediation.

Cost uncertainty outside Ve is defined as the cost of unexpected discovery of soils requiring remediation in Vu. A distribution of cost uncertainty Cu(i) is calculated applying the same cost function (Eq. [9]) to the uncertain volume Vu and to the distributions of volumes of polluted soils Vcu(i) and nonpolluted soils Vsu(i) included in Vu.

Sampling optimization requires balancing the increase of volume and cost accuracy, with the increase of investigation cost. Investigation costs Ci are calculated applying a parameterized investigation cost function to the number of samples. Finally, for a given number of samples collected at investigation stage j, an overall remediation cost Cr is defined as the sum of investigation cost Ci, cleanup cost Cc, and cost uncertainty Cu:

[10]

A distribution Cr(i) is obtained from Ci, Cc(i), and Cu(i). Depending on the skewness of the distribution, the mean or the median of this distribution provides an estimate of the overall cost.

Modeling Sampling of Various Numbers of Additional Data (Step 3)
If at stage j, the remaining uncertainties on the volume and cost estimates are too high for decision making, we propose to forecast the uncertainties on volumes and costs that could be estimated if additional samples were collected at stage j + 1 as follows:

Steps 2 to 5 are repeated for a series of successive values of Nj+1. The results are represented in the form of two graphs: a volume forecast graph representing Ve and Vu for stage j + 1 as a function of Nj+1, and a cost forecast graph representing Ci, Cc(i), and Cu(i) for stage j + 1 as a function of Nj+1. These graphs are then used as decision criteria in the design of the following sampling phase.


    APPLICATION TO A FORMER SMELTING WORK
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 APPLICATION TO A FORMER...
 CONCLUSIONS
 REFERENCES
 
Site Description
The former smelting work is located in France. The site covers an area of 3 ha. The water table lies at around 2 m below ground. The principal wind directions in this area are west to north-northeast. Historical data have shown that the chimney of the smelting work was responsible for dispersion of nontreated dust and smoke. Seventy five samples were collected from the smelting site and its neighborhood at depths of 0 to 4 m, in six sampling stages (Fig. 4). The sampling support was homogeneous, consisting of soils collected along boreholes on a height of 25 cm. Lead concentrations were measured in all the samples (Row 1 of Table 1). A detailed risk assessment study has shown that soil with Pb concentration >300 mg kg–1 involves a risk for human health via dust inhalation. The 300 mg kg–1 value was thus chosen as the regulatory remediation cutoff S, without specifying at which sampling support S had to be applied. Soil was remediated by soil washing on site. The zone that was excavated for remediation had been delineated without using geostatistics. This zone was sorted according to 212 Pb concentrations measured in blocks of a selective remediation grid. The Pb concentrations were measured on composite samples made of four small samples taken at the corners and one small sample taken at the center of each block (Fig. 2). One block had a 10-m side length and a 0.30-m height. The soil was excavated in three layers and either sent to the washing unit or a storage area.



View larger version (25K):
[in this window]
[in a new window]
 
Fig. 4. Location of the 75 samples, collected in six investigation stages, on the map of the former smelting work.

 

View this table:
[in this window]
[in a new window]
 
Table 1. Statistics of soil Pb concentrations measured on boreholes (homogeneous sampling support) in the former smelting work (unity: mg kg–1).

 
Estimating Volumes and Costs at the Sixth Investigation Stage (Steps 1 and 2)
The remediation volumes and the overall costs were estimated using the 75 real site investigation data points. Because the density of the data on the site is variable, a declustering algorithm was applied. The statistics of the declustered data are presented in Row 2 of Table 1. The Gaussian transformed data have shown an anisotropic spatial structure, which has been modeled with a combination of a nugget effect 0.1 and two anisotropic spherical models of sills 0.5 and 0.65:

[11]
where NE is northeast, NW is northwest, vert is vertical, h is a vector of distance, and Sph is the spherical model. The anisotropy axes of the variogram model are consistent with the main wind directions.

A total of K = 200 conditional simulations of point Pb concentrations were generated on a fine grid, in the framework of a multigaussian model, in a large neighborhood so that the 75 available data points were used for conditioning each simulated value. Each fine grid cell had a 1.43-m side length and a 0.30-m height, so that 49 point-simulated values were included in one block of the remediation grid. The grid was oriented according to the anisotropy axes of the variogram model. The block simulations were calculated using the point simulations, accounting for:

  1. The change of support and the information effect. Block-simulated Pb concentrations were considered as the mean of five point-simulated values taken in each block (Eq. [2]), by analogy with the real block concentrations measured during remediation (Fig. 2).
  2. The sampling and analysis error. It has been shown that large errors have affected the real block Pb concentrations (Renard-Demougeot, 2002). These large errors were modeled by the variable {epsilon}(x) taken as a uniform distribution on the interval [–1; +1], whose variance is large, equal to 0.33 (Eq. [3]).

For each block, the probability that block Pb concentration exceeds 300 mg kg–1 was subsequently calculated as the ratio of the number of block-simulated values exceeding 300 mg kg–1 to the total number of simulated values (K = 200).

Furthermore, we assume that remediation is required for soils with a probability exceeding ß = 0.6. This value ß = 0.6 is considered as a reasonable threshold for that specific site. The volume Ve that requires excavation was thus evaluated at V*e = 10912 m3. We further assume that soil can remain without any remediation or additional monitoring if the probability of Pb concentrations to exceed 300 mg kg–1 falls below {alpha} = 0.2. The uncertain volume was thus evaluated at V*u = 23327 m3. The investigation cost Ci was calculated at investigation Stage 6 with real unit prices: C*i = 68.6 kEuros. A distribution of soil washing costs Cc(i) was calculated with actual market unit prices, using the volumetric estimates. An estimate of Cc is provided by the median of the distribution Cc(i): C*c = 984.8 kEuros. A distribution of uncertain costs Cu(i) was calculated applying the cleanup cost function to Vu. An estimate of Cu is provided by the median of this distribution: C*u = 1295.8 kEuros. In these conditions, the volumetric uncertainty V*u represents 214% of the excavated volume V*e. While the financial uncertainty C*u corresponds to 132% of the cleanup cost estimate C*c.

Forecasting Volumes and Costs at the Seventh Investigation Stage (Steps 3 to 5)
Additional sampling at Stage 7 was "simulated" according to the sampling design procedure described in the "Modeling Sampling of Various Numbers of Additional Data" section, above. Steps 2 to 5 were repeated for a series of numbers of additional Pb concentrations N7: N7 (12, 25, 50, 100, 200). The location of the selected values is a trade-off between standing at the nodes of a regular grid and being within the uncertain volume Vu, made of several patches (Fig. 5). Figure 6 maps the forecasted probabilities for different tested numbers N7. The volume forecast graph (Fig. 7) shows that the excavated volume forecast Ve increases while the uncertain volume forecast Vu decreases as N7 increases. The cost forecast graph (Fig. 8) shows that the cleanup cost forecast Cc increases while the uncertain cost forecast Cu decreases as N7 increases. The investigation cost forecast Ci increases very slightly as N7 increases.



View larger version (41K):
[in this window]
[in a new window]
 
Fig. 5. Maps showing the location (white dots) of the simulated additional samples for the successive values of N7. The points are superimposed on the simplified map of the probabilities where block concentration exceeds 300 mg kg–1 in the superficial layer (estimated at Stage 6 with the 75 real site investigation data points).

 


View larger version (47K):
[in this window]
[in a new window]
 
Fig. 6. Simplified maps of probabilities where block concentrations exceed 300 mg kg–1 in the superficial layer. These maps illustrate how the forecasted uncertain volume reduces when additional data are "simulated." Note that these maps are only an illustration of the uncertainty reduction but are not real forecasts as the additional data are simulated and not actual data.

 


View larger version (24K):
[in this window]
[in a new window]
 
Fig. 7. Volume forecasts graph: excavated volumes Ve and uncertain volumes Vu as a function of the number N7 of additional data whose sampling is "simulated" at Stage 7.

 


View larger version (23K):
[in this window]
[in a new window]
 
Fig. 8. Cost forecasts graph: investigation costs Ci, cleanup costs Cc, and uncertain costs Cu as a function of the number N7 of additional data whose sampling is "simulated" at Stage 7.

 
Decision Making: How to Design Sampling Using the Volume and Cost Forecasts Graphs?
An optimal number of samples that permits having an "acceptable" remediation volume and cost uncertainty level does not exist. Only a "best compromise" can be found. Criteria, goals, constraints, and decision makers' preferences, which are specific to each remediation context, determine this "best compromise." The volume forecast graph provides environmental decision criteria in terms of volumes intended for excavation and remediation, and their uncertainties. The cost forecast graph provides financial decision criteria in terms of investigation costs, cleanup costs, and uncertainties on the cleanup costs, for various planned sampling numbers. To illustrate how the volume and cost forecast graphs can be used to choose a number of additional data to be collected, we assume that decision makers have four different goals and constraints. The first goal consists of minimizing the relative volumetric uncertainty:

[12]

The second goal consists of minimizing the relative financial uncertainty:

[13]

The third goal consists of minimizing the overall remediation cost:

[14]

The last criterion is a common financial constraint, consisting of limiting the relative sampling cost to a given percentage:

[15]

The weights given to these objectives depend on the decision makers' profiles. We consider two decision makers' profiles. We call a risk-averse decision maker a person who is trying to avoid risks. He or she will probably assign heavier weights to Eq. [12] and [13] than to Eq. [14] and [15]. We call a risk-prone decision maker a person who is ready to take risks. He or she will probably assign heavier weights to Eq. [14] and [15] than to Eq. [12] and [13].

The volume forecast graph (Fig. 7) shows that the ratio (Vu/Ve) decreases rapidly as N7 increases, and tends to stabilize for N7 ≥ 100. For N7 = 100, Vu represents 50% of Ve, instead of the 214% estimated at Stage 6. Similarly, the cost forecast graph (Fig. 8) shows that the ratio (Cu/Cc) decreases rapidly as N7 increases, and tends to stabilize for N7 ≥ 100. For N7 = 100, Cu is forecasted at 35% of cleanup cost, instead of the 132% estimated at Stage 6. When N7 > 100, the gain in accuracy on the forecasted volumes and costs is small. The forecasted cost Cr = Ci + Cc + Cu (Table 2) is minimal for N7 = 50. The ratio (Ci/Cc) slightly decreases as N7 increases, up to N7 = 100. This ratio increases when N7 > 100. The minimum ratio, calculated for N7 = 100, is forecasted at 5.8%.


View this table:
[in this window]
[in a new window]
 
Table 2. Forecasted investigation costs Ci, cleanup costs Cc, and uncertain costs Cu, and corresponding ratios according to the number N7 of additional data whose sampling is "simulated" at Stage 7 (unity: x 105 Euro).

 
According to these forecasts, leaving aside any objectives or constraints other than the four defined above, it is reasonable to think that a risk-averse decision maker would choose to collect N7 = 100 additional samples at Stage 7. Similarly, we can think that a risk-prone decision maker would prefer N7 = 50, since, even though the corresponding (Vu/Ve) and (Cu/Cc) ratios are higher than those forecasted with N7 = 100, this number of additional data minimizes Cr and maintains the ratio (Ci/Cc) at an average value (6.4%).


    CONCLUSIONS
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 APPLICATION TO A FORMER...
 CONCLUSIONS
 REFERENCES
 
The methodology presented in this paper allows support of the design of the number of additional samples that would be worth taking within a framework of an iterative sampling procedure. The value of a data sample is evaluated in terms of volumes and costs uncertainty reduction. The methodology proposes to locate additional samples in zones where the uncertainty remains important, as it has already been suggested in the literature. As illustrated on a former smelting work, a "best compromise" of additional data number can be chosen at each investigation stage j, using volumes and costs forecasted for stage j + 1. The choice is not unique since it depends on criteria, objectives, and constraints other than those quantified by geostatistics, as well as on the decision makers' preferences.

The advantage of the methodology is its flexibility. The use of conditional simulations, the modeling of soil selection and excavation, the modeling of support effect, information effect, and sample bias, and the parameterized cost function allows it to be close to the reality of remediation works. Indeed, the comparison of volumes and costs estimated by this method has shown to be in good agreement with actual values determined after remediation.

Nevertheless, the method is not helpful when the pollution does not show a spatial structure. In the case of a pure nugget effect, no additional sampling will improve the existing uncertainty. A technical limitation of the method is the choice of a single conditional realization to simulate sampling of additional points. Because only one realization is used, the forecasted volumes and costs represent only samples from a distribution and not the expectation. It means that these values should not be taken as estimations even if the uncertainty estimates are correct. In any case, it is then important to (i) check the consistency of the reference with the probability map and (ii) apply the approach to various simulations taken as references, and to compare the results from multiple realizations. The methodology will then provide intervals for the forecasted volumes and costs, which are much more useful than expectations. Another limitation is that the volumes and costs are forecasted with the variogram model adjusted at stage j. The variogram model that will be adjusted to the real data at stage j + 1 may differ from that model, especially at the beginning of the investigations when only few data are available.


    ACKNOWLEDGMENTS
 
The authors thank Martine Louvrier and Philippe Bégassat of the Agence De l'Environnement et de la Maîtrise de l'Energie (ADEME) for supplying the data and for their technical support, ADEME and Gaz de France for their financial support, and Alfred Stein for reviewing an early version of the article.


    NOTES
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 APPLICATION TO A FORMER...
 CONCLUSIONS
 REFERENCES
 
Sponsoring organizations: Agence De l'Environnement et de la Maîtrise de l'Energie (ADEME), 2 Square Lafayette, BP 406, 49004 Angers, Cedex 1, France; Gaz De France (GDF), 361 avenue du président Wilson, 93211 Saint Denis, France.


    REFERENCES
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 APPLICATION TO A FORMER...
 CONCLUSIONS
 REFERENCES
 


Related articles in JEQ:

This Issue in Journal of Environmental Quality

JEQ 2004 33: 1589-1599. [Full Text]  




This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Related articles in JEQ
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Demougeot-Renard, H.
Right arrow Articles by Renard, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Demougeot-Renard, H.
Right arrow Articles by Renard, P.
GeoRef
Right arrow GeoRef Citation
Agricola
Right arrow Articles by Demougeot-Renard, H.
Right arrow Articles by Renard, P.
Related Collections
Right arrow Geostatistics
Right arrow Uncertainty Analysis
Right arrow Data Management
Right arrow Heavy Metals
Right arrow Soil Pollution


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
The SCI Journals Agronomy Journal Crop Science
Vadose Zone Journal Journal of Plant Registrations
Journal of Natural Resources
and Life Sciences Education
Soil Science Society of America Journal