首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Using auxiliary information to improve the prediction accuracy of soil properties in a physically meaningful and technically efficient manner has been widely recognized in pedometrics. In this paper, we explored a novel technique to effectively integrate sampling data and auxiliary environmental information, including continuous and categorical variables, within the framework of the Bayesian maximum entropy (BME) theory. Soil samples and observed auxiliary variables were combined to generate probability distributions of the predicted soil variable at unsampled points. These probability distributions served as soft data of the BME theory at the unsampled locations, and, together with the hard data (sample points) were used in spatial BME prediction. To gain practical insight, the proposed approach was implemented in a real-world case study involving a dataset of soil total nitrogen (TN) contents in the Shayang County of the Hubei Province (China). Five terrain indices, soil types, and soil texture were used as auxiliary variables to generate soft data. Spatial distribution of soil total nitrogen was predicted by BME, regression kriging (RK) with auxiliary variables, and ordinary kriging (OK). The results of the prediction techniques were compared in terms of the Pearson correlation coefficient (r), mean error (ME), and root mean squared error (RMSE). These results showed that the BME predictions were less biased and more accurate than those of the kriging techniques. In sum, the present work extended the BME approach to implement certain kinds of auxiliary information in a rigorous and efficient manner. Our findings showed that the BME prediction technique involving the transformation of variables into soft data can improve prediction accuracy considerably, compared to other techniques currently in use, like RK and OK.  相似文献   

2.
Compositional Bayesian indicator estimation   总被引:1,自引:1,他引:0  
Indicator kriging is widely used for mapping spatial binary variables and for estimating the global and local spatial distributions of variables in geosciences. For continuous random variables, indicator kriging gives an estimate of the cumulative distribution function, for a given threshold, which is then the estimate of a probability. Like any other kriging procedure, indicator kriging provides an estimation variance that, although not often used in applications, should be taken into account as it assesses the uncertainty of the estimate. An alternative approach to indicator estimation is proposed in this paper. In this alternative approach the complete probability density function of the indicator estimate is evaluated. The procedure is described in a Bayesian framework, using a multivariate Gaussian likelihood and an a priori distribution which are both combined according to Bayes theorem in order to obtain a posterior distribution for the indicator estimate. From this posterior distribution, point estimates, interval estimates and uncertainty measures can be obtained. Among the point estimates, the median of the posterior distribution is the maximum entropy estimate because there is a fifty-fifty chance of the unknown value of the estimate being larger or smaller than the median; that is, there is maximum uncertainty in the choice between two alternatives. Thus in some sense, the latter is an indicator estimator, alternative to the kriging estimator, that includes its own uncertainty. On the other hand, the mode of the posterior distribution estimator, assuming a uniform prior, is coincidental with the simple kriging estimator. Additionally, because the indicator estimate can be considered as a two-part composition which domain of definition is the simplex, the method is extended to compositional Bayesian indicator estimation. Bayesian indicator estimation and compositional Bayesian indicator estimation are illustrated with an environmental case study in which the probability of the content of a geochemical element in soil being over a particular threshold is of interest. The computer codes and its user guides are public domain and freely available.  相似文献   

3.
Due to the fast pace increasing availability and diversity of information sources in environmental sciences, there is a real need of sound statistical mapping techniques for using them jointly inside a unique theoretical framework. As these information sources may vary both with respect to their nature (continuous vs. categorical or qualitative), their spatial density as well as their intrinsic quality (soft vs. hard data), the design of such techniques is a challenging issue. In this paper, an efficient method for combining spatially non-exhaustive categorical and continuous data in a mapping context is proposed, based on the Bayesian maximum entropy paradigm. This approach relies first on the definition of a mixed random field, that can account for a stochastic link between categorical and continuous random fields through the use of a cross-covariance function. When incorporating general knowledge about the first- and second-order moments of these fields, it is shown that, under mild hypotheses, their joint distribution can be expressed as a mixture of conditional Gaussian prior distributions, with parameters estimation that can be obtained from entropy maximization. A posterior distribution that incorporates the various (soft or hard) continuous and categorical data at hand can then be obtained by a straightforward conditionalization step. The use and potential of the method is illustrated by the way of a simulated case study. A comparison with few common geostatistical methods in some limit cases also emphasizes their similarities and differences, both from the theoretical and practical viewpoints. As expected, adding categorical information may significantly improve the spatial prediction of a continuous variable, making this approach powerful and very promising.  相似文献   

4.
Multigaussian kriging technique has many applications in mining, soil science, environmental science and other fields. Particularly, in the local reserve estimation of a mineral deposit, multigaussian kriging is employed to derive panel-wise tonnages by predicting conditional probability of block grades. Additionally, integration of a suitable change of support model is also required to estimate the functions of the variables with larger support than that of the samples. However, under the assumption of strict stationarity, the grade distributions and important recovery functions are estimated by multigaussian kriging using samples within a supposedly spatial homogeneous domain. Conventionally, the underlying random function model is required to be stationary in order to carry out the inference on ore grade distribution and relevant statistics. In reality, conventional stationary model often fails to represent complicated geological structure. Traditionally, the simple stationary model neither considers the obvious changes in local means and variances, nor is it able to replicate spatial continuity of the deposit and hence produces unreliable outcomes. This study deals with the theoretical design of a non-stationary multigaussian kriging model allowing change of support and its application in the mineral reserve estimation scenario. Local multivariate distributions are assumed here to be strictly stationary in the neighborhood of the panels. The local cumulative distribution function and related statistics with respect to the panels are estimated using a distance kernel approach. A rigorous investigation through simulation experiments is performed to analyze the relevance of the developed model followed by a case study on a copper deposit.  相似文献   

5.
Inverse distance interpolation for facies modeling   总被引:1,自引:0,他引:1  
Inverse distance weighted interpolation is a robust and widely used estimation technique. In practical applications, inverse distance interpolation is oftentimes favored over kriging-based techniques when there is a problem of making meaningful estimates of the field spatial structure. Nowadays application of inverse distance interpolation is limited to continuous random variable modeling. There is a need to extend the approach to categorical/discrete random variables. In this paper we propose such an extension using indicator formalism. The applicability of inverse distance interpolation for categorical modeling is then illustrated using Total’s Joslyn Lease facies data.  相似文献   

6.
Estimating and mapping spatial uncertainty of environmental variables is crucial for environmental evaluation and decision making. For a continuous spatial variable, estimation of spatial uncertainty may be conducted in the form of estimating the probability of (not) exceeding a threshold value. In this paper, we introduced a Markov chain geostatistical approach for estimating threshold-exceeding probabilities. The differences of this approach compared to the conventional indicator approach lie with its nonlinear estimators—Markov chain random field models and its incorporation of interclass dependencies through transiograms. We estimated threshold-exceeding probability maps of clay layer thickness through simulation (i.e., using a number of realizations simulated by Markov chain sequential simulation) and interpolation (i.e., direct conditional probability estimation using only the indicator values of sample data), respectively. To evaluate the approach, we also estimated those probability maps using sequential indicator simulation and indicator kriging interpolation. Our results show that (i) the Markov chain approach provides an effective alternative for spatial uncertainty assessment of environmental spatial variables and the probability maps from this approach are more reasonable than those from conventional indicator geostatistics, and (ii) the probability maps estimated through sequential simulation are more realistic than those through interpolation because the latter display some uneven transitions caused by spatial structures of the sample data.  相似文献   

7.
Interpolation techniques for spatial data have been applied frequently in various fields of geosciences. Although most conventional interpolation methods assume that it is sufficient to use first- and second-order statistics to characterize random fields, researchers have now realized that these methods cannot always provide reliable interpolation results, since geological and environmental phenomena tend to be very complex, presenting non-Gaussian distribution and/or non-linear inter-variable relationship. This paper proposes a new approach to the interpolation of spatial data, which can be applied with great flexibility. Suitable cross-variable higher-order spatial statistics are developed to measure the spatial relationship between the random variable at an unsampled location and those in its neighbourhood. Given the computed cross-variable higher-order spatial statistics, the conditional probability density function is approximated via polynomial expansions, which is then utilized to determine the interpolated value at the unsampled location as an expectation. In addition, the uncertainty associated with the interpolation is quantified by constructing prediction intervals of interpolated values. The proposed method is applied to a mineral deposit dataset, and the results demonstrate that it outperforms kriging methods in uncertainty quantification. The introduction of the cross-variable higher-order spatial statistics noticeably improves the quality of the interpolation since it enriches the information that can be extracted from the observed data, and this benefit is substantial when working with data that are sparse or have non-trivial dependence structures.  相似文献   

8.
Application of the BME approach to soil texture mapping   总被引:3,自引:1,他引:3  
In order to derive accurate space/time maps of soil properties, soil scientists need tools that combine the usually scarce hard data sets with the more easily accessible soft data sets. In the field of modern geostatistics, the Bayesian maximum entropy (BME) approach provides new and powerful means for incorporating various forms of physical knowledge (including hard and soft data, soil classification charts, land cover data from satellite pictures, and digital elevation models) into the space/time mapping process. BME produces the complete probability distribution at each estimation point, thus allowing the calculation of elaborate statistics (even when the distribution is not Gaussian). It also offers a more rigorous and systematic method than kriging for integrating uncertain information into space/time mapping. In this work, BME is used to estimate the three textural fractions involved in a texture map. The first case study focuses on the estimation of the clay fraction, whereas the second one considers the three textural fractions (sand, silt and clay) simultaneously. The BME maps obtained are informative (important soil characteristics are identified, natural variations are well reproduced, etc.). Furthermore, in both case studies, the estimates obtained by BME were more accurate than the simple kriging (SK) estimates, thus offering a better picture of soil reality. In the multivariate case, classification error rate analysis in terms of BME performs considerably better than in terms of kriging. Analysis in terms of BME can offer valuable information to be used in sampling design, in optimizing the hard to soft data ratio, etc.  相似文献   

9.
In most groundwater applications, measurements of concentration are limited in number and sparsely distributed within the domain of interest. Therefore, interpolation techniques are needed to obtain most likely values of concentration at locations where no measurements are available. For further processing, for example, in environmental risk analysis, interpolated values should be given with uncertainty bounds, so that a geostatistical framework is preferable. Linear interpolation of steady-state concentration measurements is problematic because the dependence of concentration on the primary uncertain material property, the hydraulic conductivity field, is highly nonlinear, suggesting that the statistical interrelationship between concentration values at different points is also nonlinear. We suggest interpolating steady-state concentration measurements by conditioning an ensemble of the underlying log-conductivity field on the available hydrological data in a conditional Monte Carlo approach. Flow and transport simulations for each conditional conductivity field must meet the measurements within their given uncertainty. The ensemble of transport simulations based on the conditional log-conductivity fields yields conditional statistical distributions of concentration at points between observation points. This method implicitly meets physical bounds of concentration values and non-Gaussianity of their statistical distributions and obeys the nonlinearity of the underlying processes. We validate our method by artificial test cases and compare the results to kriging estimates assuming different conditional statistical distributions of concentration. Assuming a beta distribution in kriging leads to estimates of concentration with zero probability of concentrations below zero or above the maximal possible value; however, the concentrations are not forced to meet the advection-dispersion equation.  相似文献   

10.
It is common in geostatistics to use the variogram to describe the spatial dependence structure and to use kriging as the spatial prediction methodology. Both methods are sensitive to outlying observations and are strongly influenced by the marginal distribution of the underlying random field. Hence, they lead to unreliable results when applied to extreme value or multimodal data. As an alternative to traditional spatial modeling and interpolation we consider the use of copula functions. This paper extends existing copula-based geostatistical models. We show how location dependent covariates e.g. a spatial trend can be accounted for in spatial copula models. Furthermore, we introduce geostatistical copula-based models that are able to deal with random fields having discrete marginal distributions. We propose three different copula-based spatial interpolation methods. By exploiting the relationship between bivariate copulas and indicator covariances, we present indicator kriging and disjunctive kriging. As a second method we present simple kriging of the rank-transformed data. The third method is a plug-in prediction and generalizes the frequently applied trans-Gaussian kriging. Finally, we report on the results obtained for the so-called Helicopter data set which contains extreme radioactivity measurements.  相似文献   

11.
Conditional bias-penalized kriging (CBPK)   总被引:1,自引:1,他引:0  
Simple and ordinary kriging, or SK and OK, respectively, represent the best linear unbiased estimator in the unconditional sense in that they minimize the unconditional (on the unknown truth) error variance and are unbiased in the unconditional mean. However, because the above properties hold only in the unconditional sense, kriging estimates are generally subject to conditional biases that, depending on the application, may be unacceptably large. For example, when used for precipitation estimation using rain gauge data, kriging tends to significantly underestimate large precipitation and, albeit less consequentially, overestimate small precipitation. In this work, we describe an extremely simple extension to SK or OK, referred to herein as conditional bias-penalized kriging (CBPK), which minimizes conditional bias in addition to unconditional error variance. For comparative evaluation of CBPK, we carried out numerical experiments in which normal and lognormal random fields of varying spatial correlation scale and rain gauge network density are synthetically generated, and the kriging estimates are cross-validated. For generalization and potential application in other optimal estimation techniques, we also derive CBPK in the framework of classical optimal linear estimation theory.  相似文献   

12.
In humid, well-vegetated areas, such as in the northeastern US, runoff is most commonly generated from relatively small portions of the landscape becoming completely saturated, however, little is known about the spatial and temporal behavior of these saturated regions. Indicator kriging provides a way to use traditional water table data to quantify probability of saturation to evaluate predicted spatial distributions of runoff generation risk, especially for the new generation of water quality models incorporating saturation excess runoff theory. When spatial measurements of a variable are transformed to binary indicators (i.e., 1 if above a given threshold value and 0 if below) and the resulting indicator semivariogram is modeled, indicator kriging produces the probability of the measured variable to exceed the threshold value. Indicator kriging gives quantified probability of saturation or, consistent with saturation excess runoff theory, runoff generation risk with depth to water table as the variable and the threshold set near the soil surface. The probability of saturation for a 120 m × 180 m hillslope based upon 43 measurements of depth to water table is investigated with indicator semivariograms for six storm events. The indicator semivariograms show high spatial structure in saturated regions with large antecedent rainfall conditions. The temporal structure of the data is used to generate interpolated (soft) data to supplement measured (hard) data. This improved the spatial structure of the indicator semivariograms for lower antecedent rainfall conditions. Probability of saturation was evaluated through indicator kriging incorporating soft data showing, based on this preliminary study, highly connected regions of saturation as expected for the wet season (April through May) in the Catskill Mountain region of New York State. Supplementation of hard data with soft data incorporates physical hydrology of the hillslope to capture significant patterns not available when using hard data alone for indicator kriging. With the need for water quality models incorporating appropriate runoff generation risk estimates on the rise, this manner of data will lay the groundwork for future model evaluation and development.  相似文献   

13.
Categorical data play an important role in a wide variety of spatial applications, while modeling and predicting this type of statistical variable has proved to be complex in many cases. Among other possible approaches, the Bayesian maximum entropy methodology has been developed and advocated for this goal and has been successfully applied in various spatial prediction problems. This approach aims at building a multivariate probability table from bivariate probability functions used as constraints that need to be fulfilled, in order to compute a posterior conditional distribution that accounts for hard or soft information sources. In this paper, our goal is to generalize further the theoretical results in order to account for a much wider type of information source, such as probability inequalities. We first show how the maximum entropy principle can be implemented efficiently using a linear iterative approximation based on a minimum norm criterion, where the minimum norm solution is obtained at each step from simple matrix operations that converges to the requested maximum entropy solution. Based on this result, we show then how the maximum entropy problem can be related to the more general minimum divergence problem, which might involve equality and inequality constraints and which can be solved based on iterated minimum norm solutions. This allows us to account for a much larger panel of information types, where more qualitative information, such as probability inequalities can be used. When combined with a Bayesian data fusion approach, this approach deals with the case of potentially conflicting information that is available. Although the theoretical results presented in this paper can be applied to any study (spatial or non-spatial) involving categorical data in general, the results are illustrated in a spatial context where the goal is to predict at best the occurrence of cultivated land in Ethiopia based on crowdsourced information. The results emphasize the benefit of the methodology, which integrates conflicting information and provides a spatially exhaustive map of these occurrence classes over the whole country.  相似文献   

14.
The random function is a mathematical model commonly used in the assessment of uncertainty associated with a spatially correlated attribute that has been partially sampled. There are multiple algorithms for modeling such random functions, all sharing the requirement of specifying various parameters that have critical influence on the results. The importance of finding ways to compare the methods and setting parameters to obtain results that better model uncertainty has increased as these algorithms have grown in number and complexity. Crossvalidation has been used in spatial statistics, mostly in kriging, for the analysis of mean square errors. An appeal of this approach is its ability to work with the same empirical sample available for running the algorithms. This paper goes beyond checking estimates by formulating a function sensitive to conditional bias. Under ideal conditions, such function turns into a straight line, which can be used as a reference for preparing measures of performance. Applied to kriging, deviations from the ideal line provide sensitivity to the semivariogram lacking in crossvalidation of kriging errors and are more sensitive to conditional bias than analyses of errors. In terms of stochastic simulation, in addition to finding better parameters, the deviations allow comparison of the realizations resulting from the applications of different methods. Examples show improvements of about 30% in the deviations and approximately 10% in the square root of mean square errors between reasonable starting modelling and the solutions according to the new criteria.  相似文献   

15.
Indicator cokriging (Journel 1983) is examined as a tool for real-time estimation of rainfall from rain gage measurements. The approach proposed in this work obviates real-time estimation of real-time statistics of rainfall by using ensemble or climatological statistics exclusively, and reduces computational requirements attendant to indicator cokriging by employing only a few auxiliary cutoffs in estimation of conditional probabilities. Due to unavailability of suitable rain gage measurements, hourly radar rain fall data were used for both indicator covariance estimation and a comparative evaluation. Preliminary results suggest that the indicator cokriging approach is clearly superior to its ordinary kriging counterpart, whereas the indicator kriging approach is not. The improvement is most significant in estimation of light rainfall, but drops off significantly for heavy rainfall. The lack of predictability in spatial estimation of heavy rainfall is borne out in the integral scale of indicator correlation: peaking to its maximum for cutoffs near the median, indicator correlation scale becomes increasingly smaller for larger cutoffs of rainfall depth. A derived-distribution analysis, based on the assumption that radar rainfall is a linear sum of ground-truth and a random error, suggests that, at low cutoffs, indicator correlation scale of ground-truth can significantly differ from that of radar rainfall, and points toward inclusion of rainfall intermittency, for example, within the framework proposed in this work.  相似文献   

16.
Properties and limitations of sequential indicator simulation   总被引:2,自引:0,他引:2  
The sequential indicator algorithm is a widespread geostatistical simulation technique that relies on indicator (co)kriging and is applicable to a wide range of datasets. However, such algorithm comes up against several limitations that are often misunderstood. This work aims at highlighting these limitations, by examining what are the conditions for the realizations to reproduce the input parameters (indicator means and correlograms) and what happens with the other parameters (other two-point or multiple-point statistics). Several types of random functions are contemplated, namely: the mosaic model, random sets, models defined by multiple indicators and isofactorial models. In each case, the conditions for the sequential algorithm to honor the model parameters are sought after. Concurrently, the properties of the multivariate distributions are identified and some conceptual impediments are emphasized. In particular, the prior multiple-point statistics are shown to depend on external factors such as the total number of simulated nodes and the number and locations of the samples. As a consequence, common applications such as a flow simulation or a change of support on the realizations may lead to hazardous interpretations.  相似文献   

17.
Spatial rainfall amounts accumulated over short to medium periods of time, say a few days, tend to have a probabilistic structure with very distinctive features. Some of these that are specially relevant for the purpose of spatial modeling are the presence of mixed sampling distributions, right skewed distributions conditional on rainfall occurrence, and a complex spatial association structure. The goal of this work is to construct a family for the bivariate distributions of spatial rainfall fields that incorporates these distinctive features. It is based on the separate modeling of spatial occurrence of rainfall and the spatial distribution of positive rainfalls. The main properties of the bivariate distributions are derived, and some properties of the random field realizations are illustrated through simulation. Some limitations of the proposed model are also discussed.  相似文献   

18.
A regression model is used to study spatiotemporal distributions of solute content ion concentration data (calcium, chloride and nitrate), which provide important water contamination indicators. The model consists of three random and one deterministic components. The random space/time component is assumed to be homogeneous/stationary and to have a separable covariance. The purely spatial and the purely temporal random components are assumed to have homogenous and stationary increments, respectively. The deterministic component represents the space/time mean function. Inferences of the random components involve maximum likelihood and semi-parametric methods under some restrictions on the data configuration. Computational advantages and modelling limitations of the assumptions underlying the regression model are discussed. The regression model leads to simplifications in the space/time kriging and cokriging systems used to obtain space/time estimates at unobservable locations/instants. The application of the regression model in the study of the solute content ions was done at a global scale that covers the entire region of interest. The variability analysis focuses on the calculation of the spatial direct and cross-variograms and the evaluation of correlations between the three solute content ions. The space/time kriging system is developed in terms of the space direct and cross-variograms, and allows the separate estimation of the regression model components. Maps of these components are then obtained for each one of the three ions. Using the estimates of the purely spatial component, spatial dependencies between the ions are studied. Physical causes and consequences of the space/time variability are discussed, and comparisons are made with previous analyses of the solute content dataset.  相似文献   

19.
This study uses elliptical copulas and transition probabilities for uncertainty modeling of categorical spatial data. It begins by discussing the expressions of the cumulative distribution function and probability density function of two major elliptical copulas: Gaussian copula and t copula. The basic form of spatial copula discriminant function is then derived based on Bayes’ theorem, which consists of three parts: the prior probability, the conditional marginal densities, and the conditional copula density. Finally, three kinds of parameter estimation methods are discussed, including maximum likelihood estimation, inference functions for margins and canonical maximum likelihood (CML). To avoid making assumptions on the form of marginal distributions, the CML approach is adopted in the real-world case study. Results show that the occurrence probability maps generated by these two elliptical copulas are similar to each other. However, the prediction map interpolated by Gaussian copula has a relatively higher classification accuracy than t copula.  相似文献   

20.
A Monte Carlo approach is described for the quantification of uncertainty on travel time estimates. A real (non synthetic) and exhaustive data set of natural genesis is used for reference. Using an approach based on binary indicators, constraint interval data are easily accommodated in the modeling process. It is shown how the incorporation of imprecise data can reduce drastically the uncertainty in the estimates. It is also shown that unrealistic results are obtained when a deterministic modeling is carried out using a kriging estimate of the transmissivity field. Problems related with using sequential indicator simulation for the generation of fields incorporating constraint interval data are discussed. The final results consists of 95% probability intervals of arrival times at selected control planes reflecting the original uncertainty on the transmissivity maps.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号