Laserfiche WebLink
an example, background threshold values (BTVs) and exposure point concentration(EPC)terms <br /> should be estimated by reliable statistics (and not distorted statistics) obtained using data sets <br /> representing the main dominant population under study (e.g., site,background). The low <br /> probability high outlying values contaminate the underlying left-censored or uncensored full data <br /> set from the population under study. The inclusion of outliers in a background data set needs to <br /> be justified before performing other relevant statistical analyses including the estimation of <br /> BTVs. If possible, all interested parties should be involved in decision making about the <br /> disposition(inclusion or exclusion) of outliers in a background data set. Typically, outlying <br /> locations (if any) with elevated concentrations need separate investigation. <br /> It should be noted that the objective is to compute reliable background statistics based upon the <br /> majority of a defensible background data set representing the dominant background population. <br /> In the process of estimating the BTVs, it may not be desirable to accommodate a few low <br /> probability outlying observations (if any) by using a lognormal distribution(Singh, Singh, and <br /> Iaci, 2002). The use of a lognormal distribution often accommodates outliers and multiple <br /> populations, which in turn yields inflated UCLs and background statistics such as UPLs, <br /> percentiles, and UTLs. <br /> The proper identification of multiple outliers is a complex issue based upon robust statistical <br /> methods, and is beyond the scope of ProUCL 4.0. For details of the robust outlier identification <br /> procedures, refer to Barnett and Lewis (1994), and Singh and Noccrino (1995). Amore <br /> complicated problem arises when the collected background data set may represent a potentially <br /> mixture data set including observations from some of the site areas. The occurrence of mixture <br /> samples is quite common in many environmental applications. This is especially true when data <br /> sets are collected from large federal facilities (e.g.,Navy Sites). For such cases,the underlying <br /> data set may consist of samples from the background areas as well as from some other <br /> potentially contaminated site areas. In this situation, first, one has to separate the background <br /> observations from the other site related observations. After the background data set has been <br /> properly extracted from a potentially a mixture sample, one can proceed with the computation of <br /> background statistics as available in ProUCL 4.0. <br /> Appropriate population partitioning techniques (e.g., see Singh, Singh, and Flatman (1994)) can <br /> be used to extract a background data set from a potentially mixture data set. However, the <br /> population partitioning methods are beyond the scope of ProUCL 4.0. It should be noted that <br /> some of those methods will be available in Scout(EPA, 2000) software which is currently under <br /> revision and upgrades. For methods as incorporated in ProUCL, it is assumed that one is dealing <br /> with a sample from a"single"population representing a valid site-related background data set. <br /> Therefore, before using statistical methods to compute the various limits such as UCLs, UTLs, <br /> and UPLs, it is suggested that the user pre-processes the data set to identify potential outliers and <br /> mixture populations (if any). <br /> Outlier Tests <br /> ProUCL 4.0 has a couple of classical outlier test procedures, such as the Dixon test and the <br /> Rosner test. Additionally,ProUCL 4.0 software has exploratory graphical methods including <br />