Laserfiche WebLink
3. F Test to compare two variances (dispersions)—Parametric Test <br /> 4. Wilcoxon-Mann-Whitney (WMW) Test to compare two locations, comparability of two <br /> continuous distributions—Nonparametric Test <br /> 5. Quantile Test to compare the upper tails of two continuous distributions -Nonparametric <br /> Test <br /> 6. Gehan Test to compare two locations -Nonparametric Test <br /> T-tests and F-test assume normality of the data sets under comparison. Some details of these <br /> approaches are described in ProUCL 4.0 Technical Guide. It should be noted that Gehan test, <br /> WMW test and Quantile test are also available for data sets with NDs. Gehan's test is <br /> specifically meant to be used on data sets with multiple detection limits. The Quantile test is a <br /> nonparametric test and is useful to detect a shift in the right tail of the site data distribution. The <br /> Quantile test when used in parallel with the Wilcoxon Mann Whitney (WMW) test provides the <br /> user with stronger evidence to make decisions about the comparability of site and background <br /> distributions, leading to more reliable conclusions whether the site has attained remediation <br /> levels or not. It is suggested that for best results,both WMW test and Quantile tests should be <br /> used on the same data set. <br /> Note on Comparability of Data Sets <br /> The samples collected from the two (or more)populations under comparisons should all be of <br /> the same type obtained using similar analytical methods and apparatus. In other words,the <br /> collected site and background samples should be all discrete or all composite (obtained using the <br /> same number of discrete samples, same design and pattern), and be collected from the same <br /> medium(soil) at similar depths (e.g., all surface samples or all subsurface samples) and time <br /> (e.g., during the same quarter in groundwater applications) using comparable (preferably same) <br /> analytical methods. Some good soil sample collection methods and sampling strategies are <br /> described in EPA, 2003 guidance document. <br /> Note on Influence of Outliers and Use of Lognormal Distribution <br /> Typically, in environmental data sets collected from impacted sites or monitoring wells(MWs), <br /> an outlier represents an observation coming from a potentially contaminated site location. This is <br /> especially true,when the data are collected from a site specific background area. The outlying <br /> observations need to be identified before computing the background statistics (and other <br /> estimates and test statistics) as outliers when present distort all statistics of interest, which in turn <br /> may lead to incorrect remediation and cleanup decisions for the site under investigation. For an <br /> example, inclusion of an outlier may distort the t-test statistic resulting in distorted and incorrect <br /> decision errors (Type 1 or Type 2 errors),which can lead to incorrect conclusion about the <br /> hypotheses testing. The incorrect decisions may adversely affect the human health and the <br /> environment. <br /> The main objective of using a statistical procedure is to model the majority of the data <br /> representing the main dominant population, and not to accommodate a few low probability <br /> outliers that may yield inflated and impractical statistics, results, and incorrect conclusions. For <br />