Laserfiche WebLink
1% 5% 10% 25% 50% 75% 90% 95% 99% <br /> 0 0 0 0 0 0 0 0 0 <br /> Outlier Test <br /> Dixon's extreme value test was performed to test whether the lowest value is a statistical outlier. The test was conducted at <br /> the 5% significance level. <br /> Data should not be excluded from analysis solely on the basis of the results of this or any other statistical test. If any <br /> values are flagged as possible outliers, further investigation is recommended to determine whether there is a plausible <br /> explanation that justifies removing or replacing them. <br /> DIXON'S OUTLIER TEST for TPH <br /> Dixon Test Statistic 0 <br /> Dixon 5% Critical Value 0 <br /> The calculated test statistic does not exceed the critical value, so the test cannot reject the null hypothesis that there are <br /> no outliers in the data, and concludes that the minimum value 0 is not an outlier at the 5% significance level. <br /> Because Dixon's test can be used only when the data without the suspected outlier are approximately normally distributed, <br /> a Shapiro-Wilk test for normality was performed at a 5% significance level. <br /> NORMAL DISTRIBUTION TEST(excluding outliers) <br /> Shapiro-Wilk Test Statistic 2.624e-308 <br /> Shapiro-Wilk 5% Critical Value 1.376e-313 <br /> The calculated Shapiro-Wilk test statistic exceeds the 5% Shapiro-Wilk critical value, so the test cannot reject the <br /> hypothesis that the data are normal and concludes that the data, excluding the minimum value 0, do appear to follow a <br /> normal distribution at the 5% level of significance. <br /> Data Plots for TPH <br /> Graphical displays of the data are shown below. <br /> The Histogram is a plot of the fraction of the n observed data that fall within specified data"bins." A histogram is <br /> generated by dividing the x axis (range of the observed data values) into"bins"and displaying the number of data in each <br /> bin as the height of a bar for the bin. The area of the bar is the fraction of the n data values that lie within the bin. The <br /> sum of the fractions for all bins equals one. A histogram is used to assess how the n data are distributed (spread) over <br /> their range of values. If the histogram is more or less symmetric and bell shaped, then the data may be normally <br /> distributed. <br /> The Box and Whiskers plot is composed of a central box divided by a line, and with two lines extending out from the box, <br /> called the"whiskers". The line through the box is drawn at the median of the n data observed. The two ends of the box <br /> represent the 25th and 75th percentiles of the n data values, which are also called the lower and upper quartiles, <br /> respectively, of the data set. The sample mean (mean of the n data) is shown as a 'Y' sign. The upper whisker extends to <br /> the largest data value that is less than the upper quartile plus 1.5 times the interquartile range(upper quartile minus the <br /> lower quartile). The lower whisker extends to the smallest data value that is greater than the lower quartile minus 1.5 <br /> times the interquartile range. Extreme data values (greater or smaller than the ends of the whiskers) are plotted <br /> individually as blue Xs. A Box and Whiskers plot is used to assess the symmetry of the distribution of the data set. If the <br /> distribution is symmetrical, the box is divided into two equal halves by the median,the whiskers will be the same length, <br /> and the number of extreme data points will be distributed equally on either end of the plot. <br /> The Q-Q plot graphs the quantiles of a set of n data against the quantiles of a specific distribution. We show here only the <br /> Q-Q plot for an assumed normal distribution. The pm quantile of a distribution of data is the data value, x,,, for which a <br /> fraction p of the distribution is less than x,,. If the data plotted on the normal distribution Q-Q plot closely follow a straight <br /> line, even at the ends of the line, then the data may be assumed to be normally distributed. If the data points deviate <br /> substantially from a linear line, then the data are not normally distributed. <br />