|
KOLMOGOROV SMIRNOV TWO SAMPLEName:
where n(i) is the number of points less than Yi This is a step function that increases by 1/N at the value of each data point. We can graph a plot of the empirical distribution function with a cumulative distribution function for a given distribution. The one sample K-S test is based on the maximum distance between these two curves. That is,
where F is the theoretical cumulative distribution function. The two sample K-S test is a variation of this. However, instead of comparing an empirical distribution function to a theoretical distribution function, we compare the two empirical distribution functions. That is,
where E1 and E2 are the empirical distribution functions for the two samples. Note that we compute E1 and E2 at each point in both samples (that is both E1 and E2 are computed at each point in each sample). More formally, the Kolmogorov-Smirnov two sample test statistic can be defined as follows.
The quantile-quantile plot, bihistogram, and Tukey mean-difference plot are graphical alternatives to the two sample K-S test.
<SUBSET/EXCEPT/FOR/qualification> where <y1> is the first response variable; <y2> is the second response variable; and where the <SUBSET/EXCEPT/FOR qualification> is optional.
KOLMOGOROV-SMIRNOV TWO SAMPLE TEST Y1 Y2 SUBSET Y2 > 0
CUTUPP90 - 90% critical value (alpha = 0.10) for the K-S two sample test statistic CUTUPP95 - 95% critical value (alpha = 0.05) for the K-S two sample test statistic CUTUPP99 - 99% critical value (alpha = 0.01) for the K-S two sample test statistic These parameters can be used in subsequent analysis.
"Numerical Recipes in Fortan: The Art of Scientific Computing", Second Edition, Press, Teukolsky, Vetterlling, and Flannery, Cambridge University Press, 1992, pp. 614-622.
READ AUTO83B.DAT Y1 Y2 . DELETE Y2 SUBSET Y2 < 0 KOLMOGOROV-SMIRNOPV TWO SAMPLE TEST Y1 Y2 The following output is generated.
*************************************************
** KOLMOGOROV-SMIRNOPV TWO SAMPLE TEST Y1 Y2 **
*************************************************
KOLMOGOROV-SMIRNOV TWO SAMPLE TEST
NULL HYPOTHESIS H0: TWO SAMPLES COME FROM THE SAME (UNSPECIFIED)
DISTRIBUTION
ALTERNATE HYPOTHESIS HA: TWO SAMPLES COME FROM DIFFERENT DISTRIBUTIONS
SAMPLE:
NUMBER OF OBSERVATIONS FOR SAMPLE 1 = 249
NUMBER OF OBSERVATIONS FOR SAMPLE 2 = 79
TEST:
KOLMOGOROV-SMIRNOV TEST STATISTIC = 1.000000
ALPHA LEVEL CUTOFF CONCLUSION
10% 0.37000 REJECT H0
5% 0.41000 REJECT H0
1% 0.49000 REJECT H0
Date created: 6/5/2001 |