MANN WHITNEY U STATISTIC

Name:

MANN WHITNEY U STATISTIC (LET) Type:

Analysis Command Purpose:

Compute the test statistic or alternatively the frequencies and CDF values for the U version of the Mann Whitney rank sum test. Description:

The Mann Whitney rank sum test statistic is computed by:

Rank the combined samples.
Compute the sum of the ranks for each sample (call these T₁ and T₂).
If the sample sizes are equal. the test statistic is
T = min(T₁,T₂)
If the sample sizes are unequal, let T₁ be the sum of the smaller sample size and the test statistic is
T = MIN(T₁,N₁*(N₁ + N₂ + 1) - T₁)

Sufficiently small values of T cause rejection of the null hypothesis that the sample locations are equal. Significance levels have been tabulated for small values of N₁ and N₂. For sufficiently large N₁ and N₂, the following normal approximation is used:

\( Z = \frac{|\mu - T| - 0.5}{\sigma} \)

where

Some analysts prefer a slightly different formulation for this test

\( U = N_1 N_2 + 0.5 N_1(N_1 + 1) - T \)

This form of the statistic can be computed with the command (Syntax 1)

LET U = MANN WHITNEY U STATISTIC Y1 Y2

Dataplot uses Applied Statistics algorithm 62 (as updated by Alan Miller) to obtain the cumulative frequencies and the corresponding CDF values of the U test statistic.

That is, Syntax 1 is used to compute the value of the test statistic and Syntax 2 is used to obtain the CDF for the test statistic.

Syntax 1:

This syntax returns the value of U version of the Mann-Whitney statistic.

Syntax 2:

This syntax returns the cumulative frequency table (and the corresponding CDF value) for the U version of the Mann Whitney statistic. Note that it only depends on the sample sizes for the two variables, not the data.

Examples:

LET N1 = SIZE Y1
LET N2 = SIZE Y2
LET X FREQ CDF = MANN WHITNEY U STATISTIC FREQUENCY N1 N2

Default:

None Synonyms:

None Related Commands:

RANK SUM TEST	= Compute a Mann Whitney rank sum test.
T-TEST	= Compute a t-test.
SIGN TEST	= Compute a sign test.
SIGNED RANK TEST	= Compute a signed rank test.
CHI-SQUARED TWO SAMPLE TEST	= Compute a two sample chi-square test.
BIHISTOGRAM	= Generates a bihistogram.
QUANTILE-QUANTILE PLOT	= Generate a quantile-quantile plot.

Reference:

Conover (1999), "Practical Non-Parametric Statistics," Third Edition, Wiley, pp. 272-281.

Snedecor and Cochran (1989), "Statistical Methods," Eigth Edition, Iowa State University Press, pp. 142-144.

Applications:

Non-Parametric Analysis, Two Sample Tests Implementation Date:

2011/5 Program:

 
. Step 1: Read Data (example 2 from pp. 278-279 of Conover)
.
let y1 = data 1 2 3 5
let y2 = data 4 6 7 8 9
.
set write decimals 3
let u = mann whitney u statistic y1 y2
let n1 = size y1
let n2 = size y2
let x freq cdf = mann whitney u statistic frequency  n1 n2
print "Test Statistic = ^u"
print x freq cdf

Test Statistic = 19
  
 ---------------------------------------------
               X           FREQ            CDF
 ---------------------------------------------
           0.000          1.000          0.007
           1.000          2.000          0.015
           2.000          4.000          0.031
           3.000          7.000          0.055
           4.000         12.000          0.095
           5.000         18.000          0.142
           6.000         26.000          0.206
           7.000         35.000          0.277
           8.000         46.000          0.365
           9.000         57.000          0.452
          10.000         69.000          0.547
          11.000         80.000          0.634
          12.000         91.000          0.722
          13.000        100.000          0.793
          14.000        108.000          0.857
          15.000        114.000          0.904
          16.000        119.000          0.944
          17.000        122.000          0.968
          18.000        124.000          0.984
          19.000        125.000          0.992
          20.000        126.000          1.000