SED navigation bar go to SED home page go to Dataplot home page go to NIST home page SED Home Page SED Contacts SED Projects SED Products and Publications Search SED Pages
Dataplot Vol 1 Auxiliary Chapter

ANDERSON DARLING K SAMPLE TEST

Name:
    ANDERSON DARLING K SAMPLE TEST
Type:
    Analysis Command
Purpose:
    The k-sample Anderson-Darling test is a nonparametric statistical procedure that tests the hypothesis that the populations from which two or more groups of data were drawn are identical. Each group should be an independent random sample from a population.
Description:
    This test is part of the MIL-HDBK-17 standard. In the terminology of MIL-HDBK-17, data can be either "structured" (i. e., groups) or "unstructured" (i.e., ungrouped). Unstructured data can often be simpler to analyze. Therefore the Anderson-Darling k-sample test is used to determine if a structured data set can in fact be treated as an unstructured data set. Dataplot supports most of the techniques in chapter 8 of the MIL-HDBK-17 as support for the RECIPE analysis.

    The k-sample Anderson-Darling statistic is

      ADK = (n-1)/(n**2*(k-1))*SUM[i=1][k][(1/n(i))*SUM1]
 SUM1  = SUM[j=1][L][h(j)*NUM/DEN]
 NUM   = (n*F(ij) - n(i)*H(j))**2
 DEM   = H(j)(n - H(j)) - n*h(j)/4

    where

    • hj = the number of values in the combined samples equal to zj
    • Hj = the number of values in the combined samples less than zj plus one half the number of of values in the combined samples equal to zj
    • Fij = the number of values in the ith group which are less than zj plus one half the number of values in this group which are equal to zj

    where k is the number of samples (groups), ni is the number of observations in group i, xij is the jth observation in the ith group, and z1, z2 ..., zL are the distinct values in the combined data set ordered from smallest to largest (L is less than n if there are tied observations).

    Chapter 8 of the MIL-HDBK-17 derives the formulas for the critical values of the Anderson-Darling test statistic. These formulas are rather involved and not given here.

    Dataplot uses the ANDYK routine from the MIL-HDBK-17 to compute the Anderson-Darling k sample test.

Syntax:
    ANDERSON DARLING K SAMPLE TEST <y> <groupid> <SUBSET/EXCEPT/FOR qualification>
    where <y> is the response variable;
              <groupid> is group (sample) identifier variable;
    and where the <SUBSET/EXCEPT/FOR qualification> is optional.
Examples:
    ANDERSON DARLING K SAMPLE TEST Y1 GROUP
    ANDERSON DARLING K SAMPLE TEST Y1 GROUP SUBSET GROUP > 2
Default:
    None
Synonyms:
    ANDERSON DARLING K SAMPLE is a synonym for ANDERSON DARLING K SAMPLE TEST.
Related Commands:
    LEVENE TEST = Compute Levene's test.
    BOX PLOT = Generate a box plot.
    RECIPE FIT = Perform a RECIPE analysis.
    GRUBBS TEST = Compute a Grubbs test for outliers.
    ANDERSON DARLING TEST = Anderson-Darling one sample goodness of fit test.
Reference:
    "MIL-HDBK-17 Volume 1: Guidelines for Characterization of Structural Materials", Depeartment of Defense, chapter 8. The URL for MIL-HDBK-17 is http://mil-17.udel.edu/.
Applications:
    Tolerance Studies in Reliability of Materials
Implementation Date:
    1998/6
Program:
    SKIP 25
    READ VANGEL32.DAT Y X B
    ANDERSON DARLING K SAMPLE TEST Y X

    The following output is generated:

    
      
               **********************************************
               **      ANDERSON DARLING K SAMPLE TEST Y X  **
               **********************************************
          
          
                       ANDERSON-DARLING K-SAMPLE TEST FOR COMMON GROUPS
          
         1. STATISTICS
               NUMBER OF OBSERVATIONS                     =       45
               NUMBER OF GROUPS                           =        3
               ANDERSON-DARLING K-SAMPLE TEST STATISTIC   =    155.3625
          
         2. FOR ANDERSON-DARLING K-SAMPLE TEST STATISTIC
               95         % POINT                         =    1.992743
          
         3. CONCLUSION (AT THE 5% LEVEL):
               THE GROUPS ARE SIGNIFICANTLY DIFFERENT.
               THUS: GROUPS SHOULD BE TREATED AS STRUCTURED.
        

Date created: 6/5/2001
Last updated: 4/4/2003
Please email comments on this WWW page to alan.heckert@nist.gov.