SED navigation bar go to SED home page go to Dataplot home page go to NIST home page SED Home Page SED Contacts SED Projects SED Products and Publications Search SED Pages
Dataplot Vol 2 Auxiliary Chapter

BIWEIGHT MIDCOVARIANCE

Name:
    BIWEIGHT MIDCOVARIANCE (LET)
Type:
    Let Subcommand
Purpose:
    Compute the biweight midcovariance between two variables.
Description:
    Mosteller and Tukey (see Reference section below) define two types of robustness:

    1. resistance means that changing a small part, even by a large amount, of the data does not cause a large change in the estimate

    2. robustness of efficiency means that the statistic has high efficiency in a variety of situations rather than in any one situation. Efficiency means that the estimate is close to optimal estimate given that we know what distribution that the data comes from. A useful measure of efficiency is:

        Efficiency = (lowest variance feasible)/ (actual variance)

    Many statistics have one of these properties. However, it can be difficult to find statistics that are both resistant and have robustness of efficiency.

    The standard covariance estimate is the optimal estimator for Gaussian data. However, it is not resistant and it does not have robustness of efficiency. The rank covariance statistic is one example of a robust estimate of correlation.

    The biweight midcovariance estimator is both resistant and robust of efficiency. Mosteller and Tukey recommend using the MAD or interquartile range for exploratory work where moderate efficiency in a variety of situations is adequate. The biweight midcovariance estimator can be considered for situations where high performance is needed.

    The biweight midcovariance estimate is defined as:

      s(bxy)**2 = {n*SUM[a(i)*(x(i)-Mx)*(1 - u(i)**2)**2*
 SUM[b(i)*(y(i)-My)*(1 - v(i)**2)**2}/
 {[SUM[a(i)*(1-u(i)**2)*(1-5*u(i)**2)]*
 [SUM[b(i)*(1-v(i)**2)*(1-5*v(i)**2)]}

    where

      Mx = median X
      My = median Y
      ui = (Xi - Mx)/[9*MAD]
      vi = (Yi - My)/[9*MAD]
      ai = 1 if -1 <= ui <= 1
      bi = 1 if -1 <= vi <= 1
      MAD = median absolute deviation
Syntax:
    LET <par> = BIWEIGHT MIDCOVARIANCE <y1> <y2>
                                <SUBSET/EXCEPT/FOR qualification>
    where <y1> is the first response variable;
                  <y2> is the second response variable;
                  <par> is a parameter where the computed biweight midcovariance is stored;
    and where the <SUBSET/EXCEPT/FOR qualification> is optional.
Examples:
    LET A = BIWEIGHT MIDCOVARIANCE Y1
    LET A = BIWEIGHT MIDCOVARIANCE Y1 SUBSET TAG > 2
Note:
    Support for the biweight midcovariance has been added to the following plots and commands:

      BIWEIGHT MIDCOVARIANCE PLOT
      CROSS TABULATE BIWEIGHT MIDCOVARIANCE PLOT
      BOOTSTRAP BIWEIGHT MIDCOVARIANCE PLOT
      JACKNIFE BIWEIGHT MIDCOVARIANCE PLOT
      BIWEIGHT MIDCOVARIANCE INTERACTION STATISTIC PLOT
Note:
    The COVARIANCE MATRIX command generates pairwise covariance estimates of the columns in a matrix. By default, this command generates the standard covariance estimate. The command

      SET COVARIANCE TYPE <type>

    can be used to specify an alternate covariance measure to compute in the COVARIANCE MATRIX command. The following types are supported:

      DEFAULT - use the standard estimate
      BIWEIGHT - use the biweight midcovariance estimate
      WINSOR - use the Winsorized covariance estimate
      RANK - use the rank covariance estimate
Default:
    None
Synonyms:
    None
Related Commands:
    BIWEIGHT MIDCORRELATION = Compute a biweight midcocorelation estimate between two variable.
    BIWEIGHT MIDVARIANCE = Compute a biweight midvariance estimate of a variable.
    BIWEIGHT SCALE = Compute a biweight scale estimate of a variable.
    BIWEIGHT LOCATION = Compute a biweight location estimate of a variable.
    BIWEIGHT CONFIDENCE LIMITS = Compute a biweight based confidence interval.
    WINSORIZED COVARIANCE = Compute the Winsorized covariance between two variables.
    COVARIANCE = Compute the covariance between two variables.
    RANK COVARIANCE = Compute the rank covariance between two variables.
    STATISTIC PLOT = Generate a statistic versus group plot for a given statistic.
    CROSS TABULATE PLOT = Generate a statistic versus group plot for a given statistic and two group variables.
    BOOTSTRAP PLOT = Generate a bootstrap plot for a given statistic.
    INTERACTION STATISTIC PLOT = Generate an interaction plot for a given statistic.
Reference:
    "Introduction to Robust Estimation and Hypothesis Testing", Rand Wilcox, Academic Press, 1997.

    "Data Analysis and Regression: A Second Course in Statistics", Mosteller and Tukey, Addison-Wesley, 1977, pp. 203-209.

Applications:
    Robust Data Analysis
Implementation Date:
    2002/7
Program 1:
    SKIP 25
    READ IRIS.DAT Y1 Y2 Y3 Y4 X
    LET M = CREATE MATRIX Y1 Y2 Y3 Y4
    SET COVARIANCE TYPE BIWEIGHT
    LET B = COVARIANCE MATRIX Y1 Y2 Y3 Y4
        
Program 2:
     
    SKIP 25
    READ IRIS.DAT Y1 Y2 Y3 Y4 X
    .
    MULTIPLOT CORNER COORDINATES 0 0 100 95
    MULTIPLOT 2 1
    BOOTSTRAP SAMPLES 500
    BOOTSTRAP BIWEIGHT MIDCOVARIANCE PLOT Y1 Y2
    X1LABEL B025 = ^B025, B975=^B975
    HISTOGRAM YPLOT
    END OF MULTIPLOT
    MOVE 50 96
    JUSTIFICATION CENTER
    TEXT BIWEIGHT MIDCOVARIANCE BOOTSTRAP: IRIS DATA
        
    plot generated by sample program

Date created: 7/22/2002
Last updated: 4/4/2003
Please email comments on this WWW page to alan.heckert@nist.gov.