STANDARDIZE

Name:

STANDARDIZE (LET) Type:

Let Subcommand Purpose:

Standardize, i.e., subtract the mean and divide by the standard deviation, a variable. Description:

This command provides additional flexibility in that either one or two group id variables can also be specified. That is, if one group id variable is given, the mean and standard deviation is computed for each group and the data values are standardized by the corresponding group mean and standard deviation. Likewise, if two group variables are specified, then a mean and standard deviaiton are computed for each cell of the cross tabulation and the data values are standardized by the corresponding cell mean and standard deviaition.

You can specify several alternative measures to the mean for the location statistic and several alternative measures to the standard deviaition for the scale statistic. See the Note below for details. In addition, you can choose to standardize only by location (i.e., subtract the mean but do not divide by the standard deviation) or only by scale.

You can also specifically specify a z-score or u-score. A z-score subtracts the mean and divides by the standard deviation (i.e,, it scales to a standard normal distribution). Similarly, the u-score subtracts the minimum and divides by the range. That is, it creates a standard uniform random variable (i.e., the data is scaled to a range between 0 and 1). If a z-score or u-score is explicitly requested, the settings for the SET LOCATION STATISTIC and SET SCALE STATISTIC (see Note below) are ignored.

Syntax 1:

This syntax standardizes (with respect to both location and scale) the variable with no groups.

Syntax 2:

This syntax standardizes the variable (with respect to location only) with no groups.

Syntax 3:

This syntax standardizes the variable (with respect to scale only) with no groups.

Syntax 4:

This syntax specifically computes a z-score.

Syntax 5:

This syntax specifically computes a u-score.

Syntax 6:

This syntax standardizes (with respect to both location and scale) the variable with one group variable.

Syntax 7:

This syntax standardizes the variable (with respect to location only) with one group variable.

Syntax 8:

This syntax standardizes the variable (with respect to scale only) with one group variable.

Syntax 9:

This syntax computes a z-score with one group variable.

Syntax 10:

This syntax computes a u-score with one group variable.

Syntax 11:

This syntax standardizes (with respect to both location and scale) the variable with two group variable.

Syntax 12:

This syntax standardizes the variable (with respect to location only) with two group variable.

Syntax 13:

This syntax standardizes the variable (with respect to scale only) with two group variable.

Syntax 14:

This syntax computes a z-score with two group variables.

Syntax 14:

This syntax computes a u-score with two group variables.

Examples:

SET LOCATION STATISTIC MEDIAN
SET SCALE STATISTIC MAD
LET Y2 = STANDARDIZE Y1 X1 X2

Note:

To set the location measure, enter the command

SET LOCATION STATISTIC <MEAN/MEDIAN/MIDMEAN/TRIMMED MEAN/ WINSORIZED MEAN/MIDRANGE/HARMONIC MEAN/GEOMETRIC MEAN>

To set the scale measure, enter the command

SET SCALE STATISTIC <SD/MAD/AAD/INTERQUARTILE RANGE/ GEOMETRIC SD>

Here, SD is the standard deviation, MAD is the median absolute deviation, and AAD is the average absolute deviaiton.

Note the using the ZSCORE or USCORE syntax overrides the settings specified by these SET commands. That is, ZSCORE always uses the mean and standard deviation and USCORE always uses the minimum and the range.

Default:

The default location statistic is the mean and the default scale statistic is the standard deviation. Synonyms:

IQ RANGE is a synonym for INTERQUARTILE RANGE. Related Commands:

MEAN PLOT	= Generate a mean vs. subset plot.
SD PLOT	= Generate a standard deviation vs. subset plot.
TABULATE	= Compute group statistics (one group variable).
CROSS TABULATE	= Compute group statistics (two group variables).
MEDIAN	= Compute the median.
MIDDMEAN	= Compute the midmean.
TRIMMED MEAN	= Compute the trimmed mean.
SD	= Compute the standard deviation.
AAD	= Compute the average absolute deviation.
MAD	= Compute the median absolute deviation.

Applications:

Data Analysis Implementation Date:

2001/9: Following updates made

Additional location statistics added: MINIMUM, HARMONIC MEAN, GEOMETRIC MEAN, WINSORIZED MEAN, MIDRANGE
Additional scale statistics added: IQ RANGE, GEOMETRIC SD
USCORE option added
SCALE STANDARDIZE option added

Program 1:

Program 2:

Program 3:

Program 4:

Date created: 6/5/2001
Last updated: 4/4/2003
Please email comments on this WWW page to alan.heckert@nist.gov.