|
6. Process or Product Monitoring and Control 6.5. Tutorials 6.5.2. What to do when data are non-normal |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Often it is possible to transform non-normal data into approximately normal data |
Non-normality is a way of life, since no characteristic
(height, weight, etc.) will have exactly a normal
distribution. One strategy to make non-normal data resemble normal
data is by using a transformation. There is no dearth of transformations
in statistics; the issue is which one to select for the situation at hand.
Unfortunately, the choice of the "best" transformation is generally not
obvious.
This was recognized in 1964 by G.E.P. Box and D.R. Cox. They wrote a paper in which a useful family of power transformations was suggested. These transformations are defined only for positive data values. This should not pose any problem because a constant can always be added if the set of observations contains one or more negative values. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| The Box-Cox Transformation |
Given the vector of data observations
x = x1, x2,
...xn, one way to select the power
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| The logarithm of the likelihood function |
is the arithmetic mean of the transformed data. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Confidence bound for
|
In addition, a confidence bound (based on the
likelihood ratio
statistic) can be constructed for
as follows: A set of
values that represent an approximate
100(1- )%
confidence bound for
is formed from those
that satisfy
denotes the maximum likelihood estimator for
and
)
percentile of the chi-square distribution with 1 degree of freedom.
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Example of the Box-Cox scheme | To illustrate the procedure, we used the data from Johnson and Wichern's textbook (Prentice Hall 1988), Example 4.14. The observations are microwave radiation measurements. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Sample data |
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Table of log-likelihood values for various values of
|
The values of the log-likelihood function obtained by varying
from -2.0 to 2.0 are given below.
This table shows that
The Box-Cox transform is also discussed in Chapter 1 under the
Box Cox Linearity Plot
and the
Box Cox Normality Plot.
The Box-Cox normality plot discussion provides a graphical
method for choosing
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||