Next Page Previous Page Home Tools & Aids Search Handbook
1. Exploratory Data Analysis
1.3. EDA Techniques
1.3.5. Quantitative Techniques
1.3.5.18. Yates Analysis

1.3.5.18.2.

Important Factors

Identify Important Factors The Yates analysis generates a large number of potential models. From this list, we want to select the most appropriate model. This requires balancing the following two goals.
  1. We want the model to include all important factors.
  2. We want the model to be parsimonious. That is, the model should be as simple as possible.
In short, we want our model to include all the important factors and interactions and to omit the unimportant factors and interactions.

Seven criteria are utilized to define important factors. These seven criteria are not all equally important, nor will they yield identical subsets, in which case a consensus subset or a weighted consensus subset must be extracted. In practice, some of these criteria may not apply in all situations.

These criteria will be examined in the context of the Eddy current data set. The Yates Analysis page gave the sample Yates output for these data and the Defining Models and Predictions page listed the potential models from the Yates analysis.

In practice, not all of these criteria will be used with every analysis (and some analysts may have additional criteria). These critierion are given as useful guidelines. Mosts analysts will focus on those criteria that they find most useful.

Criteria for Including Terms in the Model The seven criteria that we can use in determining whether to keep a factor in the model can be summarized as follows.
  1. Effects: Engineering Significance
  2. Effects: Order of Magnitude
  3. Effects: Statistical Significance
  4. Effects: Probability Plots
  5. Averages: Youden Plot
  6. Residual Standard Deviation: Engineering Significance
  7. Residual Standard Deviation: Statistical Significance
The first four criteria focus on effect estimates with three numeric criteria and one graphical criteria. The fifth criteria focuses on averages. The last two criteria focus on the residual standard deviation of the model. We discuss each of these seven criteria in detail in the following sections.

The last section summarizes the conclusions based on all of the criteria.

Effects: Engineering Significance The minimum engineering significant difference is defined as
    |betahati|> delta
where absolute value of betahat is the absolute value of the parameter estimate (i.e., the effect) and delta is the minimum engineering significant difference.

That is, declare a factor as "important" if the effect is greater than some a priori declared engineering difference. This implies that the engineering staff have in fact stated what a minimum effect will be. Oftentimes this is not the case. In the absence of an a priori difference, a good rough rule for the minimum engineering significant delta is to keep only those factors whose effect is greater than, say, 10% of the current production average. In this case, let's say that the average detector has a sensitivity of 2.5 ohms. This would suggest that we would declare all factors whose effect is greater than 10% of 2.5 ohms = 0.25 ohm to be significant (from an engineering point of view).

Based on this minimum engineering significant difference criterion, we conclude that we should keep two terms: X1 and X2.

Effects: Order of Magnitude The order of magnitude criterion is defined as
    absolute value of effect i < 0.10*maximum absolute value effect
That is, exclude any factor that is less than 10% of the maximum effect size. We may or may not keep the other factors. This criterion is neither engineering nor statistical, but it does offer some additional numerical insight. For the current example, the largest effect is from X1 (3.10250 ohms), and so 10% of that is 0.31 ohms, which suggests keeping all factors whose effects exceed 0.31 ohms.

Based on the order-of-magnitude criterion, we thus conclude that we should keep two terms: X1 and X2. A third term, X2*X3 (.29750), is just slightly under the cutoff level, so we may consider keeping it based on the other criterion.

Effects: Statistical Significance Statistical significance is defined as
    |betahat(i)| > 2*sd(betahat(i)) = 2*(2*sigma/sqrt(n))
That is, declare a factor as important if its effect is more than 2 standard deviations away from 0 (0, by definition, meaning "no effect").

The "2" comes from normal theory (more specifically, a value of 1.96 yields a 95% confidence interval). More precise values would come from t-distribution theory.

The difficulty with this is that in order to invoke this criterion we need the standard deviation, sigma, of an observation. This is problematic because

  1. the engineer may not know sigma;
  2. the experiment might not have replication, and so a model-free estimate of sigma is not obtainable;
  3. obtaining an estimate of sigma by assuming the sometimes- employed assumption of ignoring 3-term interactions and higher may be incorrect from an engineering point of view.
For the Eddy current example:
  1. the engineer did not know sigma;
  2. the design (a 23 full factorial) did not have replication;
  3. ignoring 3-term interactions and higher interactions leads to an estimate of sigma based on omitting only a single term: the X1*X2*X3 interaction.
For the current example, if one assumes that the 3-term interaction is nil and hence represents a single drawing from a population centered at zero, then an estimate of the standard deviation of an effect is simply the estimate of the 3-factor interaction (0.1425). In the Dataplot output for our example, this is the effect estimate for the X1*X2*X3 interaction term (the EFFECT column for the row labeled "123"). Two standard deviations is thus 0.2850. For this example, the rule is thus to keep all absolute value of betai > 0.2850.

This results in keeping three terms: X1 (3.10250), X2 (-.86750), and X1*X2 (.29750).

Effects: Probability Plots Probability plots can be used in the following manner.
  1. Normal Probability Plot: Keep a factor as "important" if it is well off the line through zero on a normal probability plot of the effect estimates.

  2. Half-Normal Probability Plot: Keep a factor as "important" if it is well off the line near zero on a half-normal probability plot of the absolute value of effect estimates.
Both of these methods are based on the fact that the least squares estimates of effects for these 2-level orthogonal designs are simply the difference of averages and so the central limit theorem, loosely applied, suggests that (if no factor were important) the effect estimates should have approximately a normal distribution with mean zero and the absolute value of the estimates should have a half-normal distribution.

Since the half-normal probability plot is only concerned with effect magnitudes as opposed to signed effects (which are subject to the vagaries of how the initial factor codings +1 and -1 were assigned), the half-normal probability plot is preferred by some over the normal probability plot.

Normal Probablity Plot of Effects and Half-Normal Probability Plot of Effects The following half-normal plot shows the normal probability plot of the effect estimates and the half-normal probability plot of the absolute value of the estimates for the Eddy current data.

probability plots indicate that model should include main effects for factor 1 and factor 2 with no interaction terms

For the example at hand, both probability plots clearly show two factors displaced off the line, and from the third plot (with factor tags included), we see that those two factors are factor 1 and factor 2. All of the remaining five effects are behaving like random drawings from a normal distribution centered at zero, and so are deemed to be statistically non-significant. In conclusion, this rule keeps two factors: X1 (3.10250) and X2 (-.86750).

Effects: Youden Plot A Youden plot can be used in the following way. Keep a factor as "important" if it is displaced away from the central-tendancy "bunch" in a Youden plot of high and low averages. By definition, a factor is important when its average response for the low (-1) setting is significantly different from its average response for the high (+1) setting. Conversely, if the low and high averages are about the same, then what difference does it make which setting to use and so why would such a factor be considered important? This fact in combination with the intrinsic benefits of the Youden plot for comparing pairs of items leads to the technique of generating a Youden plot of the low and high averages.
Youden Plot of Effect Estimatess The following is the Youden plot of the effect estimatess for the Eddy current data.

Youden plot show factor 1 and factor 2 main effects stand out

For the example at hand, the Youden plot clearly shows a cluster of points near the grand average (2.65875) with two displaced points above (factor 1) and below (factor 2). Based on the Youden plot, we conclude to keep two factors: X1 (3.10250) and X2 (-.86750).

Residual Standard Deviation: Engineering Significance This criterion is defined as
    Residual Standard Deviation > Cutoff
That is, declare a factor as "important" if the cumulative model that includes the factor (and all larger factors) has a residual standard deviation smaller than an a priori engineering-specified minimum residual standard deviation.

This criterion is different from the others in that it is model focused. In practice, this criterion states that starting with the largest effect, we cumulatively keep adding terms to the model and monitor how the residual standard deviation for each progressively more complicated model becomes smaller. At some point, the cumulative model will become complicated enough and comprehensive enough that the resulting residual standard deviation will drop below the pre-specified engineering cutoff for the residual standard deviation. At that point, we stop adding terms and declare all of the model-included terms to be "important" and everything not in the model to be "unimportant".

This approach implies that the engineer has considered what a minimum residual standard deviation should be. In effect, this relates to what the engineer can tolerate for the magnitude of the typical residual (= difference between the raw data and the predicted value from the model). In other words, how good does the engineer want the prediction equation to be. Unfortunately, this engineering specification has not always been formulated and so this criterion can become moot.

In the absence of a prior specified cutoff, a good rough rule for the minimum engineering residual standard deviation is to keep adding terms until the residual standard deviation just dips below, say, 5% of the current production average. For the Eddy current data, let's say that the average detector has a sensitivity of 2.5 ohms. Then this would suggest that we would keep adding terms to the model until the residual standard deviation falls below 5% of 2.5 ohms = 0.125 ohms.

Based on the minimum residual standard deviation criteria, and by scanning the far right column of the Yates table, we would conclude to keep the following terms:

  1. X1
(with a cumulative residual standard deviation = 0.57272)
  1. X2
(with a cumulative residual standard deviation = 0.30429)
  1. X2*X3
(with a cumulative residual standard deviation = 0.26737)
  1. X1*X3
(with a cumulative residual standard deviation = 0.23341)
  1. X3
(with a cumulative residual standard deviation = 0.19121)
  1. X1*X2*X3
(with a cumulative residual standard deviation = 0.18031)
  1. X1*X2
(with a cumulative residual standard deviation = 0.00000)

Note that we must include all terms in order to drive the residual standard deviation below 0.125. Again, the 5% rule is a rough-and-ready rule that has no basis in engineering or statistics, but is simply a "numerics". Ideally, the engineer has a better cutoff for the residual standard deviation that is based on how well he/she wants the equation to peform in practice. If such a number were available, then for this criterion and data set we would select something less than the entire collection of terms.

Residual Standard Deviation: Statistical Significance This criterion is defined as
    Residual Standard Deviation > sigma
where sigma is the standard deviation of an observation under replicated conditions.

That is, declare a term as "important" until the cumulative model that includes the term has a residual standard deviation smaller than sigma. In essence, we are allowing that we cannot demand a model fit any better than what we would obtain if we had replicated data; that is, we cannot demand that the residual standard deviation from any fitted model be any smaller than the (theoretical or actual) replication standard deviation. We can drive the fitted standard deviation down (by adding terms) until it achieves a value close to sigma, but to attempt to drive it down further means that we are, in effect, trying to fit noise.

In practice, this criterion may be difficult to apply because

  1. the engineer may not know sigma;
  2. the experiment might not have replication, and so a model-free estimate of sigma is not obtainable.

For the current case study:

  1. the engineer did not know sigma;
  2. the design (a 23 full factorial) did not have replication. The most common way of having replication in such designs is to have replicated center points at the center of the cube ((X1,X2,X3) = (0,0,0)).

Thus for this current case, this criteria could not be used to yield a subset of "important" factors.

Conclusions In summary, the seven criteria for specifying "important" factors yielded the following for the Eddy current data:

  1. Effects, Engineering Significance:
X1, X2
  1. Effects, Numerically Significant:
X1, X2
  1. Effects, Statistically Significant:
X1, X2, X2*X3
  1. Effects, Probability Plots:
X1, X2
  1. Averages, Youden Plot:
X1, X2
  1. Residual SD, Engineering Significance:
all 7 terms
  1. Residual SD, Statistical Significance:
not applicable

Such conflicting results are common. Arguably, the three most important criteria (listed in order of most important) are:

  1. Effects, Probability Plots:
X1, X2
  1. Effects, Engineering Significance:
X1, X2
  1. Residual SD, Engineering Significance:
all 7 terms

Scanning all of the above, we thus declare the following consensus for the Eddy current data:

  1. Important Factors: X1 and X2
  2. Parsimonious Prediction Equation:

      Yhat = 2.65875 + 1/2 (3.10250*X1 - .86750*X2)

    (with a residual standard deviation of .30429 ohms)

Note that this is the initial model selection. We still need to perform model validation with a residual analysis.
Home Tools & Aids Search Handbook Previous Page Next Page