7. Product and Process Comparisons
7.2. Comparisons based on data from one process
7.2.4. Does the proportion of defectives meet requirements?
7.2.4.1. Confidence intervals
|
|
Confidence intervals using the method of Agresti and Coull
|
The method recommended by
Agresti and Coull (1998)
and also by
Brown, Cai and DasGupta (2001)
(the methodology was originally developed by Wilson in 1927)
is to use the form of the confidence interval that corresponds to the
hypothesis test given in Section 7.2.4.
That is, solve for the two values of p0 (say,
pupper and plower) that result
from setting z =
and solving for p0 = pupper, and
then setting z =
-
and solving for p0 = plower.
(Here, as in Section 7.2.4,
denotes the variate value from the
standard normal
distribution such that the area to the right of the value is
/2.) Although
solving for the two values of p0 might sound
complicated, the appropriate expressions can be obtained by
straightforward but slightly tedious algebra. Such algebraic
manipulation isn't necessary, however, as the appropriate expressions
are given in various sources. Specifically, we have
|
|
Formulas for the confidence intervals
|
|
|
Procedure does not strongly depend on values of p and n
|
This approach can be substantiated on the grounds that it is the
exact algebraic counterpart to the (large-sample) hypothesis test
given in section 7.2.4 and is also supported by the research of
Agresti and Coull. One advantage of this procedure is that its
worth does not strongly depend upon the value of n and/or
p, and indeed was recommended by Agresti and Coull for
virtually all combinations of n and p.
|
|
Another advantage is that the lower limit cannot be negative
|
Another advantage is that the lower limit cannot be negative. That is
not true for the confidence expression most frequently used:
A confidence limit approach that produces a lower limit which is an
impossible value for the parameter for which the interval is
constructed is an inferior approach. This also applies to limits
for the control charts that are discussed in Chapter 6.
|
|
One-sided confidence intervals
|
A one-sided confidence interval can also be constructed simply by
replacing each
by
in the
expression for the lower or upper limit, whichever is desired. The
95% one-sided interval for p for the example in the
preceding section is:
|
|
Example
|
|
|
Conclusion from the example
|
Since the lower bound does not exceed 0.10, in which case it would
exceed the hypothesized value, the null hypothesis that the
proportion defective is at most .10, which was given in the
preceding section, would not be rejected if we used the confidence
interval to test the hypothesis. Of course a confidence interval
has value in its own right and does not have to be used for
hypothesis testing.
|
|
|
Exact Intervals for Small Numbers of Failures
and/or Small Sample Sizes
|
|
Constrution of exact two-sided confidence
intervals based on the binomial distribution
|
If the number of failures is very small or if the sample size
N is very small, symmetical confidence limits that are
approximated using the normal distribution may not be accurate enough
for some applications. An exact method based on the binomial
distribution is shown next. To construct a two-sided confidence
interval at the 100(1 - )%
confidence level for the true proportion defective p where
Nd defects are found in a sample of size N
follow the steps below.
- Solve the equation
for pU to obtain the upper
100(1 - )%
limit for p.
- Next solve the equation
for pL to obtain the lower
100(1 - )%
limit for p.
|
|
Note
|
The interval {pL, pU} is an exact
100(1 - )%
confidence interval for p. However, it is not symmetric about the
observed proportion defective,
.
|
|
Example of calculation of upper limit for binomial confidence
intervals using EXCEL
|
The equations above that determine pL and
pU can easily be solved using functions built into
EXCEL. Take as an example the situation where twenty units are sampled
from a continuous production line and four items are found to be
defective. The proportion defective is estimated to be
= 4/20 = 0.20.
The calculation of a 90% confidence interval for the true
proportion defective, p, is demonstrated using EXCEL spreadsheets.
|
|
Upper confidence limit from EXCEL
|
To solve for pU:
- Open an EXCEL spreadsheet and put the starting value of 0.5 in
the A1 cell.
- Put =BINOMDIST(Nd, N, A1, TRUE)
in B1, where Nd = 4 and N = 20.
- Open the Tools menu and click on GOAL SEEK. The GOAL SEEK box
requires 3 entries./li>
- B1 in the "Set Cell" box
/2
= 0.05 in the "To Value" box
- A1 in the "By Changing Cell" box.
The picture below shows the steps in the procedure.
|