|
2. Measurement Process Characterization 2.3. Calibration 2.3.6. Instrument calibration over a regime 2.3.6.5. Data analysis and model validation |
|
| First step - plot the calibration data | If the model for the calibration curve is not known from theoretical considerations or experience, it is necessary to identify and validate a model for the calibration curve. To begin this process, the calibration data are plotted as a function of known values of the reference standards; this plot should suggest a candidate model for describing the data. A linear model should always be a consideration. If the responses and their known values are in the same units, a plot of differences between responses and known values is more informative than a plot of the data for exposing structure in the data. |
| Warning - regarding statistical software | Once an initial model has been chosen, the coefficients in the model are estimated from the data using a statistical software package. It is impossible to over-emphasize the importance of using reliable and documented software for this analysis. |
| Output required from a software package |
With the exception of non-linear models, the software package will use
the method of least squares
for estimating the coefficients. The software package should also be
capable of performing a 'weighted' fit for situations where
errors of measurement are
non-constant over the calibration interval. The
choice of weights
is usually the responsibility of the user. The software package
should, at the minimum, provide the following information:
|
| Typical analysis of a quadratic fit | The following output is from the statistical software package, Dataplot where load cell measurements are modeled as a quadratic function of known loads. There are 3 repetitions at each load level for a total of 33 measurements. The commands |
| Run software macro |
read loadcell.dat x y quadratic fit y xreturn the following output: |
| F-ratio for judging the adequacy of the model. |
LACK OF FIT F-RATIO = 0.3482 = THE 6.3445% POINT OF THE F DISTRIBUTION WITH 8 AND 22 DEGREES OF FREEDOM |
| Coefficients and their standard deviations and associated t values |
COEFFICIENT ESTIMATES ST. DEV. T VALUE 1 a -0.183980E-04 (0.2450E-04) -0.75 2 b 0.100102 (0.4838E-05) 0.21E+05 3 c 0.703186E-05 (0.2013E-06) 35. RESIDUAL STANDARD DEVIATION = 0.0000376353 RESIDUAL DEGREES OF FREEDOM = 30Note: The T-VALUE for a coefficient in the table above is the estimate of the coefficient divided by its standard deviation. |
| The F-ratio is used to test the goodness of the fit to the data | The F-ratio provides information on the model as a good descriptor of the data. The F-ratio is compared with a critical value from the F-table. An F-ratio smaller than the critical value indicates that all significant structure has been captured by the model. |
| F-ratio < 1 always indicates a good fit |
For the load cell analysis, a plot of the
data suggests a linear fit. However, the linear fit gives a very
large F-ratio. For the quadratic fit, the F-ratio = 0.3482 with
v1 = 8 and v2 = 20 degrees of freedom. The critical value
of F(0.05, 8, 20) = 2.45 indicates that the quadratic function is sufficient
for describing the data. A fact to keep in mind is that an F-ratio < 1
does not need to be checked against a critical value; it always
indicates a good fit to the data.
Note: Dataplot reports a probability associated with the F-ratio (6.334%), where a probability > 95% indicates an F-ratio that is significant at the 5% level. Other software may report in other ways; therefore, it is necessary to check the interpretation for each package. |
| The t-values are used to test the significance of individual coefficients | The t-values can be compared with critical values from a t-table. However, for a test at the 5% significance level, a t-value < 2 is a good indicator of non-significance. The t-value for the intercept term, a, is < 2 indicating that the intercept term is not significantly different from zero. The t-values for the linear and quadratic terms are significant indicating that these coefficients are needed in the model. If the intercept is dropped from the model, the analysis is repeated to obtain new estimates for the coefficients, b and c. |
| Residual standard deviation | The residual standard deviation estimates the standard deviation of a single measurement with the load cell. |
| Further considerations and tests of assumptions |
The residuals (differences between the measurements and their fitted
values) from the fit should also be examined for
outliers and
structure that might
invalidate the calibration curve. They are also a good indicator of
whether basic assumptions of
normality and equal precision for all measurements are valid.
If the initial model proves inappropriate for the data, a strategy for improving the model is followed. |