1.
Exploratory Data Analysis
1.4. EDA Case Studies


Anscombe, F. (1973), Graphs in Statistical Analysis, The American Statistician, pp. 195199. 

Anscombe, F. and Tukey, J. W. (1963), The Examination and Analysis of Residuals, Technometrics, pp. 141160. 

Barnett and Lewis (1994), Outliers in Statistical Data, 3rd. Ed., John Wiley and Sons. 

Birnbaum, Z. W. and Saunders, S. C. (1958), A Statistical Model for LifeLength of Materials, Journal of the American Statistical Association, 53(281), pp. 151160. 

Bloomfield, Peter (1976), Fourier Analysis of Time Series, John Wiley and Sons. 

Box, G. E. P. and Cox, D. R. (1964), An Analysis of Transformations, Journal of the Royal Statistical Society, pp. 211243, discussion pp. 244252. 

Box, G. E. P., Hunter, W. G., and Hunter, J. S. (1978), Statistics for Experimenters: An Introduction to Design, Data Analysis, and Model Building, John Wiley and Sons. 

Box, G. E. P., and Jenkins, G. (1976), Time Series Analysis: Forecasting and Control, HoldenDay. 

Bradley, (1968). DistributionFree Statistical Tests, Chapter 12. 

Brown, M. B. and Forsythe, A. B. (1974), Journal of the American Statistical Association, 69, pp. 364367. 

Bury, Karl (1999). Statistical Distributions in Engineering, Cambridge University Press. 

Chakravarti, Laha, and Roy, (1967). Handbook of Methods of Applied Statistics, Volume I, John Wiley and Sons, pp. 392394. 

Chambers, John, William Cleveland, Beat Kleiner, and Paul Tukey, (1983), Graphical Methods for Data Analysis, Wadsworth. 

Chatfield, C. (1989). The Analysis of Time Series: An Introduction, Fourth Edition, Chapman & Hall, New York, NY. 

Cleveland, William (1985), Elements of Graphing Data, Wadsworth. 

Cleveland, William and Marylyn McGill, Editors (1988), Dynamic Graphics for Statistics, Wadsworth. 

Devaney, Judy (1997), Equation Discovery Through Global SelfReferenced Geometric Intervals and Machine Learning, Ph.d thesis, George Mason University, Fairfax, VA. 

Draper and Smith, (1981). Applied Regression Analysis, 2nd ed., John Wiley and Sons. 

du Toit, Steyn, and Stumpf (1986), Graphical Exploratory Data Analysis, SpringerVerlag. 

Efron and Gong (February 1983), A Leisurely Look at the Bootstrap, the Jackknife, and Cross Validation, The American Statistician. 

Evans, Hastings, and Peacock (2000), Statistical Distributions, 3rd. Ed., John Wiley and Sons. 

Everitt, Brian (1978), Multivariate Techniques for Multivariate Data, NorthHolland. 

Filliben, J. J. (February 1975), The Probability Plot Correlation Coefficient Test for Normality, Technometrics, pp. 111117. 

Fuller Jr., E. R., Frieman, S. W., Quinn, J. B., Quinn, G. D., and Carter, W. C. (1994), Fracture Mechanics Approach to the Design of Glass Aircraft Windows: A Case Study, SPIE Proceedings, Vol. 2286, (Society of PhotoOptical Instrumentation Engineers (SPIE), Bellingham, WA). 

Gill, Lisa (April 1997), Summary Analysis: High Performance Ceramics Experiment to Characterize the Effect of Grinding Parameters on Sintered Reaction Bonded Silicon Nitride, Reaction Bonded Silicon Nitride, and Sintered Silicon Nitride , presented at the NIST  Ceramic Machining Consortium, 10th Program Review Meeting, April 10, 1997. 

Granger and Hatanaka (1964), Spectral Analysis of Economic Time Series, Princeton University Press. 

Grubbs, Frank (1950), Sample Criteria for Testing Outlying Observations, Annals of Mathematical Statistics, 21(1) pp. 2758. 

Grubbs, Frank (February 1969), Procedures for Detecting Outlying Observations in Samples, Technometrics, 11(1), pp. 121. 

Hahn, G. J. and Meeker, W. Q. (1991), Statistical Intervals, John Wiley and Sons. 

Harris, Robert L. (1996), Information Graphics, Management Graphics. 

Hastie, T., Tibshirani, R. and Friedman, J. (2001), The Elements of Statistical Learning: Data Mining, Inference, and Prediction, SpringerVerlag, New York. 

Hawkins, D. M. (1980), Identification of Outliers, Chapman and Hall. 

Boris Iglewicz and David Hoaglin (1993), "Volume 16: How to Detect and Handle Outliers", The ASQC Basic References in Quality Control: Statistical Techniques, Edward F. Mykytka, Ph.D., Editor. 

Jenkins and Watts, (1968), Spectral Analysis and Its Applications, HoldenDay. 

Johnson, Kotz, and Balakrishnan, (1994), Continuous Univariate Distributions, Volumes I and II, 2nd. Ed., John Wiley and Sons. 

Johnson, Kotz, and Kemp, (1992), Univariate Discrete Distributions, 2nd. Ed., John Wiley and Sons. 

Kuo, Way and Pierson, Marcia Martens, Eds. (1993), Quality Through Engineering Design", specifically, the article Filliben, Cetinkunt, Yu, and Dommenz (1993), Exploratory Data Analysis Techniques as Applied to a HighPrecision Turning Machine, Elsevier, New York, pp. 199223. 

Levene, H. (1960). In Contributions to Probability and Statistics: Essays in Honor of Harold Hotelling, I. Olkin et al. eds., Stanford University Press, pp. 278292. 

McNeil, Donald (1977), Interactive Data Analysis, John Wiley and Sons. 

Mendenhall, William and Reinmuth, James (1982), Statistics for Management and Ecomonics, Fourth Edition, Duxbury Press. 

Mosteller, Frederick and Tukey, John (1977), Data Analysis and Regression, AddisonWesley. 

Natrella, Mary (1963), Experimental Statistics, National Bureau of Standards Handbook 91. 

Nelson, Wayne (1982), Applied Life Data Analysis, AddisonWesley. 

Nelson, Wayne and Doganaksoy, Necip (1992), A Computer Program POWNOR for Fitting the PowerNormal and Lognormal Models to Life or Strength Data from Specimens of Various Sizes, NISTIR 4760, U.S. Department of Commerce, National Institute of Standards and Technology. 

Neter, Wasserman, and Kutner (1990), Applied Linear Statistical Models, 3rd ed., Irwin. 

Pepi, John W., (1994), Failsafe Design of an All BK7 Glass Aircraft Window, SPIE Proceedings, Vol. 2286, (Society of PhotoOptical Instrumentation Engineers (SPIE), Bellingham, WA). 

The RAND Corporation (1955), A Million Random Digits with 100,000 Normal Deviates, Free Press. 

Rosner, Bernard (May 1983), Percentage Points for a Generalized ESD ManyOutlier Procedure,Technometrics, 25(2), pp. 165172. 

Scott, David (1992), Multivariate Density Estimation: Theory, Practice, and Visualization , John Wiley and Sons. 

Snedecor, George W. and Cochran, William G. (1989), Statistical Methods, Eighth Edition, Iowa State University Press. 

Stefansky, W. (1972), Rejecting Outliers in Factorial Designs, Technometrics, 14, pp. 469479. 

Stephens, M. A. (1974). EDF Statistics for Goodness of Fit and Some Comparisons, Journal of the American Statistical Association, 69, pp. 730737. 

Stephens, M. A. (1976). Asymptotic Results for GoodnessofFit Statistics with Unknown Parameters, Annals of Statistics, 4, pp. 357369. 

Stephens, M. A. (1977). Goodness of Fit for the Extreme Value Distribution, Biometrika, 64, pp. 583588. 

Stephens, M. A. (1977). Goodness of Fit with Special Reference to Tests for Exponentiality , Technical Report No. 262, Department of Statistics, Stanford University, Stanford, CA. 

Stephens, M. A. (1979). Tests of Fit for the Logistic Distribution Based on the Empirical Distribution Function, Biometrika, 66, pp. 591595. 

Tietjen and Moore (August 1972), Some GrubbsType Statistics for the Detection of Outliers, Technometrics, 14(3), pp. 583597. 

Tufte, Edward (1983), The Visual Display of Quantitative Information, Graphics Press. 

Tukey, John (1977), Exploratory Data Analysis, AddisonWesley. 

Velleman, Paul and Hoaglin, David (1981), The ABC's of EDA: Applications, Basics, and Computing of Exploratory Data Analysis, Duxbury. 

Wilk, M. B. and Gnanadesikan, R. (1968), Probability Plotting Methods for the Analysis of Data, Biometrika, 5(5), pp. 119. 