Education and Training: Data Sets

Data Sets for Selected Short Courses

Data sets for the following short courses can be
viewed from the web.
 Design of Experiments
(Jim Filliben and Ivilesse Aviles)
 Bayesian Analysis
(Blaza Toman)
 ANOVA (Stefan Leigh)
 Regression Models
(Will Guthrie)
 Exploratory Data Analysis
(Jim Filliben)
 Statistical Concepts (Mark Vangel)

Data sets for Design of Experiments Short Course

NOTE: You probably need to download the macros used for
the "10 step analysis" to your "DATAPLOT\MACROS" directory.
These can be downloaded as a single
tar file (WinZip knows how to handle tar files) or as the
individual files:
The following data sets are available for the
Design of Experiments (DEX) course:
 Bike Speeds
(2^{74}) (Box, Hunter, and Hunter)
 Bike Speeds
(2^{74} plus foldover) (Box, Hunter,
and Hunter)
 Chemical Process Yield
(2^{4}) (Box, Hunter, and Hunter)
 Cleanser Stability
(2^{41}) (Box, Hunter, and Hunter)
 Filtration Times
(2^{74}) (Box, Hunter, and Hunter)
 Filtration Times
(2^{73} plus foldover) (Box, Hunter,
and Hunter)
 Reactor Efficiency
(2^{5}) (Box, Hunter, and Hunter)
 Reactor Efficiency
(2^{51}) (Box, Hunter, and Hunter)
 Reactor Efficiency
(2^{52}) (Box, Hunter, and Hunter)
 Reactor Efficiency
(Box, Hunter, and Hunter)
 Boys' Shoe Material
(Box, Hunter, and Hunter) (randomized block design)
 Defective Box Springs
(Box, Hunter, and Hunter)
 Exact Flow Dual Rotor Turbine
Meters (2^{3})
 Exact Flow Dual Rotor Turbine
Meters (2^{4})
 Peak Convolution
Algorithms (2^{72}, 3 response variables)
 Fire Safety (2^{3})
 Optimization of Hot Plate
Gap (2^{62})
 Ceramic Strength
(15 factors, 480 points)
 Sonoluminescent Ligh Intensity
(2^{73})
 Radiocarbon Measurements
of Albuquerque Carbon Monoxide (2^{3} plus
2 replicated corner points)
 Fire Research
(2^{5} with 9 response variables)
 SO2 Permeation Tube Mass
Loss (2^{3})
 SO2 Permeation Tube Mass
Loss (2^{3} plus replicated center points)
 Gas Metal Arc Welding
Spatter (2^{83} plus 3 center points)
 Walt Rossiter Data
(2^{5})
 Defective Lightbulbs
(2^{5})
 Superconducting Chip
Optimization (s^{3})
 Homemade Bread Taste
(2^{4} plus replicated points)
 Summer Intern Funnel
(2^{3})
 Network Processing Time
(2^{51})
 Network Processing Time
(2^{3})

Data sets for Bayesian Analysis Short Course

The following data sets are available for the Bayesian
Analysis course:

Data sets for Regression Short Course

The first few data sets from the class notes are listed
below. The Data Set Name is the name I gave each data set
in the notes. The File Name gives the name of the file
containig the data set and is often the original name of the
data set as well. The column Source lists where I got the
data, not necessarily the original source of the data. Data
sets I made up are listed as "Simulated" in the Source column.
The data sets are ordered chronologically by their first
appearance in the notes. I will try to add the rest of the
data sets soon. If there are data sets you would particularly
like to use that are not listed here please let me know which
ones they are and I will add them first.

Data sets for Analysis of Variance Short Course

The following data sets are available for the Analysis of
Variance (ANOVA) course:
 New Car Interest Rates
(p. 71)
 Cigarette Smokers
(p. 114)
 Rat Feed (p. 127)
 Acidity of Sour Cream
(p. 150)
 ELISA HIV Optical Density
(p. 157)
 Simulated Solid Tumor 
Averaged Measurements
 Simulated Solid
Tumor  Replicated Measurements
 Dental Gold Hardness
 Kenton
Foods Package Design (Excel file)
 New
Car Interest Rate (Excel file)
 Moisture
in Concrete (Excel file)
 Auto
Insurance (Excel file)
 Castle
Bakery (Excel file)
 Sour
Cream (Excel file)
 Dental Gold
Fillings (Excel file)
Note: you probably need to view the Excel files using
Internet Explorer from a Windows platform.

Data sets for Exploratory Data Analysis Short Course

The following data sets are available for the
Exploratory Data Analysis (EDA) course:
 4 "Equivalent" Data Sets
(Anscombe) (p. 3)
 Normal Random Numbers
(p. 14)
 Uniform Random Numbers
(p. 17)
 Random Walk (p. 19)
 Flicker Noise (p. 22)
 Josephson Junction
Cryothermometry (p. 24)
 Beam Deflections (p. 27)
 Wind Speeds (p. 29)
 Filter Transmittance (p. 31)
 Spinning Rotor Pressure Gage
(p. 33)
 Standard Resistor (p. 35)
 Battery Additive (p. 37)
 1969 Draft Lottery
(p. 38)
 NOAA Ozone Study (p. 42)
 ASTM Sulfur Trioxide
(p. 47)
 Boys' Shoe Material (Box,
Hunter, and Hunter) (p. 48)
 Defective Lightbulbs
(p. 52)
 Hospital Death Rates
(p. 60)
 1992 Presidential
Election (p. 62)
 1994 Olympics Women's
Skating (p. 68)
 Electrical Connectors
(p. 70)
 Interlaboratory Stress
Corrosion (p. 72)
 Drill Thrust Force (p. 75)
 Dental Polysac Adhesion
(p. 78)
 Chemical Reaction Yield
(Box, Hunter, and Hunter) (p. 80)

Data sets for Statistical Concepts Short Course

The following data sets are available for the
Statistical Concepts course:
 Speed of Light (Newcomb and
Michaelson)
 Paper Thickness (Youden)
 Normal Cumulative Probability
Table
 Normal Random Numbers
 Random Walk
 1969 Draft Lottery
 Defective Lightbulbs
(Sheesley)
 t Percent Point Function
Table
 Normal Cumulative Probability
Table
 Astronomical Units
(Youden)
 Zinc (Wilson)
 Molded Plastic Strength
(Wilson) >
 Airline Performance

Date created: 2/1/2002
Last updated: 8/28/2003
Please email comments on this WWW page to
sedwww@nist.gov.
