
FLUCTUATION PLOTName:
For the fluctuation plot, a grid is created so that is each combination of categories has a fixed position on the grid. At each grid position, two rectangles are drawn. The first is drawn in a background color and is full size (i.e., the maximum count). A second rectangle is drawn in a foreground color with a height proportional to the count for that particular combination of categories. The background rectangle is drawn to give a sense of scale. If you do not want this background rectangle, then set the color equal to the background color of the plot. Some analysts find the format of the flucuation plot easier to interpret than the mosaic plot. Although the mosaic and fluctuation plots were developed to visualize counts for categorical data, Dataplot can also generate the fluctuation plot for various statistics. For example, you could use it to display mean values for several factor variables. In particular, we have found it useful for displaying binomial probabilities. For displaying the value of a statistic, the minimum value of the statistic over all combinations of categories will be drawn with zero height and the maximum value of the statistic over all categories will be drawn at the full height. Intermediate values will be scaled between the minimum and maximum values. The list of supported statistics can be obtained by entering
By default, the FLUCTUATION PLOT generates a single foreground color. The CONTOUR option (see syntax 4) allows the foreground color to be set based on the value of the statistic relative to a "levels" variable (this borrows from the TABULATION PLOT. For example, you can specify two colors based on whether the statistic is above or below some threshold value. Alternatively, you can also use the CONTOUR option to provide additional guidance on the value of the statistic.
<SUBSET/EXCEPT/FOR qualification> where <x1> ... <xk> is a list of one to six categorical variables; and where the <SUBSET/EXCEPT/FOR qualification> is optional. This syntax is used for the case where you have raw data (i.e., the data has not yet been cross tabulated) and the statistic of interest is the number of observations in each cell.
<SUBSET/EXCEPT/FOR qualification> where <stat> is the one of the supported statistics: <y1> ... <y3> is a list of one to three response variables (depending on how many variables <stat> requires); <x1> ... <xk> is a list of one to six categorical variables; and where the <SUBSET/EXCEPT/FOR qualification> is optional. This syntax is used for the case where you have raw data (i.e., the data has not yet been cross tabulated) and you are computing a statistic that requires one to three response variables.
where <m> is a matrix containing a twoway table; and where the <SUBSET/EXCEPT/FOR qualification> is optional. This syntax is used for the case where the data have already been crosstabulated into a twoway table. Although this is typically used for the COUNTS case, the table can in fact contain values for any statistic that has been previously crosstabulated (including statistics not listed in Syntax 1  Syntax 3 above).
<SUBSET/EXCEPT/FOR qualification> where <stat> is the one of the supported statistics: <y1> ... <y3> is a list of one to three response variables (depending on how many variables <stat> requires); <x1> ... <xk> is a list of one to six categorical variables; <ylevel> is a variable that defines the levels for the value of the statistic; variables; and where the <SUBSET/EXCEPT/FOR qualification> is optional. This syntax can be used to specify different foreground colors based on the value of the statistic.
FLUCTUATION BINOMIAL PROBABILITY PLOT Y X1 X2 FLUCTUATION PLOT M
When there are two or more categorical variables, the division is first performed vertically, then horizontally. This vertical/horizontal subdivision is repeated until all the categorical variables are accommodated. When there are two or more categorical variables, you can change the vertical/horizontal order to horizontal/vertical by entering the command
To restore the default order, enter
SET FLUCTUATION PLOT CEILING <value> Values less than the floor value will be set to the floor value and values greater than the ceiling value are set to the ceiling value. The default is to use the minimum and maximum values of the computed statistic. For the COUNT case, the floor value will be set to 0. For the BINOMIAL PROBABILITY case, the floor and ceiling values will be set to 0 and 1, respectively. After the fluctuation plot is generated, Dataplot will save the internal parameters STATMINI and STATMAXI that contain the minimum and maximum values, respectively, of the computed statistic.
To reset fixed width bars, enter the command
This option does not apply to the case where the statistic being computed is the frequency counts (COUNT). In this case, the height of the bars already indicates the frequency counts.
BINOMIAL RATIO MEAN CONFIDENCE LIMIT MEDIAN CONFIDENCE LIMIT added the following command
If this option is set to ON, there are three rectangles that are drawn:
In the default case, we set the color of the rectangles using the following commands (where the colors are set to your taste)
region fill color g75 black region border color g75 black If the uncertainty option is set to on, we set the color of the three rectangles using the following commands (where again the colors are set to your taste)
char bl bl bl bl circle bl char fill on all char hw 0.5 0.375 all region fill color g75 g75 cyan cyan region border color g75 g75 cyan cyan The first 2 colors specify the background color for the rectangles below and above the statistic value, respectively. Colors 3 and 4 specify the foreground colors for the rectangles below and above the statistic value, respectively. Typically we recommend that the same color be used as in the above example. By default, alpha is set to 0.05 for computing the uncertainty intervals. To use a different value of alpha, enter
By default (= OFF), each factor variable is coded from 1 to NDIST with NDIST denoting the number of levels (i.e., distinct values for that factor variable. When there are more than two factor variables and some of the combinations of levels for the factor variables are missing, it is desirable to suppress this coding. Setting this option to ON will use the original units for the factor variables. You may want to code each of the factor variables. For example, if there are four factor variables, you can do something like
LET X2C = CODED X2 LET X3C = CODED X3 LET X4C = CODED X4 SET FLUCTUATION PLOT CODED ON FLUCTUATION BINOMIAL PROBABILITY PLOT Y X1C X2C X3C X4C
SET FLUCTUATION SORTED OFF SET FLUCTUATION SORTED ROW SET FLUCTUATION SORTED COLUMN ON specifies that both the column and row direction will be sorted, OFF (the default) specifies that neither direction will be sorted, ROW specifies that the vertical direction will be sorted, and COLUMN specifies that the horizontal direction will be sorted. You can specify whether the sort is an ascending (the default) or a descending sort by entering the commands
<ASCENDING/DESCENDING> SET FLUCTUATION PLOT ROW SORT DIRECTION ... <ASCENDING/DESCENDING>
SET FLUCTUATION CONTOUR BINOMIAL PROPORTION LOWER SET FLUCTUATION CONTOUR BINOMIAL PROPORTION UPPER
Friendly (2000), "Visualizing Categorical Data", SAS Institute Inc., p. 90.
2009/9: Added "uncertainty" option for several statistics 2010/7: Added "contour" option . Example from page 61 of Friendly . Data denotes counts. read matrix m 5 29 14 16 15 54 14 10 20 84 17 94 68 119 26 7 end of data . label case asis tic mark label case asis title case asis title offset 2 . x3label title Fluctuation Plot y1label Eye Color x1label Hair Color tic offset units data xlimits 1 4 major xtic mark number 4 minor xtic mark number 0 xtic mark offset 1 1 x1tic mark label format alpha x1tic mark label content Black Brown Red Blond ylimits 1 4 major ytic mark number 4 minor ytic mark number 0 ytic mark offset 1 1 y1tic mark label format alpha y1tic mark label content Green Hazel Blue Brown y1tic mark label justification right . line color g75 black region fill color g75 black region border color g75 black . fluctuation plot m Program 2: skip 25 read alarm.dat inst src expalarm obsalarm let n = size expalarm let correct = 0 for i = 1 1 n let correct = 1 subset expalarm = 0 subset obsalarm = 0 let correct = 1 subset expalarm = 1 subset obsalarm = 1 . label case asis tic mark label case asis title case asis title offset 2 . x3label title Fluctuation Plot of Binomial Probability for Correct Alarm y1label Instrument x1label Source tic offset units data xlimits 1 6 major xtic mark number 6 minor xtic mark number 0 xtic mark offset 1 1 ylimits 1 15 major ytic mark number 15 minor ytic mark number 0 ytic mark offset 1 1 . line color g75 black region fill color g75 black region border color g75 black . set fluctuation plot width proportional fluctuation binomial probability plot correct inst src Program 3: skip 25 read ripken.dat y x1 to x4 . label case asis tic mark label case asis title case asis . x3label title Fluctuation Plot for Cal Ripken Mean Batting Average let string v1 = Low let string v2 = Middle let string v3 = Left:sp()High let string v4 = Low let string v5 = Middle let string v6 = Right:sp()High let igy = group label v1 to v6 let string h1 = Inside let string h2 = Middlecr()Fastball let string h3 = Outside let string h4 = Inside let string h5 = Middlecr()Curveball let string h6 = Right let igx = group label h1 to h6 . tic offset units data xlimits 1 6 major xtic mark number 6 minor xtic mark number 0 xtic mark offset 1 1 x1tic mark label format group label x1tic mark label content igx ylimits 1 6 major ytic mark number 6 minor ytic mark number 0 ytic mark offset 1 1 y1tic mark label format group label y1tic mark label content igy y1tic mark label justification right . line color g75 black region fill color g75 black region border color g75 black . fluctuation mean plot y x2 x1 x4 x3 . move 50 92 just center text (Minimun BA: ^statmini, Maximum BA: ^statmaxi) Program 4: skip 25 read alarm.dat inst src expalarm obsalarm let n = size expalarm let correct = 0 for i = 1 1 n let correct = 1 subset expalarm = 0 subset obsalarm = 0 let correct = 1 subset expalarm = 1 subset obsalarm = 1 . label case asis tic mark label case asis title case asis title offset 2 frame corner coordinates 10 20 80 90 . x3label title Binomial Probability for Correct Alarm y1label Instrument x1label Source tic offset units data xlimits 1 6 major xtic mark number 6 minor xtic mark number 0 xtic mark offset 0.6 0.6 ylimits 1 15 major ytic mark number 15 minor ytic mark number 0 ytic mark offset 1 1 y1label displacement 7 . let p10 = 0.7 let p20 = 1.01 let ylevel = data p10 p20 . line color g75 red dgreen region fill color g75 red dgreen region border color g75 red dgreen . set fluctuation plot width proportional fluctuation binomial probability contour plot correct inst src ylevel . let p1 = 0.20 let p2 = 0.40 let p3 = 0.60 let p4 = 0.80 let p5 = 1.01 let ylevel = data p1 p2 p3 p4 p5 let ncolor = 5 let string color1 = red let string color2 = orange let string color3 = cyan let string color4 = blue let string color5 = dgreen region fill on all region fill color g75 ^color1 ^color2 ^color3 ^color4 ^color5 region border color g75 ^color1 ^color2 ^color3 ^color4 ^color5 line color g75 ^color1 ^color2 ^color3 ^color4 ^color5 . fluctuation binomial probability contour plot correct inst src ylevel . box fill pattern solid box shadow hw 0 0 justification left height 1.7 . let xcoor1 = 81 let xcoor2 = 85 let xcoor3 = xcoor2 + 1 let ycoor1 = 90 let yinc = 4 let ycoor2 = ycoor1  yinc . let kind = ncolor loop for k = 1 1 ncolor box fill color ^color^kind box xcoor1 ycoor1 xcoor2 ycoor2 let ycoor3 = ycoor2 + 1 move xcoor3 ycoor3 let km1 = kind  1 let aval1 = ^p^km1 let aval2 = ^p^kind let aval2 = min(1,aval2) if k < ncolor if k = 1 text ^aval1  ^aval2 else text ^aval1  ^aval2 end of if else text <= ^aval1 end of if let ycoor1 = ycoor2 let ycoor2 = ycoor1  yinc let kind = kind  1 end of loop  
Privacy
Policy/Security Notice
NIST is an agency of the U.S. Commerce Department.
Date created: 1/6/2009 