Ellison - 2001 - Exploratory Data Analysis and Graphical Display
Ellison - 2001 - Exploratory Data Analysis and Graphical Display
AARON M. ELLISON
3.1 Introduction
You have designed your experiment, collected the data, and are now confronted
with a tangled mass of information that must be analyzed, presented, and pub-
lished. Turning this heap of raw spaghetti into an elegant fettucine alfredo will
be immensely easier if you can visualize the message buried in your data. Data
graphics, the visual "display [of] measured quantities by means of the combined
use of points, lines, a coordinate system, numbers, symbols, words, shading, and
color" (Tufte 1983, p. 9) provide the means for this visualization.
Graphics serve two general functions in the context of data analysis. First,
graphics are a tool used to explore patterns in data before the formal statistical
analysis (Exploratory Data Analysis, or EDA, Tukey 1977). Second, graphics
communicate large amounts of information clearly, concisely, and rapidly, and
illuminate complex relationships within data sets.
Graphic EDA yields rough sketches to help guide you to appropriate, often
counterintuitive, formal statistical analyses. In contrast to EDA, presentation
graphics are final illustrations suitable for publication. Presentation graphics of
high quality can leave a lasting impression on readers or audiences, whereas
vague, sloppy, or overdone graphics easily can obscure valuable information and
engender confusion. Ecological researchers should view EDA and sound presen-
tation graphic techniques as essential components of data analysis, presentation,
and publication.
This chapter provides an introduction to graphic EDA, and some guidelines
for clear presentation graphics. More detailed discussions of these and related
37
38 Design and Analysis of Ecological Experiments
topics can be found in texts by Tukey (1977), Tufte (1983, 1990), and Cleveland
(1985). These techniques are illustrated for univariate, bivariate, multivariate, and
classified quantitative (ANOVA) data sets that exemplify the types of data sets
encountered commonly in ecological research. Sample data sets are described
briefly in section 3.3; formal analyses of three of the illustrated data sets can be
found in chapters 14 (univariate data set) and 10 (predator-prey data set), and Pot-
vin (1993, chapter 3, ANOVA data set). You may find some of the graphics types
presented unfamiliar or puzzling, but consider them seriously as alternatives to
the more common bar charts, histograms, and pie charts. The majority of these
graphs can be produced by readily available Windows-based software (Kardia
1998). I used S-Plus (MathSoft, Inc.) and SYSTAT (SPSS, Inc.) to construct the
figures in this chapter.
Guiding Principles. The question or hypothesis guiding the experimental de-
sign also should guide the decision as to which graphics are appropriate for ex-
ploring or illustrating the data set. Sketching a mock graph, without data points,
before beginning the experiment usually will clarify experimental design and al-
ternative outcomes. This procedure also clarifies a priori hypotheses that will
prevent inappropriately considering a posteriori hypotheses (suggested by EDA)
as a priori. Often, the simplest graph, without frills, is the best. However, graphs
do not have to be simple-minded, conveying only a single type of information,
and they need not be assimilated in a single glance. Tufte (1983) and Cleveland
(1985) provide numerous examples of graphs that require detailed inspection be-
fore they reveal their messages. Besides the aesthetic and cognitive interest they
provoke, complex graphs that are information-rich can save publication costs and
time in presentations.
Regardless of the complexity of your illustrations, you should adhere to the
following four guidelines in EDA and production graphics:
BETTER
3. Length
4. Angle/Slope
5. Area
6. Volume
WORSE
Figure 3.1 Ordering of graphic features according to their relative accuracy in representing
quantitative variation (after Cleveland 1985).
charts) not only do not increase the information content available, but often ob-
scure the message (a dramatic, if unfortunate, set of examples can be found in
Benditt 1992). These graphics, common in business presentations and increas-
ingly rife at scientific meetings, violate all of the suggested guidelines. Finally,
more dimensions often are used than are necessary, for example, "areas" and
lines where a point would do. Sirnken and Hastie (1987) discuss exceptions to
Cleveland's graphic hierarchy. In general, when designing graphics, adhere to the
Shaker maxim: form follows function.
High-quality graphical elements can be assembled into effective graphic dis-
plays of data (Cleveland 1985). First, emphasize the data. Lines drawn through
data points should not hide the points themselves. Second, data points should
Exploratory Data Analysis and Graphic Display 41
never lie on axes themselves, as the axes can obscure data points. If, for example,
there are many points that would fall along a zero line, then extend that axis
beyond zero (figure 3.6). Third, reference lines, if needed (which they rarely are),
should be deemphasized relative to the data. This can be accomplished with dif-
ferent line types (variable thicknesses; dotted, dashed, or solid) or shading. Fourth,
overlapping data symbols or data sets should be clearly distinguishable. You can
increase data visibility and minimize overlap by varying symbol size or position,
separating data sets to be compared into multiple plots, or changing from arithme-
tic <to logarithmic scales. Exemplars include the jitter plot, which avoids overlap
of identical values (figure 3.3B) and spreading of responses to categories across
an axis (figure 3.12D). Fifth, the plot must be easily readable following reduction
for publication or when projected as a slide to a seminar audience. Finally, Cleve-
land recommends using a full rectangular plot frame, not the more common bot-
tom axis/left axis only combination seen in many articles. This, together with tick
marks outside the plot frame (1) emphasize the data and (2) help the reader
accurately place individual data points. Tufte (1983) disagrees, as the extra axes
are an excessive use of ink and convey no information. Examples in this chapter
illustrate most of these possibilities. In the final analysis, many of these rules
reflect not only insight into cognitive perception, but also aesthetic judgments by
you, the author.
From this discussion, we could ask, Isn't all this too much trouble? Should
we dispense with graphs altogether in favor of tables? Because of their concise-
ness, graphics are almost always preferable in oral presentations. Graphs illustrate
more clearly relationships among variables and are a quick means of displaying
multivariate information. However, where exact values are important (as in final
publications), tables are more precise. Although the need for precise tables has
been obviated by the increasing availability of digitizing software and on-line
data archives, data presented graphically must be unbiased and uncensored. A
discussion of what data should be provided, in either graphs or tables, follows in
section 3.4.
3.3 Examples
A histogram is an example of a density plot; that is, each bar illustrates the
frequency, or density, of the values occurring in the data set between the lower
bound and the upper bound of each bar. Histograms are commonly confused with
bar charts (see section 3.3.4). The latter are used to illustrate some summary mea-
sure (often the mean, sum, or percentage) of all the values within a given treat-
ment category. Histograms of the Ailanthus data are shown in figure 3.2.
A histogram is not the best method for answering the two questions posed
previously, for three reasons. First, the raw data are hidden. In this example, there
are 75 plants, which have been divided into 12 biomass groups, or bins (figure
3.2A). It is impossible to know, for example, if the third bar (range 12-14 nodes)
contains 10 observations of 12 nodes, 10 observations of 14 nodes, or any other
of the possible combinations of 12-14 nodes in 10 observations. Second, the di-
vision into 12 bins is arbitrary; it was the default of the graphics program. We
could just as easily use 24 or 6 bins, both of which change the apparent shape of
the distribution (figures 3.2B,C) without conveying additional information. Third,
summary statistics cannot be computed from the data illustrated in the histogram.
Thus, a histogram does not enable us to answer key questions about univariate
data. In addition, histograms fall low on Cleveland's hierarchy of graphic primi-
tives. Bars in a histogram use vertical lines, horizontal lines, and shading in con-
cert to present information embodied in the single point indicated by the top of
the bar.
Tukey (1977) introduced the stem-and-leaf diagram as the simplest alternative
to the histogram (figure 3.3A). The main advantage of the stem-and-leaf diagram
is that the raw data are presented in toto. Summary statistics can be derived easily
from or incorporated into the figure. Nevertheless, stem-and-leaf diagrams suffer
visually from one of the same drawbacks as histograms: the number of bins is
arbitrary. Two other alternatives to histograms are jitter plots (figure 3.3B) and
dit plots (figure 3.3C). These two figures preserve the underlying data structure
(all values are presented), do not use arbitrary bins, and can be constructed
quickly without additional preparation (e.g., sorting) of the data set. Both plots
permit rapid assessment of density patterns and are simple to understand.
Stem-and-leaf plots and the density diagrams presented in figure 3.3 can be
used as simple alternatives to histograms. However, these plots do not clearly
convey some of the information that ecologists may want to communicate, and it
is difficult to compare the information in two or more of these plots. I suggest
the box-and-whisker plot (Tukey 1977), often called simply a box plot, as a pre-
sentation alternative to the univariate histogram (figures 3.2 and 3.4A). An advan-
tage of the box plot is that it provides more summary statistical information than
a histogram—it includes medians, quartiles, ranges, and outliers (extreme vari-
ates)—in much less space and with much less ink. Box plot construction is not
dependent on arbitrary bins, so these plots do not exaggerate or distort the data
distribution. By notching the box plot (figure 3.12E), you can easily add confi-
dence intervals so that plots of several distributions can be compared easily.
Wilkinson (1990; Haber and Wilkinson 1982) developed the fuzzy gram (figure
3.4B), another alternative to the histogram. Fuzzygrams are histograms with prob-
ability distributions superimposed on each bar. Consequently, fuzzygrams present
Exploratory Data Analysis and Graphic Display 43
Figure 3.2 Histograms of the number of nodes per plant of 75 surviving Ailanthus altissima
individuals grown in a 5 X 20 plant rectangular array. Each bar represents the frequency
or count (right axis) of observations within the bounds indicated by the ticks on the
*-axis, and the proportion of the total sample (left axis) represented by each bar. The three
plots illustrate the variation in histogram presentation obtained by changing the bin width:
(A) default (bin width = 4); (B) bin width = 2; (C) bin width = 8. At the top of the figure,
a box plot (see figure 3.4 for construction details) illustrates summary statistics and is a
better indication of the true data distribution.
44 Design and Analysis of Ecological Experiments
Figure 3.3 Alternative density plots that convey more information than a histogram. (A)
A stem-and-leaf plot. In this plot, each line is a stem, and each datum on a stem is a leaf.
The label for the stem is the first digit (starting part) of the number, followed by the value
of the leaf. On the first line, the starting part is 0 and the only leaf is 8, indicating a value
of 08 nodes. On the second line, the starting part is 1, and there are four leaves, indicating
four data points: 10, 11, 11, and 11 nodes. The location of the sample median (M) and
upper and lower quartiles (H) are also marked on this plot. (B) A jittered density plot.
Each point is placed along the horizontal scale at the exact location of its value. To keep
points of equal value from overlapping, they are located at random heights above the
x-axis. (C) A dit plot. Each point indicates an individual observation, stacked along the
y-axis at its location along the x-axis. In essence, a dit plot is a stem-and-leaf plot with
symbols substituted for leaves.
Exploratory Data Analysis and Graphic Display 45
not only the data, but also some estimation of how realistically the data represent
the actual population distribution. Such a presentation is particularly useful in
concert with results derived from sensitivity analyses (Ellison and Bedford 1991)
or resampling methods (Efron 1982; chapters 7 and 14). Haber and Wilkinson
(1982) discuss, from a cognitive perspective, the merits of fuzzygrams and other
density plots relative to traditional histograms. Histograms (figure 3.2), stem-and-
leaf plots (figure 3.3A), dit plots (figure 3.3C), and fuzzygrams (figure 3.4B) can
indicate possible bimodality in the data. Bimodal data, observed commonly in
plant ecology, are obscured by box plots and jittered density diagrams.
Probability plots are common features of most statistical packages, and they
provide a visual estimate of whether the data fit a given distribution. The most
common probability plot is the normal probability plot (figure 3.5A). Here, the
observed values are plotted against their expected values if the data are from a
normal distribution; if the data are derived from an approximately normal distri-
bution, the points will fall along a relatively straight diagonal line. There are also
numerical statistical tests for normality (e.g., Sokal and Rohlf 1995; Zar 1996).
If, for biological reasons, the investigator believes the data come from a popula-
tion with a known distribution different from a normal one, it is similarly possible
to construct probability plots for other distribution functions (figure 3.5B).
competitive interactions among trees (e.g., Weller 1987) and forest successional
dynamics (e.g., Horn et al. 1989).
For both exploration and presentation, scatterplots are the most straightforward
way of displaying bivariate data (figure 3.6A). However, because scatterplots are
merely a display, they do not necessarily reveal pattern. Figure 3.6A illustrates
clearly this idea. Three functional response curves (Holling 1966; chapter 10)
could be fit to these data, but it is not clear from the scatterplot itself which curve
would best fit the data. EDA is particularly useful for dealing with these data,
which show high variability and no obvious best relatiTonship between the two
variables.
Recent computer-intensive innovations in smoothing techniques (reviewed by
Efron and Tibshirani 1991) have expanded the palette of smoothers developed by
Tukey (1977). Basically, to construct a smoothed curve through the data, a best-
fit line is constructed through a subset of the data, local to each point along the
jt-axis. This process is repeated for each point, and a smooth line is constructed
by connecting the intersections of each local regression line. The result of this
process, using LOWESS (robust LOcally WEighted regrESSion: Cleveland 1979;
Efron and Tibshirani 1991), is shown for the predator-prey data in figure 3.6B.
In this case, 50% of the data were used to construct each segment of the smoothed
curve. That is, to construct the first segment, the response data from 0 < N0 < 50
were used; to construct the second segment, the response data from 1 < N0 < 51
were used, and so forth. The apparent type III functional response observed in
the smoothed curve is supported by the formal analysis of these data (chapter 10).
The lack of underlying assumptions about the distribution and variance of the
data and the ability to elucidate patterns very noisy data are two advantages of
smoothing over traditional regression techniques. One disadvantage of smoothing
is that relative weighting of data used for each segment must be specified in
advance, usually with little or no rational basis for the decision. Moreover, statis-
tical comparison of different smoothed curves is virtually impossible. Most statis-
tical software packages compute a variety of smoothers (see reviews by Ellison
1992; Kardia 1998).
Smoothers are used appropriately only when there is clear a priori knowledge
of an independent variable and a corresponding dependent variable or variables.
When this is not the case, other exploratory techniques are more appropriate for
examining relationships between variables. In addition, smoothing does not pro-
vide information about potential outliers in the data set. To examine correlations
between variables and to search a posteriori for outliers, influence plots and con-
vex hulls are useful exploratory tools.
A scatterplot of the relationship between tree height and stem diameter (A. M.
Ellison, unpubl. data, 1993) is illustrated in figure 3.7A. The raw data are shown,
and there appears to be an apparent outlier (a 30-m-tall tree with a dbh > 70 cm).
In an influence plot of these data (figure 3.7B), the size of each point becomes
directly proportional to the magnitude of the change its removal would have on
the Pearson correlation coefficient (r) between the two variables. By overlaying
a bivariate 50% confidence ellipse, it becomes obvious that outlying points have
greater influence on r than do points within the ellipse.
Exploratory Data Analysis and Graphic Display 49
Figure 3.7 Scatterplots of tree diameter versus tree height for 41 trees in a mixed hardwood
stand. (A) Raw data. (B) An influence plot, where the size of each point is directly propor-
tional to the magnitude of its influence on r. Shading of the points indicates the direction
of the influence (open circles have a positive influence on r, solid circles a negative influ-
ence). In this case, the putative outlier is shown as a large solid point (influence x 100 =
11). Removal of this point alone, therefore, would increase the value of r from 0.72 to
0.83. A 50% bivariate confidence ellipse is overlain on the figure. (C) An influence plot
of the data following log transformation. (D) Two different 95% confidence ellipses, the
outer constructed based on the variables' standard deviations, and the inner constructed
based on the standard errors of the means of the variables.
52 Design and Analysis of Ecological Experiments
Figure 3.8 A convex hull (A and B, solid line) and a depth-2 peel (B, dotted line) around
the tree size data. The hull is constructed by determining which points are farthest from
the centroid of the data and by joining those points to form a polygon that envelopes the
other points. To peel the hull, all the points that lie on the initial convex hull are deleted,
and a new convex hull is constructed for the remaining points.
Exploratory Data Analysis and Graphic Display 53
useful and informative than always including 0, especially if the extreme value
of either variable is « 0 or » 0. Restricting the values on the axes to just
beyond the extreme values of the data improves clarity and highlights pattern.
Axis breaks do not always help, and changing the relative scaling after an axis
break usually hinders accurate perception of the data and can stymie future digi-
tizers.
Figure 3.9 A scatterplot matrix of the tree size data. This plot illustrates bivariate relation-
ships between all possible combinations of variables in a multivariate dataset. The variable
name in the boxes along the diagonal corresponds to ;t-axis variables of plots below the
diagonal and v-axis variables above the diagonal.
Spread (some measure of variance) versus level (mean, median) plots (Norusis
1990) are a rapid, graphic way to examine the within- and between-treatment
group variances, as well as to provide clues as to appropriate data transformations
to bring heteroscedastic data into line. Norusis (1990), modifying the technique
of Box et al. (1978), suggests plotting the natural log of the interquartile distance
(i.e., the hspread; fig. 3.4A) versus the natural log of the median for each treat-
ment group. An appropriate transformation of the data to remove dependency of
the spread on the level is then given as 1 minus the slope of the linear regression
line fit to the spread versus level plot. Figure 3.10A illustrates a spread versus
level plot for Potvin's Plantago data. Note that the raw data are not homoscedas-
tic; the variance increases with the mean. Following Norusis (1990) and Box et
al. (1978), the slope of the regression line for this plot is 1.71, suggesting that
the data be transformed by raising each observation to the -0.71 power. After
such a transformation, the spread versus level plot (figure 3.1 OB) illustrates that
the strict dependency of spread on level no longer exists, and the data are some-
what more suitable for ANOVA (the variances are no longer correlated with the
mean, although they are still not equalized). Plant size data are often subject to
logarithmic transformations to equalize variances within treatment groups. A log
transformation of these data is almost as good as the negative exponential trans-
formation in equalizing these variances (table 3.1). Box and Cox (1964) and Zar
(1996) provide detailed methods on determining the "best" transformation to be
used on heteroscedastic data. Such transformations may not make biological
sense, but keep in mind that the role of transformation is to bring your data in
line with the assumptions and requirements of the statistical model(s) you are
testing.
Graphic EDA can also be used to examine interaction effects in data. An
example is illustrated in figure 3.11 for Potvin's Helianthus data. In this experi-
ment, Potvin illustrates how position on a greenhouse bench interacts with geno-
type to determine plant mass. The top figure illustrates the relatively small size
of genotype A and the relatively large size of genotype E. Although a scatterplot
matrix might have made this pattern clearer, there is no real reason to plot row x
column, or row x genotype, or column x genotype when the point is to illustrate
the row x column interaction effect on genotype. The lower figure, a contour plot
of the top figure, illustrates the clear "hot spot" in the upper left corner of the
bench. Because interaction effects often involve visualizing data in more than two
dimensions, you can use many of the techniques normally applied to multivariate
data in the exploration of interactions.
Classified quantitative data are presented poorly in the ecological literature.
These problems are illustrated with the data of Ellison et al. (1993) on resource
allocation and morphological responses to light by Nepsera (figure 3.12). The
most common ways of presenting classified quantitative data are bar charts, sepa-
rated or stacked (figures 3.12A,B), and pie charts (figure 3.12C). Separated bar
charts (figure 3.12A), where a single bar represents the results of a single treat-
ment, suffer from the same problems as histograms. The bars themselves use a
lot of ink—horizontal lines, vertical lines, shading of bars of arbitrary width—to
convey information about only a single point at the top of the bar (compare figure
56 Design and Analysis of Ecological Experiments
Figure 3.10 Spread versus level plots of the Plantago data. Values plotted on (A) are
ln(interquartile distance) on the y-axis versus ln(median plant mass) on the x-axis of seven
replicate individuals of each of five genotypes. Genotype number is indicated on the plot.
(B) Spread versus level plot of data following a negative exponential transformation (Nor-
usis 1990). See text and table 3.1 for further explanation.
Exploratory Data Analysis and Graphic Display 57
Variance
Negative
Log exponential
Genotype Mean Untransformed transformation transformation 0>~°'71)
"Variances are shown (1) before transformation, (2) after transformation by natural logarithms, and (3) after trans-
formation by the negative exponential suggested by the spread versus level plot (figure 3.10).
3.12A with 3.12D). Stacked bar charts (figure 3.12B), where treatment groups are
divided into subsets and groups are compared against one another, are virtually
unintelligible and never should be used. In this example, the percent allocation to
leaves, roots, and stems sums to roughly 100% (allowing for error and missing
values). Figure 3.12A (bars side by side) at least clearly illustrates the relative
allocation to each part. It is not so simple, on the other hand, to determine the
relative allocation in figure 3.12B.
Because we use 0 as our reference point, the first guess would be that the
allocation to roots in 2% light is approximately 70% and that to stems is 100%,
when clearly this cannot be true. However, it is difficult to determine visually the
beginning point of any of the stacked segments beyond the lowest one. Although
measures of variance can be placed clearly on side-by-side bar charts, error bars
cannot be placed on stacked bar charts (see section 3.4). Shading, hatching, and
other chartjunk used in bar charts also can interfere with accurate perception of
the data and decrease the data-to-ink ratio. Pies share all of the problems of
stacked bar charts, and none of the advantages of side-by-side bar charts. I can
think of no cases in which a pie chart should be used.
There are several alternatives to bar charts and pie charts. Plots in which the
mean value of the response variable is plotted as a single point, along with some
measure of error, clearly illustrate the same data as in a bar chart with greater
clarity and less ink (figure 3.12D). Sets of box plots better illustrate the underly-
ing data structure and convey more information with less ink and confusion (fig-
ure 3.12E). These box plots have been "notched" (McGill et al. 1978) to show
95% confidence intervals. Polar category plots (with or without error bars; the
latter are shown in figure 3.12F) are the minimalist alternative to bar charts and
are a visually comparable substitute for pie charts. These polar category plots
illustrate the response of eight measured variables to the three light environments
and clearly convey overall differences between treatment groups.
58 Design and Analysis of Ecological Experiments
Figure 3.11 Two ways of visualizing the effect of bench position and genotype on stem
dry weight of Helianthus. The top figure is a three-dimensional scatterplot, with genotype
letter (A-F) as the plotting symbol. The addition of sticks connecting each point to its
position on the x-y plane permits more accurate perception of the true height along the
z-axis of each point. The lower figure is a contour plot, with intensity of shading indicating
the biomass at a particular row x column location on the bench. These contours were
determined by a negative exponential smoothing routine, where the influence of neighbor-
ing values decreases exponentially with distance. Shading density increases with biomass.
Exploratory Data Analysis and Graphic Display 59
Any reported parameter must include a measure of the reliability of that para-
meter, as well as the sample size. For example, sample means, whether reported
graphically or in tables, must be accompanied by the sample size and some esti-
mator of the variance. Error bars on graphs must be correctly identified. Three
kinds of error bars are seen commonly in the ecological literature: standard devia-
tions, standard errors, and n% confidence intervals. Strictly speaking, the first is
the sample standard deviation. The second, more properly referred to as the stan-
dard error of the mean, is an estimate of the accuracy of the estimate of the
mean. We compute it as the standard deviation of a distribution of means of
samples of identical sizes from the underlying population (see Zar 1996, section
6.3 for a complete description). Thus, calling error bars simply standard deviation
bars confounds the two. Measures of error are used to calculate n% confidence
intervals. We can easily compute confidence intervals of normally distributed
data from the standard error of the mean (Sokal and Rohlf 1995). For other distri-
butions, approximations of confidence intervals can be computed using boot-
straps, jackknifes, or other resampling techniques (Efron 1982; chapter 13). All
of these measures require information about sample size, which must be reported
to ensure accurate interpretation of results.
In general, error bars are useful only when they convey information about
confidence intervals. Typically, in the ecological literature, means are plotted
along with error bars illustrating 1 standard error of the mean. For suitably large
n, or for samples from a normal distribution, 1 standard error bar approximates a
68% confidence interval. This conveys little information of interest, since we are
accustomed to thinking in terms of 50%, 90%, 95%, or 99% confidence intervals.
Further, most ecological samples are small, or the underlying data distributions
are unknown. In those cases, error bars representing 1 standard error of the mean
convey no useful information at all. In keeping with the guidelines for graphical
display presented at the beginning of the chapter, I suggest that sample standard
deviations or 95% confidence intervals be the error bars of choice. Two-tiered
error bars (Cleveland 1985) that display both quantities are an excellent compro-
mise. Meta-analysis (chapter 18) requires sample standard deviations, and if re-
ported together with sample size, they permit rapid calculation of confidence
intervals, standard errors, or most other measures of variation. In the end, the
choice of error bar lies with you. It is most important that they be identified
accurately.
If you transformed the data before analysis, your calculated standard deviation
will be symmetrical only with respect to the transformed mean. If you present
the results back-transformed (as is common practice), the error bars may be asym-
metric.
3.5 Conclusion
Ecologists traditionally have used a limited palette of graphic elements and tech-
niques for exploring and presenting data. We must refocus our vision to grasp
60 Design and Analysis of Ecological Experiments
new or unfamiliar graphic elements and techniques that will permit clear commu-
nication of our data. We can now use available computer hardware and software
with expanded EDA and presentation capabilities to display our results accu-
rately, concisely, and in aesthetically pleasing ways (Ellison 1992; Kardia 1998).
We can improve our comprehension and appreciation of data by using many of
the graphic techniques presented in this chapter, just as we can increase our ap-
preciation of the diversity of pasta entrees with a trip to a fine Italian restaurant.