Univariate and Multivariate Skewness and Kurtosis For Measuring Nonnormality: Prevalence, in Uence and Estimation
Univariate and Multivariate Skewness and Kurtosis For Measuring Nonnormality: Prevalence, in Uence and Estimation
net/publication/309225521
CITATIONS READS
259 1,353
3 authors:
Ke-Hai Yuan
University of Notre Dame
176 PUBLICATIONS 7,507 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Meghan K. Cain on 15 June 2018.
Abstract Nonnormality of univariate data has been Almost all commonly used statistical methods in psychol-
extensively examined previously (Blanca et al., Method- ogy and other social sciences are based on the assumption
ology: European Journal of Research Methods for the that the collected data are normally distributed. For exam-
Behavioral and Social Sciences, 9(2), 78–84, 2013; Miceeri, ple, t- and F-distributions for mean comparison, Fisher Z-
Psychological Bulletin, 105(1), 156, 1989). However, less transformation for inferring correlation, and standard errors
is known of the potential nonnormality of multivariate data and confidence intervals in multivariate statistics are all
although multivariate analysis is commonly used in psy- based on the normality assumption (Tabachnick & Fidell,
chological and educational research. Using univariate and 2012). Researchers rely on these methods to accurately
multivariate skewness and kurtosis as measures of nonnor- portray the effects under investigation, but may not be
mality, this study examined 1,567 univariate distriubtions aware that their data do not meet the normality assump-
and 254 multivariate distributions collected from authors of tion behind these tests or what repercussions they face when
articles published in Psychological Science and the Amer- the assumption is violated. From a methodological perspec-
ican Education Research Journal. We found that 74 % of tive, if quantitative researchers know the type and severity
univariate distributions and 68 % multivariate distributions of nonnormality that researchers are facing, they can exam-
deviated from normal distributions. In a simulation study ine the robustness of normal-based methods as well as
using typical values of skewness and kurtosis that we col- develop new methods that are better suited for the analysis
lected, we found that the resulting type I error rates were of nonnormal data. It is thus critical to understand whether
17 % in a t-test and 30 % in a factor analysis under some practical data satisfy the normality assumption and if not,
conditions. Hence, we argue that it is time to routinely report how severe the nonnormality is, what type of nonnormal-
skewness and kurtosis along with other summary statistics ity it is, what the consequences are, and what can be done
such as means and variances. To facilitate future report of about it.
skewness and kurtosis, we provide a tutorial on how to com- To understand normality or nonnormality, we need to
pute univariate and multivariate skewness and kurtosis by first define a measure of it. Micceri (1989) evaluated
SAS, SPSS, R and a newly developed Web application. deviations from normality based on arbitrary cut-offs of
various measures of nonnormality, including asymmetry,
Keywords Nonnormality · Skewness · Kurtosis · Software tail weight, outliers, and modality. He found that all 440
large-sample achievement and psychometric measures dis-
tributions were nonnormal, 90 % of which had sample sizes
larger than 450. More recently, Blanca et al. (2013) evalu-
ated nonnormality using the skewness and kurtosis1 of 693
Meghan K. Cain
[email protected]
1 Withoutspecific mention, the skewness and kurtosis refer to the
1 University of Notre Dame, Notre Dame, IN, 46556, USA sample skewness and kurtosis throughout the paper.
Behav Res (2017) 49:1716–1735 1717
small samples, with sample size ranging from 10 to 30. including a guideline on how to report sample statistics in
The study includes many psychological variables, and the empirical research and some possible solutions for nonnor-
authors found that 94.5 % of distributions were outside the mality.
range of [-0.25, 0.25] on either skewness or kurtosis and
therefore violated the normality assumption. However, nei-
ther Micceri nor Blanca et al. discuss the distribution of Univariate and multivariate skewness and kurtosis
skewness or kurtosis, how to test violations of normality,
or how much effect they can have on the typically used Different formulations for skewness and kurtosis exist in
methods such as t-test and factor analysis. the literature. Joanes and Gill (1998) summarize three com-
Scheffe (1959, p.333) has commented that kurtosis and mon formulations for univariate skewness and kurtosis that
skewness are “the most important indicators of the extent they refer to as g1 and g2 , G1 and G2 , and b1 and b2 . The
to which nonnormality affects the usual inferences made in R package moments (Komsta & Novomestky, 2015), SAS
the analysis of variance.” Skewness and kurtosis are also proc means with vardef=n, Mplus, and STATA report g1 and
an intuitive means to understand normality. If skewness is g2 . Excel, SPSS, SAS proc means with vardef=df, and SAS
different from 0, the distribution deviates from symmetry. proc univariate report G1 and G2 . Minitab reports b1 and b2 ,
If kurtosis is different from 0, the distribution deviates from and the R package e1071 (Meyer et al., 2015) can report all
normality in tail mass and shoulder (DeCarlo, 1997b).2 three. There are also several measures of multivariate skew-
In practice, normality measures such as skewness and ness and kurtosis, though Mardia’s measures (Mardia, 1970)
kurtosis are rarely reported. In order to study nonnormal- are by far the most common. These are currently available
ity, we have contacted and obtained responses from 124 in STATA, or as add-on macros multnorm in SAS or mardia
researchers, among whom only three reported skewness in SPSS (DeCarlo, 1997a).
and kurtosis in their papers. The under-report of normal-
ity measures can be due to several reasons. First, many Univariate skewness and kurtosis
researchers are still not aware of the prevalence and influ-
ence of nonnormality. Second, not every researcher is famil- For the univariate case, we adopt Fisher’s skewness (G1 )
iar with skewness and kurtosis or their interpretation. Third, and kurtosis (G2 ). Specifically, the skewness, G1 , is calcu-
extra work is needed to compute skewness and kurtosis lated as
than the commonly used summary statistics such as means √
and standard deviations. Fourth, researchers might worry n(n − 1) m3
G1 = · 3/2 , (1)
about the consequences of reporting large skewness and n−2 m2
kurtosis.
This paper provides a simple and practical response to the and the kurtosis, G2 , as
continuing under-report of nonnormality measures in pub-
lished literature by elucidating the problem of nonnormality
n−1 m4
and offering feasible recommendations. We begin with an G2 = · (n + 1) −3 +6 , (2)
easy-to-follow introduction to univariate and multivariate (n − 2)(n − 3) m22
skewness and kurtosis, their calculations, and interpreta-
tions. We then report on a review we conducted assessing where mr = ni=1 (xi − x̄)r /n is the rth central moment
the prevalence and severity of univariate and multivariate with x̄ being the sample mean and n the sample size. The
skewness and kurtosis in recent psychology and education sample skewness G1 can take any value between negative
publications. We also show the influence of skewness and infinity and positive infinity. For a symmetric distribution
kurtosis on commonly used statistical tests in our field using such as a normal distribution, the expectation of skewness is
data of typical skewness, kurtosis, and sample size found 0. A non-zero skewness indicates that a distribution “leans”
in our review. In addition, we offer a tutorial on how to one way or the other and has an asymmetric tail. Distribu-
compute the skewness and kurtosis measures we report here tions with positive skewness have a longer right tail in the
through commonly used software including SAS, SPSS, R, positive direction, and those with negative skewness have a
and a Web application. Finally, we offer practical recom- longer left tail in the negative direction.
mendations for our readers to follow in their own research, Figure 1 portrays three distributions with different values
of skewness. The one in the middle is a normal distribution
and its skewness is 0. The one on the left is a lognormal
2 Kurtosis measures can be centered at either 0 or 3, the former is distribution with a positive skewness = 0.95. A commonly
usually referred to as “excess kurtosis”. This is because the normal used example of a distribution with a long positive tail is
distribution has a kurtosis of 3, and therefore an excess kurtosis of 0. the distribution of income where most households make
1718 Behav Res (2017) 49:1716–1735
0.15
Lognormal (skewness=0.95)
Normal (skewness=0)
Skew-normal (skewness= -0.3) tribution is leptokurtic, meaning it has positive kurtosis, the
distribution has a fatter tail than the normal distribution with
the same variance. Generally speaking, if a data set is con-
0.10
able (Anscombe and Glynn, 1983; D’Agostino, 1970). The reason for
3 The inflation adjusted medium household income is $53,657 in 2014
adopting the method discussed here is that the standard errors of skew-
based on census. ness and kurtosis are reported in popular statistical software such as
4 Note that if g = m /m2 − 3 is used to estimate kurtosis it also has a SPSS and SAS, and, therefore, it is a feasible method for evaluating
2 4 2
minimum value of -2. skewness and kurtosis through existing software.
Behav Res (2017) 49:1716–1735 1719
(a) (b)
variance=0.5 kurtosis=3
variance=1 kurtosis=0
0.5
variance=2 kurtosis=−1
0.4
0.4
0.3
density
density
0.3
0.2
0.2
0.1
0.1
0.0
0.0
−4 −2 0 2 4 −4 −2 0 2 4
Fig. 2 Illustration of the relationship between kurtosis and variance. In Fig 2a each population has a kurtosis of 0, and variance varies from 0.5
to 2.0. In Fig 2b each population has a variance of 1, and kurtosis varies from -1 to 3.
1
n
Although Micceri (1989) and Blanca et al. (2013) have
S= (xi − x̄)(xi − x̄) . (9)
n studied univariate nonnormality, we are not aware of any
i=1
study that has investigated multivariate skewness and kur-
Both measures have a p subscript, so they are specific to a tosis with empirical data or has tested the significance of
set of p variables. The expected Mardia’s skewness is 0 for nonnormality. Therefore, we conducted a study to further
a multivariate normal distribution and higher values indi- evaluate the severity of nonnormality of empirical data,
cate a more severe departure from normality. The expected especially in the multivariate case. Focusing on published
Mardia’s kurtosis is p(p + 2) for a multivarite normal research, we contacted 339 researchers with publications
distribution of p variables. As in the univariate case, val- that appeared in Psychological Science from January 2013
ues under this expectation indicate platykurtism and higher to June 2014 and 164 more researchers with publications
values indicate leptokurtism. that appeared in the American Education Research Journal
1720 Behav Res (2017) 49:1716–1735
from January 2010 to June 2014. The two journals were cho- Because these most extreme values may be outliers, we also
sen due to their prestige in their corresponding fields. We report 1st through 99th percentiles of univariate skewness
asked the researchers to provide the univariate and multi- and kurtosis. Percentiles can be interpreted as the percent
variate skewness and kurtosis of continuous variables used of samples with lower skewness or kurtosis than that value.
in their papers. Binary, categorical, and nominal variables There is clearly a large range from the 1st to the 99th
were excluded, but likert items were included because they percentile, especially for kurtosis. The correlation between
are often treated as normal in data analysis. To help the sample size and skewness is r = −0.005, and with kurtosis
researchers compute the skewness and kurtosis, we provided is r = 0.025. These are comparable to what Blanca et al.
a tutorial for different software as we will present later in (2013) have reported in which correlations between sample
this paper. Our data collection ended in November, 2014, by size and skewness and kurtosis were .03 and -.02, respec-
which point we had obtained 1,567 univariate measures and tively. The results in Table 1 include skewness and kurtosis
254 multivariate measures of skewness and kurtosis from when the sample size is smaller and larger than 106, the
194 studies. Some authors submitted univariate results with- median sample size of all collected data. As shown in this
out multivariate results so some variables for which we have table, negative skewness and kurtosis are much more com-
univariate measures are not included in a multivariate mea- mon than previously reported: 38 % of distributions have
sure. The median sample size for these studies was 106, negative skewness and 47 % have negative kurtosis. This
and the sample size ranged from 10 to 200,000. The median could be due to the number of likert measures provided, but
number of variables included in a multivariate measure was because of the anonymous submission option this cannot
3, and ranged from 1 to 36. Since researchers had the option be confirmed. The mean univariate skewness is 0.51, and
to submit skewness and kurtosis anonymously, it is unclear the sample size-weighted mean is 0.47. The mean univari-
how many authors responded to our request or what their ate kurtosis is 4.29, and the sample size-weighted mean is
study characteristics may be. 8.41. Sample size-weighted means are helpful because we
expect sample measures to better-reflect that of the popula-
Univariate skewness and kurtosis tion as sample size increases. To account for this, measures
from large samples are given higher weight than those from
As shown in Table 1, univariate skewness ranged from smaller samples. Therefore, on average, the skewness and
-10.87 to 25.54 and univariate kurtosis from -2.20 to kurtosis are larger than that of a normal distribution. To
1,093.48, far wider than previously reported or tested. further visualize what these distributions look like, Fig. 3
There were 805 distributions with n ≤ 106 and 762 with n > 106. Nonnormality is defined by significant statistics ZG1 or ZG2 , p < .05
Behav Res (2017) 49:1716–1735 1721
shows histograms of 20 randomly selected distributions 1,476. Percentiles of Mardia’s skewness and kurtosis split
from our review. Note that there is no common shape that by median sample size and median number of variables
explains skewness or kurtosis. used in their calculation are presented in Table 2. The
Percentages of univariate distributions with significant correlation between sample size and Mardia’s skewness
skewness or kurtosis by sample size are presented in Table 1. is r = −0.01 and with Mardia’s kurtosis is r = 0.02.
About 66 % of univariate distributions had significant skew- The correlation between the number of variables and Mar-
ness and 54 % had significant kurtosis. Almost 74 % of dis- dia’s skewness is r = 0.58 and with Mardia’s kurtosis is
tributions had either significant skewness or kurtosis and were r = 0.73. After centering Mardia’s kurtosis on p(p + 2),
therefore classified as nonnormal. As expected, it becomes the expected value under normality, the correlation between
easier for tests to become significant with larger sample sizes. kurtosis and the number of variables becomes r = 0.05.
Over 95 % of distributions with sample sizes greater than the The mean multivariate skewness is 32.94, and the sam-
median sample size, 106, were tested as nonnormal. ple size-weighted mean is 28.26. The mean multivariate
Conversely, when the sample size was less than 106 only kurtosis is 78.70, and the sample size-weighted mean is
56 % of distributions were significantly nonnormal. 92.03. Therefore, the average skewness and kurtosis are
greater than that of a multivariate normal distribution. This
Multivariate Skewness and Kurtosis has important ramifications especially for SEM, for which
multiple outcome measures are often used and for which
The 254 collected Mardia’s multivariate skewness ranged multivariate kurtosis can asymptotically affect standard
from 0 to 1,332 and multivariate kurtosis from 1.80 to errors.
1722 Behav Res (2017) 49:1716–1735
∗ is b
There were 136 multivariate distributions with n ≤ 106, 118 with n > 106, 144 with p ≤ 3, and 110 with p > 3. b2,p 2,p centered on p(p +2)
Percentages of multivariate distributions with significant were statistically significant when the sample size was
Mardia’s skewness and kurtosis are presented in Table 3. larger than 106. Similarly, more Mardia’s measures became
About 58 % of multivariate skewness measures and 57 % of significant with more variables.
multivariate kurtosis measures reached significance. Com- To summarize, based on the test of 1,567 univariate
bining these, 68 % of multivariate distributions were signif- and 254 multivariate skewness and kurtosis from real data,
icantly nonnormal. In particular, 94 % of Mardia’s measures we conclude that 74 % of univariate data and 68 % of
Table 3 Percent significant Mardia’s skewness and kurtosis at significance level 0.05
Skewness 34 86 53 65 58
Kurtosis 35 82 47 70 57
Either 46 94 60 79 68
There were 136 multivariate distributions with n ≤ 106, 118 with n > 106, 144 with p ≤ 3, and 110 with p > 3. Nonnormality is defined by
significant statistics z1,p or z2,p , p < .05
Behav Res (2017) 49:1716–1735 1723
multivariate data significantly deviated from a univariate or specified skewness and kurtosis from a Pearson distribution
multivariate normal distribution. In examining only those in R (R Core Team, 2016) using the package PearsonDS
univariate measures included in a multivariate measure, (Becker & Klößner, 2016).7 Then, we obtained the empir-
68 % have significant nonnormality. Therefore, nonnormal- ical type I error rate to reject the null hypothesis that
ity is a severe problem in real data, though multivariate the population mean is equal to 0 using the significance
nonnormality does not appear to be a severe problem above level 0.05 in a two-tailed, a lower-tail, and an upper-tail
and beyond that of univariate normality. However, this one-sample t-test.
relationship requires further study to evaluate. Table 4 displays the empirical type I error rate for each
condition. For brevity, type I error rates of just the low-
est sample size are presented for conditions with skewness
Influences of skewness and kurtosis between -1.17 and 0.94 because these conditions did not
present any problems. To better understand the empirical
In order to clearly show the influence of skewness and kur- type I error rate, we bold those that are outside of the range
tosis, we conducted simulations on the one-sample t-test, [0.025, 0.075]. When the skewness and kurtosis are 0, the
simple regression, one-way ANOVA, and confirmatory fac- generated data are from a normal distribution and the empir-
tor analysis (CFA). Simulation studies are useful because ical type I error rate is close to 0.05 even when the sample
when data is generated from a specific model we know what size is as small as 18 for all three tests. When data devi-
results the statistical tests should show, and so we can evalu- ate from normality, the results show that a two-sided test is
ate how nonnormality affects those results. Note that for all more robust than a one-sided test. The two-sided test only
of these models, the interest is in the normality of the depen- has increased type I error rate for a skewness of 6.32, for
dent variable(s). There are no normality assumptions put on which a sample size of 554 is necessary to dissipate the
the independent variables. effect. A lower tail t-test has even higher type 1 error rates at
this skewness, and an upper tail t-test has an increased type
Influence of univariate skewness and kurtosis I error rate with negative skewness and very low rates with
high positive skewness.
Yuan et al. (2005) show that the properties of mean esti- A simple regression and a one-way ANOVA with three
mates are not affected by either skewness or kurtosis asymp- groups were also tested at all of these conditions. The
totically, but that the standard error of sample variance regression was robust to all conditions, even at the lowest
is a function of kurtosis. If normality is assumed (kur- sample size. Type 1 error rate for the ANOVA gets as low as
tosis = 0), the standard error of sample variance will be 0.022 under the most extreme skewness (6.32) at the lowest
underestimated when kurtosis is positive and overestimated sample size (18). However, all other type I error rates were
when kurtosis is negative. In other words, kurtosis will still within the [0.025,0.075] robustness range. Type I error rates
have an effect on variance estimates at very large sam- can increase if each population is from a different distribu-
ple sizes while mean estimates are only affected in small tion, but as long as each distribution has equal variances the
samples. For example, Yanagihara and Yuan (2005) found departures are still not too severe. Influence on power, how-
that the expectation and variance of the t-statistic depends ever, can be immense (See Levine and Dunlap (1982), for
on skewness, but that the effect lessens as sample size example).
increases.
To concretely demonstrate the influence of univariate Influence of multivariate skewness and kurtosis
skewness and kurtosis, we conducted a simulation study on
a one-sample t-test. In the simulation, we set the skewness In order to show the influence of multivariate skewness
to the 1st, 5th, 25th, 50th, 75th, 95th, and 99th percentiles and kurtosis, we conducted simulation studies on CFAs.
of univariate skewness found in our review of practical First, we focus on a one-factor model with four manifest
data. These were tested in sample sizes of the 5th, 25th, variables. For each manifest variable, the factor loading
50th, 75th, and 95th percentiles of sample size found in our is fixed at 0.8 and the uniqueness factor variance is 0.36.
review. Therefore, these conditions should represent typi- The variance of the factor is set to 1. Note that the expected
cal results found in our field. Because kurtosis has little Mardia’s kurtosis is p(p + 2). When kurtosis = 24 data are
influence on the t-test, it was kept at the 99th percentile,
95.75, throughout all conditions. In total, we considered 35 7 Pearson distribution includes a class of distributions. It is used here
conditions for each test. Under each condition, we gener- because it allows us to vary the skewness and kurtosis while the mean
ated 10,000 sets of data with mean 0, variance 1, and the and variance remain the same.
1724 Behav Res (2017) 49:1716–1735
Tail Tested
Bolded entries are those outside of the range [0.025,0.075] and are therefore considered different from the nominal 0.05
from a multivariate normal distribution, and the centered review of practical data of four manifest variables (30, 60,
kurtosis is 0. Although in our review of practical data about and 100). Sample sizes of 48, 106, 554, and 1489 were used
half of the data sets had centered Mardia’s kurtosis less than to evaluate these conditions. Because skewness does not
0, 21 is the only multivariate kurtosis less than 24 we were influence SEM asymptotically, it was kept at 0 throughout
able to successfully simulate. Hence, we used these two val- all conditions. In total, 20 conditions were considered. 1,000
ues of Mardia’s kurtosis (21 and 24) along with the 75th, data sets were used to evaluate each condition. The authors
95th, and 99th percentiles of Mardia’s kurtosis found in our are currently unaware of any method to simulate data with
Table 5 Type I error rates of the χ 2 test for factor analysis with 4 manifest variables
Sample Size
Bolded entries are those outside of the range [0.025,0.075] and are therefore considered different from the nominal 0.05
Behav Res (2017) 49:1716–1735 1725
a particular multivariate skewness and kurtosis, so instead The same conditions were tested as in the simulation study
we used the R package lavaan (Rosseel, 2012) to simu- above, with the exception of those with a sample size of 48.
late data from a model with certain univariate skewness and The same univariate kurtoses were used to simulate the data,
kurtosis. Appropriate univariate values were found to simu- though they result in different multivariate kurtosis for eight
late multivariate values of a population by trial and error. variables than they do for four. The resulting empirical type
The influence of skewness and kurtosis is evaluated I error rates of these multivariate kurtoses for both of these
through the empirical type I error rate of rejecting the fac- models are given in Table 6.
tor model using the normal-distribution-based chi-squared Once again, type I error is maintained when the distribu-
goodness-of-fit test. This test is significant when the model tion is multivariate normal (kurtosis = 80), but once kurtosis
does not fit the data. Because the true one-factor model reaches 150 all type I errors are above 0.05. As sample size
was fit to the simulated data, one would expect the empir- increases, the problem worsens. In comparison to the results
ical type I error rate to be close to the nominal level 0.05. shown in Table 5, type I errors are worse with an increase
Deviation from it indicates the influence of skewness and in the number of manifest variables. However, holding the
kurtosis. The empirical type I error rates at different levels number of manifest variables constant, an increase in the
of Mardia’s kurtosis are summarized in Table 5. number of factors lowers type I error rate.
The results show that when the data are from a multivari- In summary, if either univariate or multivariate non-
ate normal distribution (kurtosis = 24), the empirical type I normal data are analyzed using normal-distribution-based
error rates were close to the nominal level 0.05. However, methods, it will lead to incorrect statistical inference. Given
when the data deviate from a multivariate normal distribu- the prevalence of nonnormality as we have shown in the
tion to a Mardia’s kurtosis of 60, the empirical type I error previous section, it is very important to quantify the nonnor-
rates are all greater than 0.05. Unsurprisingly, the problem mality. We suggest using skewness and kurtosis to measure
becomes worse with an increase in sample size. For exam- nonnormality and we will show how to obtain both uni-
ple, when the multivariate kurtosis is 100 and the sample variate and multivariate skewness and kurtosis in the next
size is 1489, the normal-distribution-based chi-squared test section.
rejects the correct one-factor model 29.8 % of the time.
Type 1 error rates were also obtained in a one-factor
model with eight manifest variables and a two-factor model Computing univariate and multivariate skewness
with four manifest variables each to investigate the effects and kurtosis
of an increase in the number of manifest variables or num-
ber of factors. Factor loadings were adjusted to maintain In this section, we illustrate how to compute univariate
uniqueness factor variance at 0.36 and total variance at 1. and multivariate skewness and kurtosis in popular statistical
Table 6 Empirical Type I error rates of the χ 2 test for factor analysis with 8 manifest variables
Sample Size
Bolded entries are those outside of the range [0.025,0.075] and are therefore considered different from the nominal 0.05
1726 Behav Res (2017) 49:1716–1735
software including SAS, SPSS, and R as well as a newly 1999-2000, in 3rd grade in the spring of 2002, in 5th grade
developed Web application. As previously mentioned, dif- in the spring of 2004, and in 8th grade in the spring of 2007.
ferent softwares produce different types of univariate skew- The data used here consist of four consecutive mathemati-
ness and kurtosis. Furthermore, most don’t report tests or cal ability measures of 563 children from kindergarten to 1st
multivariate measures. Using our software and macros for grade. To simplify our discussion, we assume that all files
SAS, SPSS, and R produces consistent and full results to be used are in the folder of “C:\nonnormal”, which needs
across software. Some software requires macros that can to be changed accordingly.
be downloaded from our website at http://w.psychstat.org/
nonnormal. Our Web application can be found at http://w. SAS
psychstat.org/kurtosis. All tools provided perform listwise
deletion before assessing nonnormaltiy. As an example, we To use SAS for computing the univariate and multivariate
use a subset of data from the Early Childhood Longitudinal skewness and kurtosis, first download the mardia.sas macro
Study, Kindergarten Class of 1998-99 (ECLS-K) to show file from our website. Our macro was modified from a SAS
the use of different software. The ECLS-K is a longitudi- macro MULTNORM provided by the SAS company. After
nal study with data collected in kindergarten in the fall and saving the sas macro file, the following code can be used to
spring of 1998-99, in 1st grade in the fall and spring of get the skewness and kurtosis for the ECLS-K data.8
In the SAS input, Line 1 through Line 4 read the ECLS- a standard error 0.21 (Line 8), indicating the kurtosis is
K data in the file “eclsk563.txt” into SAS. Line 5 significantly larger than 0.
includes the SAS macro file downloaded from our web- The second part of the output, from Line 10 to Line
site for use within SAS. The sixth line uses the function 23 includes the information on multivariate skewness and
mardia in the macro to calculate skewness and kurtosis. kurtosis. First, the multivariate skewness is 2.26 (Line 16)
The argument “data=” specifies the SAS database to use with a standardized measure of 212.24 (Line 17). The p-
and “var=” specifies the variables to use in calculating the value for a chi-squared test is approximately 0 (Line 18).
skewness and kurtosis. Therefore, the multivariate skewness is significantly larger
The SAS output from the analysis of the ECLS-K data is than 0. Second, the multivariate kurtosis is 25.47 (Line 21)
given below. The first part of the output, from Line 1 to Line with the standardized measure of 2.51 (Line 22). The p-
8, displays the univariate skewness and kurtosis as well as value for a z-test is approximately 0.01 (Line 23). Therefore,
their corresponding standard error. For example, the skew- the multivariate kurtosis is significantly different from that
ness for the ECLS-K data at time 1 is 0.69 with a standard
error 0.10 (Line 5). Based on a z-test, one would conclude
that the skewness is significantly larger than 0. For another 8 The number on the right is used to identify the code only and is not
example, the kurtosis for the data at time 4 is 1.29 with part of the SAS code.
Behav Res (2017) 49:1716–1735 1727
of a multivariate normal distribution with 4 variables (24). distribution and therefore violate the normality assumption
Consequently, the data do not follow a multivariate normal if used in multivariate analysis.
1728 Behav Res (2017) 49:1716–1735
SPSS The code on the first eight lines in the input is used to read
the ECLS-K data into SPSS. These lines are not necessary
DeCarlo (1997b) has developed an SPSS macro to cal- if your data are already imported into SPSS. Line 10 gets
culate multivariate skewness and kurtosis.9 We slightly the SPSS macro into SPSS for use. The function mardia
modified the macro to make the output of univariate skew- calculates univariate and multivariate skewness and kurtosis
ness and kurtosis consistent to other software. To use the for the variables specified by the vars option on Line 11.
SPSS macro, first download the macro file mardia.sps to Note that the folder to the data file and the SPSS macro
your computer from our website. Then, open a script editor file needs to be modified to reflect the actual location of
(File->New->Syntax) within SPSS and include the follow- them.
ing SPSS script.
The SPSS output from the analysis of the ECLS-K data the output includes univariate skewness and kurtosis and
is given below. Similar to the SAS output, the first part of the second part is for the multivariate skewness and kurto-
sis. SPSS obtained the same skewness and kurtosis as SAS
9 The original macro can be downloaded at http://www.columbia.edu/ because the same definition for skewness and kurtosis was
ld208/Mardia.sps. used.
Behav Res (2017) 49:1716–1735 1729
The output from the R analysis is presented below. skewness and kurtosis as SAS and SPSS.
Clearly, it obtains the same univariate and multivariate
Behav Res (2017) 49:1716–1735 1731
kurtosis. The output of the analysis is provided below. The those of Micceri (1989) and Blanca et al. (2013) strongly
output is identical to the R output except for the variable suggest the prevalence of nonnormality in real data.
names for univariate skewness and kurtosis. This is because Our investigation on the influence of skewness and kurto-
by default the variable names are constructed using “V” and sis involved simulation studies on the one-sample t-test and
an integer in R. factor analysis. Through simulation, we concretely showed
that nonnormality, as measured by skewness and kurto-
sis, exerted great influence on statistical tests that bear the
Discussion and recommendations normality assumption. For example, the use of the t-test
incorrectly rejected a null hypothesis 17 % of the time
The primary goals of this study were to assess the preva- and the chi-squared test incorrectly rejected a correct factor
lence of nonnormality in recent psychology and education model 30 % of the time under some conditions. Therefore,
publications and its influence on statistical inference, as nonnormality can cause severe problems. For example, a
well as to provide a software tutorial on how to compute significant result might be simply an artificial effect caused
univariate and multivariate skewness and kurtosis. First, by nonnormality.
nonnormality clearly exists in real data. Based on the test Given the prevalence of nonnormality and its influence
of skewness and kurtosis of data from 1,567 univariate vari- on statistical inference, it is critical to report statistics such
ables, much more than tested in previous reviews, we found as skewness and kurtosis to understand the violation of nor-
that 74 % of either skewness or kurtosis were significantly mality. Thus, we highly recommend that journal editors
different from that of a normal distribution. Furthermore, and reviewers encourage authors to report skewness and
68 % of 254 multivariate data sets had significant Mardia’s kurtosis in their papers. In Table 7, we list the summary
multivariate skewness or kurtosis. Our results together with statistics that are critical to different statistical methods in
Behav Res (2017) 49:1716–1735 1733
and the R package rsem (Yuan & Zhang, 2012) are the Methods for Research in the Natural Sciences Statistics in
only softwares to offer truly robust methods that address Biology: Statistical Methods for Research in the Natural Sci-
ences (VOL 1, pp. 140–146). New York McGraw-Hill Book
all three points of concern (parameter estimates, standard
Company.
errors, and test statistics), and WebSEM and rsem offer them Box, G.E., & Cox, D.R. (1964). An analysis of transformations An
for free. Additionally, WebSEM has a user-friendly inter- analysis of transformations. Journal of the Royal Statistical Soci-
face in which researchers can draw the path diagram they ety. Series B (Methodological), 211–252. Retrieved 2016-07-26,
from. http://www.jstor.org/stable/2984418
wish to fit.
Corder, G.W., & Foreman, D.I. (2014). Nonparametric statistics: A
As shown in Fig. 3, there is no common distribution step-by-step approach: Wiley.
of practical data in psychology and education. With such D’Agostino, R.B. (1970). Transformation to Normality of the Null
diversity in data shapes and research goals, it is impossi- Distribution of g 1 Transformation to Normality of the Null
Distribution of g 1. Biometrika, 57(3), 679. doi:10.2307/2334794
ble to create one universal solution. However, we hope that
DeCarlo, L. (1997a). Mardia’s multivariate skew (b1p) and multi-
through this paper we were able to elucidate the problem variate kurtosis (b2p). Retrieved from. http://www.columbia.edu/
through our review of practical data and simulation and ld208/Mardia.sps
offer some feasible recommendations to researchers in our DeCarlo, L. (1997b). On the meaning and use of kurtosis. On the
meaning and use of kurtosis. Psychological methods, 2(3), 292.
field. It is our hope that researchers begin to take nonnor- doi:10.1037/1082-989X.2.3.292
mality seriously and start to report them along with means Huber, P.J. (1967). The behavior of maximum likelihood estimates
and variances that have already been established in data under nonstandard conditions. In Proceedings of the fifth Berkeley
analysis. We believe that reporting skewness and kurtosis symposium on mathematical statistics and probability (VOL. 1, pp.
221–233.
in conjunction with moving toward robust analysis offer Joanes, D., & Gill, C. (1998). Comparing measures of sample
two high-impact changes that can be made in the litera- skewness and kurtosis Comparing measures of sample skew-
ture at this time. These actions will not only increase the ness and kurtosis. Journal of the Royal Statistical Society:
transparency of data analysis, but will also encourage quan- Series D (The Statistician), 47(1), 183–189. 10.1111/1467-9884.
00122
titative methodologists to develop better techniques to deal Komsta, L., & Novomestky, F. (2015). Moments: Moments, cumu-
with nonnormality, improve statistical practices and con- lants, skewness, kurtosis and related tests moments: Moments,
clusions in empirical analysis, and increase awareness and cumulants, skewness, kurtosis and related tests. Retrieved from
knowledge of the nonormality problem for all researchers in https://CRAN.R-project.org/package=moments (R package ver-
sion 0.14.
our field. Levine, D.W., & Dunlap, W.P. (1982). Power of the F test with skewed
data: Should one transform or not? Power of the f test with skewed
data: Should one transform or not?. Psychological Bulletin, 92(1),
272. doi:10.1037/0033-2909.92.1.272
Acknowledgments This research is supported by a grant from the Mardia, K.V. (1970). Measures of Multivariate Skewness and
U.S. Department of Education (R305D140037). However, the contents Kurtosis with Applications Measures of Multivariate Skew-
of the paper do not necessarily represent the policy of the Department ness and Kurtosis with Applications. Biometrika, 57(3), 519.
of Education, and you should not assume endorsement by the Federal doi:10.2307/2334770
Government. Meyer, D., Dimitriadou, E., Hornik, K., Weingessel, A., & Leisch,
Correspondence concerning this article can be addressed to Meghan F. (2015). e1071: Misc Functions of the Department of Statistics,
Cain ([email protected]), Ke-Hai Yuan ([email protected]), or Zhiyong Probability Theory Group (Formerly: E1071), TU Wien. Retrieved
Zhang ([email protected]), Department of Psychology, University of from https://CRAN.R-project.org/package=e1071 (R package ver-
Notre Dame, 118 Haggar Hall, Notre Dame, IN 46556. sion 1.6-7).
Micceri, T. (1989). The unicorn, the normal curve, and other
improbable creatures. The unicorn, the normal curve, and
other improbable creatures. Psychological Bulletin, 105(1), 156.
References doi:10.1037/0033-2909.105.1.156
Palmer, E.M., Horowitz, T.S., Torralba, A., & Wolfe, J.M. (2011).
Anscombe, F.J., & Glynn, W.J. (1983). Distribution of the Kurto- What are the shapes of response time distributions in visual
sis Statistic b2 for Normal Samples Distribution of the Kur- search?. Journal of Experimental Psychology: Human Percep-
tosis Statistic b2 for Normal Samples. Biometrika, 70(1), 227. tion and Performance, 37(1), 58–71. Retrieved 2016-07-26, from
doi:10.2307/2335960 doi:10.1037/a0020747
Becker, M., & Klößner, S. (2016). PearsonDS: Pearson Distribution R Core Team (2016). R: A Language and Environment for Statistical
System PearsonDS: Pearson Distribution System. Retrieved from Computing R: A Language and Environment for Statistical Com-
https://CRAN.R-project.org/package=PearsonDS (R package ver- puting. Vienna, Austria R Foundation for Statistical Computing.
sion 0.98. https://www.R-project.org/
Blanca, M.J., Arnau, J., López-Montiel, D., Bono, R., & Bendayan, R. Rosseel, Y. (2012). lavaan: An R Package for Structural Equation Mod-
(2013). Skewness and Kurtosis in Real Data Samples Skewness eling lavaan: An R Package for Structural Equation Modeling.
and kurtosis in real data samples. Methodology: European Journal Journal of Statistical Software, 48(2), 1–36. http://www.jstatsoft.
of Research Methods for the Behavioral and Social Sciences, 9(2), org/v48/i02/
78–84. doi:10.1027/1614-2241/a000057 Sakia, R.M. (1992). The Box-Cox Transformation Technique: A
Bliss, C. (1967). Statistical Tests of Skewness and Kurtosis Statistical Review. The Statistician, 41(2), 169. Retrieved 2016-07-26, from
Tests of Skewness and Kurtosis. In Statistics in Biology: Statistical doi:10.2307/2348250
Behav Res (2017) 49:1716–1735 1735
Satorra, A., & Bentler, P. (1988). Scaling corrections for statistics of Mathematical and Statistical Psychology, 58(2), 209–237.
in covariance structure analysis (UCLA Statistics Series 2). Los doi:10.1348/000711005X64060
Angeles: University of California at Los Angeles: Department of Yuan, K.H., & Bentler, P.M. (1998). Normal theory based
Psychology. test statistics in structural equation modelling. British Jour-
Scheffe, H. (1959). The analysis of variance. The analysis of variance. nal of Mathematical and Statistical Psychology, 51(2), 289–
New York: Wiley. 309.
Shohat, J. (1929). Inequalities for Moments of Frequency Functions Yuan, K.H., Bentler, P.M., & Zhang, W. (2005). The Effect
and for Various Statistical Constants. Biometrika, 21(1/4), 361. of Skewness and Kurtosis on Mean and Covariance Struc-
doi:10.2307/2332566 ture Analysis: The Univariate Case and Its Multivariate Impli-
Tabachnick, B.G., & Fidell, L.S. (2012). Using Multivariate Statistics cation. Sociological Methods & Research, 34(2), 240–258.
Using Multivariate Statistics (6 ed). Pearson. doi:10.1177/0049124105280200
Wang, L., Zhang, Z., McArdle, J.J., & Salthouse, T.A. (2008). Yuan, K.H., & Zhang, Z. (2012). Robust structural equation modeling
Investigating Ceiling Effects in Longitudinal Data Anal- with missing data and auxiliary variables. Psychometrika, 77(4),
ysis. Multivariate Behavioral Research, 43(3), 476–496. 803–826. doi:10.1007/s11336-012-9282-4
doi:10.1080/00273170802285941 Zhang, Z., & Yuan, K.H. (2012). WebSEM: Conducting structual
Yanagihara, H., & Yuan, K.H. (2005). Four improved statistics for equation modelling online. Notre Dame, IN. Retrieved from.
contrasting means by correcting and kurtosis. British Journal https://websem.psychstat.org