The Fuzzy Approach to Assessment of ANOVA Results
The Fuzzy Approach to Assessment of ANOVA Results
1 Introduction
In general, uncertainty expresses our lack of knowledge about the future behavior and
about the states of an investigated phenomenon. The oldest solution, provided by
Pascal, uses a probabilistic approach based on the frequency of events; Kolmogorov
however formalized it, basing on an axiomatic approach and Borel's field of sets.
Such an approach assumes implicitly that there is a possibility, at least a potential one,
to replicate the test many times and experimentally determine the associated asymp-
totic frequency. The investigations of Poincare [1] and Hadamard [2] provided at the
end of the 19th century and later in 1963 by Lorenz [3] revealed that a deterministic
system with large sensitivity may lead to chaos, practically indistinguishable from a
probabilistic random system.
* Corresponding author
A different approach to the uncertainty was presented in 1965 by Zadeh [4]. He
proposed to introduce a specific extension of the set algebra: the membership function
with a value varying from 0 to 1. Such an idea has been developed intensively since
the 1960s, in particular, the fuzzy arithmetic defined for real fuzzy numbers by Du-
bois and Prade [5].
In 1968, Zadeh [6] already considered the relation of the probability and fuzzy de-
scriptions of uncertainty, however, it was Buckley [7, 8] who offered in 2005 a con-
sistent approach to fuzzy estimators related to random samples defined as crisp da-
tasets. His concept of a fuzzy estimator is based on the mapping between a source
pair: a significance level and its related confidence interval and the resulting alpha-
cut. Such an approach requires rather difficult analytical transformations and it has
appeared to be rather impractical in a more complicated analysis.
In 2006, Grzegorzewski [9] chose the theory of decision as a starting point and de-
veloped a more general classification that contained three elements: analyzed data,
tested hypotheses and additional assumptions/conditions. The elements may be con-
sidered as fuzzy or non-fuzzy, which leads to many possible combinations. It is very
interesting as a conceptual idea but there is lack of practical instructions for the evalu-
ation of such tests.
Some elements of a fuzzy approach have been adopted by the design of experi-
ments analysis [10], which however, led to a very difficult inference. A similar at-
tempt to use the neural network approximation [11] revealed a large instability of
results affected by a random procedure of neural network identification. This may
also be caused by an unconsidered correlation between variables, which imposes a
selection of the particular pair of triangular norms [12].
In a further analysis the authors adopted Buckley’s approach but his very compli-
cated analytical transformations were replaced with a bootstrap approach [13], which
also led to the obtaining of empirical distributions. The distributions were used to
construct alpha-cuts related to fuzzy estimators.
2 Methods
A : R → [ 0,1] . (1)
{
A[α] = x ∈ R : A( x ) ≥ α ∧ α ∈ (0,1] . } (2)
The value of alpha-cut for α = 0 is specifically defined as a closed support of a mem-
bership function i.e.:
{
A[0] = x ∈ R : A( x ) ≠ 0 . } (3)
Buckley restricted the use of fuzzy numbers to the subtype of ‘triangular shaped fuzzy
numbers’, which means that a membership function is a combination of two monoton-
ic functions: left – monotonically increasing and right – monotonically decreasing.
Buckley used their inverse form to define the alpha-cut Q[α] of the fuzzy number Q
as a closed, bounded interval for 0 ≤ α ≤ 1 i.e.:
where β is used as a symbol of the significance level because the traditional symbol α
collides with the argument of the alpha-cut. The values of β vary from something very
small but different from zero to less than 1 e.g. 0.01 ≤ β < 1. The value of 1 is treated
in a special manner because it results in zero-length interval i.e. in a confidence inter-
val of 0 confidence.
Thus the fuzzy estimator θ of the statistics θ is defined inversely through its al-
pha-cut θ[α] :
SS Error = ∑∑ ( xij − xi ) .
k
2
(10)
i =1 j =1
SS Factor
MS Factor = , (11)
k −1
SS Error
MS Error = . (12)
n−k
Finally, F statistics is evaluated
MS Factor
F= . (13)
MS Error
If the assumptions are met, the F statistics should follow Fisher’s distribution with the
degrees of freedom equal to f1 = k – 1 and f2 = n – k, respectively. The significance
level p-Value is evaluated from an inverse cumulative distribution and the null hy-
pothesis is rejected if p-Value is smaller than the critical significance level α assigned
a priori.
( )
zij = µ + ai + ε ij , ε ∼ N 0, σ 2 , (14)
3 Materials
z = ln( y ) , (15)
where y denotes the obtained eutectic area and z denotes the transformed value.
The source data were divided into 6 groups with 31, 49, 80, 75, 64 and 61 values,
respectively [17]. The technological aim was to check the homogeneity between the
cross-sections.
4 Results
At the beginning, the classic analysis led to ANOVA table with crisp results (Tab.1).
Table 1. ANOVA table for transformed data (source [17]).
Effect SS df MS F p
Trace 3.318 5 0.664 0.411 0.841
Error 572.019 354 1.616 – –
Total 575.338 359 1.603 – –
Such results do not impose the rejection of a homogeneity hypothesis.
Next, the bootstrap procedure resulted in the dataset containing 10.000 records of
bootstrapped F and p values. The dataset size (10.000) was selected for convenience:
an easy selection of exact quantiles and – in reverse – an easy recalculation of the
membership function values associated with confidence intervals.
The obtained values of F and p were sorted, associated with their 1/nth of experi-
mental probability (i.e. 1/10.000 for the mentioned case) and – through the inverse
cumulative probability function – confidence intervals for different values of signifi-
cance level were evaluated.
5 Analysis
The description statistics for the bootstrapped F and p values are presented in Tab.2.
• classic ANOVA F = 0.411 is related to the fuzzy assessment 0.922 (see Fig.2),
• classic ANOVA p = 0.841 is related to the fuzzy assessment 0.922 (see Fig.4).
Firstly, it is worth noting that both assessments are equal, which means that such
fuzzy estimators are consistent in decision making. Secondly, such a relatively large
value means that the hypothesis of means equality may be treated with a high degree
of certainty. Such conclusions are a slightly different in comparison to the statistical
orthodox: ‘not reject’, which does not mean ‘accept’.
6 Conclusion
References
1. Poincaré, J.H.: Sur le problème des trois corps et les équations de la dynamique.
Divergence des séries de M. Lindstedt. Acta Mathematica 13, 1-270 (1890)
2. Hadamard, J.: Les surfaces à courbures opposées et leurs lignes géodesiques.
Journal de Mathématiques Pures et Appliquées 4, 27-73 (1898)
3. Lorenz, E.N.: Deterministic non-periodic flow. Journal of the Atmospheric
Sciences 20, 130-141 (1963)
4. Zadeh, L.A.: Fuzzy sets. Information and Control 8, 338-353 (1965)
5. Dubois, D., Prade, H.: Fuzzy real algebra: some results. Fuzzy Sets and Systems 2,
327-348 (1979)
6. Zadeh, L.A.: Probability Measures of Fuzzy Events. J Math Anal Appl 23, 421-&
(1968)
7. Buckley, J.J.: Fuzzy statistics: hypothesis testing. Soft Computing 9, 512-518
(2005)
8. Buckley, J.J.: Fuzzy probability and statistics. Springer Verlag, Heidelberg (2006)
9. Grzegorzewski, P.: Decision support under uncertainty. Statistical methods for
imprecise data. EXIT, Warszawa (2006)
10. Pietraszek, J.: Fuzzy Regression Compared to Classical Experimental Design in
the Case of Flywheel Assembly. Lect Notes Artif Int 7267, 310-317 (2012)
11. Pietraszek, J., Gadek-Moszczak, A.: The Smooth Bootstrap Approach to the
Distribution of a Shape in the Ferritic Stainless Steel AISI 434L Powders. Solid
State Phenomen 197, 162-167 (2013)
12. Pietraszek, J.: The Modified Sequential-Binary Approach for Fuzzy Operations on
Correlated Assessments. Lect Notes Artif Int 7894, 353-364 (2013)
13. Shao, J., Tu, D.: The Jackknife and Bootstrap. Springer, New York (1995)
14. Davies, L., Gather, U.: Robust Statistics. In: Gentle, J.E., Hardle, W.K., Mori, Y.
(eds.) Handbook of Computational Statistics, vol. 2, pp. 711-749. Springer,
Heidelberg (2012)
15. Fisher, R.A.: Statistical Methods for Research Workers. Oliver and Boyd Press,
Edinburgh (1925)
16. Rutherford, A.: Introducing ANOVA and ANCOVA - A GLM Approach. SAGE
Publications Ltd., London, Thousand Oaks, New Delhi (2001)
17. Szczotok, A., Nawrocki, J., Gądek-Moszczak, A., Kołomycki, M.: The Bootstrap
Analysis of One-Way ANOVA Stability in the Case of the Ceramic Shell Mould
of Airfoil Blade Casting. Solid State Phenomena 235, 24-30 (2015)