IdentifyingRule BasedTSKFuzzyModels
IdentifyingRule BasedTSKFuzzyModels
Manfred Männle
Institute for Computer Design and Fault Tolerance
University of Karlsruhe
D-76128 Karlsruhe, Germany
Phone: +49 721 608-6323, Fax: +49 721 608-3962
Email: [email protected]
ABSTRACT: This article presents a rule-based fuzzy model for the identification of nonlinear MISO (multiple input,
single output) systems. The presented method of fuzzy modeling consists of two parts: (1) structure modeling, i.e., the
determination of the number of rules and input variables involved respectively, and (2) parameter optimization, i.e., the
optimization of the location and form of the curves which describe the fuzzy sets and the optimization of the consequence
parameters. For structure modeling we use a modified TSK-model, which was first proposed by Takagi, Sugeno, and
Kang in [Takagi 85, Sugeno 88]. For parameter optimization of the initial model we propose singular value decomposition
(SVD). To optimize the parameters of further models we propose the use of RPROP [Riedmiller 93, Zell 94]. We applied
RPROP to the modified version of the TSK-model, implemented the algorithm, and tested its performance [Männle 95,
Männle 96]. In this article we focus on the structure modeling part. By means of two examples we show the performance
of RPROP and the input space partitioning performed by the structuring algorithm.
KEYWORDS: Nonlinear system identification, rule-based fuzzy models, input space partitioning, learning algorithms
1 INTRODUCTION
Fuzzy theory finds its application to deal with problems where there is imprecision caused by the absence of sharp criteria
[Zadeh 65]. An example for this is human language processing. Linguistic terms can be defined by fuzzy sets and we can
formulize fuzzy if-then rules [Zadeh 73]. Operators like AND and OR and the implication IF THEN are defined and
so we can somehow calculate with statements given in this form.
One important practical application of fuzzy theory is fuzzy control: The desired control actions are described by human
experts in the form of fuzzy if-then rules. Combined with fuzzification, a fuzzy inference machine, and defuzzification,
a set of fuzzy if-then rules provides a fuzzy controller, which showed itself as a robust and powerful controller in many
applications.
In this paper we describe a modeling approach. We exploit measured input-output data of an unknown process in order
to get an interpretable process description. Therefore, we automatically generate fuzzy if-then rules which describe the
process’ input-output behavior. This is commonly known as fuzzy modeling or fuzzy identification [Zadeh 94].
In reality, the model response is an approximation to the system response which holds for a restricted set of inputs. Such
an approximation can be designed at an arbitrary precision (e.g., using a polynomial covering all data points), but usually
you then have identified the set of inputs and not the underlying process. What we are searching here is an interpretable
model, i.e., a qualitative description of the underlying process. For this task fuzzy modeling has been proven to be a
suitable approach.
There are various kinds of fuzzy modeling. One is to describe the input-output relation of data with a fuzzy relation.
Relation-based approaches are for example described in [Pedrycz 91, Vrba 92, Chen 94]. Some comparisons of relation-
based approches can be found in [Postlethwaite 91, Böhler 94].
Another approach is to divide the input-output space into clusters. Fuzzy rules are generated by projection of these clusters
to the input space universes. Each rule represents one or several clusters which can be interpreted as local models. For
clustering, Sugeno and Yasukawa use the fuzzy c-means method [Sugeno 93], Nakamori and Ryoke use fuzzy c-varieties
[Nakamori 94].
In this work we have to deal with another problem. We want to identify complex systems with a a large number of input
variables. For this, methods yielding a polynomial or even exponential number of rules (depending from the number of
input variables) like relation-based approaches are not applicable. That is why we take a “bottom up” rule-based approach:
the algorithm starts with a one-rule model and adds additional rules at each refining step until the desired accuracy or a
maximum number of rules is reached. The model structure we use here is similar to the TSK-model’s [Zadeh 94], which
was first proposed by Takagi, Sugeno, and Kang in [Takagi 85, Sugeno 88]. Other derivations from TSK can be found for
example in [Yager 93, Tanaka 94, Jang 93, Yen 98].
In Section 2 we describe the modified TSK-model we use for fuzzy modeling. We also explain in detail the algorithm’s
search for the model structure and the parameter optimization of the initial model by singular value decomposition (SVD).
For a description of RPROP, the parameter optimization for the defined models, we refer to [Männle 95, Männle 96]. In
Section 3 we demonstrate the way RPROP works and how the structuring algorithm performs an input space partitioning.
As a first example we choose a step function in order to show the performance of the RPROP parameter optimization. This
type of function is relatively hard to approximate for a model having smooth sigmoidal membership functions, since the
optimization technique has to find a very high steepness of the membership function. The second example demonstrates
how the algorithm and especially the input space partitioning works. For this purpose we choose two Gaussian bell
functions as a more complex example of a two dimensional mapping. Section 4 closes this article by some conclusions
and perspectives.
2 FUZZY IDENTIFICATION
The modeling is performed iteratively. Each iteration consists of two phases (see figure 1): Beginning from a simple
model, a new and more complex structure (containing more fuzzy rules) is chosen and its parameters are optimized by a
learning algorithm based on the given training data. This step is repeated until the model is considered good enough or
the border for a maximum number of fuzzy rules is exceeded.
model
n good enough?
The training data consists of measured or simulated input-output data pairs
with
IR ,
IR. For , the fuzzy model performs a nonlinear mapping
$
!#" % (1)
i.e., it models a MISO system with & input variables. MIMO systems can be identified by applying several MISO models.
The fuzzy model used is derived from the fuzzy model of Takagi, Sugeno, and Kang [Takagi 85, Sugeno 88], which is
often referred to as TSK-model. The whole fuzzy model consists of a set of ' fuzzy rules ' ')( . Each rule consists
of a premise and a consequence. Consequences of Sugeno-like fuzzy models like the TSK model are simply a linear
combination of (possibly but not necessarily) all input variables. Therefore, each consequence defines a linear mapping
or, geometrically speaking, a hyperplane in the input-output space.
The premise of the fuzzy rule '+* !,.-/,0' is a conjunction of 12* fuzzy clauses of the form 4365 *+78!94:;* , i.e., it considers
those 1<* input variables, which are given in the index set =* . Using an index set for each rule is necessary, since a fuzzy
rule needs not to be complete, i.e., needs not to consider all input variables : ?> @ & . As we will see later, each
premise defines a subspace of the input space. Therefore, Sugeno-like models can be seen as piecewise linear models with
fuzzy transitions.
The initial fuzzy rule ' has for = BA the form
9 :;* : F S
55!5#" (4)
The parameter $ :;* describes the location, parameter % :;* the steepness of the > th membership function in rule - . Figure 2
(right) shows an example of two sigmoidal membership functions with $#'& and %)( (solid line) or %+*,( (dashed
line).
The - th rule’s strength - * for the input vector 4 with @ ; 4 /. 10 U U U20/. ; .4365 IR is given by
- * 7 98 9<:;* : and by choosing the product as t-norm we get -)*
?> 9<:;* : (5)
:;:=< :;:=<
For - A
' @B /. 10 U UU
0C. we also define the normalized strength D * 4
as
*
*4 G
*
-
H
D
( where we have D (6)
E
3 4
H F
3
F -
* F -
Figure 2 (left) shows the two consequences and (solid lines) and the model output (dashed line) of the fuzzy model
QP
7C 7 8 9
S & U
G2R K LNMO (
U
7*0
P P
GS 7 C
7 8 9
K LNMO (
)$ ,% T
U*,( for the membership functions 9 and 9 , which are shown in Figure 2 (right).
P P P
with $ T& % ( and
One can see how the model output is formed by the two linear consequences and
QP
with a fuzzy transition between
< V
*W& YX and <
& YX .
Z
2 2
f1 F1
f2 F2
1.5 y 1.5
1 1
0.5 0.5
0 0
-1 -0.5 0 0.5 1 -1 -0.5 0 0.5 1
u u
Figure 2: Fuzzy model containing two rules for a one dimensional input.
Adjusting a model’s parameters based on a set of training data is also called learning. For the given fuzzy models,
we have to optimize all parameters in the rules’ premises and consequences. The problem can be divided into two
subproblems: optimization of the premise parameters, which is a nonlinear optimization problem, and optimization of
the consequence parameters. Optimization of the consequence parameters reduces to a linear optimization problem when
fixing the premises’ parameters [Sugeno 85, Yen 98].
Parameter optimization is done by minimization of the global modeling error
P P
*
(8)
for
where
@ with
@ .
For the premise parameter optimization one could choose any nonlinear optimization technique. We apply RPROP, a
powerful nonlinear optimization technique which was originally developed for artificial neural network training [Ried-
miller 93, Zell 94]. We choose RPROP because it is much faster than simple gradient descent methods and it is easier
to apply than other sophisticated gradient descent methods as for example Levenberg-Marquardt algorithms [Hagan 94].
In this paper we do not explain in detail how to apply RPROP to the given fuzzy model, since this is already done in
[Männle 95, Männle 96].
For optimization of the consequence parameters we first apply singular value decomposition (SVD) which yields a nu-
merically stable solution of a linear least squares optimization problem [Klema 80, Chen 94].
Let the 0
& S U ' matrix be defined as
U
; U
D D D D ( D ( U
..
.
U
U
( ( U
D D D D D (9)
..
.
4
U
< 4
U
( 4
( 4
D D D D D
U
6 P R P P
with D * as in (6) and the consequence parameter vector P P R ()P ( P
( we can write the
model output of all examples simply as U P .
The D H in depend
P on the premise parameters $ and % . These are considered constant during the optimization of P . So,
the error only depends on the parameters P H and we have for the minimal error:
P
H
& (10)
P
*@
PC &
P
P (12)
, is diagonal,
and is orthogonal. Using this we get
P
( diagonal)
( orthogonal
)
(13)
Using the above approach for consequence parameter optimization and RPROP for premise parameter optimization at
each step we did not get good results. The reason is that the linear optimization calculates the best solution for the
consequence parameters in one step. After that, the iteratively adapting RPROP did not change the premise parameters
any more. Apparently, by applying SVD the parameter optimization of the whole fuzzy model kept sticking in a local
minimum. We therefore apply SVD only for the initial linear fuzzy model. For all other models we use RPROP to
iteratively adapt all parameters at the same time.
The determination of the optimal model structure is a combinatorial problem and an efficient algorithm to solve such a
problem is not (yet) available. Therefore, a heuristic search algorithm is used to find a good but not necessarily optimal
structure within a reasonable calculation time.
In this work we use a heuristic search very similar to that described in [Takagi 85, Sugeno 88]. In Figure 3 the heuristic
search algorithm is given in pseudo code. It alternately determines a new model structure , then optimizes this
structure and finally calculates its quality . In between each epoch ' , the best structure of all investigated structures
is saved in ( and becomes the starting point for the next epoch. In every epoch each rule is refined in each input
variable, yielding ' U & possibilities to examine. Thus, at every epoch the input space is once more divided by adding a
new rule.
Figure 4 shows the possibilities of model refinements to examine. The algorithm starts with one rule containing a linear
model. In the first epoch, assuming the refinement in the th variable was the best, we have a model with two rules with
premises containing 43 . Then, the model is further refined by examinating all U & possibilities yielding to a three-rule
model, say 3 * : , where the refinement in : in the second rule was best, and finally the four-rule model 3 * : * H
in the third epoch. As an advantage, the modeling need not start at the one-rule model. One can formulate à priori
knowledge by a set of rules and then start the algorithm improving this initial model.
When refining a model, the subspace covered by one rule is devided to be covered by two rules afterwards. The inital
partition (given by the fuzzy set’s $ ) is performed on the middle of the subspace.
The choice of the P stopping criterion for the optimization process is crucial for the quality of the model. We use the sum
squared error (8) to calculate the model quality. In order to avoid “over-learning”, i.e., the identification of noise in
the training data, we divide the training set into two sets A and B. Then, the RPROP algorithm is performed on A whereas
the quality calculation is based on B (cross validation). Thus, identification of noise in A usually yields a decreasing
quality since the noise in A and B can be considered different and the training procedure is terminated.
' @ /* initial linear model (see sect. 2.1) */
optimize /* e.g., through SVD (see sect. 2.2) */
calculate /* model quality */
while '.' and (not model good enough) do
' B' S
*
(
end if
end for
end for
if (
/* overall best model so far */
(
(
end if
end while
start
3 EXAMPLES
In this paper we focus on the heuristic search algorithm and the iterative way the models are more and more refined at
each epoch. The examples shown here have two aims:
As a first example we chose a step function in order to show the performance of the RPROP parameter optimization. This
type of function is relatively hard to approximate for a model having smooth sigmoidal membership functions, since the
optimization technique has to find a very high steepness of the membership functions.
The second example demonstrates how the structuring algorithm and especially its input space partitioning work. For this
purpose we chose Gaussian bell functions as a more complex example of a two dimensional mapping.
The evaluation of the algorithm’s performance is described briefly, since you can find two examples in [Männle 95,
Männle 96].
3.1 Step Function
Step functions are quite hard to approximate by models having smooth sigmoidal membership functions. Crisp functions
are usually hard to approximate by fuzzy models. This is because the optimization algorithm has to adjust the membership
function to be very steep, i.e., has to find very high values for % .
original r=1
0.9 0.9
0.6 0.6
0.3 0.3
0 0
-0.3 -0.3
0.5 0.5
-0.5 0 -0.5 0
x2 x2
0 0
x1 -0.5 x1 -0.5
0.5 0.5
Figure 5: Original step function (left) and model containing one rule (right).
r=2 r=3
0.9 0.9
0.6 0.6
0.3 0.3
0 0
-0.3 -0.3
0.5 0.5
-0.5 0 -0.5 0
x2 x2
0 0
x1 -0.5 x1 -0.5
0.5 0.5
Figure 6: Models for step function containing two (left) and three rules (right).
1 1
0 0
-0.5 0 0.5 -0.5 0 0.5
1
: if
P P
G S is 0.5 then T*W&
&
0
-0.5 0 0.5
1 1
G : if is 0.5 then V*W&
P
O
is 0.5 X
0 0
-0.5 0 0.5 -0.5 0 0.5
Figure 5 (left) shows the original step function. From figure 5 (right) to figure 6 you can see the development of the fuzzy
model. As expected, a model containing three rules completely describes the original function
In figure 7 you find the resulting fuzzy model. All $ are adjusted to the step which lies between 0 and 0.05. All % are
adjusted to values about 100 yielding the very steep membership functions which can be seen in figure 7.
To find the resulting model, six models had to be examined. In the first epoch two possible refinements (& ) of
the initial rule, in the second epoch & U ' U possibilities. To optimize the models’ parameters RPROP needed
on the average only about 16 iterations for each model, so that computation time of our C++ implementation on a Sun
UltraSparc1 was only a few seconds.
original r=2
1 1
0.5 0.5
0 0
-0.5 -0.5
0.5 0.5
-0.5 0 -0.5 0
x2 x2
0 0
x1 -0.5 x1 -0.5
0.5 0.5
Figure 8: Original Gaussian bell functions (left) and model containing two rules (right).
r=3 r=4
1 1
0.5 0.5
0 0
-0.5 -0.5
0.5 0.5
-0.5 0 -0.5 0
x2 x2
0 0
x1 -0.5 x1 -0.5
0.5 0.5
Figure 9: Models for Gaussian bell functions containing three (left) and four rules (right).
To visualize the way the modeling algorithm works, we chose a two dimensional function as an example. The function to
be approximated consists of two Gaussian bell functions in the two dimensional input space. From this function, 25x25
equidistant and normalized base points were generated. (see figure 8 (left)). For better comprehension of the resulting
models we chose the simplified fuzzy models containing rules with constant consequences.
The figures 8 (right), 9, and 10 show the development of the fuzzy model. The figures 11 to 13 show the found input
space partitioning (grey bars) and the investigated refinements (dashed lines) for the first six epochs. The dashed lines
also indicate the initial location of the new fuzzy sets before optimization. The fuzzy models of the first six epochs are
given in the figures 14 to 18 in the appendix.
r=5 r=6
1 1
0.5 0.5
0 0
-0.5 -0.5
0.5 0.5
-0.5 0 -0.5 0
x2 x2
0 0
x1 -0.5 x1 -0.5
0.5 0.5
Figure 10: Models for Gaussian bell functions containing five (left) and six rules (right).
0.5 0.5
R1
0 0 R2 R1
-0.5 -0.5
-0.5 0 0.5 -0.5 0 0.5
0.5 0.5
R1
0 R2 R1 R3 0 R2 R3
R4
-0.5 -0.5
-0.5 0 0.5 -0.5 0 0.5
0.5 0.5 R6
R1
R1
0 R2 R3 0 R2 R3
R4 R4
-0.5 R5 -0.5 R5
-0.5 0 0.5 -0.5 0 0.5
In figure 8 (right)
$
we can see the two partial models: one rule with $ *W& for “ small” (G S from figure 14) and the
2
other with & & X for “ big” (G R from figure 14) and a fuzzy transition between them. Rule G R is being refined into
two new rules. In figure 12 (left) you can see the resulting input space partitioning.
The model of the estimation inP figure 9 (right) is listed in figure 16. This model can be interpreted easily. For small
$ $
(GS ) and big (G ) we have *W& . The two “hills” in the middle of the -range are formed by the rules G R
and G by their positive consequence values.
Figure 10 shows the last refinements rounding off the bell functions. The final input space partition is shown in figure 13
(right), the complete fuzzy model is given in figure 18. Thus, in this example, the original function is modeled with only
six fuzzy rules.
A comparison with four other nonlinear modeling algorithms taken from [Frank 95] is given in [Männle 96]. In some
cases of the 40 models to identify the presented approach was outperformed by the best of the other algorithms, but it
shows itself as the overall best technique. Especially when approximating the real data it outperformed all other regarded
techniques. In [Männle 96] we also examined Box/Jenkins’ gas furnace [Box 76] as a well known example for a dynamical
system. For this data, a simple model containing only three rules already yields a satisfying approximation.
4 CONCLUSIONS
In this paper we focused on the structure modeling and input partitioning part of our approach to build rule-based fuzzy
models. The initial one-rule model is iteratively refined by adding rules, i.e., further partitioning the input space. Every
rule can be considered as a local model and the imprecision of the fuzzy sets results in smooth transitions between these
local models. We illustrated this by means of a two dimensional example in section 3.2.
We applied RPROP as a powerful optimization technique to the parameter optimization problem. We found it to be robust,
i.e., working fine with its standard parameters, and converging very fast to a good minimum (see section 3.1). This makes
it possible to built also models for high dimensional training sets (e.g., & J& ) that comprise many training examples.
The solution for finding a good model structure must be considered provisionary and it seems worth spending additional
effort in developing a better method.
Computation time grows quadratically when rising & linearly. This is the result of the heuristic search algorithm. A
stronger heuristic can further reduce this factor. For example by excluding input variables which are not involved in
premises during several epochs (potentially indicating that this variable does not contribute to the model and is unim-
portant), some branches of the search tree (figure 4) are cut. This can considerably reduce the search space. However,
practical investigations showed that variables, although seeming unimportant at the beginning, are sometimes needed late
in the modeling procedure to get a good model.
AnotherP promising approach is to iteratively refine the model in those input space regions where the highest approximation
error is made. Note, that one must regard regions since regarding single input-output pairs can cause a too big influence
of possible outliers.
The problem of structure modeling as it appears in this work is well suited for meta algorithms like for example genetic
algorithms. Tanaka et al. [Tanaka 94] investigated the use of a genetic algorithm for fuzzy models with trapezoidal
membership functions and a restricted model structure. When trying this, research effort should be spent to the coding
scheme, since the application of an appropriate coding scheme largely determines the success of such a genetic algorithm.
Finally, one could think of a “top down” approach instead of a “bottom up” method. For a top down approach one chooses
a fine partition containing many rules at the beginning and then deletes the rules found not to be necessary. This is done
for example in [Jang 93] and [Yen 98]. We did not investigate this approach since it has a crucial drawback when applying
it on some of our data sets: We found training data sets of real technical processes to often have an input dimension of 10
or even higher. Even when only choosing a coarse initial partition and dividing
R each input dimension by only four fuzzy
$
sets, the resulting models become infeasible, since you have to deal with Mio initial fuzzy rules. Even if you can
build a model and reduce the number of fuzzy rules there are usually so many rules left that it is very hard to interpret
the model. We found that if you want to build models automatically (including parameter optimization and structure
modeling) for a high dimensional training set you usually have no choice but a bottom up approach.
5 APPENDIX: FUZZY MODELS OF GAUSSIAN BELL FUNCTIONS
then
G2R : if is 0.5 &
& X
0
-0.5 0 0.5
1
GS : if T*W&
P
is 0.5 then
0
-0.5 0 0.5
Figure 14: Model for two Gaussian bell functions, two rules.
0
-0.5 0 0.5
1
: if < is 0.5
P
G S then T*W& X
0
-0.5 0 0.5
1
Figure 15: Model for two Gaussian bell functions, three rules.
1 1
then
G2R : if is 0.5 P
O
is 0.5 & YX
0 0
-0.5 0 0.5 -0.5 0 0.5
1
: if < is 0.5
P
G S then T*W&
0
-0.5 0 0.5
1
G : if P
is 0.5 O
is 0.5 then & (
0 0
-0.5 0 0.5 -0.5 0 0.5
Figure 16: Model for two Gaussian bell functions, four rules.
1 1
G : if is 0.5 P
O
is 0.5 then @ (
0 0
-0.5 0 0.5 -0.5 0 0.5
1 1
G : if is 0.5 P
O
is 0.5 then & &
0 0
-0.5 0 0.5 -0.5 0 0.5
Figure 17: Model for two Gaussian bell functions, five rules.
1 1
GS : if is 0.5
P
then T*W& &
0
-0.5 0 0.5
1
0 0
-0.5 0 0.5 -0.5 0 0.5
Figure 18: Model for two Gaussian bell functions, six rules.
REFERENCES
Böhler, A. 1994. Lösung von Fuzzy-Relationen-Gleichungen. Diplomarbeit, French-German Institute for Automation
and Robotics (IAR), Universität Karlsruhe, Germany.
Box, G.; Jenkins, G. 1976. Time Series Analysis, Forecasting and Control. Holden-Day, 6th edition.
Chen, J.; Lu, J.; Chen, L. 1994. An on-line identification algorithm for fuzzy systems. Fuzzy Sets and Systems, Vol. 64,
No. 1, pp. 63–72.
Frank, I. 1995. Modern Nonlinear Regression Methods. Chemometrics and Intellig. Laboratory Syst., Vol. 27, pp. 1–19.
Hagan, M.; Menhaj, M. 1994. Training Feedforward Networks with the Marquardt Algorithm. IEEE Transactions on
Neural Networks, Vol. 5, pp. 989–993.
Jang, J.-S. R. 1993. ANFIS: Adaptive-Network-Based Fuzzy Inference System. IEEE Transactions on System, Man, and
Cybernetics, Vol. 23, No. 3, pp. 665–685.
Klema, V.; Laub, A. 1980. The singular value decomposition: Its computation and some applications. IEEE Transactions
on Automatic Control, Vol. 25, No. 2, pp. 164-176.
Männle, M. 1995. Modellierung Nichtlinearer Systeme mit Unscharfen Regeln. Diplomarbeit, French-German Institute
for Automation and Robotics (IAR), Universität Karlsruhe, Germany.
Männle, M.; Richard, A.; Dörsam T. 1996. Identification of Rule-Based Fuzzy Models Using the RPROP Optimization
Technique. In Proceedings of Fourth European Congress on Intelligent Techniques and Soft Computing (EUFIT’96), pp.
587–591.
Nakamori, Y.; Ryoke, M. 1994. Identification of fuzzy prediction models through hyperellipsoidal clustering. IEEE
Transactions on Systems, Man, and Cybernetics, Vol. 24, No. 8, pp. 1153–1173.
Pedrycz, W. 1991. Processing in relational structures: Fuzzy relational equations. Fuzzy Sets and Systems, Vol. 40, pp.
77-106.
Postlethwaite, B. 1991. Empirical comparison of methods of fuzzy relational identification. IEEE Proceedings-D, Vol.
139, No. 3, pp. 199–206.
Riedmiller, M.; Braun, H. 1993. A Direct Adaptive Method for Faster Backpropagation: The RPROP Algorithm. In
Proceedings of the IEEE Int. Conf. on Neural Networks (ICNN), pp. 586–591.
Sugeno, M.; Kang, G. 1988. Structure Identification of Fuzzy Model. Fuzzy Sets and Systems, Vol. 26, pp. 15–33.
Sugeno, M.; Yasukawa, T. 1993. A fuzzy-logic-based approach to qualitative modeling. IEEE Transactions on Fuzzy
Systems, Vol. 1, No. 1, pp. 7–31.
Takagi, T.; Sugeno, M. 1985. Fuzzy Identification of Systems and its Application to Modeling and Control. IEEE
Transactions on Systems, Man, and Cybernetics, Vol. 15, pp. 116–132.
Tanaka, M.; Ye, J.; Tanino, T. 1994. Identification of Nonlinear Systems Using Fuzzy Logic and Genetic Algorithms.
SYSID, Vol. 1, pp. 301–306.
Vrba, J. 1992. Peak-pattern concept and and max-min inverse problem in fuzzy control modeling. Fuzzy Sets and
Systems, Vol. 47, No. 1, pp. 1–11.
Yager, R.; Filev, D. 1993. Unified structure and parameter identification of fuzzy models. IEEE Transactions on Systems,
Man, and Cybernetics, Vol. 23, No. 4, pp. 1198–1205.
Yen, J; Wang, L. 1998. Improving the Interpretability of TSK Fuzzy Models by Combining Global Learning and Local
Learning. IEEE Transactions on Fuzzy Systems, Vol. 6, No. 4, pp. 530–537.
Zadeh, L. 1965. Fuzzy Sets. Information and Control, Vol. 8, pp. 338–353.
Zadeh, L. 1973. The Birth and Evolution of Fuzzy Logic. International Journal on General Systems, Vol. 17, pp. 95–105.
Zadeh, L. 1994. The Role of Fuzzy Logic in Modeling, Identification and Control. Modeling, Identification, and Control,
Vol. 15, pp. 191–203.
Zell, A.; Mache, N.; Sommer, T. et al 1994. Stuttgart Neural Network Simulator, Users Manual, Version 3.2. University
of Stuttgart, Germany.