0% found this document useful (0 votes)

67 views12 pages

Process Modeling and Optimization Using Focused Attention Neural Networks

Uploaded by

vane-16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views12 pages

Process Modeling and Optimization Using Focused Attention Neural Networks

Uploaded by

vane-16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

ISA

TRANSACTIONS1

ISA Transactions 37 (1998) 41±52

Process modeling and optimization using focused attention

neural networks
James D. Keeler*, Eric Hartman, Stephen PicheÂ
Pavilion Technologies, Inc., 11100 Metric Blvd., #700, Austin, TX 78758, U.S.A.

Abstract
Neural networks have been shown to be very useful for modeling and optimization of nonlinear and even chaotic
processes. However, in using standard neural network approaches to modeling and optimization of processes in the
presence of unmeasured disturbances, a dilemma arises between achieving the accurate predictions needed for modeling
and computing the correct gains required for optimization. As shown in this paper, the Focused Attention Neural
Network (FANN) provides a solution to this dilemma. Unmeasured disturbances are prevalent in process industry
plants and frequently have signi®cant eects on process outputs. In such cases, process outputs often cannot be accu-
rately predicted from the independent process input variables alone. To enhance prediction accuracy, a common neural
network modeling practice is to include other dependent process output variables as model inputs. The inclusion of
such variables almost invariably bene®ts prediction accuracy, and is benign if the model is used for prediction alone.
However, the process gains, necessary for optimization, sensitivity analysis and other process characterizations, are
almost always incorrect in such models. We describe a neural network architecture, the FANN, which obtains accuracy
in both predictions and gains in the presence of unmeasured disturbances. The FANN architecture uses dependent
process variables to perform feed-forward estimation of unmeasured disturbances, and uses these estimates together
with the independent variables as model inputs. Process gains are then calculated correctly as a function of the esti-
mated disturbances and the independent variables. Steady-state optimization solutions thus include compensation for
unmeasured disturbances. The eectiveness of the FANN architecture is illustrated using a model of a process with two
unmeasured disturbances and using a model of the chaotic Belousov±Zhabotinski chemical reaction. # 1998 Elsevier
Science Ltd. All rights reserved.
Keywords: Neural Networks; Steady state optimization; Disturbance rejection; Process modeling

1. Introduction neural networks include papers on basic algo-

rithms [1,2], variations on basic algorithms [3,4],
Arti®cial neural networks represent a set of theoretical proofs of universal function approx-
powerful mathematical techniques for modeling, imation properties of neural networks [5,6], and
control, and optimization, in which models applications to problem domains, including pre-
``learn'' processes behavior directly from process diction, optimization, and control of industrial
data. Examples of the extensive literature on processes [7±12].
Given data of reasonable quality, building a
neural network model that simply predicts accu-
* Corresponding author. Tel: 1-800-880-5432; e-mail: rately is relatively straightforward [1,13]. How-
[email protected] ever, when modeling industrial processes for

0968-0896/98/$19.00 # 1998 Elsevier Science Ltd. All rights reserved

PII: S0019 -0 578(98)00005 -6
42 J.D. Keeler et al./ISA Transactions 37 (1998) 41±52

purposes such as process understanding, sensitiv- values. For instance, in the distillation column
ity analysis, and optimization, accuracy in the example, signi®cant unmeasured disturbances
process gains (derivatives of outputs with respect would make it impossible to accurately predict the
to inputs) is also essential. With standard neural top and bottom compositions as a function of
network approaches to modeling processes subject reboil steam, re¯ux, feed ¯ow, and column pres-
to unmeasured disturbances, obtaining accuracy sure alone.
in both the predictions and gains is often not pos- To improve prediction accuracy in such situa-
sible. To understand why this is so, consider the tions, a common neural network modeling strat-
following example of a distillation column. egy is to include dependent variables along with
Typical process variables in a distillation column the independent variables as model inputs. In the
are: distillation column example, this would mean
adding the temperatures as model inputs in addi-
. Manipulated variables: reboil steam, re¯ux.
tion to the manipulated variables in order to aid
. Measured disturbance variables: feed ¯ow,
prediction of the top and bottom compositions.
column pressure.
Because dependent variables (e.g., the tempera-
. Output (controlled) variables: top and bot-
tures) re¯ect the eects of unmeasured dis-
tom compositions.
turbances (e.g., feed composition), including
In addition, distillation columns are typically dependent variables as inputs to a model ordina-
instrumented to monitor a number of other rily does improve prediction accuracy of the out-
dependent variables (which the operators are not put. However, because the functional relationship
interested in controlling directly): of the independent to the dependent variables is
not represented in the model, and because the
. Dependent (not controlled) variables: over-
independent and dependent variables are usually
head temperature, bottom section tempera-
highly correlated, the gains in such models are
ture, re¯ux temperature.
almost certain to be incorrect. In our example, this
These process variable categories are shown in means that adding the temperatures to the model
Fig. 1. inputs will cause the gains of the top and bottom
Unmeasured disturbances such as feed compo- compositions with respect to the manipulated
sition, weather, catalyst degradation in reactors, variables to be wrong. Consequently, while models
and plant wear, can cause outputs to vary despite containing dependent variables as inputs may
®xed independent variable settings [14]. If the have good ability to predict the output variables,
eects of such disturbances are at all signi®cant, a optimization settings and sensitivity analysis are
neural network model containing only the inde- typically inaccurate due to incorrect gains. The
pendent process variables as inputs (the manipu- Focused Attention Neural Network (FANN)
lated and measured disturbance variables, which allows steady-state neural network models to
contain no information about the disturbances), obtain accurate predictions and gains in the pre-
will be unable to accurately predict the output sence of unmeasured disturbances.

Fig. 1. Classi®cation of process variables for modeling a plant.

J.D. Keeler et al./ISA Transactions 37 (1998) 41±52 43

In Section 2, we illustrate the problem in detail

with three simple cases. In Section 3, we describe
the FANN solution. In Section 4, we present two
case studies, and we close with conclusions in
Fig. 2. Case 1: no disturbances. A model with the independent
Section 5. variable u alone as input can obtain both accurate predictions
and the correct gain.

2. The problem
Given sucient data, the trained neural net-
In short, the problem addressed by the FANN work model of Fig. 2 will match the process Eq.
architecture is the following dilemma: (1) and yield correct predictions:
1. Information about unmeasured in¯uences y^ a1 a2 u 3
which is re¯ected in dependent process out-
put variables is frequently necessary for
The gain of the model is also correct, matching the
neural network process models to have the
process gain Eq. (2):
required prediction accuracy. However,
2. adding dependent variables as model inputs @
y^ a1 a2
causes the gains for the manipulated vari- @u
ables to be inaccurate.
This dilemma is illustrated in the following 2.2. Case 2: unmeasured disturbances, independent
example, which is considered in three cases. model inputs only

2.1. Case 1: no unmeasured disturbances We now alter the process equations of Case 1 by
adding a disturbance d to the state s:
Assume a noiseless, linear plant1 with manipu-
lated input u, dependent variable s, and output y a1 u s
variable y:
s a2 u d
y a1 u s
s a2 u Eliminating s we have:

Eliminating s we have: y a1 u a2 u d a1 a2 u d 4

y a1 u a2 u a1 a2 u 1 which diers from the Case 1 process Eq. (1) by
the addition of the disturbance d. The gain equa-
from which we can compute the process gain of y tion for this process, on the other hand, is the
with respect to u: same as the Case 1 gain Eq. (2):

@y @y
a1 a2 2 a1 a 2 5
@u @u

Since there are no unmeasured disturbances We here consider the same model structure as in
aecting the process, y can be modeled as a func- Case 1 (Fig. 3).
tion of u only, as shown in Fig. 2. The presence of disturbances makes it impos-
sible to accurately model y as a function of u only.
1
A linear plant is chosen for simplicity to illustrate the Assuming the disturbance d is zero-mean, the
dilemma. It is shown later how a nonlinear system is modeled. trained neural network model will be identical to
44 J.D. Keeler et al./ISA Transactions 37 (1998) 41±52

Fig. 3. Case 2: unmeasured disturbances exist, and y is mod- Fig. 4. Case 3: unmeasured disturbances exist, and y is mod-
eled as a function of u only. The predictions for y are inaccu- eled as a function of u and s. With this model structure, the
rate by the amount d. However, the model gain is correct. predictions for y are accurate, but the gain for u is incorrect.

Case 1, Eq. (3): However, the functional dependency of s on u is

not represented in this model; that is, u and s are
y^ a1 a2 u both treated as independent variables. Hence, in
this model, the gain for u calculates as:
Comparing this to the present process Eq. (4), we
see that the predictions produced by this model @
y^ a1
will each be incorrect by the amount of the dis- @u
turbance d.
The gain of this model, on the other hand, is which diers by a2 from the true gain Eq. (6).
correct, matching the process gain Eq. (5):

@ 2.4. Summary of the above cases

y^ a1 a2
@u
A summary of the above cases appears in
2.3. Case 3: unmeasured disturbances, independent Table 1.
and dependent model inputs The FANN modeling structure makes it possi-
ble to build industrial process neural network
Consider the same process as in Case 2: models whose predictions and gains are both
accurate, despite the presence unmeasured dis-
y a1 u s 6 turbances. How this is accomplished is the subject
s a2 u d of the next section.

Eliminating s gives the same results as in Case 2:

3. The FANN solution
y a1 u a2 u d a1 a2 u d 4
We reiterate the basic dilemma, described in the
@y previous section, which is addressed by the FANN
a1 a2 5
@u architecture:
1. Information about unmeasured in¯uences
In Case 2, the predictions were inaccurate because
which is re¯ected in dependent process out-
the model had no knowledge of the disturbance, d.
put variables is frequently necessary for
Because the dependent variable s contains the dis-
neural network process models to have the
turbance d, the model can be made to predict
required prediction accuracy (Case 2 above).
accurately by including s as an input to the model,
However,
as shown in Fig. 4.
2. adding dependent variables as model inputs
A trained neural network model of this struc-
causes the gains for the manipulated vari-
ture (again, given sucient data) will produce a
ables to be inaccurate (Case 3 above).
correct model, matching the process Eq. (6):
The FANN structure solves the dilemma by
y^ a1 u s modeling the functional relationships among the
J.D. Keeler et al./ISA Transactions 37 (1998) 41±52 45

Table 1
Summary of standard neural network approaches

Independent inputs only Independent and dependent inputs

No unmeasured disturbances Predictions: Correct (*) Correct

Gains: Correct (*) Incorrect
Unmeasured disturbances Predictions: Incorrect Correct
Gains: Correct Incorrect

Summary of the models considered in Section 2, illustrating the dilemma faced by standard neural network approaches when used
to model processes subject to unmeasured distrubances.
(*) This case was not considered because a correct model can be obtained with only independent inputs.

process variables, estimating unmeasured dis- We ®rst assume that the dependent process vari-
turbances, and thereby providing accurate predic- ables s are, like the outputs y, functions of u and d:
tions and gains. This structure is now described.
s fu; d
3.1. The FANN structure
As in the case of y, because disturbances d are
First, we assume that the vector of outputs y is present, s cannot be predicted from u alone.
given by a function, in general nonlinear, of the We proceed by expanding f as follows:
vector of manipulated variables u and the vector
of unmeasured disturbances d: s fu; d gu hd lu; d 10

y Fu; d 7 where g, h, and l are in general nonlinear func-

tions. Assuming that the eects of coupling the
(If there were no unmeasured disturbances, then a manipulated variables and disturbance variables is
Case 1 model would suce.) Because we do not weak, s may be approximated as:
have a direct measurement of d, we consider an
approximation to Eq. (7) that uses the vector of s gu hd gu D 11
measured dependent process variables s in place
of d: where we have de®ned, for convenience, the
y Ju; s 8 (uncoupled) eects of the unmeasured dis-
turbances as:
The accuracy of the approximation depends upon
the dependent variables s re¯ecting the eects of D hd
the unmeasured disturbances d.
Consider now a model trained to represent Eq. Using these relations we may eliminate s from Eq.
(8). For clarity we denote this model as: (8).

^ s
y^ Ju; 9 y Ju; s Ju; gu D Zu; D 12

A model of this form was considered in Case 3 of Eqs. (11) and (12) now have the desired functional
Section 2, which illustrated the fact that such a dependenciesÐthe dependent process variables s
neural network model results (given sucient and y are expressed as functions of the indepen-
data) in accurate predictions of y, but in incorrect dent variables u and D.
gains. We denote a model representing Eq. (11) as:
We now describe how the gains may be made
^
s^ g^ u D 13
correct without sacri®cing prediction accuracy.
46 J.D. Keeler et al./ISA Transactions 37 (1998) 41±52

Fig. 5. The FANN architecture, described by Eqs. (13) and (14) Model 1 and Model 2 are neural network models. This structure
properly models the dependent variables s and outputs y as functions of the two types of independent variables, u (manipulated) and DÃ
(estimated eects of unmeasured disturbances).

Thus, estimating s simply requires summing the zation, DÃ is held ®xed, and u is adjusted to achieve
estimates gÃ(u) and DÃ. As is discussed below, gÃ(u) is the desired setpoint for y. (This assumes that the
obtained by training a model from u to s. The unmeasured disturbances are varying slowly.) By
computation of DÃ is discussed in the next section. ®xing DÃ the optimization takes into account the
Equations (9) and (13) together de®ne the eects of the unmeasured disturbances.
FANN model structure. Combining Eqs. (9) and
(13) in the manner of Eq. (12), the FANN archi- 3.2. Computing DÃ
tecture can be expressed as:
The question remains as to how to compute DÃ.
^
y^ Zu; ^
D 14 Rearranging Eq. (11), we see that D can be given
by
The FANN model is shown in Fig. 5, where both
Model 1 and Model 2 are neural network models. D s ÿ gu 15
Model 1 is trained to represent gÃ(u) by training
a model with s as outputs and u as inputs. This is To estimate D with a model DÃ, we note that the
analogous to the Case 2 model of Section 2, with s dependent variable s is measured and that g(u)
as outputs instead of y. Model 1 will be inaccurate may be modeled by gÃ(u) as described above.
in its predictions of s to the degree that unmea- Therefore, DÃ may be computed simply as:
sured disturbances are present, but (given su-
cient data) its gains with respect to s will be ^ s ÿ g^ u
D 16
correct. This is precisely what is desired. Model 2
is trained with y as outputs and u and s as inputs This relationship is shown in Fig. 6.
(in the training mode, sÃ is identical to s, by de®ni- Estimation of the eects of unmeasured dis-
tion of DÃ, described in the next section). turbances allows for accurate predictions as well
Thus, the FANN modeling structure correctly as correct calculation of process gains as a func-
treats both u and DÃ as independent variables, and tion of the independent variables and the unmea-
represents the functional dependence of s and y sured disturbances. These factors enable
upon those independent variables. During optimi- compensation for unmeasured disturbances during
optimization.

3.3. The FANN model applied to the Section 2

example

We now apply the FANN model to the example

Fig. 6. Estimation of D in the FANN architecture, corre- process of Section 2 (with disturbance, Cases 2
sponding to Eq. (16). Model 1 is the same as in Fig. 5. and 3):
J.D. Keeler et al./ISA Transactions 37 (1998) 41±52 47

y a1 u s 6 (4), with s eliminated, and we see that the gain is

s a2 u d correct, matching Eq. (5):

Again, with s eliminated we have: @

y^ a1 a2
@u
y a1 u a2 u d a1 a2 u d 4
Thus, the FANN architecture correctly computes
@y both the predictions and the gains in the presence
a1 a2 5
@u of unmeasured disturbances.
First we apply the structure of Fig. 6 to estimate
D. The trained neural network Model 1 will yield 3.4. Summary
(given sucient data):
Table 2 extends Table 1 to include the FANN
^ a2 u
gu architecture.
In summary, the FANN architecture provides
The subtraction in Fig. 6 gives: correct predictions and correct gains in models of
processes subject to unmeasured disturbances.
D^ s ÿ gu
^ a2 d ÿ a 2 u d It should be noted that the FANN structure
diers signi®cantly from that of adding a bias to a
The addition in Fig. 5 gives: model to compensate for model mismatch. The
purpose of the FANN structure is to compensate
s^ gu
^ d a2 u d s for unmeasured disturbances, not for model mis-
match. The FANN architecture performs feed-
The predictions for y, given by the trained Model forward estimation of unmeasured disturbances
2 in Fig. 5, will be correct (again, given sucient and compensates for them during optimization.
data), matching the process Eq. (6): Such compensation cannot be achieved by model
biasing, which does not change the model gains.
y^ a1 u s As with any model, model biasing may be used
with FANN models to compensate for bias mis-
With respect to predictions, then, the FANN match. If model mismatch is due to non-stationary
model is identical to the Case 3 model (Fig. 4). behavior in the plant, bias adjustment may be
With respect to the gains, however, the models are useful in the short-term. For the longer-term,
not alike. As shown in Fig. 5, s is not treated as an mismatch should be handled via on-line model
independent variable in a FANN model; instead, s retraining. Using these tools coupled with the
is correctly modeled as a dependent variable. FANN architecture provides the full capabilities
Therefore, the gain is calculated according to Eq. required to achieve eective optimization in linear

Table 2
Summary of neural network approaches including FANN

Independent inputs only Independent and dependent inputs FANN architecture

No unmeasured distrubances Predictions: Correct () Correct () Correct

Gains: Correct (*) Incorrect (*) Correct
Unmeasured disturbances Predictions: Incorrect Correct Correct
Gains: Correct Incorrect Correct

The FANN architecture compared to the models considered in Section 2. Only the FANN architecture obtains both correct
predictions and correct gains in the presence of unmeasured disturbances.
(*) These cases were not considered because a correct model can be obtained with only independent inputs.
48 J.D. Keeler et al./ISA Transactions 37 (1998) 41±52

and nonlinear production units aected by mea- s2 4u21 d2 2 18

sured and unmeasured disturbances.
y s21 s2

4. Case studies To generate data, the three independent variables

u1 , u2 , and u3 were chosen from a random uniform
4.1. An example with two disturbances distribution on [0,1]. Only u1 and u2 aect the
process, but all three variables were included in
Our ®rst example demonstrates the ability of a the models. The unmeasured disturbance variables
FANN to accurately model both the output and d1 and d2 vary slowly compared to the indepen-
the gains of a process, and to estimate and com- dent variables. For d1 and d2 , the frequency of the
pensate for unmeasured disturbances to compute cosine terms were about 1/3 the frequency of the
optimization setpoints. The process is modeled by sine terms. As the equations indicate, the depen-
the following set of equations: dent variables s1 and s2 are a function both of
the independent variables u1 and u2 and of
d1 sin!1 t cos!2 t the unmeasured disturbances d1 and d2 , and the
d2 sin!3 t cos!4 t 17 output variable y is a function of the dependent
s1 u1 u2 0:0u3 0:5d1 variables s1 and s2 .

Fig. 7. A portion of the time series of the variables in this example.

J.D. Keeler et al./ISA Transactions 37 (1998) 41±52 49

The time series of the variables are shown in

Fig. 7. The process was modeled in a FANN
architecture. Using the trained model, y was given
a setpoint equal to its average value (4.7), and, at
each time step, optimized values for the indepen-
dent variables were computed using a nonlinear
programming algorithm. The resulting RMS
deviation of y from its setpoint over the entire
10,000 data points was 0.008. Such accuracy in
feed-forward optimization is only achievable with
a model whose predictions and gains are both
accurate. The optimization accuracy achieved in Fig. 9. Actual versus FANN model estimates of the unmea-
this example demonstrates that the predictions sured disturbances d2 .
and gains in the FANN model are both highly
accurate despite the presence of unmeasured dis-
turbances in the process. Recall that the disturbances are ®xed during
We now look in detail at the ability of the optimization. Let u1 and u2 denote the optimized
FANN model to estimate the eects of unmea- settings of the independent variables, and let s1
sured disturbances. and s2 denote the values of the dependent vari-
Recall from Eq. (16) that disturbance eects are ables given by Eqs. (17) and (18) for the optimized
estimated using Model 1 of Fig. 6 along with the settings u1 and u2 :
initial values of the independent and dependent
variables. Denote these estimates by d10 and d20 . The s1 u1 u2 0:5d1
scatterplots of Figs. 8 and 9 show the accuracy of
s2 4u21 d2 2
the estimated disturbances compared to the actual
disturbances d1 and d2 .
This accurate estimation of the disturbances Let s01 and s02 denote the FANN model estimates of
allows the FANN model to compensate for their the dependent variables as given by Eq. (13) for
eects and achieve the optimization accuracy noted the optimized settings u1 and u2 . The scatterplots
above. of Figs. 10 and 11 show the accuracy of these
We now examine the accuracy of the FANN estimates compared to the actual values given by
model in predicting the dependent variables when the above equations.
the independent variables are set to their opti-
mized values.

Fig. 8. Actual versus FANN model estimates of the unmea- Fig. 10. Actual versus FANN model estimates of the optimized
sured disturbance d1 . values of the dependent variable s1 .
50 J.D. Keeler et al./ISA Transactions 37 (1998) 41±52

active chemical species are involved, but the

dynamics can be approximated by a Poincare sur-
face of section return map that is a very simple
relation [7]. We simplify the dynamical equations
governing the BZ reaction signi®cantly and
assume that the essential dynamics can be descri-
bed near the bifurcation region by a quadratic
return map of the form:

yt 1 styt1 ÿ yt

where yt is the concentration at time t, st is the

Fig. 11. Actual versus FANN model estimates of the optimized
values of the dependent variable s2 .
bifurcation parameter that is modulated over time,
and yt 1 is the value of the concentration on
the next step of the return map. This simple iterated
4.2. The Belousov±Zhabotinski reaction map retains the chaotic characteristics of the origi-
nal dynamics. We take yt to be the output of this
In this example we model and control a decep- process and st to be a dependent variable. As
tively simple-appearing yet extremely dicult usual, we let st be a function of a manipulated
process known as the Belousov±Zhabotinski (BZ) variable ut and an unmeasured disturbance dt:
chemical reaction. This reaction has been shown
to display chaotic oscillations in the bromine con- st a1 cosw1 ut a2 cosw2 dt
centration in a continuous stirred tank reactor
[15]. Chaotic reactions are dicult to control due Here w2 is small compared to w1 , and st is scaled
to their inherent sensitive dependence to initial to range from 2.0 to 4.0, which acts to modulate
conditions and to disturbances. the process in and out of the chaotic regime. The
The BZ reaction goes through period doubling time series of st and yt are shown in Fig. 12.
to chaos via successive pitchfork bifurcations The task is not only to predict but also to
under certain operating conditions. Several dozen maintain yt 1 at its setpoint given ut, st,

Fig. 12. A portion of the time series of the dependent variable, s, and the output variable, y, from the modulated BZ reaction map.
The system displays bursts of chaotic activity as s increases above 3.4.
J.D. Keeler et al./ISA Transactions 37 (1998) 41±52 51

and yt, as shown in Fig. 13. First, we trained a reaction. Fig. 14 shows the extremely poor results
Case 2 neural network (Fig. 3) to model the BZ of one-step-ahead control to a setpoint of p 0:63
reaction. As expected, due to the unmeasured dis- using the Case 3 model.
turbance dt, the model was unable to accurately Lastly, we trained a FANN model on the BZ
predict the output yt, and the accuracy of the reaction. The prediction accuracy was identical to
model was only r2 0:90. This indicates that the the Case 3 model (r2 0:99). As Fig. 14 shows,
unmeasured disturbance is signi®cant and needs to control by the FANN model is dramatically better
be estimated. than the Case 3 model (65% smaller RMS error
Next, we trained a Case 3 neural network overall). Because the FANN model correctly
(Fig. 4) to model the process, which included the represents the behavior of the dependent variable
dependent variable st as a model input. As st and the output yt in response to changes to
expected, the prediction accuracy improved to a the manipulated variable, the gain function for
high r2 0:99, but the gain for ut was inaccurate ut is correct, and the optimized settings maintain
(0.0025 the size of the gain for st), and hence the the output yt at its setpoint despite the presence
model was ineective when used to control the BZ of ¯uctuating unmeasured disturbances.

Fig. 13. (a) The mathematical structure of the simulation of the BZ plant. The dependent variable st is a function of the manipulated
variable ut and the unmeasured disturbance dt. The output at the next time step, yt 1, depends explicitly only on st and yt.
(b) The FANN model does not receive the unmeasured disturbance dt as an input, and only ut is modi®able. The system computes
optimal values for ut using a nonlinear programming algorithm which maintains the output yt 1 at the setpoint P.

Fig. 14. BZ process dynamics under the one-step-ahead control of the Case 3 neural network model (Fig. 4) and the FANN model
(Figs. 5 and 6). For both cases, the setpoint is 0.63, and the control method is a standard non-linear programming code. The process
controlled by the Case 3 model diers only slightly from the original uncontrolled process (not shown), even though the manipulated
variable (also not shown) takes on its extreme values. This failure is due to the wrong gain for the manipulated variable resulting from
the inappropriate model structure. Using the FANN model, in which the gains are correct, results in the controlled output deviating
from the setpoint only slightly in the most chaotic region, a vast improvement over both the original dynamics and the Case 3 model
control.
52 J.D. Keeler et al./ISA Transactions 37 (1998) 41±52

5. Conclusions Microstructure of Cognition Vol. 1: Foundations, MIT

Press/Bradford, Cambridge, MA, 1986, 318±362.
[2] J. Moody, C. Darken, Fast learning in networks of
We have described and demonstrated a neural-
locally-tuned processing units, Neural Computation 1
network modeling structure, the FANN, which, in (1989) 281±294.
contrast to standard neural network modeling [3] A.S. Weigend, B.A. Huberman, D.E. Rumelhart, Predict-
approaches, is able to provide both accurate pre- ing the future: a connectionist approach, International
dictions and correct gains in models of processes Journal of Neural Systems 1 (1990) 193.
[4] E. Hartman, J.D. Keeler, Predicting the future: advan-
subject to unmeasured disturbances. The FANN
tages of semi-local units, Neural Computation 3 (1991)
architecture makes use of dependent process out- 566±579.
put variables to obtain predictive accuracy, while [5] K. Hornik, M. Stinchcombe, H. White, Multilayer feed-
properly representing the functional relationships forward networks are universal approximators, Neural
among the variables necessary to obtain accurate Networks 2 (1989) 359±366.
[6] E. Hartman, J.D. Keeler, J. Kowalski, Layered neural
gains. We demonstrated in two case studies,
networks with Gaussian hidden units as universal
including an extremely dicult chaotic reaction approximators, Neural Computation 2 (1990) 210±215.
problem, that the FANN architecture provides [7] J.D. Keeler, Prediction and control of chaotic chemical
feed-forward compensation for unmeasured dis- reactions via neural network models. Conference on Arti-
turbances during optimization. Because unmea- ®cial Intelligence in Petroleum Exploration and Produc-
tion, Plano, TX 1993, 31±38.
sured disturbances are prevalent in process
[8] W.T. Miller, R.S. Sutton, P.J. Werbos (Eds.), Neural
industry plants, such compensation is often essen- Networks for Control, MIT Press, Cambridge, MA, 1990.
tial for eective optimization. [9] K.S. Narendra, K. Parthasarathy, Identi®cation and con-
The FANN structure diers signi®cantly from trol of dynamic systems using neural networks, IEEE
that of adding a bias to a model to compensate for Transactions on Neural Networks 1 (1990) 4±27.
[10] E.D. Sontag, ``Feedback stabilization using two-hidden-
model mismatch. The purpose of the FANN
layer nets'', IEEE Transactions Neural Networks 3 (1992)
structure is to compensate for unmeasured dis- 981±990.
turbances, not for model mismatch. The FANN [11] L. Ungar, E. Hartman, J. Keeler, G. Martin, Process
architecture performs feed-forward estimation of modeling and control using neural networks, in: G. Ste-
unmeasured disturbances and compensates for phanopoulos, V. Venkatasubramanian, J. Davis (Eds.),
Proceedings of Intelligent Systems in Process Engineering.
them during optimization. ISPE '95, Snowmass, CO, AIChE Symposium Series,
Other important aspects of optimization that 1995, 57±67.
were not addressed in this paper, such as dead [12] P.J. Werbos, Neurocontrol and fuzzy logic: connections
times and optimization constraints, are straight- and designs, Proceedings of the 2nd Joint Technology
forward to incorporate and implement in the Workshop on Neural Networks and Fuzzy Logic, April
1990, NASA Conference Publication 10061; IJAR 6 (2)
FANN paradigm. 185±192.
[13] J.D. Keeler, R.B. Ferguson, Commercial applications of
soft sensors (TM). International Forum for Process Ana-
lytical Chemistry (IFPAC) Conference, Orlando, FL,
References 1996, 81±88.
[14] R. Weber, C. Brosilow, The use of secondary measure-
[1] D.E. Rumelhart, G.E. Hinton, R.J. Williams, Learning ments to improve control, AIChE Journal 18 (1972) 614±623.
internal representations by error propagation, in D.E. [15] J.C. Roux, Experimental studies of bifurcations leading to
Rumelhart, J.L. McClelland and the PDP Research Group chaos in the Belousov±Zhabotinsky reaction, Physical 7D
(Eds.), Parallel Distributed Processing: Explorations in the (1983) 57±68.

MODULE 2 DL
No ratings yet
MODULE 2 DL
9 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
30 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
163 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
4 pages
Neural Networks For Time Series Forecasting With R - Dr. N.D Lewis
67% (3)
Neural Networks For Time Series Forecasting With R - Dr. N.D Lewis
227 pages
978-3-030-41068-1 (1)-133-188
No ratings yet
978-3-030-41068-1 (1)-133-188
56 pages
Building Neural Network Models For Time Series: A Statistical Approach
No ratings yet
Building Neural Network Models For Time Series: A Statistical Approach
27 pages
Summary Review
No ratings yet
Summary Review
14 pages
DL_ANN_RNN_CNN [Autosaved] [Autosaved]
No ratings yet
DL_ANN_RNN_CNN [Autosaved] [Autosaved]
53 pages
week 03-04 - Deep Feedforward Networks - Intro
No ratings yet
week 03-04 - Deep Feedforward Networks - Intro
141 pages
2002.01600
No ratings yet
2002.01600
31 pages
Feedforward Neural Networks - Part 2 - Parveen Khurana - Medium
No ratings yet
Feedforward Neural Networks - Part 2 - Parveen Khurana - Medium
39 pages
Psichogios AICHE-1 PDF
No ratings yet
Psichogios AICHE-1 PDF
13 pages
AI_Lec24-25
No ratings yet
AI_Lec24-25
63 pages
Lecture NN 2005
No ratings yet
Lecture NN 2005
137 pages
Recurrent Neural Network Modeling For Model Predictive Control
No ratings yet
Recurrent Neural Network Modeling For Model Predictive Control
31 pages
Neural Network Adaptive Robust Control of Nonlinear Systems in Semi-Strict Feedback Form
No ratings yet
Neural Network Adaptive Robust Control of Nonlinear Systems in Semi-Strict Feedback Form
12 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
NFC UNIT2(PORTAL)
No ratings yet
NFC UNIT2(PORTAL)
26 pages
04 - Machine Learning for Embedded and Edge AI
No ratings yet
04 - Machine Learning for Embedded and Edge AI
58 pages
Softcomputing NN
No ratings yet
Softcomputing NN
84 pages
Artificial Neural Network Seminar Report
50% (2)
Artificial Neural Network Seminar Report
15 pages
Deep ONet
No ratings yet
Deep ONet
22 pages
A Hybrid Neural Network-First Principles Approach Process Modeling
No ratings yet
A Hybrid Neural Network-First Principles Approach Process Modeling
13 pages
Feed Forward Neural Network
No ratings yet
Feed Forward Neural Network
16 pages
Patton 1994
No ratings yet
Patton 1994
6 pages
Neural Network Applications and Implementations
No ratings yet
Neural Network Applications and Implementations
35 pages
Session 11
No ratings yet
Session 11
12 pages
Recurrent Neural Network Application
No ratings yet
Recurrent Neural Network Application
10 pages
chen1990
No ratings yet
chen1990
25 pages
tfm_lichtner_bajjaoui_aisha
No ratings yet
tfm_lichtner_bajjaoui_aisha
18 pages
Applying Artificial Neural Networks and Virtual Experimental Design to Quality Improvement of Two Industrial Processes
No ratings yet
Applying Artificial Neural Networks and Virtual Experimental Design to Quality Improvement of Two Industrial Processes
18 pages
Applications of ANN
No ratings yet
Applications of ANN
19 pages
A Novel Neural Network For Nonlinear Convex Programming: Xing-Bao Gao
No ratings yet
A Novel Neural Network For Nonlinear Convex Programming: Xing-Bao Gao
9 pages
Neural Network
No ratings yet
Neural Network
3 pages
A Recurrent Neural-Network-Based Real-Time Learning Control Strategy Applying To Nonlinear Systems With Unknown Dynamics
No ratings yet
A Recurrent Neural-Network-Based Real-Time Learning Control Strategy Applying To Nonlinear Systems With Unknown Dynamics
11 pages
NNDL
No ratings yet
NNDL
96 pages
0820_2000-benitez-NN
No ratings yet
0820_2000-benitez-NN
3 pages
PATTERN
No ratings yet
PATTERN
2 pages
Unit 2 Deep Learning
No ratings yet
Unit 2 Deep Learning
19 pages
Neural Network Seminar Report
No ratings yet
Neural Network Seminar Report
17 pages
Physics-Informed Deep Learning
100% (1)
Physics-Informed Deep Learning
18 pages
On The Pareto Front of Physics-Informed Neural
No ratings yet
On The Pareto Front of Physics-Informed Neural
17 pages
Module 2
No ratings yet
Module 2
44 pages
AN Overview Artificial Neural Network Approach AN Overview Artificial Neural Network Approach
No ratings yet
AN Overview Artificial Neural Network Approach AN Overview Artificial Neural Network Approach
34 pages
Neural Networks for Process Control and Optimization_ Two Industrial Applications (1)
No ratings yet
Neural Networks for Process Control and Optimization_ Two Industrial Applications (1)
13 pages
A Proposal On Machine Learning Via Dynamical Systems
No ratings yet
A Proposal On Machine Learning Via Dynamical Systems
11 pages
Vanishing Gradient Problem
No ratings yet
Vanishing Gradient Problem
3 pages
Solving Parabolic Periodic P-Laplacian by Deep Learning
No ratings yet
Solving Parabolic Periodic P-Laplacian by Deep Learning
15 pages
UNIT_1_DL
No ratings yet
UNIT_1_DL
18 pages
[doi 10.1109_cdc.2001.980299] Selmic, R.R.; Lewis, F.L. -- [IEEE 40th Conference on Decision and Control - Orlando, FL, USA (4-7 Dec. 2001)] Proceedings of the 40th IEEE Conference on De
No ratings yet
[doi 10.1109_cdc.2001.980299] Selmic, R.R.; Lewis, F.L. -- [IEEE 40th Conference on Decision and Control - Orlando, FL, USA (4-7 Dec. 2001)] Proceedings of the 40th IEEE Conference on De
6 pages
Neural Networks and Applications Tutorial
No ratings yet
Neural Networks and Applications Tutorial
45 pages
Updated_AAM_QB_(1)[1]
No ratings yet
Updated_AAM_QB_(1)[1]
6 pages
Project Management
No ratings yet
Project Management
39 pages
Unit 03 - Neural Networks - MD
No ratings yet
Unit 03 - Neural Networks - MD
24 pages
Starr 1965 Modular Production - A New Concept
100% (1)
Starr 1965 Modular Production - A New Concept
13 pages
Neural Network For SSCV Hydrodynamics
No ratings yet
Neural Network For SSCV Hydrodynamics
104 pages
Generalización de La Desigualdad de Hölder
No ratings yet
Generalización de La Desigualdad de Hölder
5 pages
DL mod 1 final
No ratings yet
DL mod 1 final
4 pages
Legendre FLANN
No ratings yet
Legendre FLANN
7 pages
Unit 3
No ratings yet
Unit 3
7 pages
Laplace Transform Numerical Inversion PDF
No ratings yet
Laplace Transform Numerical Inversion PDF
18 pages
Crossley 2005
No ratings yet
Crossley 2005
36 pages
Quality Assurance and Quality Control in (1)
No ratings yet
Quality Assurance and Quality Control in (1)
6 pages
Slides Active Flow Control Deep Reinforcement Learning
No ratings yet
Slides Active Flow Control Deep Reinforcement Learning
46 pages
Understanding Multi-Layer Feed-Forward Neural Networks in Machine Learning
No ratings yet
Understanding Multi-Layer Feed-Forward Neural Networks in Machine Learning
4 pages
Simulator-Free Solution of High-Dimensional Stochastic Elliptic Partial Differential Equations Using Deep Neural Networks
No ratings yet
Simulator-Free Solution of High-Dimensional Stochastic Elliptic Partial Differential Equations Using Deep Neural Networks
31 pages
Research Skills 2
No ratings yet
Research Skills 2
5 pages
Optimization Methods Using Artificial Intelligence Algorithms To Estimate Thermal Efficiency of PVT System
No ratings yet
Optimization Methods Using Artificial Intelligence Algorithms To Estimate Thermal Efficiency of PVT System
15 pages
On Neural Networks in Identification and Control of Dynamic Systems
No ratings yet
On Neural Networks in Identification and Control of Dynamic Systems
34 pages
An Extended Functional Network Model and Its Application For A Gas Sensing System
No ratings yet
An Extended Functional Network Model and Its Application For A Gas Sensing System
12 pages
Monte Carlo Algorithms For Evaluating Sobol' Sensitivity Indices
100% (1)
Monte Carlo Algorithms For Evaluating Sobol' Sensitivity Indices
31 pages
Prediction of The Minimum Film Boiling Temperature Using Artificial Neural Network
No ratings yet
Prediction of The Minimum Film Boiling Temperature Using Artificial Neural Network
11 pages
Brief About 6 Sigma
No ratings yet
Brief About 6 Sigma
16 pages
Qmethods & Manscie Prelim LP (Maximization Model)
No ratings yet
Qmethods & Manscie Prelim LP (Maximization Model)
13 pages
EE-330 Digital Signal Processing (Lec-5)
No ratings yet
EE-330 Digital Signal Processing (Lec-5)
11 pages
New Finite Element Analysis Lec1
No ratings yet
New Finite Element Analysis Lec1
35 pages
Statistics Is The Science of Using Information Discovered From Studying Numbers
No ratings yet
Statistics Is The Science of Using Information Discovered From Studying Numbers
3 pages
Prediction of Natural Gas Viscosity Using - Artificial Neural - Network Approach
No ratings yet
Prediction of Natural Gas Viscosity Using - Artificial Neural - Network Approach
7 pages
Fitting A Neural Network Model
No ratings yet
Fitting A Neural Network Model
9 pages
Whittaker 1902
No ratings yet
Whittaker 1902
11 pages
Outline For ASM2
No ratings yet
Outline For ASM2
4 pages
Accounts of Experiences in The Application of Artificial Neural Networks in Chemical Engineering
No ratings yet
Accounts of Experiences in The Application of Artificial Neural Networks in Chemical Engineering
15 pages
Solutions Assignment-Ii
No ratings yet
Solutions Assignment-Ii
5 pages
A Q4Q4 Continuum Structural Optimization Implement
No ratings yet
A Q4Q4 Continuum Structural Optimization Implement
7 pages
Forex Market Prediction
No ratings yet
Forex Market Prediction
5 pages
An Optimized Task Scheduling Algorithm in CloudComputing
No ratings yet
An Optimized Task Scheduling Algorithm in CloudComputing
2 pages
AP Calculus Exam Prep: Part 1 - Calculator Active Exam Tips
No ratings yet
AP Calculus Exam Prep: Part 1 - Calculator Active Exam Tips
4 pages
Module 1 Ungrouped Data
No ratings yet
Module 1 Ungrouped Data
5 pages
Introduction To Business Analytics-Ragesh T.S.
No ratings yet
Introduction To Business Analytics-Ragesh T.S.
5 pages
Algebra Review Packet-1
No ratings yet
Algebra Review Packet-1
24 pages
Hypothesis Testing - Z Test
No ratings yet
Hypothesis Testing - Z Test
28 pages
Use of Neural Networks in Process Engineering Thermodynamics, Diffusion, and Process Control and Simulation Applications
No ratings yet
Use of Neural Networks in Process Engineering Thermodynamics, Diffusion, and Process Control and Simulation Applications
16 pages
D Alembert Solution
0% (1)
D Alembert Solution
22 pages
Simulation and Energy Consumption Analysis of A Propane Plus Recovery Plant From Natural Gas
No ratings yet
Simulation and Energy Consumption Analysis of A Propane Plus Recovery Plant From Natural Gas
7 pages
Application and Anti Derivative
No ratings yet
Application and Anti Derivative
4 pages
Week02 Bracketing Methods
No ratings yet
Week02 Bracketing Methods
8 pages
EC2204QB
No ratings yet
EC2204QB
16 pages
Software Toolkit: MATLAB
No ratings yet
Software Toolkit: MATLAB
15 pages
Statistics Section B (2) 1
0% (1)
Statistics Section B (2) 1
2 pages
Demand Management Practice Problems
No ratings yet
Demand Management Practice Problems
3 pages
The Volterra Series and Its Application
From Everand
The Volterra Series and Its Application
Mark Dunn
No ratings yet
Feedback Control Theory
From Everand
Feedback Control Theory
Bruce Francis
5/5 (1)

Process Modeling and Optimization Using Focused Attention Neural Networks

Uploaded by

Process Modeling and Optimization Using Focused Attention Neural Networks

Uploaded by

ISA

ISA Transactions 37 (1998) 41±52

Process modeling and optimization using focused attention

1. Introduction neural networks include papers on basic algo-

0968-0896/98/$19.00 # 1998 Elsevier Science Ltd. All rights reserved

Fig. 1. Classi®cation of process variables for modeling a plant.

In Section 2, we illustrate the problem in detail

Eliminating s we have: y  a1 u  a2 u  d  a1  a2 u  d 4

Case 1, Eq. (3): However, the functional dependency of s on u is

@ 2.4. Summary of the above cases

Eliminating s gives the same results as in Case 2:

Independent inputs only Independent and dependent inputs

No unmeasured disturbances Predictions: Correct (*) Correct

y  F u; d 7 where g, h, and l are in general nonlinear func-

3.3. The FANN model applied to the Section 2

We now apply the FANN model to the example

y  a1 u  s 6 (4), with s eliminated, and we see that the gain is

Again, with s eliminated we have: @

Independent inputs only Independent and dependent inputs FANN architecture

No unmeasured distrubances Predictions: Correct (*) Correct (*) Correct

and nonlinear production units aected by mea- s2  4u21  d2  2 18

4. Case studies To generate data, the three independent variables

Fig. 7. A portion of the time series of the variables in this example.

The time series of the variables are shown in

active chemical species are involved, but the

y t  1  s ty t 1 ÿ y t

where y t is the concentration at time t, s t is the

5. Conclusions Microstructure of Cognition Vol. 1: Foundations, MIT

You might also like

Eliminating s we have: y a1 u a2 u d a1 a2 u d 4

y Fu; d 7 where g, h, and l are in general nonlinear func-

y a1 u s 6 (4), with s eliminated, and we see that the gain is

No unmeasured distrubances Predictions: Correct () Correct () Correct

and nonlinear production units aected by mea- s2 4u21 d2 2 18

yt 1 styt1 ÿ yt

where yt is the concentration at time t, st is the