1.3.2a Continuous Data Analysis With Analog Computers Using Statistical and Regression Techniques 1964
1.3.2a Continuous Data Analysis With Analog Computers Using Statistical and Regression Techniques 1964
2a
(1)
perience in the use and testing of the analog ap- f = N
proach.
Such computations can be performed Off-Line, On- which is recognized easily as the familiar arith-
Line Open Loop, or On-Line Closed Loop on con- metic average. For data analysis, this statistical
tinuous-signal inputs; digitizing of the analog signal property is important for two reasons: 1) it is fun-
is unnecessary. Noisy signals, unavoidable in many damental to the definition of other statistical param-
pilot or plant operations, whose deviation traces eters' and 2) it applies equaUywell to normal popu-
serve as the basis for subsequent calculations can lation distributions and to those that are not dis-
be "reduced" to more meaningful form by relative- tributed normally.
It would appear desirable to utilize this statistical
property in the analysis of continuous data or for ... t<tl I
o~___T~2()~I_______ [[:>____ I 1T2
f(tldt=f (t)
T_2_-_T_I__~TI~_____o
the measurement of continuous process variables
for purposes of control. In order to do so, it be-
comes necessary to obtain the "estimate" of the
mean as a continuous and changing function of Figure 2. Analog Circuit for Calculation of Estimate of the
time. Specifically, one must be ableJo define and Mean for a Fixed Time Interval
compute the average or mean value, f, of a signal,
f(t) , varying with time over the interval T 1~ t.$T 2' A refinement to this circuit would be to generate
f(t) as a continuously varying function of time as
As an example, assume that a steel mill is pro- shown in Figure 3. The time interval then must be
ducing a continuous metal strip which, ideally, considered as a variable so that the average is
should be uniform but actually is fluctuating in computed continuously from time T l' In Figure 3,
thickness (as shown in Figure 1) because of in- T 1 is considered to be zero computer time and T 2
evitable random disturbances in the process. Over
has been replaced by t since the upper limit of the
integral is a variable.
rn
DESIRED
w OR NOMINAL t
:J:
(.) VALUE
z
~~~~~~~~~~~bu~~~~~~~dr~~ f
I
(n="t L
rt f(t)dt
rn o
w
z
:.::
(.)
:J:
I-
+ f (t)
LENGTH:::: TIME
Figure 3. Analog Circuit for Calculation of a Continuous
Figure 1. Thickness of Steel Plate from a Rolling Mill as a Estimate of the Mean far a Fixed Time Interval
Function of Time
This value can be computed with the simple circuit The resetting necessary with the previous two cir-
of Figure 2. At time Tl the integrator is placed cuits can be aVOided, and a much simpler circuit
in the COMPUTE mode, and at time T2 its output is obtained without worry about overloads and divi-
observed. The integrator then can be reset and sion circuits. The clue to the method is the fact
another average taken. that past values of f(t) become obsolete. Since the
basic signal, f(t) , is continuous, it seems advan- This can be simplified by letting T 1- -00, or
tageous to let past inform.ation become obsolete
!fradually rather than abruptly. This means that
f(t) has to be defined in such a way that recent
values count much more heavily than earlier values -aT at
f (T) =a e 2 e f(t) dt (6)
and the behavior of f(t) in the remote past has very
little effect. This suggests that a weighted average
be used.
The weighted-average-f(t) of a function, f(t), over The minus infinity in the lower limit serves to in-
T 1 ~ t ~ T 2 with weight function <f> (t) is defined by dicate that the average has been generated for such
(1)* where <f> (t)~O in the interval Tl~t.:5T2' Thus a long time that the effect of what happened before
T 1 is negligible. In other words, since the exponen-
tial weighting function, eat, approaches zero as
t--oo, the importance of events prior to T1 is negli-
T2
1; 1
f(t) <f> (t) dt
gible if Tl is suitably chosen.
L f(t) e
at
dt (7)
T2
d f(T)
dT
= al(~e-aT) feat f(t) dt (9)
-00
f (t) a
.h 1
eat f(t) dt
(5)
+e -aT [eaT f(T)]}
aT2 aTl
e - e d f (T) = a[- f (T) + f(T)] = af(T) - af (T) (10)
dT
tThose familiar with linear analysis and, in particular, convolution
integrals, will recognize Equation 8 as the output of a filter whose
*Numbers in parentheses in the body of the text refer to references impulse response ;s ae -at; that is, a first·order filter with time
listed in APPENDIX 1. constant lla.
Equation 10 is implemented by the simple circuit In the circuit of Figure 4 it is obvious that an initial
of Figure 4, which is recognized easily as the cir- condition applied to the integrator will improve the
cuit for a simple filter or first order lag. Note that computed average at the beginning. This value should
represent a good guess as to the nominal or expected
mean value of f(t). One normally would have such an
-f (t)
estimate available. If itis a good estimate, the com-
puted average will be reasonable from the start; if it
is a bad one, it will not make any difference after
about three to five time constants.
ex THE VARIANCE
Figure 4. Analog Circuit for Obtaining the EMP Estimate of A second important statistical parameter is the
the Mean variance which is used to give a basic measure of
the distribution of a population. It is defined as the
square of the standard deviation and is equal to the
the input and output signals have been written in mean- squared deviation of the variable from its
terms of the more familiar notation for time, t, mean. For discrete data, an estimate of the variance
which is not to be confused with the dummy variable is obtained with the summation
of Equation 7.
4) average the square with a second filter At present, the "temperature" reading is inserted
circuit. manually into the charge balance computer (the
mean value of temperature is "guesstimated" by
From these requirements the circuit of Figure 6 the operator). This could be automated easily with
is derived easily. an EMP mean value circuit since the transducer
signal is a continuous electrical signal. The condi-
tion that the reading of the mean value circuit
should not be used at the beginning of the time
history, t<tl, for reasons mentioned previously,
can be automated by using the standard deviation
(variance) as a control criterion, i.e., when 0'2(T)
is greater than a reference value, do not use
f(T) , when 0' 2(T) is less than a reference value,
Figure 6. Analog Circuit for Calculatior of the EMP Estimate use f(T). The reference value chosen will depend
of the Variance on the maximum variance expected during smooth
pour conditions. This can be mechanized readily
Example: The LD Steel Process can be used as an on the analog computer by means of a comparator.
example of the use of the EMP mean and variance
for the control of a process. This is the oxygen With this simple technique a better estimate of the
steel making process wherein it is possible to con- mean temperature could be inserted into the charge
trol bath temperatures without an external fuel computer automatically and economically.
supply by charging the vessel with materials that
are thermally balanced. The charge materials con- AUTOCORRE LA TION
sist of hot metal (iron), scrap, and lime.
The autocorrelation function, defined as an integral
The hot metal temperature can range from 2200°F between fixed limits, is converted easily to a con-
to 2600°F, and, hence, it is necessary to measure tinuous EMP autocorrelation function, ¢ (T), by the
the temperature of the iron to obtain a correct definition
thermal balance.
-00
fIt) fIt -;) e -a (T - t) dt (13)
to
aw
2 2 (19)
(P +a) + w
1/.
Figure 9. Circuit for Fourth-Order Pade Approximation for
Ideal Time Delay of Magnitude r (5) Note that this circuit gives pew) at one value of w.
The parameter, w, can be changed merely by
ting a time delay, T. This circuit is accurate to with- changing the two potentiometers labelled "w"
in 1 degree of phase shift for input frequencies in
f(t) such that the product of the maximum useful
signal frequency, wm , with the time delay, T, shall
not exceed 6.5 radians, i.e.,
TW < 6. 5 radians
m- (14)
F(w) = 1 T
-00
f(t) e
-aCT - t)
e
-jwt
dt (15)
and Power Spectrum
REGRESSION ANALYSIS
r
There are many types of regression---linear,
quadratic, high order, multivariate, etc. These
N
m 2: Xi + Nb (22)
i=l
Figure 11. Circuit for Obtaining the "Least Squares"
N N Regression Parameters!!! and ~
"'"" X.Y. (23)
L..J 1 1 + LXi
i=l i=l that in calculating m and b from the definite in-
tegral we form an "algebraic" loop. This brings
up the question of circuit stability. It can be shown"
Since X and Y will be continuous functions of that once the computation is under way, (t>O), the
time, the discrete summation from 1 to N can loop will be stable unless X is constant. However,
be replaced by a time integral where the total if X is constant it cannot be used as an independent
time, t, is proportional to N. Therefore, Equa- variable in a correlation study. Overloads at the
tions 22 and 23 become very beginning of the computation (a region of no
interest) can be taken care of with feedback limiters
on the division amplifiers, or by using a "steepest
t descent" division circuit (7).
mf. o
Xdt+bt (24)
It should be observed that m and b are defined at
every instant of time. For small values of t
(corresponding to small sample size), the estimates
t of m and b will be relatively insignificant and,
f o
t XY dt
dt+bi Xdt (25)
hence, will be changing rapidly. As the time in-
terval increases, however, the values of m and b
become more significant and actually should reach
"steady state" or non-changing values.
These two equations can be solved simultaneously The technique for linear regression can be extended
to yield m and b as follows: to quadratic or higher order regressions, it then
being necessary to define a new set of equations--
t t such as Equations 22 and 23-- for determining the
b =!a Y dt - m fax dt
(26)
unknown parameters of the regression system.
Once the equations are defined, they can be con-
t verted to continuous integrals and instrumented by
standard analog techniques.
N N
Li=l
Yi LXi
i=l
m + b (28)
N N
N N N
L X.Y.
1 1 LX~ LXi Figure 12. Unsealed Analog Circuit for Calculation of
i=l i=l i=l (29) Continuous EMP Values of Regression
m + b
N N N Parameters I:!! and ~
CONCLUSIONS
Recalling the correspondence between EMP vari-
ables and discrete summations, one can transform The conversion of statistical parameters to EMP
Equations 28 and 29 immediately into continuous variables enables data analysis to be performed
EMP notation which gives continuously through the use of relatively simple
analog circuits. Limits on the size of the integra-
mX + b (30) tion interval normally encountered with continuous
signals have been eliminated; the need to "reset"
(31) the integrators is no longer required since they are
now serving as convolution circuits rather than pure
where m and b are now the EMP estimates of the accumulators. A continuous estimate of statistical
regression parameters. The analog circuit required parameters can be calculated readily.
is shown in Figure 12. The statements made with
regard to the stability of the circuit shown in Fig- The concept of replacing discrete summations with
ure 11 apply also to the algebraic loop found in the the EMP mean can be a valuable one. In addition to
Figure 12 circuit, obvious uses for instrumentation and control, for
both on-line and off-line systems, this technique
It should be observed that time has, in effect, been also can be used in analog simulation studies. For
taken out of the problem by the conversion to EMP example, it is sometimes desirable to calculate
variables. There is no longer any need to "reset" the rms value of a computed variable. This is ac-
the integrators since they are now serving as con- complished quite simply by 1) squaring the in-
volution circuits rather than pure accumulators. It stantaneous value of the variable, 2) taking the
follows that circuits similiar to those shown can be mean of the square of the variable with an EMP
instrumented for continuous higher order and con- circuit, and 3) taking the square root of the mean.
tinuous multi-variable regressions. All that is reo Other uses arise in simulation workwhere Gaussian
quired is more analog computing equipment. noise is used to disturb a particular parameter.
APPENDIX I: REFERENCES
(1) Davenport, W.B., Jr., and W.L. Root: "An Introduction to the Theory of Random Signals
and Noise", McGraw-Hill Book Company, Inc., New York, 1958.
(2) Otterman, Joseph: "The Properties and Methods for Computation of Exponentially
Mapped Past Statistical Variables", IRE TRANS. on Automatic Control, Volume AC-5
Number 1, January 1960, pp. 11-17.
(3) Volk, William: "Applied Statistics for Engineers", McGraw-Hill Book Company, Inc.,
New York, 1958, p. 136.
(4) Slatosky, W.J.: "End Point Temperature Control in LD Steel Making", Journal of
Metals, Volume 12, March 1960, pp. 226-230.
(5) Brenner, M.M., and J.D. Kennedy: "Dead Time Simulation for Electronic Analog Com-
puters", National Simulation Council, December 11, 1957.
(7) Favreau, R.R.: "Dividing Circuit Obtained by Applying Method of Steepest Ascent",
Princeton Computation Center Report 132, Electronic Associates, Inc., Prmceton,
New Jersey.
(8) Hannaner, George: "Algebraic Loops - Some Stability Considerations" Education and
Training Memo #22, Electronic Associates, Inc., Princeton, New Jersey.
EAr ELECTRONIC ASSOCIATES, INC. Long Branch, New Jersey
ADVANCED SYSTEMS ANALYSIS AND COMPUTATION SERVICES/ANALOG COMPUTERS/HYBRID ANALOG·DIGITAL COMPUTATION EQUIPMENT/SIMULATION SYSTEMS/
SCIENTIFIC AND LABORATORY INSTRUMENTS/INDUSTRIAL PROCESS CONTROL SYSTEMS/PHOTOGRAMMETRIC EQUIPMENT/RANGE INSTRUMENTATION SYSTEMS/TEST
AND CHECK·OUT SYSTEMS/MILITARY AND INDUSTRIAL RESEARCH AND DEVELOPMENT SERVICES/FIELD ENGINEERING AND EQUIPMENT MAINTENANCE SERVICES.