Quantitative Validation of Mathematical Models: Kevin Dowding
Quantitative Validation of Mathematical Models: Kevin Dowding
The additional dependencies identified on the left side of Eq. where ε T is the experimental measurement error.
(22) are the physical/material property dependence Experiments intended for validating a model require more
φ p = { k, L } , and boundary conditions x = { q 0, T L } . The than physical measurements to compare with a model. The
relationship of the model to this particular experiment is experiment must provide material/physical properties and
through the values in φ p, x . The values that are ultimately used boundary/initial conditions for input to the model. The material/
in the model will have error relative to the “true” (unknown) physical properties and boundary conditions relate the model to
values for the given experimental apparatus. These errors (the specific) experiment. The values used in a model are either
contribute to the inherent error and affect the model predictions. measured during the validation experiment or taken from
another source. Examples of a source other than the validation
Model values to be studied in this case are differences experiment are previous experiments or expert opinion. Material
between the dependent variable (temperature) of the properties are generally not measured directly in a validation
mathematical equations and the boundary temperature experiment, while boundary conditions are typically measured.
Therefore, in addition to measuring temperature differences,
q̂ 0 L̂
Ŝ ≡ T ( z, φ̂ p, x̂ ) – T L = --------- 1 – --- .
z which we will compare to model values, we simulate measuring
(23)
k̂ L̂ the boundary condition. By measuring a temperature difference,
we only need the heat flux boundary condition for the model.
D̂ 1, i D̂ 2, i D̂ 3, i D̂ 4, i D̂ i ≡ Ŝ Ŝ
Exp Exp
1
4 q̂ 0, i °C °C
--- ∑ D̂ j, i
4j = 1 1 102.84 6 107.19
Exp 2 103.62 7 111.91
°C °C °C °C °C W /m 2
3 90.53 8 107.05
1 99.37 103.57 96.65 101.91 100.37 1439.9
4 113.88 9 107.10
2 101.65 96.70 100.26 93.99 98.15 1450.7 5 105.84 10 107.88
3 101.39 99.04 103.71 98.11 100.56 1267.5
4 96.30 103.17 99.66 101.14 100.07 1594.4 ANALAYSIS
5 93.64 98.07 97.89 96.95 96.63 1481.8 Sensitivity Analysis
6 104.56 99.89 103.68 97.91 101.51 1500.8 The data in Table 1 are available to study the accuracy of the
7 97.65 101.76 99.25 101.44 100.02 1566.8 mathematical model in Eq. (19) through Eq. (21). An obvious
8 99.77 102.67 106.93 101.57 102.73 1498.8 question is: What does a comparison of these data with model
predictions in Table 2 demonstrate about the model? The value
9 102.74 100.17 96.68 101.46 100.26 1499.5 of experimental measurements to gain insight to a model is not
10 99.17 103.83 105.59 98.43 101.76 1510.3 intuitively obvious. Potentially, the experimental data are not
suitable for assessing certain characteristics of the model. By
Model Predictions studying the sensitivity of the model we can learn about the
dependence of the model to predict the experimental data.
The model prediction, Ŝ ( z ) , is the solution of the If model predictions of the experimental data are insensitive
mathematical equations in Eq. (19) through Eq. (21) with to the model, or equivalently parameters in the model, little is
estimated/measured values of properties and boundary learned about the validity of the model by comparing with the
conditions. In this simple example an analytical solution exists validation data. It may be possible to conclude that the model is
as given in Eq. (23). The parameters for the model are physical (fortuitously) valid, when in fact good agreement is achieved
because there is lack of dependence on the model. Likewise, if
∂Ŝ q̂ 0 L̂
k̂ ------ = – Ŝ ( z ) = – --------- 1 – ---
z
(27) k , ( W /m°C )
k̂ L̂
∂k̂
Previous Exper- 14 1 15
iment
120
which represents the relationship between the error k̂ – k of
110 experiment i and q 0ˆ, j – q 0, j of experiment j. In most cases we
100
would not expect the error in model inputs to be related. When
we have a model input that is a constant for each experiment,
90 but unknown, the error due to not knowing the constant affects
the prediction for all experiments. In our example, if the
80
estimated thermal conductivity is too high, it will impact all
model value Ŝ model predictions in a similar manner. Consequently, the error
70
0 2 4 6 8 10 12
in thermal conductivity is correlated ρ kk = 1 in Eq. (33)
Experiment between experiments. There is no reason to believe the errors in
Figure 1. Model prediction with 2σ uncertainty bars other model inputs are correlated,
( ρ kq = ρ q k = ρ q q = 0 ) ( i ≠ j ) . We can simplify Eq. (33) to
0 0 0 0
∂Ŝ i ∂Ŝ i
E [ Ŝ i – S ] = E [ ( k̂ – k ) ] ------- + E [ ( q̂ 0i – q 0, i ) ] -------- . (31)
∂k̂ ∂q̂ 0 2 σ̂ k 2 2 σ̂ q 0
2
k̂, q̂ 0 q̂ 0, k̂ Ŝ i ------- + Ŝ i --------- (i = j)
k̂ q̂ 0
V S ≡ cov ( Ŝ i – S, Ŝ j – S ) = .
(35)
2 σ̂ k 2
2 ∂Ŝ i 2 ∂Ŝ i Ŝ i Ŝ j ------- (i ≠ j)
2
σ̂ S = E [ ( Ŝ i – S ) ] = E [ ( k̂ – k ) ] ------- + E [ ( q̂ 0i – q 0, i ) ] --------
2 2
= k̂
i
∂k̂ ∂q̂ 0
σ̂ k 2 2 σ̂ q0 2
2
----- The values for the covariance matrix of the ten experiments
k̂ i q̂ 0 i
- Ŝ + -------- Ŝ (32)
are listed in Table 4. Note that σ̂ S i are values along the diagonal
of VS . Since the model has (assumed) independent
A plot of the model predictions with uncertainty bars
measurements of heat flux we do not have off-diagonal
calculated as two times the standard deviation due to input
contributions for the heat flux as shown in Eq. (35). The model
uncertainty is shown in Fig. 1. These uncertainty limits
predictions of the 10 experiments are seen to be correlated
demonstrate the range within which we would expect the “true”
value of our model to reside if the “true” values k and q 0 are Cov ( Ŝ 1 – S, Ŝ 2 – S ) ( 7.37 )
2
ρS = ----------------------------------------------- = ------------------------------------- = 0.52 . (36)
bounded by the standard error given in Table 3. Given that we 1, S2
σ̂ S σ̂ S ( 10.24 ) ( 10.28 )
1 2
have used estimates of these values, the applicability of the
range shown to contain the “true” value is conditioned upon the
accuracy of the estimates and the standard error.
Table 4: Covariance matrix of model predictions, VS
Consider predicting multiple experiments, say experiments i
and j. The covariance matrix for model predicts of the two
Exp 1 2 3 4 5 6 7 8 9 10
experiments is
1 10.24 7.37 6.89 7.73 7.45 7.50 7.66 7.49 7.49 7.52
cov ( Ŝ i – S, Ŝ j – S ) = E [ ( Ŝ i – S ) ( Ŝ j – S ) ] = 2 7.37 10.28 6.91 7.75 7.48 7.52 7.69 7.52 7.52 7.55
3 6.89 6.91 9.63 7.25 6.99 7.03 7.18 7.03 7.03 7.05
∂Ŝ i ∂Ŝ j ∂Ŝ i ∂Ŝ j
ρ kk σ̂ k σ̂ k ------- -------- + ρ kq σ̂ k σ̂ q ------- -------- 4 7.73 7.75 7.25 10.82 7.84 7.89 8.06 7.72 7.88 7.91
∂k̂ ∂k̂ 0 0j
∂k̂ ∂q̂ 0
5 7.45 7.48 6.99 7.84 10.40 7.60 7.77 7.60 7.60 7.63
∂Ŝ i ∂Ŝ j ∂Ŝ i ∂Ŝ j 6 7.50 7.52 7.03 7.89 7.60 10.47 7.82 7.65 7.65 7.68
+ ρ q k σ̂ q σ̂ k -------- -------- + ρ q q σ̂ q σ̂ q -------- -------- ,(33)
j ∂q̂
0 ∂k̂
7 7.66 7.69 7.18 8.06 7.77 7.82 10.72 7.81 7.82 7.84
0 0i 0 0 0i 0j
∂qˆ ∂qˆ 0 0
8 7.49 7.52 7.02 7.88 7.60 7.65 7.81 10.46 7.64 7.67
In Eq. (33) a correlation coefficient has been defined 9 7.49 7.52 7.03 7.88 7.60 7.65 7.82 7.64 10.46 7.67
10 7.52 7.55 7.05 7.91 7.63 7.68 7.84 7.67 7.67 10.50
The analysis and text, with minor editing, that follows was
120
and prediction error. These terms cannot be separated
110 individually, because D and S are not known, but they can be
separated statistically by their variances.
100
If we assume that temperature measurement errors are
90 independent of flux measurement errors and of prediction
errors, which is generally plausible and which we know to be
80 model value Ŝ the case in this example, then Eq. (6) leads to the relationship
exp value D̂
70
0 2 4 6 8 10 12 var ( D̂ – Ŝ ) = var ( D̂ – D ) + var ( Ŝ – S ) + var ( D – S ) (37)
Experiment
where var ( . ) denotes the variance of the parenthetic quantity.
Figure 3. Comparison of experiment and model
The square of the standard deviation of residuals gives an
values with 2σ uncertainty bars on each estimate of the left-hand variance. The experimental data and
We thus have the following estimated standard Values of r 2 calculated using different error (covariance)
deviations: σ̂ e = 6.48°C , σ̂ D = 1.51°C , and σ̂ S = 6.31°C matrices are listed in the Table 8. By comparing the value of r 2
The root sum of squares of the two measurement error with the critical value we have statistical evidence as to whether
contributors to the variance of the residuals is equal to the residuals are statistically significant relative to the error in
2 2 the model and experiment. For the total error, V T , or error in the
6.31 + 1.51 = 6.49°C , which is essentially equal to the
standard deviation of the observed prediction errors, which is model, V S , we see that the value is less than the critical value.
6.48°C . Thus, measurement error accounts for all of the Hence we do not have statistical evidence to reject the residuals
observed variability. There is no evidence in these data of as being consistent with these errors. The outcome is different
extraneous extra-model, unmodeled physics contributing to the when we only account for error in the experiment. We would
variability of residual. That is, this analysis leads to the reject the residuals as being consistent with error in the
conclusion that var ( D – S ) = 0 . This finding thus captures the experiment because the value exceeds the critical value.
way that the data and predictions were in fact generated. Discussion
Prediction error in this example is all bias, no variance.
The two metrics studied in this example are aimed at
A finding of bias should lead to an analysis aimed at answering different questions. Error model estimation uses the
determining the source of the bias and a means of eliminating it residuals to quantify the accuracy of the model for predicting
in subsequent predictions. In this case, as is clear from Eq. (23), the experiment [7]. Consistency testing [9] (statistically)