Uncertainty Evaluation in Reservoir Forecasting by Bayes Linear Methodology
Uncertainty Evaluation in Reservoir Forecasting by Bayes Linear Methodology
1 Introduction
For these tasks, Monte Carlo methods are computationally too expensive,
as too many simulation runs are required. As shown in [3, 6, 8], more sophis-
ticated statistical approaches, such as response surface methodology (RSM)
or Bayesian approaches, are more appropriate than Monte Carlo methods.
When s(x) is a smooth function, one can use multiple regression tech-
niques to approximate s(x) from a few simulation runs. In the RSM, a linear
model is used, i.e., a linear combination of q fixed basis functions; usually low
order polynomials. The coefficients of the linear model are calculated using a
standard least squares technique.
RSM was originally introduced in physical experiments, where each obser-
vation of a physical process is subject to measurement error. In contrast, a
simulator is deterministic, i.e., rerunning the code with the same inputs gives
identical observations. In this case, an interpolatory estimator rather than an
approximation is usually preferred. A Bayesian approach yields, unlike RSM,
an interpolatory (posterior) estimator, see the appendix of [3] for details.
Application of a Bayesian approach results in updating a prior distribution
of a statistical model sB by Bayes’ rule,
PPost (sB (x)|sX ) ∝ PPrior (sB (x)) PLikelihood (sX |sB (x)),
E[sBL (x)|sX ] = E[sBL (x)] + Cov[sBL (x), sX ]Var[sX ]−1 (sX − E[sX ]),
Var[sBL (x)|sX ] = Var[sBL (x)] + Cov[sBL (x), sX ]Var[sX ]−1 Cov[sX , sBL (x)].
Prior knowledge about the random process is usually built by expert elicita-
tion [4]. In our case, an initial set of simulator runs is used to support the
elicitation process. This initial data is not analyzed statistically. The data is
rather interpreted by reservoir engineers who provide estimates of the prior
4 D. Busby, C.L. Farmer, A. Iske
mean E[sBL (x)] and variance Var[sBL (x)]. The required selection of the ac-
tive variables x∗ and of the regression functions g in (1), usually low order
polynomials, is done through sensitivity analysis, as described in [10, 14].
We decided to work with the autocovariance function
The design criterion we work with relies on the maximum mean square
error (MMSE). In this case, design points, x∗ , are sequentially added, one
at a time, where the posterior variance Var[sBL (x)|sX ] of the current Bayes
(m)
linear emulator sBL ≡ sBL is maximal among all x ∈ χ. In this way, the
(m+1)
prediction error of the subsequent (posterior) emulator sBL vanishes at x∗ .
A similar design criterion is proposed in [13], but for kriging.
In summary, each step of the sequential design is performed as follows.
(1) Compute an input configuration x∗ which maximizes Var[sBL (x)|sX ];
(2) Run the simulator at the selected configuration x∗ to obtain s(x∗ );
(3) Rebuild the emulator by including the new simulator output s(x∗ ).
As regards a stopping criterion, we chose a customized diagnostic measure
which relies on the prediction error
(m−1)
η(m) = |sBL (xm ) − s(xm )|,
where xm = x∗ denotes the design point which was added at step m, and
(m)
s(xm ) is the simulator response at xm . Note that sBL (xm ) = s(xm ). When
the sequence η(m) of prediction errors stabilizes, i.e., |η(m)−η(m−1)| < TOL
(m)
for some tolerance TOL, we take sBL as an a sufficiently accurate emulator.
The PUNQS test case relies on a synthetic reservoir model taken from the
North Sea Brent reservoir, a real-world oilfield. The PUNQS test case is fre-
quently used as an industrial reservoir engineering model since its use in the
European research project PUNQ [11] is a benchmark test for comparative
inversion studies and for stochastic reservoir modelling.
A top structure map of the PUNQS reservoir field is shown in Figure 1.
The geological model contains 19 × 28 × 5 = 2660 grid blocks, 1761 of which
are active. The reservoir is surrounded by a strong aquifer in the North and
in the West, and it is bounded by a fault to the East and to the South. A
small gas cap is located in the centre of this dome-shaped structure. The
geological model consists of five independent layers, where the porosity distri-
bution in each layer was modelled by geostatistical simulation. Initially, the
field contains six production wells located around the gas-oil contact. Due to
the strong aquifer, no injection wells are required.
As suggested by reservoir engineers, we consider the following seven main
sources of uncertainty: (i) the analytical coefficient of the aquifer strength,
AQU, (ii) the residual gas oil saturation, GOS, (iii) the residual water
oil saturation, WOS, (iv) the vertical permeability multiplier in low qual-
ity sands, VPML, (v) the vertical permeability multiplier in high quality
6 D. Busby, C.L. Farmer, A. Iske
Fig. 1. PUNQS test case. Top structure map of the reservoir field.
(a) (b)
Note that the response surface sBL obtained from the Bayes linear estima-
tor (Figure 2 (a)) is, in comparison with sRSM of RSM (Figure 2 (b)), much
closer to the true response surface P15OPR, and so the Bayes linear esti-
8 D. Busby, C.L. Farmer, A. Iske
Method η1 η2 η∞
BL 3.0 4.6 17.6
RSM 6.2 7.1 16.2
In our second test case, we consider the bottom hole pressure at well
PRO15 after 13 years, response surface P15BHP, as a function of HPMH
and GOS. The design set X was constructed by applying FFD to obtain an
initial set of 7 points, followed by a sequential design for further 2 points,
yielding m = 9 design points in total.
Figure 3 displays the response surface of the Bayes linear emulator, sBL ,
and the response surface obtained by RSM, emulator sRSM , each of which was
constructed by using m = 9 design points. For comparison, Figure 3 displays
9 × 9 grid points of the true response surface. Our numerical results are shown
in Table 2.
(a) (b)
Method m η1 η2 η∞ m η1 η2 η∞
BL 7 3.7 5.3 13.4 9 2.7 4.3 12.6
RSM 7 4.2 5.5 12.4 9 3.6 4.8 11.3
different emulators, sBL and sRSM . Note that Table 2 involves two different
comparisons, one using the initial set of m = 7 design points, the other using
all m = 9 design points. Note that the accuracy of the emulator sBL is, unlike
that of sRSM , significantly improved by the adaptive insertion of only two
design points, x8 and x9 . Moreover, the prediction quality of the Bayes linear
emulator sBL is superior to that of sRSM not only in smooth regions of the true
surface P15BHP, but also in regions where P15BHP is highly nonlinear.
However, the emulator sBL exhibits small overshoots near discontinuities of
P15BHP, which explains the somewhat inferior prediction error η∞ of sBL .
The same comment applies to our first test case, see Table 1.
5 Conclusion
We have shown the utility of Bayes linear methodology, in combination with
sequential adaptive design, for uncertainty evaluation in reservoir forecast-
ing. The resulting Bayes linear estimation has been applied to the PUNQS
10 D. Busby, C.L. Farmer, A. Iske
test case, a rather simple but fairly realistic and frequently used model prob-
lem from reservoir engineering. The performance of the resulting emulator
has been compared with that obtained from the response surface method-
ology (RSM), the basic method of commercial reservoir software, such as
COUGAR [2]. We found that the Bayes linear methodology is superior to
RSM, especially for highly nonlinear responses. For high-dimensional input
data a significant number of more simulator runs need to be included in the
initial sequential design. This is illustrated in our previous paper [1].
Acknowledgement
The work of Daniel Busby and Armin Iske was supported by Schlumberger and
by the European Union through the project FAMOUS, contract no. ENK6-
CT-2002-50528. Chris L. Farmer thanks the Royal Society for support through
an Industry Fellowship at the University of Oxford.
References
1. D. Busby, C.L. Farmer, and A. Iske (2005) Multilevel adaptive design and anal-
ysis of computer experiments. Manuscript.
2. The COUGAR project, http://consortium.ifp.fr/cougar/.
3. P. S. Craig, M. Goldstein, J. C. Rougier, and A. H. Seheult (2001) Bayesian fore-
casting for complex systems using computer simulators. Journal of the American
Statistical Association 96, 717–729.
4. P. S. Craig, M. Goldstein, J. A. Smith, and A. H. Seheult (1998) Constructing
partial prior specifications for models of complex physical systems. The Statis-
tician 47, 37–53.
5. C. Currin, T. Mitchell, M. Morris, and D. Ylvisaker (1988) A Bayesian approach
to the design and analysis of computer experiments. ORNL Technical Report
6498, National Technical Information Service, Springfield, VA 22161.
6. J. P. Dejean and G. Blanc (1999) Managing uncertainties on production predic-
tions using integrated statistical methods. SPE Journal, SPE 56696.
7. M. Goldstein (1998) Bayes linear analysis. Encyclopaedia of Statistical Sciences.
8. M. C. Kennedy and A. O’Hagan (2000) Bayesian calibration of computer mod-
els. Journal of the Royal Statistical Society B. 63, 425–464.
9. R. H. Myers and D. C. Montgomery (2002) Response Surface Methodology:
Process and Product Optimization Using Designed Experiments. Wiley.
10. J. Oakley, A. O’Hagan (2004) Probabilistic sensitivity analysis of complex mod-
els: a Bayesian approach. Journal of the Royal Statistical Society B 63, 425–464.
11. The European project PUNQ (Production Forecasting with UNcertainty
Quantification), http://www.nitg.tno.nl/punq/.
12. R Development Core Team (2004) R Foundation for Statistical Computing.
http://www.R-project.org.
13. J. Sacks, W. J. Welch, T. J. Mitchell, and H. P. Wynn (1989) Design and analysis
of computer experiments. Statistical Science 4(4), 409–435.
14. A. Saltelli, K. Chan, and M. Scott (2000) Sensitivity Analysis, Wiley, New York.