Support Vector Regression and Functional Networks
Support Vector Regression and Functional Networks
net/publication/220606228
Support Vector Regression and Functional Networks for viscosity and Gas/Oil
Ratio Curves Estimation.
CITATIONS READS
17 243
5 authors, including:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Amar Khoukhi on 11 February 2015.
In oil and gas industry, prior prediction of certain properties is needed ahead of explo-
ration and facility design. Viscosity and gas/oil ratio (GOR) are among those properties
described through curves with their values varying over a specific range of reservoir pres-
sures. However, the usual single point prediction approach could result into curves that
are inconsistent, exhibiting scattered behavior as compared to the real curves. Support
Vector Regressors and Functional Networks are explored in this paper to solve this prob-
lem. Inputs into the developed models include hydrocarbon and non-hydrocarbon crude
oil compositions and other strongly correlating reservoir parameters. Graphical plots
and statistical error measures, including root mean square error and average absolute
percent relative error, have been used to evaluate the performance of the models. A com-
parative study is performed between the two techniques and with the conventional feed
forward artificial neural networks. Most importantly, the predicted curves are consistent
with the shapes of the physical curves of the mentioned oil properties, preserving the
need of such curves for interpolation and ensuring conformity of the predicted curves
with the conventional properties.
1. Introduction
1.1. Motivation
Reservoir fluid properties are very important in petroleum engineering compu-
tations such as material balance calculations, well test analysis, reserve estima-
tion, inflow performance calculations, fluid flow in porous media, evaluation of
new formation for potential development, numerical reservoir simulations, design
269
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
2. Related Works
2.1. Viscosity correlations
A good number of empirical correlations have been developed in the literature to
estimate crude oil viscosity at, below and above bubble point pressure (Pb ). Pb is
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
the pressure at which the light hydrocarbon components in the oil starts to change
to the gas phase, and hence appear as bubbles in the oil sample. Correlations
based on soft computing techniques have recently been developed to predict this
important PVT property. Graphical correlations1 were developed for predicting vis-
cosity at different reservoir pressures using a data set from U.S.A. oil samples. The
authors correlated the under-saturated oil viscosity with the viscosity at Pb (µob ),
and pressure above bubble point Pb , while using oil gravity and temperature of
range 100–220◦ F to develop the dead oil (at the standard temperature and pres-
sure) viscosity (µod ) correlation. In the same vein, a graphical correlation2 was
presented for predicting oil viscosity at Pb . The correlation was developed as a
function of GOR using viscosity data of 457 crude oil samples from Canada and
U.S.A.
Correlations for µob and µod were developed, with a data set of 2073 oil viscosity
measurements used to develop the µob correlation while 460 dead oil observations
was used to develop µod correlation.3 Another correlation was developed in Ref. 4,
for µod in the temperature range of 50–300◦ F. The variable µod was correlated as
a function of American Petroleum Institute (API) oil gravity scale and tempera-
ture. A large set of PVT measurements was used5 to develop an under-saturated
oil viscosity correlation as a function of µob , Pb and reservoir pressure. Also, a µod
correlation6 based on a modification of Beggs and Robinson’s correlation was pre-
sented. Correlations for µob was used to predict oil viscosity below and above bubble
point were developed in Ref. 7. Total data points of 150, 1503 and 1691 were used
to develop the three correlations. The viscosity data used in the study were from
Saudi crude oil. For µob correlation, gas relative density, solution GOR, relative
temperature and oil relative density, independent variables were used. For viscosity
above and below Pb correlations, the correlating variables used were µob , reservoir
pressure and Pb . Correlations were also developed8 for µod , µob and under-saturated
oil viscosity using light crude oil data of Libya. µod was correlated as a function of
stock tank oil gravity and temperature. The µob was correlated with API oil gravity,
µod and Pb , while the under-saturated oil viscosity was correlated with pressure,
Pb , µob , µod and API oil gravity.
In Ref. 9, new empirical correlations for µod , µob and under-saturated oil vis-
cosity were suggested. The µod was correlated as a function of API oil gravity and
reservoir temperature, µob as a function of dead oil viscosity and solution GOR,
and under-saturated oil viscosity as a function of µob , Pb and reservoir pressure. A
total of 126 laboratory PVT analyses from Texas and Louisiana in U.S. were used
to develop oil viscosity correlations. Other correlations for µob and under-saturated
oil viscosity based on UAE crude oil were presented in Ref. 10. The correlated µob
with solution GOR, reservoir temperature, gas specific gravity and API oil grav-
ity using 57 data points. An under-saturated oil viscosity correlation was devel-
oped as a function of Pb , µob , reservoir pressure and solution GOR using 328 data
points.
Other correlation functions were introduced in Ref. 11 for µod , µob and under-
saturated oil viscosity for Gulf of Mexico crude oils based on 100 PVT laboratory
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
reports. The authors correlated µod with temperature, pressure and solution GOR
at Pb , and API oil gravity. The µob was correlated with µod and solution GOR, while
the under saturated oil viscosity was correlated with µob , Pb , reservoir pressure and
solution GOR.
In the same vein, a number of researchers have developed viscosity correlations
using soft computing techniques. A Radial Basis Function Network model12 was
used for predicting oil viscosity using reservoir pressure and temperature, API
oil gravity and gas gravity as the network inputs. A universal neural-network-
based model for estimating PVT properties of crude oil systems was introduced
in Ref. 13. Another ANN model14 was developed using all the data points of a
reservoir to obtain the viscosity curve. A PVT data of 650 reservoir fluids from
around the world was used to develop the viscosity correlation model and also
for some other PVT properties. An ANN correlation model15 was introduced to
predict brine viscosity using temperature and salinity as the network inputs. A
total of 1040 data points were used to build the model. Another neural network
was constructed16 to predict viscosity below Pb for Pakistani crude oil. The cor-
relating parameters were: pressure, reservoir temperature Pb , oil formation volume
factor, solution GOR, gas specific gravity and API gravity. More recently, an oil
viscosity correlation was presented in Ref. 17 for Iranian crude oil using genetic
algorithms. The input parameters were the pressure, temperature, and reservoir
fluid GOR and oil density. An excellent study had been reported recently,18 imple-
menting a support vector regression technique on PVT Correlations for Indian
Crude oil.
In all abovementioned prediction techniques, including empirical correlations
and soft computing models, data points are being predicted and the shapes of the
resulting estimated curves may not be consistent with the experimental ones.
The implementation studies of the presented work were achieved based on three
databases say, data sets A, B and C. Data set A consists of the hydrocarbon and
non-hydrocarbon components, and some other properties of the crude oil. Data set
B consists of the viscosity-pressure measurements to generate viscosity curves for
the corresponding wells in data set A, while data set C consists of GOR pressure
measurements to generate gas/oil ratio curves for corresponding wells in A. These
data were taken from Middle East crude oil reservoirs.
Initially, there were 106 data points in the set A. In preprocessing the data, we
applied two different outlier-detection methods on the data set A before utilizing it.
These are Cook’s distance method and Chauvenet’s criterion.25 The former method
was implemented using STATISTICA software while details on the latter method
can be found in Ref. 25. Only data points that were detected to be outliers by the
two methods have been declared as outliers and removed.
Eventually, seven data points were declared as outliers. After the removal of
the outliers from data set A, it was reduced to 99 data points. As recommended
and is usually done, the predictors were normalized within the interval [0 1] using
formula (1) below. This makes the input data dimensionless and ensures that the
predictors are independent of the measurement units.
(xold − min(xi ))
xnew = i
, (i = 1, 2, . . . , n). (1)
i
(max(xi ) − min(xi ))
The data set A was then divided into training and testing sets. The training set
consists of 70% (approx. 70 data points) while the testing set consists of 30%
(approx. 29 points). The idea of curve prediction using artificial neural network
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
technique was first introduced in Ref. 26 using ANN, in Ref. 27 using RBF, and
Refs. 28 and 29 using a hybrid ANN with differential evolution.
4. Problem Statement
Typical viscosity and GOR curves are shown in Figs. 1 and 2. Each curve represents
the variation of viscosity or GOR for the corresponding oil well. The significance
of such a curve is compromised, if single point or multi-point based prediction is
utilized. Equations (2) and (3) can be used to represent any crude oil viscosity curve
and Eq. (4) is used to represent GOR curves.
β
P − Pd
µ = µod + (µob − µod ) for P < Pb , (2)
Pb − Pd
µ = µob + α(P − Pb ) for P ≥ Pb , (3)
τ
P − Pb
Rs = Rsb , (4)
Pb − Pd
where α and β are viscosity curve coefficients and τ is the fitting GOR curve
coefficient. The statistical distribution of the fitting coefficients is shown in Table 1.
From Eqs. (2) and (3), three parameters µob , α and β are needed to generate
the viscosity curve. The first parameter µob is determined from the laboratory
PVT analyses while α and β are to be generated from the curve fitting. Two
2
Experimental
Predicted
1.8
1.6
1.4
Viscosity
1.2
0.8
Fig. 1. Typical result from single or multi-data point prediction for viscosity curve.
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
Experimental
600
Predicted
500
400
Gas/Oil Ratio
300
200
100
0
0 200 400 600 800 1000 1200 1400 1600 1800
Pressure
Fig. 2. Typical result from single or multi-data point prediction for GOR curve.
parameters, Rsb and τ are needed to generate GOR curves, Rsb is determined
from PVT laboratory analyses while τ is obtained from curve fitting.
5. Approach
5.1. Support vector regression
Support vector machine modeling schemes and methods are among the most suc-
cessful and effective algorithms in both machine learning and data mining commu-
nities. It has been widely used as a robust tool for classification and regression. An
overview can be found in Refs. 30 and 31. Support Vector Regression (SVR) is a
regression version of Support Vector Machines (SVMs) (Fig. 3). Unlike classifica-
tion problems where the outputs are either 1 and 0 or 1 and −1, the outputs in the
regression problems are real numbers. This makes it a bit difficult to model this type
of information which has infinite possibilities. With the introduction of Vapnik’s
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
ε-insensitive loss function, SVM has been extended to solve nonlinear regression
estimation problems, leading to techniques known as SVR. These have been shown
to exhibit excellent performance.32 SVR has been found to be very robust to predict
complex nonlinear relationship problems in many applications, including problems
such as optical character recognition, text categorization, and face detection in
images.33 In the case of regression, a margin of tolerance ∈ is set in approximation
to the SVM which would have already being inferred from the problem. As shown
in Figs. 3 and 4, SVMs map input vectors to a higher dimensional space, where a
maximal separating hyperplane is constructed.34–36
The kernel function is responsible for transforming the data set into hyperplane.
The variables of the kernel must be computed accurately since they determine the
structure of high-dimensional feature space which governs the complexity of the
final solution.
Fig. 3. The original input space mapped to a higher feature space with a separable training set.
Fig. 4. Soft margin loss setting for a linear dimensional SVR (Schölkopf and Smola, 2002).
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
After the selection of a kernel, the other highly influential parameters in any
SVR model based on the observation are “C” and “kernel option”. For polynomial
kernel, kerneloption denotes the degree of the kernel polynomial while it denotes
kernel bandwidth for “Gaussian”. “C” is the trade-off between achieving minimal
training error and complexity of the model. The kernel functions and SVR options
used in this study appear in Appendix 2.
W41
X1 X4
f (ΣW4i Xi)
W42 W64
X2 W52 X6
f (ΣW6i Xi)
f (ΣW5i Xi) X5
W65
X3
W53
X4
X1
f1 (X1, X2)
X6
X2 f3 (X4, X5)
X3
f2 (X2, X3) X5
For the case at hand, the different families of functions used for each parameter
appear in Appendix 3. The MDLP was used to optimize the network and select the
best model. It can be noted that in some cases, some functions are zero, this means
that the corresponding input to that node does not really affect the predicting
output at that instance.
1
n
AAPRE = |Ei |. (8)
n i
Experimental
SVR
FN
2
1.5
Viscosity (cP)
0.5
0
0 500 1000 1500 2000 2500
Pressure (psi)
2 Experimental
SVR
1.8 FN
1.6
1.4
Viscosity (cP)
1.2
0.8
0.6
0.4
0.2
0
0 500 1000 1500 2000 2500 3000 3500 4000
Pressure (psi)
2 Experimental
SVR
1.8
FN
1.6
1.4
Viscosity (cP)
1.2
0.8
0.6
0.4
0.2
0
0 500 1000 1500 2000 2500 3000
Pressure (psi)
1.5 Experimental
SVR
FN
1
Viscosity (cP)
0.5
0
0 500 1000 1500 2000 2500 3000 3500 4000
Pressure (psi)
Fig. 10. Viscosity vs. pressure plot for sample well TS2.
Experimental
SVR
2 FN
1.5
Viscosity (cP)
0.5
0
0 500 1000 1500 2000 2500
Pressure (psi)
Fig. 11. Viscosity vs. pressure plot for sample well TR3.
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
2.5
Viscosity (cP)
1.5
0.5
0
0 500 1000 1500 2000 2500 3000 3500 4000
Pressure (psi)
Fig. 12. Viscosity vs. pressure plot for sample well TS3.
for the two frameworks are shown in Table 2. For this pair of techniques, the
performance of both frameworks, SVR and FN, are very competitive. While FN
performance is better than that of SVR in the training phase with lower RMSE
and AAPRE, which are 0.06765 and 5.4% respectively, against those of SVR which
are 0.07495 and 6.3953% respectively, SVR performance is very competitive with
that of FN for the testing wells. For the testing phase, FN has lower AAPRE
of 8.5514%, against that of SVR which is 8.5969%, while SVR has lower RMSE,
0.0765, against that of FN which is 0.07941. Also, from Table 2, the RMSE and
APPRE for the FFNN predictions are the highest for both training and testing.
In essence, the results of SVR and FN are very competitive for viscosity curve
prediction, while both clearly outperform FFNN. The predicted curves from the
two SC techniques show good matching with the experimental curves for both
training and testing wells with little deviation in some testing wells. Table 3 shows
a sample of predicted viscosity curve parameters by SVR FN and FFNN models.
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
600
500
400
300
200
100
0
0 200 400 600 800 1000 1200 1400 1600 1800
Pressure (psi)
Fig. 13. Gas/oil ratio vs. pressure plot for sample well TR1.
Experimental
800 SVR
FN
700
SolutionGas/Oil Ratio (SCF/STB)
600
500
400
300
200
100
0
0 200 400 600 800 1000 1200 1400 1600 1800
Pressure (psi)
Fig. 14. Gas/oil ratio vs. pressure plot for sample well TS1.
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
400
300
200
100
0
0 200 400 600 800 1000
Pressure (psi)
Fig. 15. Gas/oil ratio vs. pressure plot for sample well TR2.
1000 Experimental
SVR
900
FN
Solution Gas/Oil Ratio (SCF/STB)
800
700
600
500
400
300
200
100
0
0 200 400 600 800 1000 1200 1400 1600 1800 2000
Pressure (psi)
Fig. 16. Gas/oil ratio vs. pressure plot for sample well TS2.
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
300
250
200
150
100
50
0
0 100 200 300 400 500 600 700 800
Pressure (psi)
Fig. 17. Gas/oil ratio vs. pressure plot for sample well TR3.
Experimental
900 SVR
FN
800
Solution Gas/Oil Ratio (SCF/STB)
700
600
500
400
300
200
100
0
0 200 400 600 800 1000 1200 1400 1600 1800 2000
Pressure (psi)
Fig. 18. Gas/oil ratio vs. pressure plot for sample well TS3.
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
more demanding than FN at the training and testing with a significant difference,
whereas the FFNN is the most demanding at training. Furthermore, Tables 8 and 9
show the correlation coefficient as to relate the statistical significance of the results
which also illustrate that SVR performs better than FN for predicting gas/oil Curve
and that SVR and FN are competitive regarding viscosity curve prediction.
7. Conclusion
In this paper, we have presented two advanced computational intelligence tech-
niques to predict crude oil Pressure-Volume-Temperature (PVT) properties that
need to be represented as curves over a specified range of reservoir pressures.
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100
Instead of the usual single or multi-data points prediction, which could distort
the consistency of the curve’s shape, an efficient approach for predicting such PVT
properties curves has been introduced and implemented. In all predictions, we have
implemented different independent neural network techniques, viz: Support Vector
Regression, Functional Networks and Feedforward Neural Network. The viscosity
and gas/oil ratio curves prediction problems were formulated and implemented
using these approaches. Simulation of these results had been reported and compar-
isons between the three techniques were discussed on both viscosity and solution
GOR curve predictions. Interestingly, the shapes of the predicted viscosity and
solution GOR curves are consistent with the physical law and the experimental
curves. This makes the use of such predicted curve practicable for use, contributing
thereby to enhanced exploration and production processes through cost reduction
and improving human operator conditions.
Acknowledgment
This work was supported by King Fahd University of Petroleum and Minerals
(Grant No. SB100014), and partially by King Abdulaziz City for Science and Tech-
nology under Grant MSTP-KACST-08-OIL82-4, Saudi Arabia.
f5 (x5 ) = 2.63 ×106 log(x5 + 2) − 5.9 ×106 log(x5 + 3) + 3.3144 × 106 log(x5 + 4);
f6 (x6 ) = −2.2 × 107 log(x6 + 2) + 4.7 × 107 log(x6 + 3) − 2.5 × 107 log(x6 + 4);
f7 (x7 ) = 1.65 × 106 log(x7 + 3) − 1.71 × 106 log(x7 + 4);
f8 (x8 ) = 2056.883 log(x8 + 4);
f9 (x9 ) = 1.47 × 109 log(x9 + 2) − 2.9 × 109 log(x9 + 3) + 1.48 × 109 log(x9 + 4);
f10 (x10 ) = −5 × 105 log(x10 + 3) + 5.119 × 105 log(x10 + 4); f11 (x11 ) = 0;
f12 (x12 ) = 84618.57 log(x12 + 2) − 246973 log(x12 + 3) + 169459 log(x12 + 4).
References
1. C. Beal, The viscosity of air, water, natural gas, crude oil and its associated gases at
oil field temperature and pressures, Trans. AIME 165 (1946) 94–115.
2. J. Chew and C. A. Jr. Connally, A viscosity correlation for gas–saturated crude oils,
Trans. AIME 216 (1959) 23–25.
3. H. D. Beggs and J. R. Robinson, Estimating the viscosity of crude oil system, J. Pet.
Tech. 9 (1975) 1140–1149.
4. O. Glaso, Generalized pressure-volume temperature correlations, J. Pet. Tech. 32(5)
(1980) 785–795.
5. M. Vazuquez and H. D. Beggs, Correlation for fluid physical property prediction,
J. Pet. Tech. 32(6) (1980) 968–970.
6. E. O. Egbogah and T. Ng. Jack, An improved temperature-viscosity correlation for
crude oil systems, J. Pet. Sc. & Eng. 5 (1990) 197–200.
7. S. A. Khan, M. A. Al-Marhoun, S. O. Duffuaa and S. A. Abu-Khamsin, Development
of viscosity correlations for crude oils, Fifth SPE Middle East Oil Show, Bahrain,
March (1987), pp. 7–10.
8. R. Labedi, Improved correlations for predicting the viscosity of light crudes, J. Pet.
Sc. & Eng. 8 (1992) 221–234.
9. G. E. Jr. Petrosky and F. F. Farshad, Viscosity, correlations for Gulf of mexico crude
oils, Paper SPE 29468 Presented at Production Operations Symposium (Oklahoma
City, OK, U.S.A.), 2–4 April, 1995.
10. R. A. Almehaideb, Improved PVT correlations for UAE crude oils, SPE Middle East
Oil & Gas Show and Conference (Bahrain, March 1997), pp. 15–18.
11. B. Dindoruk and P. G. Christman, PVT properties and viscosity correlations for Gulf
of Mexico oils, SPE Res. Eng. 7(6) (2004) 427–437.
12. A. M. El-Sharkawy, Modeling the properties of crude oil and gas systems using RBF
network, SPE Asia Pacific Oil & Gas Conference (Perth, Australia, October 1998),
pp. 12–14.
13. Ridha B. Gharbi, Adel M. Elsharkawy and Mansour Karkoub, Universal Neural-
Network-Based Model for estimating the PVT properties of crude oil systems, Energy
Fuels 13(2) (1999) 454–458, DOI: 10.1021/ef980143v.
14. N. Varotsis, V. Gaganis, J. Nighswander and P. Guieze, A novel non-iterative method
for the prediction of the PVT behavior of reservoir fluids, SPE Annual Technical
Conference and Exhibition (Houston, Texas, October 1999), pp. 3–6.
15. E. Osman and M. Al-Marhoun, Artificial neural networks models for predicting PVT
properties of oil field brines, 14th SPE Middle East Oil & Gas Show and Conf.
(Bahrain, 12–15 March, 2005).
October 7, 2011 10:10 WSPC/S1469-0268 157-IJCIA
S1469026811003100