0% found this document useful (0 votes)
5 views32 pages

PETE110 - Spring2019 - 2020 - Data Interpretation

Uploaded by

gorkemerkanli
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views32 pages

PETE110 - Spring2019 - 2020 - Data Interpretation

Uploaded by

gorkemerkanli
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 32

PETE 110

Introduction to
Petroleum Engineering
Spring 2020 - 2021
Dr. Sevtaç Bülbül, Dr. Kıymet Gizem Gül
Ertunç
Data Interpretation

2
Example data
Data Point 1 2 3 4 5 6 7 8 9
Measured 14 28 35 44 51 58 62 67 71

• In general, data is presented in graphical forms.


• The most commonly used graphical representation
methods are bar graphs and line graphs.
Bar Chart
Data Point 1 2 3 4 5 6 7 8 9
Measured 14 28 35 44 51 58 62 67 71
Line Chart
Data Point 1 2 3 4 5 6 7 8 9
Measured 14 28 35 44 51 58 62 67 71
Data Values and Graphical
Representation
• Bulk data is usually represented in sets of pre-
defined ranges.

• Number of measurements per a range of data is


defined as the frequency.

• The total group developed by each range set and


corresponding frequency is so-called frequency
distribution.

• A frequency distribution shows us a summarized


grouping of data divided into mutually exclusive
classes and the number of occurrences in a class.
Frequency and Frequency
Distribution
Data Point 1 2 3 4 5 6 7 8 9
Measured 14 28 35 44 51 58 62 67 71

Range 10-30 30-50 50-70 >70


Frequency 2 2 4 1
Frequency Distribution 0.222 0.222 0.444 0.111
Mean Values
• The mean values of a set of data is important
in order to understand the frequency
distribution of the data.

• There are different methods available for


determining mean values such as:
o Arithmetic Mean
o Geometric Mean
o Harmonic Mean
o Root Mean Square
Arithmetic Mean
• Arithmetic mean is the central tendency of a
collection of numbers taken as the sum of the
numbers divided by the size of the collection

1 n
Xa   Xi
n i 1

• Arithmetic mean is not a robust statistic, meaning


that it is greatly influenced by outliers.
Geometric Mean
• The geometric mean indicates the central tendency
or typical value of a set of numbers.
• A geometric mean is often used when comparing
different items when each item has multiple
properties that have different numeric ranges.

n
Xg  n
X
i 1
i
Harmonic Mean
• The harmonic mean is the reciprocal of the arithmetic
mean of the reciprocals.
• It is appropriate for situations when the average of
rates is desired.

n
Xh  n
1

i 1 X i
Root Mean Square
• The root mean square, also known as the quadratic
mean, is a statistical measure of the magnitude of a
varying quantity.

1 n

2
X rms  Xi
n i 1
Example
Data Point 1 2 3 4 5 6 7 8 9
Measured 14 28 35 44 51 58 62 67 71

1 n
Aritmetic Mean Xa   Xi X a  47.778
n i 1

Geometric Mean Xg  n
X i X g  43.224
i 1

n
Xh 
Harmonic Mean
n
1 X h  37.443
X i 1 i

1 n
 i X rms  51.077
2
Root Mean Square X rms  X
n i 1
Variation Measurements
Mean Deviation
• If the arithmetic mean value of a data set, X1, X2, ...,
Xn, is a:
1 n
Mean Deviation   X i  X a
n i1

Standard Deviation
• If the arithmetic mean value of a data set, X1, X2, ...,
Xn, is a:
n

 X  Xa 
2
i
x  i 1

n
Example
Data Point 1 2 3 4 5 6 7 8 9
Measured 14 28 35 44 51 58 62 67 71

Mean Deviation  15.580

 x  18.060
Linear Correlation Analysis
• In engineering, most of the times, analysis of more than two
variables has to be determined.

• The strength of the linear relation between two variables is


measured by linear correlation coefficient, r:

 n    
n   X iYi     X i   Yi 
r  i   i  i 
2 2
   
n  X 
i
2
   Xi 
 i 
n Y 
i
2
   Yi 
 i 

• If r = +1, there is total positive correlation, if r = -1, there is


total negative correlation, and if r = 0, there is no linear
correlation between the variables.
Example
Porosity, % 12 23 21 17 16 28 32 15 8 24
Permeability, 10.39 13.56 14.12 13.04 12.00 14.00 16.49 12.25 8.49 13.86
md

r = 0.9409
Empirical Correlations
• It is desired to correlate a mathematical relation
among the variables which are major parts of any
scientific data.

• Usually, curves are plotted to see the relation


between the variables. The mathematical relations
introduced to explain the behavior of these curves
are called empirical correlations.

• The most commonly used empirical equation types


are parabolas, hyperboles, power functions,
trigonometric functions, etc.
Empirical Correlations
• As a rule of thumb, the following procedure is applied
for selecting the empirical equation form:
o Data is plotted on a cartesian coordinate system.
o Depending on the shape of the curve or line, an
equation is defined. Equations which require the
least number of constants and the simplest structures
are selected.
o Equation constants are determined using the actual
variables.
o The confidence verifications are conducted.
o If the proposed equation does not successfully
apply, a different structured equation or more
parameter equation is selected and the procedure
is repeated until a successful correlation is obtained.
Examples of two and three
Parameter Equations
Checking the Relevance of the Empirical
Correlation

• In order to check whether the empirical correlation is


compatible with the variables, r2 (R2) is used.

• The closer the R2 to 1.0, the curve fits better to the data

• Mind that overfitting data to the correlation can be


physically meaningless
Determination of the Slope of Lines
Plotted on Different Coordinate Systems
• Cartesian coordinate system

y2  y1
slope 
x2  x1
Example
• Lets determine the slope for the line that is passing from the
following two points on a cartesian plot:

(x1,y1)=(2600, 4000) and (x2, y2)=(9500, 8000)

• Applying the equation given,

8000  4000
slope   0.5797
9500  2600
Determination of the Slope of Lines
Plotted on Different Coordinate Systems
• Semi-logarithmic coordinate system

log10a  log10a 1 1
slope  
x2  x1 x2  x1

In semi-log plots, slope is determined using the points


correspond to a “cycle”. A cycle on a logarithmic scale is
the range between 10a and 10a-1.
Example
• Lets determine the slope for the line that is passing from the
following two points on a semi-log plot:

(x1,y1)=(4400, 10) and (x2, y2)=(8600, 100)

• Applying the equation given,

1
slope   0.000238
8600  4400
Determination of the Slope of Lines Plotted
on Different Coordinate Systems
• Logarithmic-logarithmic coordinate system

If a cycle approach is used;


 y2 
log  
log y2  log y1  y1  slope 
log10a  log10a 1

1

1
slope   log x2  log x1 log x2  log x1 x 
log x2  log x1  x2  log  2 
log    x1 
 x1 
Example
• Lets determine the slope for the line that is passing from the
following two points on a log-log plot:

(x1,y1)=(35, 10) and (x2, y2)=(1100, 100)

• Applying the equation given,

1
slope   0.6679
 1100 
log  
 35 
Interpolation
• Interpolation is used to estimate the data that lies
between known data points.

• One possible method to determine a value within the


range of the available data range is plotting a graph
of the data, and reading the desired value from the
plotted graph.

• Other widely used method is to mathematically


interpolate by assuming some functional relationship
between the known data.

• The most commonly assumed relation is a linear


relation.
Interpolation
• Assume that a value of y is desired to be determined as
a function of x, i.e., (yd, xd) . Let x1 <xd <x2 and (x1,
y1)and (x2,y2) are known. The slope of a straight line can
be determined as:
y2  y1
m
x2  x1
• Since the slope between any two points on a straight
line is constant: y y
m d 1

xd  x1
• Therefore, the desired value, yd, can be determined as
xd  x1
yd  m  xd  x1   y1   y2  y1   y1
x2  x1
Example
• Viscosity of normal octane as a function of
temperature at one atmosphere pressure is
presented as shown in the table:

• It is desired to determine the viscosity of normal


octane at 25˚C. Interpolation would be required to
estimate a value between the known viscosities at
temperatures of 20˚C and 40˚C.
Example

• From the graph, for 25˚C, the viscosity can be read


approximately as 0.505 cp.
• If mathematical approach is followed;

xd  x1 25  20
yd   y2  y1   y1   0.433  0.542   0.542  0.51475
x2  x1 40  20
Thanks for Listening...

32

You might also like