0% found this document useful (0 votes)

9 views

Petroleum Data Managment

Uploaded by

Homayoun Najafi

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Petroleum Data Managment

Uploaded by

Homayoun Najafi

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

Petroleum Data

Management
Titles
❑Chapter 1: Data in Petroleum Engineering Field
Introduction
• Data is a key company asset
• Large volumes of data need to be integrated for information,
knowledge and innovation
• An essential component of artificial intelligence applications
Chapter 1:
Data in Petroleum Engineering Field
Data Sources in Upstream Oil Industry
BIG Data
distributed computing
Big Data Like Crude!
Big Data

Data Analytics
Chapter 2:
Statistics
Descriptive Statistics
Bivariate Data

Pearson correlation coefficient

Spearman correlation coefficient

Quantile
The word “quantile” comes from the word quantity. In simple terms, a quantile is where a sample is divided into
equal-sized, adjacent, subgroups (that’s why it’s sometimes called a “fractile“). It can also refer to dividing
a probability distribution into areas of equal probability.

Median is 50 percentile or .5 quantile(50 before and after it!)

▪ Quantile of ith data is about i/N+1

▪ Respectively data for q quantile is q(N+1)

❑ Quantile plot can be used to detect symmetry of a distribution; it is sketch of qi vs. xi:
✓ A symmetrical distribution is characterized by an S-shaped quantile plot, where the distance on the
horizontal axis between the median (50th percentile) and any percentile P below the median is equal to
the distance from the median to the (100-Pth percentile). Symmetrical distributions are characterized by
mean¼median¼mode.
✓ If the distribution has positive skewness, that portion of the quantile plot corresponding to q>0.9 will
usually be longer and flatter than the rest of the plot.
✓ Conversely, distributions with negative skewness have a long flat portion on the quantile plot
corresponding to q<0.1.
Vertical Heterogeneity in Permeability Using Q Plot

Dykstra and Parsons used the log-normal distribution of

permeability to define the coefficient of permeability variation

In a normal distribution, the value of k is such that 84.1% of the

permeability values are less than k¯+s and 15.9% of
the k values are less than k¯−s.

For a log-normal permeability distribution, the Dykstra–Parsons

coefficient can be estimated from
Parametric Models
• Uniform Distribution

The uniform distribution is useful as a rough model for representing low states of knowledge when only the
upper and lower bounds are known
Parametric Models
• Triangular Distribution
Parametric Models
• Normal Distribution
CDF: F(x) has no closed-form solution but is often
presented using the complementary error function
solution
Parametric Models
• Lognormal Distribution
Parametric Models
• Poisson Distribution
When events occur as a purely random (Poisson) process, the number of independent events occurring
within a fixed time interval follows a Poisson distribution.

1. Events are independent of each other. The occurrence of one event does not affect the probability
another event will occur.
2. The average rate (events per time period) is constant.
3. Two events cannot occur at the same time.
Parametric Models
• Binomial Distribution
A binomial distribution is the distribution of the number of successes k in a sequence of n independent trials, where
the probability of success p is constant from trial to trial. Each trial with two outcomes (success or failure) is also
called a Bernoulli experiment:
The binomial distribution can be
approximated by the normal distribution if n
is large and p approaches 0.5 such that:
Parametric Models
• Weibull Distribution
The Weibull distribution is a commonly used tool for modeling growth (or decline) in biological, clinical,
population, and natural resource studies. It has also been used to analyze production decline from
unconventional reservoirs.
K<1 decreasing rate with time
K=1 constant rate with time (exponential decline rate)
K>1 increasing rate with time
Parametric Models
• Beta Distribution
Shows some example beta distributions. The beta distribution does not have a mechanistic basis but can be
very useful for fitting empirical data to distributions, because of the flexible mathematical form .This becomes
particularly relevant for the purposes of uncertainty quantification using Monte Carlo simulation,
Central Limit Theorem
• When independent random variables are summed up, their properly normalized sum tends toward a normal
distribution (informally a bell curve) even if the original variables themselves are not normally distributed.
Q-Q plot
• Comparing Two Distributions
• are compared by plotting their corresponding quantiles.
• A Q-Q plot of two identical distributions will be a straight line with unit slope (i.e., x=y).If the Q-Q plot plots
as a straight line with a nonunit slope, then the two distributions have the same shape but their location and
spread may differ.
Normal Score Transformation
• Often, it is useful to transform a sample distribution into the space of
an equivalent normal distribution, where many statistical operations
can be easily performed and visualized.
First Moments Distributions Fitting Method

From Sample
Chapter 3

Prediction
Linear Regression
Estimating Confidence Intervals for the Mean Response and Forecast
Best fit leads to normal distribution for the residuals
Chapter 2:
Classification
Unsupervised Classification

❑K Mean Classification
❑Model Based Classification
❑Hierarchical Classification
❑Forest Random Classification
K-Means Classification

Random centers
Minimize E show a cluster
New center for each class
Again previous steps

Normalization would be needed where

we deal with diverse data types.
Hierarchical Classification
• Agglomerative
• Divisive
HOW DO WE CALCULATE THE SIMILARITY BETWEEN TWO
CLUSTERS???

• MIN sim(C1,C2) = Min Sim(Pi,Pj) such that Pi ∈ C1 & Pj ∈ C2

•MIN approach cannot separate clusters properly if there is

noise between clusters.
MAX: Sim(C1,C2) = Max Sim(Pi,Pj) such that Pi ∈ C1 & Pj ∈ C2

•MAX approach does well in separating clusters if there is noise between clusters.

•Max approach is biased towards globular clusters.

•Max approach tends to break large clusters.
Group Average: sim(C1,C2) = ∑ sim(Pi, Pj)/|C1|*|C2| where, Pi ∈ C1 & Pj ∈ C2

•The group Average approach does well in separating clusters if

there is noise between clusters.

•The group Average approach is biased towards globular

clusters.
Distance between centroids: less popular

•Ward’s Method: This approach of calculating the similarity between two clusters is exactly the
same as Group Average except that Ward’s method calculates the sum of the square of the
distances Pi and PJ.
sim(C1,C2) = ∑ (dist(Pi, Pj))²/|C1|*|C2|

•Ward’s method approach also does well in separating clusters

if there is noise between clusters.

•The group Average approach is biased towards globular

clusters.
Dendrogram
It is determined after
checking dendrogram
Model Based Clustering
Forest Random Clustering
• It is supervised but can be used as unsupervised with trivial data

Property matrix

Random Matrix
and then
Clustering them
Figure 1 Raw Data
Figure 2 K Mean
Figure 3AgglomerativeClustering
Figure 4 GaussianMixture
Figure 5 Forest Random

Statistics, Data Analysis, and Decision Modeling, 5th Edition
100% (5)
Statistics, Data Analysis, and Decision Modeling, 5th Edition
556 pages
MT131 M131 00966597837185 TMA حل واجبات MT131 M131 المهندس أحمد at الجامعة العربية المفتوحة
No ratings yet
MT131 M131 00966597837185 TMA حل واجبات MT131 M131 المهندس أحمد at الجامعة العربية المفتوحة
10 pages
AP Statistics 핵심정리
No ratings yet
AP Statistics 핵심정리
20 pages
Tenko Raykov, George A. Marcoulides-Basic Statistics - An Introduction With R-Rowman & Littlefield Publishers (2012) PDF
No ratings yet
Tenko Raykov, George A. Marcoulides-Basic Statistics - An Introduction With R-Rowman & Littlefield Publishers (2012) PDF
345 pages
The Everyday Life Bible Kindle Joyce Meyer
No ratings yet
The Everyday Life Bible Kindle Joyce Meyer
68 pages
Lab 1.1 Protocol - Easy Lithium Acetate Transformation of Yeast - UPDATED
No ratings yet
Lab 1.1 Protocol - Easy Lithium Acetate Transformation of Yeast - UPDATED
10 pages
Survival Analysis - lecture 5
No ratings yet
Survival Analysis - lecture 5
69 pages
Probability
No ratings yet
Probability
27 pages
5 Random Var PDF
No ratings yet
5 Random Var PDF
74 pages
Maths Roadmap For Machine Learning - Statistics
No ratings yet
Maths Roadmap For Machine Learning - Statistics
5 pages
Business Statistics Flashcards - Quizlet11
No ratings yet
Business Statistics Flashcards - Quizlet11
19 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
33 pages
Session on Non-Gaussian Distribution
No ratings yet
Session on Non-Gaussian Distribution
13 pages
Quality Control: Fundamentals of Statistics
No ratings yet
Quality Control: Fundamentals of Statistics
62 pages
Prob Stat Petroleum Resources Assessmenbt
No ratings yet
Prob Stat Petroleum Resources Assessmenbt
147 pages
Statistics Normality
No ratings yet
Statistics Normality
42 pages
Normal Distribution
No ratings yet
Normal Distribution
10 pages
DADM S3 Skewness and Transformations To Achieve Normality
No ratings yet
DADM S3 Skewness and Transformations To Achieve Normality
9 pages
Statisitcs
No ratings yet
Statisitcs
22 pages
Introduction To The Practice of Basic Statistics (Textbook Outline)
100% (14)
Introduction To The Practice of Basic Statistics (Textbook Outline)
65 pages
General Purpose: Poisson Distribution
No ratings yet
General Purpose: Poisson Distribution
11 pages
R22-UNIT2-CH2
No ratings yet
R22-UNIT2-CH2
28 pages
SMDM Faqs Week 2
No ratings yet
SMDM Faqs Week 2
2 pages
CHE331 L08 Descriptive Stats
No ratings yet
CHE331 L08 Descriptive Stats
31 pages
(eBook PDF) Business Statistics: For Contemporary Decision Making, 8th Editioninstant download
100% (2)
(eBook PDF) Business Statistics: For Contemporary Decision Making, 8th Editioninstant download
50 pages
(eBook PDF) Business Statistics: For Contemporary Decision Making, 8th Edition - The ebook in PDF and DOCX formats is ready for download now
100% (1)
(eBook PDF) Business Statistics: For Contemporary Decision Making, 8th Edition - The ebook in PDF and DOCX formats is ready for download now
45 pages
lec08-2025
No ratings yet
lec08-2025
43 pages
ANALYST Sources
No ratings yet
ANALYST Sources
23 pages
Statistics Notes 1702100127
No ratings yet
Statistics Notes 1702100127
22 pages
Understanding Q-Q Plots: Latest News
No ratings yet
Understanding Q-Q Plots: Latest News
4 pages
8 CSC446 546 InputModeling
No ratings yet
8 CSC446 546 InputModeling
44 pages
Week2-1
No ratings yet
Week2-1
24 pages
Business Statistics
No ratings yet
Business Statistics
106 pages
Stats and Maths For Data Analyst
No ratings yet
Stats and Maths For Data Analyst
23 pages
Solutions Manual to accompany Miller & Freund’s Probability and Statistics for Engineers 8th edition 0321640772 - Download Instantly To Experience The Full Content
100% (5)
Solutions Manual to accompany Miller & Freund’s Probability and Statistics for Engineers 8th edition 0321640772 - Download Instantly To Experience The Full Content
51 pages
Normality Test
No ratings yet
Normality Test
103 pages
Statistics For Datacience
100% (1)
Statistics For Datacience
7 pages
4.1.1 Input Modeling
No ratings yet
4.1.1 Input Modeling
63 pages
Grey Minimalist Business Project Presentation
No ratings yet
Grey Minimalist Business Project Presentation
30 pages
Note 02
No ratings yet
Note 02
31 pages
Theoretical Questions in Basic Business Statistics
No ratings yet
Theoretical Questions in Basic Business Statistics
12 pages
Probs-Stats Revision Notes
No ratings yet
Probs-Stats Revision Notes
19 pages
Sampling and sampling distribution with Business Application_v2.docx
No ratings yet
Sampling and sampling distribution with Business Application_v2.docx
11 pages
2.1 - Normal Data
No ratings yet
2.1 - Normal Data
19 pages
Head First Statistics Bullet Points
No ratings yet
Head First Statistics Bullet Points
28 pages
Measures of Relative Position
No ratings yet
Measures of Relative Position
28 pages
CH 9
No ratings yet
CH 9
13 pages
Key of Week1 - Lecture Notes
No ratings yet
Key of Week1 - Lecture Notes
10 pages
Statistics For Data Science: What Is Normal Distribution?
No ratings yet
Statistics For Data Science: What Is Normal Distribution?
13 pages
Normal Distribution For ML
No ratings yet
Normal Distribution For ML
17 pages
What Is Probability
No ratings yet
What Is Probability
8 pages
Theory and Formula
No ratings yet
Theory and Formula
42 pages
Ap Stat 1-7 Notes
No ratings yet
Ap Stat 1-7 Notes
12 pages
Features
No ratings yet
Features
42 pages
Solutions Manual to accompany Miller & Freund’s Probability and Statistics for Engineers 8th edition 0321640772 pdf download
100% (2)
Solutions Manual to accompany Miller & Freund’s Probability and Statistics for Engineers 8th edition 0321640772 pdf download
45 pages
Input Modeling For Simulation
No ratings yet
Input Modeling For Simulation
48 pages
Stat Chapter 5-9
No ratings yet
Stat Chapter 5-9
32 pages
MATH 361 (Autosaved)
No ratings yet
MATH 361 (Autosaved)
17 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
aliitan2009
No ratings yet
aliitan2009
11 pages
cheng2005
No ratings yet
cheng2005
31 pages
selker2006
No ratings yet
selker2006
8 pages
rogers1991
No ratings yet
rogers1991
23 pages
Stehfest 1970
No ratings yet
Stehfest 1970
8 pages
alemohammad2018
No ratings yet
alemohammad2018
26 pages
Sheghf1
No ratings yet
Sheghf1
5 pages
wang2008
No ratings yet
wang2008
11 pages
Bubble Point Pressure Correlation: A. Lasater
No ratings yet
Bubble Point Pressure Correlation: A. Lasater
3 pages
C 210
No ratings yet
C 210
8 pages
Rajesh 2020
No ratings yet
Rajesh 2020
9 pages
2020-21 Fall 41553 Bernardo-Pinto
No ratings yet
2020-21 Fall 41553 Bernardo-Pinto
49 pages
Aim of Promoting and Improving The Quality of Life Through Ongoing Support and Development of
No ratings yet
Aim of Promoting and Improving The Quality of Life Through Ongoing Support and Development of
5 pages
I You We They He She It: What When Where Who Whose Why How How Long Do Does Vinf?
No ratings yet
I You We They He She It: What When Where Who Whose Why How How Long Do Does Vinf?
8 pages
The Voluptuary-Erotic Poetry
No ratings yet
The Voluptuary-Erotic Poetry
16 pages
Marketing Strategies For Profitabilty of Handicraft Industry....
No ratings yet
Marketing Strategies For Profitabilty of Handicraft Industry....
9 pages
The Amazing Qur'An: Dr. Gary Miller
No ratings yet
The Amazing Qur'An: Dr. Gary Miller
17 pages
Bodega Dreams Thesis Statement
100% (3)
Bodega Dreams Thesis Statement
4 pages
Might and Magic Cheats Sheet
No ratings yet
Might and Magic Cheats Sheet
2 pages
Catalogo Juegos ps3 1
No ratings yet
Catalogo Juegos ps3 1
12 pages
Common Mistakes of Electronic Repair
100% (2)
Common Mistakes of Electronic Repair
10 pages
How Do Shakespeare and Shelly Present The Flaw of Hubris in Macbeth and in Frankenstein
100% (1)
How Do Shakespeare and Shelly Present The Flaw of Hubris in Macbeth and in Frankenstein
4 pages
2600 Legarda St. Sampaloc, Manila
No ratings yet
2600 Legarda St. Sampaloc, Manila
4 pages
Intro
No ratings yet
Intro
8 pages
May Fourth Movement
No ratings yet
May Fourth Movement
10 pages
1.questions Words
No ratings yet
1.questions Words
1 page
PBE Manual 02 Part1 PDF
No ratings yet
PBE Manual 02 Part1 PDF
20 pages
Health Care Process
No ratings yet
Health Care Process
8 pages
Classics in Cultural Criticism I: Britain, Edited by Bernd-Peter Lange
No ratings yet
Classics in Cultural Criticism I: Britain, Edited by Bernd-Peter Lange
36 pages
Test Bank for Respiratory Disease A Case Study Approach to Patient Care, 3rd Edition: Wilkins all chapter instant download
100% (19)
Test Bank for Respiratory Disease A Case Study Approach to Patient Care, 3rd Edition: Wilkins all chapter instant download
45 pages
Future Forms
No ratings yet
Future Forms
2 pages
Principles:: Nature of The Case
100% (1)
Principles:: Nature of The Case
3 pages
Seismic shock
No ratings yet
Seismic shock
7 pages
People V Hassan
No ratings yet
People V Hassan
2 pages
Managing The Teaching and Learning Process: Unit 27 Using Language Appropriately For A Range of Classroom Functions
100% (1)
Managing The Teaching and Learning Process: Unit 27 Using Language Appropriately For A Range of Classroom Functions
26 pages
Tuberculosis
No ratings yet
Tuberculosis
20 pages
Strategic Management and Project Management: Purpose
No ratings yet
Strategic Management and Project Management: Purpose
10 pages
Fotoditazin Instructions For Intravenous PDT
No ratings yet
Fotoditazin Instructions For Intravenous PDT
2 pages
Live Horoscope Discussion of - Arvind J Jain
100% (1)
Live Horoscope Discussion of - Arvind J Jain
105 pages

Petroleum Data Managment

Uploaded by

Petroleum Data Managment

Uploaded by

Petroleum Data

Pearson correlation coefficient

Spearman correlation coefficient

Median is 50 percentile or .5 quantile(50 before and after it!)

▪ Quantile of ith data is about i/N+1

Dykstra and Parsons used the log-normal distribution of

In a normal distribution, the value of k is such that 84.1% of the

For a log-normal permeability distribution, the Dykstra–Parsons

Normalization would be needed where

• MIN sim(C1,C2) = Min Sim(Pi,Pj) such that Pi ∈ C1 & Pj ∈ C2

•MIN approach cannot separate clusters properly if there is

•Max approach is biased towards globular clusters.

•The group Average approach does well in separating clusters if

•The group Average approach is biased towards globular

•Ward’s method approach also does well in separating clusters

•The group Average approach is biased towards globular

You might also like