Lecture 11: Standard Error, Propagation of Error, Central Limit Theorem in The Real World

This document summarizes key concepts related to standard error, propagation of error, and how the central limit theorem applies in real-world situations. The main points are: 1) The standard deviation of a sample statistic is called its standard error. A common formula is used to calculate the standard error when deriving one quantity from others. 2) This formula for propagation of error assumes measurements are independent and subject to small random errors. It provides the standard deviation of the derived quantity based on the standard deviations of the input quantities. 3) In cases where inputs are sample means, the central limit theorem implies the derived quantity will be approximately Gaussian distributed, with mean equal to the true population value and variance calculated

Uploaded by

Wisnu van Nugrooy

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views

Lecture 11: Standard Error, Propagation of Error, Central Limit Theorem in The Real World

Uploaded by

Wisnu van Nugrooy

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Lecture 11: Standard Error, Propagation of

Error, Central Limit Theorem in the Real World

5 October 2005
1 Standard Error
Quick point of terminology: last time, when we talked about getting at the
sampling distribution of summary statistics, we mostly looked at their means
the law of large numbers, in particular, is about the mean of the sample
distribution. Theres also going to be a variance or standard deviation. Its a bit
unfortunate, terminologically, but the standard deviation of a sample statistic
is called its standard error. The main tool for getting at standard errors is
the central limit theorem. Recall that X has mean and variance
2
/n, so it
has standard deviation /

n.
2 Propagation of Error
In many experimental lab courses, you learn a rather mysterious-looking formula
for the error bars of derived or calculated quantities. It says that if you have a
quantity z which is a function of measured quantities x and y, i.e., z = h(x, y),
then

z
=

_
h
x
_
2

2
x
+
_
h
y
_
2

2
y
where
z
is the standard deviation of z, and similarly for the other variables.
(This formula, and everything which follows, extends in the natural way to
functions of more than two variables.)
We are now in a position to see exactly where this formula comes from, and
when its actually valid.
We assume that each of the input quantities x and y is really a random
variable, X, Y , which has some average value (
x
,
y
), plus uctuations around
it which represent noise in our apparatus, errors of procedure, gremlins, etc.
The value of z we calculate is therefore also a random quantity, Z, because if
the uctuations had come out dierently, wed be plugging dierent numbers
into the function h, and getting a dierent answer. The question we want to
answer is how dierent that result would, probably, be.
1
Lets start by Taylor-expanding h, making the expansion around the mean
values of the input variables:
h(X, Y ) h(
x
,
y
) +
_
h
x
_
(X
X
) +
_
h
y
_
(Y
Y
) +higher order terms
A Taylor expansion like this is only valid if the neglected higher-order terms, like,
1
2
_

2
h
x
2
_
(X
X
)
2
, are small compared to the included terms, like
_
h
x
_
(X
X
).
So we want
_
h
x
_
(X
X
)
1
2
_

2
h
x
2
_
(X
X
)
2
_
h
x
_

1
2
_

2
h
x
2
_
(X
X
)
2
_
h
x
_
_

2
h
x
2
_ (X
X
)
And similarly for Y . We can have this happen either if X
X
is always very
small, or if the ratio of the rst to the second derivative is always very large
that is, the function h is smooth.
Assumption 1: Measurement errors are small, where the scale for smallness is
set by the ratio of rst to second derivatives.
If Assumption 1 holds, and we can use our Taylor expansion, weve re-
expressed h as a linear combination of random variables, and we know how to
handle linear combinations. First, the mean:
E[Z] = E[h(X, Y )] h(
X
,
Y
) +E
__
h
x
_
(X
X
)
_
+E
__
h
y
_
(Y
Y
)
_
= h(
X
,
Y
) +
_
h
x
_
E[X
X
] +
_
h
y
_
E[Y
Y
]
= h(
X
,
Y
) +
_
h
x
_
(E[X]
X
) +
_
h
y
_
(E[Y ]
Y
)
= h(
X
,
Y
) +
_
h
x
_
(
X

X
) +
_
h
y
_
(
Y

Y
)
= h(
X
,
Y
)
Now we compute the variance:
Var (Z) = Var (h(X, Y )) Var
_
h(
x
,
y
) +
_
h
x
_
(X
X
) +
_
h
y
_
(Y
Y
)
_
= Var
__
h
x
_
(X
X
) +
_
h
y
_
(Y
Y
)
_
We can drop h(
X
,
Y
) because its just a constant, but now we need to make
an additional assumption.
2
Assumption 2: The measurement errors in the input variables are indepen-
dent.
Var (Z) Var
__
h
x
_
(X
X
)
_
+ Var
__
h
y
_
(Y
Y
)
_
=
_
h
x
_
2
Var (X
X
) +
_
h
y
_
2
Var (Y
Y
)
=
_
h
x
_
2
Var (X) +
_
h
y
_
2
Var (Y )
=
_
h
x
_
2

2
X
+
_
h
y
_
2

2
Y
Taking the square root of Var (Z) to get the standard deviation gives us the
usual formula for propagation of error.
The most important special case for this is when the values of x and y we
plug in to the formula are themselves obtained by averaging many measurements
that X, above, is really X, and Y is really Y . Lets make the following
assumptions.
Assumption 3: Measurement errors are independent from one measurement
to the next.
Assumption 4: There are many measurements of each variable.
In this case, we can use the central limit theorem to say more about X and
Y . The mean values of X and Y are still the population means,
X
and
Y
.
But now the standard deviations we plug in are standard errors, s
x
=
X
/

n
and s
y
=
Y
/

n. Also, X and Y are Gaussian. Since a linear combination of

independent Gaussians is Gaussian, Z is also Gaussian.
So we have the following result.
Suppose Z = h(X, Y ), where X is the sample mean of measured
values of X, and likewise for Y . Then, if Assumptions 14 hold, Z
is approximately Gaussian, with mean h(
X
,
Y
), and variance
_
h
x
_
2

2
X
n
+
_
h
y
_
2

2
Y
n
where n is the number of measurements of each input variable, and

2
X
is the true (population) variance of X.
3 The Law of Large Numbers and Central Limit
Theorem in Real Data
The law of large numbers and the central limit theorem Ive presented assume
independent data-points. While we can create independent data, and a lot of
experimental technique, survey design methods, etc., is about ensuring our data
3
are independent, phenomena in the natural world are rarely so cooperative as
to be completely independent. Fortunately, the asymptotic laws still generally
hold when the data values are not too dependent. Making this precise involves
some mathematics way beyond the scope of this course though I strongly
encourage you to take a course in stochastic processes, where youll learn all
about it but we can convince ourselves of its validity experimentally in many
cases.
Heres one particular case: the wind-tunnel data which we rst saw back
in the second lecture. Well look at the acceleration measurements. These are
weakly correlated (the correlation between successive values is about 0.017,
which is small, but denitely not zero). The equivalent of looking at a sample of
n independent draws from the distribution is to look at at a time average of T
successive values from the series:
1
T

T
i=1
a
k+i1
. This time average is going to
depend on the starting position, k, as well as on the length of the interval over
which we average, T. If the system were looking at is well-behaved, though, our
initial starting point (k) makes less and less dierence as we look at longer and
longer intervals (T ), just as, with independent samples, the sample mean
always converges to the population mean. With a dependent system, what we
hope is that the time average converges to the space average, which is the
mean over the sample space:
lim
T
1
T
T

i=1
a
k+i1
=
_
af(a)da
where f(a) is systems density in the sample space the fraction of the time
it spends near the point a. If this happens, we say that the system is er-
godic. Ergodicity is extremely important for statistics, because it means that
any suciently long sequence of data is representative of the whole process,
and we can use it to make reliable inferences about the system as a whole. Its
also extremely important to making statistical mechanics and thermodynamics
work. Unfortunately, the math needed to really handle ergodicity is fairly com-
plicated
1
, but we can see it demonstrated in our data. After all, if the equation
above holds, then, starting from any position k, the time averages should get
closer and closer as T gets larger and larger. If we histogram the time averages
(Fig. 1), we see that this is indeed the case they become more and more
tightly peaked around a common central value.
If the values a
t
are ergodic, then so is any function of a
t
. In particular, if we
look at the indicator function which says whether or not a
t
B for some set B,
this will also converge on a limiting value, which is the probability of B. We saw
something like this in lecture 2, but lets look at it again for the acceleration.
Here Ive chosen the set B = [0.05, 0.06] [0.03, 0.02] i.e., two intervals on
either side of zero. (Theres no particular interest to this region, I just chose it
to show that this works on pretty much any event you like.) As you can see in
Fig. 2, the time-average of the number of measurements falling in B converges to
1
Though you might try reading Michael C. Mackey, Times Arrow: The Origins of Ther-
modynamic Behavior (Dover Books, 2003).
4
Figure 1: Distribution of the values of time averages. Filled circles: individual
measurements. Open circles: averages of successive measurements (T = 2).
Squares: averages over thirty time-steps (T = 30); diamonds: T = 100; tri-
angles: T = 1000. Note that as we average over longer and longer times, the
distribution gets narrower and narrower, while the center does not move. This
indicates that time-averages are converging to a common value, independent of
when we start observing the acceleration that the system is ergodic.
5
a stable value, no matter when we start making our measurements. (Remember
that the sampling rate here is 30kHZ, so 3000 time-steps is one second.)
At this point, we should be pretty much convinced that the law of large num-
bers holds in this data that reasonably long samples all look alike, and are all
representative of the process as a whole. What about the central limit theorem?
More specically, do the time averages approach a Gaussian distribution?
One way to check this would be to compare the histograms of the time-
averages, as in Fig. 1, to Gaussian density functions with the same mean and
variance. But then wed have to assess whether two more-or-less bell-shaped
wiggly curves are good matches, and wed rather do something easier. The
something easier is provided by probability plots, which you read about last
week.
Remember how probability plots work: along the horizontal axis, Ive plotted
all the dierent values seen in the data, in order. Each value falls at a a certain
quantile of the data: the i
th
largest value is bigger than or equal to a fraction i/n
of the sample values. Now, for any distribution, the quantile function Q(p) is the
inverse of the cumulative distribution F(x). Just as F(x) answers what is the
probability that a random value is x?, Q(p) answers What value is at least
as large as a fraction p of the samples? The vertical axis gives Q(F(x)), where
F(x) is the CDF of the data, and Q is the quantile function of a theoretical
distribution, here the standard Gaussian. If the data really does come from the
theoretical distribution, then Q = F
1
, and we should get a straight line, up to
sampling error. If not, well get something curved. One wrinkle is that a sample
from any Gaussian distribution, plotted against the standard Gaussian, should
give a straight line, because all Gaussian distributions can be standardized by
a linear transformation. So plotting the data against a standard Gaussian lets
us check normality.
The next few gures give Gaussian probability plots for the individual accel-
eration measurements (T = 1), averages over successive pairs of measurements
(T = 2), and then over times of length 30, 100 and 1000. What you can see
is that the probability plots come closer and closer to straight lines, over more
and more of their range, until at T = 1000 weve got something which is really
very Gaussian indeed. So it looks like the central limit theorem holds in this
real-world, correlated data too.
However, theres an important caveat here. If the CLT worked just like it did
in the case of independent data, then the variance of the time-averages should
be approximately Var (A) /T. We know that the variance is getting smaller
we can see that in Fig. 1 but is it getting smaller like 1/T?
T Variance of time averages Var (A) /T
1 7.50 10
4
7.50 10
4
2 3.69 10
4
3.75 10
4
30 2.83 10
5
2.50 10
5
100 6.21 10
6
7.50 10
6
1000 2.55 10
7
7.50 10
7
The variance in the time-averages is actually getting smaller faster than the
6
Figure 2: Convergence of relative frequencies to long-run probabilities for real
data. The horizontal axis shows time, in steps of 1/30,000 second. The ver-
tical axis shows the fraction of measures to date which fall into the region
B = [0.03, 0.02] [0.05, 0.06]. Gray horizontal line: long-run average of this
fraction (probability). Solid line: time-averages starting from the rst mea-
surement. Dashed line: time-averages starting from the 100, 00
th
measurement.
Dotted line: time-averages starting from the 900, 000
th
measurement.
7
Figure 3: Gaussian probability plot of the acceleration values. Here and in the
other probability plots, the diagonal line connects values at the rst quartile to
those at the third quartile; it serves as a rough guide to the eye.
8
Figure 4: Gaussian probability plot of the means of pairs of successive acceler-
ations
9
Figure 5: Gaussian probability plot of the means of thirty successive accelera-
tions. The horizontal line through zero is just a graphics bug.
10
Figure 6: Gaussian probability plot of the means of 100 successive accelerations
11
Figure 7: Gaussian probability plot of the means of 1,000 successive accelera-
tions
12
CLT would predict in independent data. This is basically because the correlation
between a
t
and a
t+1
is negative if one of them uctuates above the mean
value, the other one is apt to move below it, so theyre even more likely to cancel
out uctuations around the mean than independent measurements are. The
moral of this story is that while time averages converge, and they tend to have
a Gaussian distribution when you look at enough of them, you cant, necessarily,
assume that theyll have the same Gaussian distribution as if measurements were
all independent of one another.
13

Darker Gifts - Curse of Strahd Expansion
50% (4)
Darker Gifts - Curse of Strahd Expansion
6 pages
Grade 6 TLE (IA) LAS
100% (2)
Grade 6 TLE (IA) LAS
44 pages
Catalogue
No ratings yet
Catalogue
40 pages
Cramer Raoh and Out 08
No ratings yet
Cramer Raoh and Out 08
13 pages
Lubrizol Hydraulic Fluids
No ratings yet
Lubrizol Hydraulic Fluids
6 pages
Guesstimation: A New Justification of The Geometric Mean Heuristic
No ratings yet
Guesstimation: A New Justification of The Geometric Mean Heuristic
7 pages
Second-Order Nonlinear Least Squares Estimation: Liqun Wang
No ratings yet
Second-Order Nonlinear Least Squares Estimation: Liqun Wang
18 pages
Interval-Valued and Fuzzy-Valued Random Variables: From Computing Sample Variances To Computing Sample Covariances
No ratings yet
Interval-Valued and Fuzzy-Valued Random Variables: From Computing Sample Variances To Computing Sample Covariances
8 pages
Non Linear Root Finding
No ratings yet
Non Linear Root Finding
16 pages
Probability & Statistics Theme 6 Sampling Distribution Random Sample
No ratings yet
Probability & Statistics Theme 6 Sampling Distribution Random Sample
4 pages
1998 Material Ease 6
No ratings yet
1998 Material Ease 6
4 pages
July 1, 2009 11:54 WSPC/102-IDAQPRT 00360
No ratings yet
July 1, 2009 11:54 WSPC/102-IDAQPRT 00360
17 pages
Detecting A Vector Based On Linear Measurements: Ery Arias-Castro
No ratings yet
Detecting A Vector Based On Linear Measurements: Ery Arias-Castro
9 pages
Basics
No ratings yet
Basics
8 pages
Errors Experiment
No ratings yet
Errors Experiment
8 pages
Basic Probability Reference Sheet: February 27, 2001
No ratings yet
Basic Probability Reference Sheet: February 27, 2001
8 pages
Appendix A 4
No ratings yet
Appendix A 4
6 pages
Completed Lab
No ratings yet
Completed Lab
4 pages
CastroAmorim Final
No ratings yet
CastroAmorim Final
17 pages
Research Statement
No ratings yet
Research Statement
5 pages
Stochastic Process
No ratings yet
Stochastic Process
11 pages
Advance Econometrics Assignment
No ratings yet
Advance Econometrics Assignment
8 pages
Using Extreme Value Theory To Optimally Design PV Applications Under Climate Change
No ratings yet
Using Extreme Value Theory To Optimally Design PV Applications Under Climate Change
7 pages
Pred Logic
No ratings yet
Pred Logic
5 pages
NLVnotes
No ratings yet
NLVnotes
20 pages
Method of Moments
No ratings yet
Method of Moments
5 pages
Information Theory: 1 Random Variables and Probabilities X
No ratings yet
Information Theory: 1 Random Variables and Probabilities X
8 pages
ErrorAnalysis
No ratings yet
ErrorAnalysis
8 pages
Cross-Sectional Dependence
No ratings yet
Cross-Sectional Dependence
5 pages
Probability Presentation
No ratings yet
Probability Presentation
26 pages
Computational Fluid and Solid Mechanics Full Version
No ratings yet
Computational Fluid and Solid Mechanics Full Version
14 pages
In Certi Dumb Re 008
No ratings yet
In Certi Dumb Re 008
17 pages
Theory of Uncertainty of Measurement PDF
No ratings yet
Theory of Uncertainty of Measurement PDF
14 pages
VI. Notes (Played) On The Vibrating String
No ratings yet
VI. Notes (Played) On The Vibrating String
20 pages
Entropy: Statistical Information: A Bayesian Perspective
No ratings yet
Entropy: Statistical Information: A Bayesian Perspective
11 pages
1 Preliminaries: 1.1 Dynkin's π-λ Theorem
No ratings yet
1 Preliminaries: 1.1 Dynkin's π-λ Theorem
13 pages
Two Proofs of The Central Limit Theorem
No ratings yet
Two Proofs of The Central Limit Theorem
13 pages
A (Very) Brief Review of Statistical Inference: 1 Some Preliminaries
No ratings yet
A (Very) Brief Review of Statistical Inference: 1 Some Preliminaries
9 pages
Basic Statistics: Statistics: Is A Science That Analyzes Information Variables (For Instance
No ratings yet
Basic Statistics: Statistics: Is A Science That Analyzes Information Variables (For Instance
14 pages
Hypothesis Testing 2
No ratings yet
Hypothesis Testing 2
7 pages
Lab4_ErrorAnalysisII (1)
No ratings yet
Lab4_ErrorAnalysisII (1)
8 pages
Sufficient Statistics and Exponential Family
No ratings yet
Sufficient Statistics and Exponential Family
11 pages
A Case Study of Bank Queueing Model: Kasturi Nirmala, DR - Shahnaz Bathul
No ratings yet
A Case Study of Bank Queueing Model: Kasturi Nirmala, DR - Shahnaz Bathul
8 pages
MIT14 30s09 Lec17
No ratings yet
MIT14 30s09 Lec17
9 pages
Mathematical Finance
No ratings yet
Mathematical Finance
13 pages
Supplementalchap Mediantheoremandorderstatistics
No ratings yet
Supplementalchap Mediantheoremandorderstatistics
24 pages
002 Lecture Statistical Theory
No ratings yet
002 Lecture Statistical Theory
7 pages
On A Singular Degenerate Reaction Diffusion Model Applied To Quenching and Biology
No ratings yet
On A Singular Degenerate Reaction Diffusion Model Applied To Quenching and Biology
12 pages
Part 4 Diffusion and Hops
No ratings yet
Part 4 Diffusion and Hops
24 pages
Arena Stanfordlecturenotes11
No ratings yet
Arena Stanfordlecturenotes11
9 pages
I. Time Series and Stochastic Processes
No ratings yet
I. Time Series and Stochastic Processes
26 pages
Chapter 4 - Summarizing Numerical Data
No ratings yet
Chapter 4 - Summarizing Numerical Data
8 pages
Basics
No ratings yet
Basics
61 pages
Thermal Physics Lecture 35
No ratings yet
Thermal Physics Lecture 35
8 pages
Compressive Wave Computation
No ratings yet
Compressive Wave Computation
45 pages
Robust Statistics For Outlier Detection (Peter J. Rousseeuw and Mia Hubert)
No ratings yet
Robust Statistics For Outlier Detection (Peter J. Rousseeuw and Mia Hubert)
8 pages
out
No ratings yet
out
26 pages
Rao Blackwell
No ratings yet
Rao Blackwell
2 pages
Topics On Mean Value Theorems: Gen-Bin Huang
No ratings yet
Topics On Mean Value Theorems: Gen-Bin Huang
28 pages
On Convergence Rates in The Central Limit Theorems For Combinatorial Structures
No ratings yet
On Convergence Rates in The Central Limit Theorems For Combinatorial Structures
18 pages
Assignment
No ratings yet
Assignment
11 pages
Lectures on the Coupling Method
From Everand
Lectures on the Coupling Method
Torgny Lindvall
No ratings yet
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Current Feasibility Study Framework Final Draft
No ratings yet
Current Feasibility Study Framework Final Draft
7 pages
Book2
No ratings yet
Book2
42 pages
12G, 120G, 130G, 140G and 160G Electrical System Motor Graders
100% (1)
12G, 120G, 130G, 140G and 160G Electrical System Motor Graders
2 pages
Hanon Exercise - Exercises 1 To 33
33% (3)
Hanon Exercise - Exercises 1 To 33
5 pages
Question Bank
No ratings yet
Question Bank
3 pages
Lecture #19: Moments of Inertia
No ratings yet
Lecture #19: Moments of Inertia
19 pages
CH352 CED Lecture 1
No ratings yet
CH352 CED Lecture 1
8 pages
Vintage Photo Album For Education by Slidesgo
No ratings yet
Vintage Photo Album For Education by Slidesgo
56 pages
Non Probability Sampling
100% (2)
Non Probability Sampling
9 pages
Crest Uk
No ratings yet
Crest Uk
6 pages
Brian Davidson - Resume Fall 2017
No ratings yet
Brian Davidson - Resume Fall 2017
2 pages
S.No. Name Branch Registration Number
No ratings yet
S.No. Name Branch Registration Number
4 pages
2014 Cruisair Pricebook
No ratings yet
2014 Cruisair Pricebook
120 pages
Femtocell, A Key Element in Mobile Broadband: Jean-Baptiste Vezin
No ratings yet
Femtocell, A Key Element in Mobile Broadband: Jean-Baptiste Vezin
8 pages
HP Laserjet Managed MFP E52645 Series
No ratings yet
HP Laserjet Managed MFP E52645 Series
5 pages
ECA HPLC Data Integrity and Workshop
No ratings yet
ECA HPLC Data Integrity and Workshop
6 pages
Hostile Takeovers
No ratings yet
Hostile Takeovers
6 pages
Helicopter Sizing and Calculations
0% (1)
Helicopter Sizing and Calculations
117 pages
Learn Linux for Beginners_ From Basics to Advanced Techniques [Full Book]
No ratings yet
Learn Linux for Beginners_ From Basics to Advanced Techniques [Full Book]
97 pages
7.2.17-Patriot-Systems-Operations-Manual
No ratings yet
7.2.17-Patriot-Systems-Operations-Manual
46 pages
SPIE APRS Tutorial - Hyperspectral RS - Vinay Kumar
No ratings yet
SPIE APRS Tutorial - Hyperspectral RS - Vinay Kumar
16 pages
Lab7 PDF
No ratings yet
Lab7 PDF
24 pages
Juan Carlos I. Iquiña, PTRP: Personal Information
No ratings yet
Juan Carlos I. Iquiña, PTRP: Personal Information
2 pages
Intermittent Wiper Controller
No ratings yet
Intermittent Wiper Controller
1 page
Strategic Benchmarking
No ratings yet
Strategic Benchmarking
7 pages

Lecture 11: Standard Error, Propagation of Error, Central Limit Theorem in The Real World

Uploaded by

Lecture 11: Standard Error, Propagation of Error, Central Limit Theorem in The Real World

Uploaded by

Lecture 11: Standard Error, Propagation of

Error, Central Limit Theorem in the Real World

n. Also, X and Y are Gaussian. Since a linear combination of

You might also like