0% found this document useful (0 votes)

2 views

ECS3706 study unit 17_reduced size file

The document outlines the objectives and content of an Econometrics I course at NYU, emphasizing the importance of understanding statistical principles for econometric analysis. It covers fundamental concepts such as probability distributions, random variables, and the significance of statistical estimators. The study unit includes practical tasks to reinforce learning and prepare students for examination material related to these statistical concepts.

Uploaded by

Jabulani Pilime

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

ECS3706 study unit 17_reduced size file

Uploaded by

Jabulani Pilime

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

STuDY uNIT 17

Statistical principles

ECONOMETRICS IN ACTION

The Department of Economics at New York university (NYu) has evolved into
one of the world’s leading centres for research and teaching in economics.
Professor C flinn of NYu teaches the Econometrics I course. Here are some
of his comments on his course objectives:

• We will begin by reviewing probability and sampling theory. To be a competent

econometrician, one needs to have a solid understanding of basic statistical
theory, some familiarity with data, and a good knowledge of economic theory.
• from my perspective, econometrics is essentially the application of
standard statistical tools to the analysis of conditional relationships between
random variables. What distinguishes econometrics from statistics is the
econometrician’s objective to infer something about behaviour from empirical
relationships between variables.
• In this course, we will attempt to prepare the student for this kind of research
enterprise by carefully covering most or all of the statistical theory [albeit at
a basic level] they will need to do competent applied econometric analysis.

The message is clear. You cannot fully understand econometrics without a

solid grounding in statistics.

STuDY OBJECTIVES
Econometrics makes extensive use of statistical concepts. Some examples:
• We assume that the data used in regression analysis are a random sample drawn
from the population. What exactly is the meaning of “random sample” and of
“population”?
• What are the implications of using sample estimates? The concept of the sampling
distribution of a sample estimator is a fundamental concept you must understand
well.
• Related to the sampling distribution are concepts like unbiased estimators and
minimum variance. What do these mean?

This module requires you to be familiar with statistical concepts. This chapter deals
with the basic statistical concepts required in this regard. This could be particularly
helpful to students who have not previously completed statistics courses. Students
who have previously completed statistics courses may find this chapter a convenient
means to brush up their statistics, and may even learn some new things!

Yes, this study unit is examination material. Within each of the sections below, we
clearly indicate what you must understand.

Open Rubric
STuDY uNIT 17: Statistical principles

The approach of this chapter is different to that of other chapters.

• We first provide the headers of sections as discussed in the textbook.

• We then tell you exactly what you are required to know. Remember, the focus is
on understanding the meaning of statistical concepts.
• There may be examination questions on the material you are required to know.
We may, for example, ask you to derive a standard deviation (given some simple
data), to explain its meaning, to explain what is a sample distribution, or to explain
the meaning of expected value.
• The major part of this study unit consists of a number of tasks which are practical
applications of all the major statistical concepts. The tasks are meant to be
learning exercises. They may assist you in better understanding statistical concepts.
Definitely work through them!
• Although some aspects may be explained in a different way than in the textbook,
the textbook remains your prime source.

17.1 PROBABILITY DISTRIBuTIONS

This section covers topics on probability, mean, variance and standard deviation,
continuous random variables, standardised variables and the normal distribution.

We expect you, in the case of discrete random variables, to understand the meaning of
• a random variable (X) and the probability distribution of X which is denoted by
its probability density function P(X)
• the mean (or expected value) of random variable (X)
• the variance and the standard deviation of random variable (X)

In the case of continuous random variables, you must understand

• why continuous variables arise
• the meaning of the probability distribution (the probability density curve)
• the meaning of mean, variance and standard deviation
• the meaning of standardised variables

In the case of the normal distribution, you must

• understand its meaning and how the central limit theorem can give rise to a
normal distribution
• be able to apply the normal distribution in practice

Tasks 17.1.1–17.1.5 deal with the following major statistical concepts:

• probability density function (uniform) of a discrete random variable and its
expected value
• mean, variance and standard deviation of a discrete random variable which is
not uniformly distributed
• continuous random variables and their probability density functions; standardised
random variables
• expected value and bias
• the normal distribution
• the central limit theorem

ECS3706/1 33
TASK 17.1.1
Consider a normal die with numbers 1 to 6 on its sides. Let X measure the outcome
of a throw of the die.

(a) Explain how the concept of a discrete random variable (X) may be applied
to the throw of the die.

(b) Derive the probability density function P(X). Explain whether P(X) is normally
distributed.

(d) Derive the variance of X and the standard error of X.

5 ANSWERS
(a) The variable (X) can assume six possible outcomes when the die is thrown.
The range of possible outcomes of X is (1, 2, 3, 4, 5, 6). Because these are
a countable number of possible values, X is a discrete variable. Because X
assumes values by random chance, X is also a random variable. Thus X is
a discrete random variable.

(b) The probability P(X) is the probability of obtaining each of these X-values.
Because each number 1 to 6 has an equal chance of occurring, P(X) = 1/6
for all X. Note that ΣP(X) = 1.

Variable X is not normally but uniformly distributed. In the case of the uniform
distribution, P(X) is constant for all values of X. In the case of the normal
distribution, the chart of P(X) versus X is bell shaped. Loosely speaking, this
means that the probability P(X) of realising numbers in the middle range of
X is higher than that of the tail ends.

34
STuDY uNIT 17: Statistical principles

(c) The expected value of X is derived as ∑ X . P(X):

= 1.(1/6) + 2.(1/6) + 3.(1/6) + 4.(1/6) + 5.(1/6) + 6.(1/6)
= (1 + 2 + 3 + 4 + 5 + 6).(1/6)

= 21/6
= 3.5

The meaning of the expected value is the average value of a large number of
throws. Because each throw can yield numbers 1 to 6, where the probability of each
number is 1/6, we can expect the average of a large number of throws to be 3.5.

(d) The variance of X is ∑ (X – μ)2.P(X) where μ is the expected value of

X (μ = 3.5).

X P(X) X- μ (X- μ)2 (X- μ)2.P(X)

1 1/6 -2.5 6.25 1.0417

2 1/6 -1.5 2.25 0.3750

3 1/6 -0.5 0.25 0.0417

4 1/6 0.5 0.25 0.0417

5 1/6 1.5 2.25 0.3750

6 1/6 2.5 6.25 1.0417

Sum 2.9167

The variance of X is 2.9167. The standard error of X is 2.91667 = 1.7078.

TASK 17.1.2

This example deals with a nonuniform probability density function

in contrast
to example 17.1.1 which deals with a uniform one.
Consider a normal die with numbers 1 to 6 on its sides. Let Y
measure the sum of two throws of the die. for example, if two
throws realise a 4 and a 2, then Y = 6.

The outcomes of all possible throw 1 and throw 2 values are displayed in the
table on the right.

The possible outcomes of Y range from a minimum of Y = 2 (1 + 1) to a maximum

of Y = 12 (6 + 6). Each of the 6 x 6 possible outcomes has an equal probability
to occur, that is, 1/36. Note that there are more “7” outcomes for Y than, for
example, 5s, simply because more combinations of throws have the sum of 7.

ECS3706/1 35
Y = throw1 Outcome of throw 2
+ throw 2
1 2 3 4 5 6

1 2 3 4 5 6 7

Outcome of throw 1
2 3 4 5 6 7 8
3 4 5 6 7 8 9
4 5 6 7 8 9 10
5 6 7 8 9 10 11
6 7 8 9 10 11 12

(a) List all possible values of Y as well as their frequency (how many times each
occurs). Which value of Y occurs most?

(b) Determine and draw P(Y), the probability density function of Y. Is Y normally
distributed?

6 ANSWERS

(a) See the table below for the 11 possible Y i values which fall between 2 and
12. The Y-value of 7 occurs most (it is called the mode).

Y Fre- P(Y) Y.P(Y) Y (Y (Y – μ)2.

quency –μ – μ)2 P(Y)
(F/36)
(F)

2 1 0.0278 0.0556 -5 25 0.6944

3 2 0.0556 0.1667 -4 16 0.8889

4 3 0.0833 0.3333 -3 9 0.7500

5 4 0.1111 0.5556 -2 4 0.4444

6 5 0.1389 0.8333 -1 1 0.1389

7 6 0.1667 1.1667 0 0 0.0000

8 5 0.1389 1.1111 1 1 0.1389

9 4 0.1111 1.0000 2 4 0.4444

10 3 0.0833 0.8333 3 9 0.7500

36
STuDY uNIT 17: Statistical principles

11 2 0.0556 0.6111 4 16 0.8889

12 1 0.0278 0.3333 5 25 0.6944

Sum 36 1.0000 μ = 0 110 5.8333

7.0000

Frequency: the number of times the specific Y-value occurs.

(b) P(Y) is proportional to the frequency of Y and P(Y) = Yfrequency /36. ΣP(Y) = 1,
that is, the area under the P(Y) curve is 1, which of course also applies to
the continuous variable case. The nice thing about P(Y)s is that if you want
to derive the probability of getting numbers say 5 to 9, you simply add their
P(Y)s, which is (4 + 5 + 6 + 5 + 4)/36 = 24/36. The probability density func-
tion is displayed below.

Is Y normally distributed? Well, not quite! To be normally distributed, Y must

be a continuous variable and its probability distribution P(Y) must be bell
shaped.

Y is a discrete variable and its probability distribution P(Y) is not bell shaped.

σ 2 = Σ(Y i – μ)2.P(Y i) = 5.8333

σ = √ σ 2 = 2.4152. σ is a measure of the dispersion or variation of Y. In the

case of a normal distribution, about 2/3 of its values fall within the range μ –
σ to μ + σ. Applied to the case of Y, μ – σ = 4.6 and μ + σ = 9.4. Well, Y is a
discrete variable and not normally distributed either, but let’s approximate it
by the range 5-9. The probability of Y falling within this range is indeed 2/3
(sum its P(Y) values: 1/36(4 + 5 + 6 + 5 + 4) = 24/36 = 2/3).

TASK 17.1.3
Explain why there is a need for continuous variables. How do we interpret P(X) for
continuous variables? When is a continuous variable normally distributed? What
is a standardised variable?

ECS3706/1 37
7 ANSWER
In real life the outcomes of random variables are often not countable numbers. Often
the values of random variables are rational numbers which may include decimal
fractions. For example, a continuous random variable, u, may assume the value
of -4.7636 (rounded to 4 decimals). In regression analysis, the error term values
typically include rational numbers which fluctuate around an average value of 0.

Continuous random variables, say variable X, allow for rational numbers. Continu-
ous random variables often occur over an interval, say from -20.8 to +30.2. It is
even possible that we do not even specify their minimum or maximum X-values!
For example, it is possible that –∞≤X≤ +∞ where ∞ indicates infinity, as in the
case of the normal distribution.

But how do we deal with their probability density functions P(X)? The P(X)
curve is defined such that the total area under the curve = 1. We cannot speak
of the probability of obtaining a, say, X = 7 value. The probability of P(X = 7)
would be very small. We instead deal with the probability across a range of X-
values, for example 4 ≤ X ≤ 7.

An example of a discrete distribution that is approxi-mately normally distributed

is provided on the left. This refers to the case where X is the sum of six throws of
the die. In this case, the minimum value of X = 6 (6 x 1) and the maximum value
of X = 36 (6 x 6).

The value of X = 21 occurs most frequently. The sum of the probability of obtaining
values in both tail ends (that is, relative large deviations from the average), say X
≤ 12 plus X ≥ 30 is relatively small.

38
STuDY uNIT 17: Statistical principles

We use standardised Z-values to look up probabilities of the normal distribution

in which case Z = (X – μ)/σ. The probability of say -1 ≤ Z ≤ 1 is represented by the
area under the P(Z) curve from Z = -1 to +1. See the chart at the left. According
to table B7, the area under the curve for Z ≥ +1 is 0.1587, similarly the area below
Z ≤ -1 is 0.1587. Thus the area below the curve for -1 ≥ Z ≤ 1 is 0.6826. Conse-
quently, the probability that a continuous normally distributed random variable will
fall within the range μ – σ and μ + σ is 68.26%.

TASK 17.1.4
(a) Explain what is meant by the expected value of a random discrete variable
(X). Its P(X) and X.P(X) are provided in table 17.1.4.

(b) What is the meaning of bias in the case of a sample distribution (X) used to
measure an unknown population parameter μ?

Table 17.1.4

X P(X) X.P(X)

3 1/216 0.0139

4 3/216 0.0556

5 6/216 0.1389

6 10/216 0.2778

7 15/216 0.4861

8 21/216 0.7778

9 25/216 1.0417

10 27/216 1.2500

11 27/216 1.3750

ECS3706/1 39
12 25/216 1.3889

13 21/216 1.2639

14 15/216 0.9722

15 10/216 0.6944

16 6/216 0.4444

17 3/216 0.2361

18 1/216 0.0833

Sum 1.000 10.5000

8 ANSWERS
The expected value of a random discrete variable (X), E(X) is its weighted average:

∑X.P(X) = 10.5.

Assume that variable X is the sample estimate of a population parameter μ. Also

assume the sample estimates vary from 3 to 18 and that their P(X) is as in table
17.1.4.

If E(X) = μ = 10.5, then the estimator is unbiased. If, say, E(X) = 12, while μ = 10.5,
then the estimator is biased. Bias occurs when the estimator tends to overestimate
or underestimate the true value.

In the case of a random continuous variable Z:

E(Z) = ∫Z.P(Z).dZ where ∫P(Z).dZ = 1.

TASK 17.1.5
Psychologists tell us that the intelligence quotient (IQ) of the population is normally
distributed with average μ = 100 and the standard deviation σ = 15.

(a) Compile a table which indicates which proportion of the population has an IQ
exceeding (or equal to) 100, 110, 120, 130, 140 and 145, respectively. In the
process, also indicate the standardised Z-values. Look up the probabilities
in table B-7 (the normal distribution). Also indicate how many persons of a
population of 10 000 persons fall within each group.

(b) Explain why IQ is normally distributed within a population. Refer to the central
limit theorem.

40
STuDY uNIT 17: Statistical principles

9 ANSWERS
(a)

IQ (X) Number of
persons in
(greater or z = (X – μ)/σ Probability
population
equal to) that Z > z
of 10 000

100 0.00 0.5000 5 000

110 0.67 0.2514 2 514

120 1.33 0.0918 918

130 2.00 0.0228 228

140 2.67 0.0038 38

145 3.00 0.0013 13

(b) The central limit theorem states that if Z is a standardised sum of N independ-
ent and identically distributed random variables, then the probability distribu-
tion of Z approaches the normal distribution. See page 552. IQ is normally
distributed because it reflects the cumulative outcome of a large number of
hereditary and environmental factors. See page 554.

17.2 SAMPLING
This section deals with topics on selection bias, survivor bias, nonresponse bias and
the power of random selection.

Studenmund provides a good overview of some sample selection methods and of

sampling error. Our interest, however, does not lie with the different sample selection
methods. Within econometrics, sampling is important because we use the concept
of the sampling distribution. Thus, you need to focus only on the following aspects:
• the difference between the population and the sample
• the meaning of sampling error
• the meaning of statistical inference

TASK 17.2.1
Explain:

• What is sampling in general?

• Why do sampling concepts arise in econometrics?

10 ANSWERS
Sampling is the process of selecting only some units, for example people, organi-
sations) from a total population of interest. For example, we can select a sample
of, say, 50 students from the population of 250 000 Unisa students. The beauty

ECS3706/1 41
of sampling is that the characteristics of the sample quite often accurately reflect
those of the population. Statistical inference refers to the process of estimating
population parameters (mean, total, ratios, et cetera) from the sample estimates,
and of providing suitable measures of their accuracy.

In econometrics, we use sample data for estimation. An example is the house

price regression of chapter 1 where the sample of 43 houses 1 is a subset of all
houses sold in Southern California during a given time period. In econometrics,
we make the distinction between the population regression function (PRF) and
the sample regression function (SRF). The PRF refers to the true but unknown
regression equation. The PRF is a theoretical construct. It is not something we
would normally estimate because often not all population values are known, or the
population is impractical to measure. In contrast, the SRF is a practical concept.
The SRF is based on data which we observe. In practice, we estimate the SRF.

Given that we use sampling, we can expect that the sample estimates of parameters
will fluctuate round their true population parameters. This is called sampling error.
Parameters refer to statistical measures such as the mean or standard deviation.
In econometrics our interest lies mainly with the coefficients of a regression equa-
tion – which may also be called parameters.

17.3 ESTIMATION
This section deals with sampling distributions, the mean of the sampling distribution,
the standard deviation of the sampling distribution, the t-distribution, confidence
intervals and sampling from finite populations.

We expect you to understand

• the meaning of a sampling distribution, and its expected value and standard
deviation
• the meaning of systematic error (or bias)
• the meaning of the t-distribution

The importance of the sampling distribution

Please refer back to the statement made by Kennedy at the beginning of this study
unit. We sometimes have different estimators which have different sampling distribu-
tions. For example, we will come across the econometric problem of serial correla-
tion (chapter 9) which affects the accuracy of estimates. In this case we then have
the choice of two estimators, normal OLS, and the method of GLS. The choice of
the better estimator then rests upon the characteristics of its sampling distribution.

To determine the best estimator, we ask four questions:

• Is the estimator unbiased?

• What is the size of its standard error?
• Are the estimates of its standard error unbiased?
• What impact does an increased sample size have on these characteristics?

1
The sample in Studenmund only includes real estate transactions of the past four weeks.

42
STuDY uNIT 17: Statistical principles

TASK 17.3.1
This task addresses the sampling distribution of a sample estimator. In this case,
the sample estimator is X , that is, the average of a sample of X-values drawn
from a population of X-values. The question is how will X match the true popula-
tion average.

!
In study unit 4 we will again deal with the sampling !
distribution. In that case, our
interest lies with the sample distribution of b where b is a sample estimate of a
coefficient of a regression equation. In both cases, however, the principle of a
sample distribution is similar.

Explain the meaning of

• the sampling distribution of X

• the expected value of X , as well as bias
• the standard deviation (also called standard error) of X

11 ANSWERS
The easiest way to explain the meaning of a sampling distribution is to use a
simulation approach. The following steps outline this approach:

(1) The first step is to define precisely what characteristic of the population we
wish to measure. Assume that we wish to determine the population average
(or mean) of variable X of the population.
(2) In this case we need to determine whether the sample average ^ X h , based
on a random sample, is a good estimator of the population average (μ).
The goal of the procedure is to determine how well the sample estimator
X performs.
(3) We create a known population by simply generating, say, 50 000 random
values of X.
(4) We then sample repeatedly from this population by random selection of,
say, samples of 20 observations each.
(5) We calculate the sample mean of each sample ^ X h . We record these es-
timates into a histogram.
(6) The distribution of these estimates defines the sampling distribution of X
Because we know the true mean (μ), we can determine how much the sample
estimates of the mean ^ X h deviate from μ.

Your lecturer has applied these steps in practice. First (step 3) observations (X)
for the population were generated which conform to the normal distribution with
the average μ = 100 and a standard deviation of X of σ = 15. This is easily done
by using a PC and MS Excel.

Then a large number of random samples (each of sample size 20) were selected
(step 4). The sample mean of each sample ^ X h was derived and recorded into a
histogram (step 5). The histogram summarises the frequency of values of different
X obtained from all samples.

With respect to the histogram, the Y-axis measures the relative occurrence of the
values of X . The number on the X-axis represents the upper bound, for example,
100.5 represents values of X falling between 99.5 and 100.5.

ECS3706/1 43
Which conclusions can we make based on this sampling distribution?

(1) The first, possibly unexpected, fact is that the outcomes of random sampling
produce a well-behaved distribution! The sample averages appear to cluster
around the true value being estimated and the distribution is symmetric. The
expected value (weighted average) of the sample means of all samples is
equal (at least very close) to the true value of µ = 100. This implies that the
estimator X is unbiased. Bias in the estimator occurs when the expected
value of X is not equal to µ.
(2) Deviations ( X – µ) do occur, which are both positive and negative. However,
in most cases, these deviations are relatively small. Large deviations do
occur, but the probability of this is relatively low.

(3) To further judge the accuracy of a sample value X we need information

regarding the “width” of the sample distribution of X . for example, in the
histogram above almost all observations of X fall within the range μ – 10 to
μ + 10. The SE ^ X h is such a measure (but different from the value of 10)
where SE = standard error (also called standard deviation).
Statistical theory tells us that the SE ^ X h may be derived as follows:
SE ^ Xh v
SE ^ X h = = = 3.354 where N is the sample size.
N 20
The SE ^ X h measures the “width” of the sample distribution of X . Of
course, the “narrower” the sample distribution of X is, the more accurate
its estimates are.

(4) The distribution of deviations X – µ conforms to the normal distribution.

This allows us to make probability statements regarding the extent of devia-
tions X – µ. It is convenient, however, to write this in its standardised form:

X -n
Z= where v is the standard error of X
v N
N

σ is the SE(X) and N is the sample size.

44
STuDY uNIT 3: Learning to use regression analysis

The advantage of this form is that Z is normally distributed with an average

of 0 and a standard error of 1. Because tables of the normal distribution are
published in this form it is then easy to compare values of Z (obtained from
the sample) to that of z (the values published in the table).

The distribution of random sampling estimates of X is highly predictable. We

may assume that a single random sample estimate will conform to this behaviour.

TASK 17.3.2
Explain the meaning of the t-distribution (with respect to sample estimator X ) and
explain which sources of sampling variation it accounts for.

This task provides some background regarding the t-distribution which will again
appear in the next study unit.

12 ANSWER
In task 17.3.1 (4), reference was made to the standardised form of X – µ, that is
X -n
Z= equation A.
v
N
Because we only have sample data, the sample will provide values for X and
N. Both μ and σ are, however, unknown. The first (μ) is not really a problem due
to the nature of hypothesis testing. In the next study unit, you will learn that we
simply replace μ with a fixed value, that is a value of which its “compatibility” with
X is tested. The second, σ = SE(X), remains unknown.

Because the sample consists of N values of X, σ may in fact be estimated. An

unbiased estimator of σ is s, where

s=
/ ^ Xi - X h2 equation B.
N -1
If we replace σ within equation A with its estimate s in equation B, then
X -n
t= equation C.
s
N
Although Z is normally distributed, t is distributed like the t-distribution. The t-
distribution copes with two sources of variation, that is, X and s, which of course
vary from sample to sample.

In the next study unit the t-value is also used to test the coefficients of a regression

! !
equation for statistical significance. You only have one sample, and this sample
provides only one estimate each of b and SE _ b i . It is derived as
!
b - b0
t=
!
SE ^bh
where β 0 is the H 0 value of the coefficient being tested and b is the sample es-
timate of coefficient β.

ECS3706/1 45

Time+Series+Forecasting Monograph
100% (4)
Time+Series+Forecasting Monograph
58 pages
Solution Manual For Introductory Econometrics 6th Edition by Woolridge
0% (3)
Solution Manual For Introductory Econometrics 6th Edition by Woolridge
7 pages
T Barge Analysis
90% (10)
T Barge Analysis
27 pages
Descriptive and Infrential Statistics
No ratings yet
Descriptive and Infrential Statistics
33 pages
Discrete Probability Distributions: Random Variables
No ratings yet
Discrete Probability Distributions: Random Variables
52 pages
Discrete Random Variable
No ratings yet
Discrete Random Variable
41 pages
ECN121 Lecture 2 Notes
No ratings yet
ECN121 Lecture 2 Notes
7 pages
Expected Value
No ratings yet
Expected Value
3 pages
Statistical Methods and Testing of Hypothesis
No ratings yet
Statistical Methods and Testing of Hypothesis
52 pages
Slides-Probability and Random Processes, 4, March 2024
No ratings yet
Slides-Probability and Random Processes, 4, March 2024
116 pages
Review Some Basic Statistical Concepts: Topic
No ratings yet
Review Some Basic Statistical Concepts: Topic
55 pages
Probability Distribution
100% (1)
Probability Distribution
20 pages
Session 7 (CHPT 5 & 6)
No ratings yet
Session 7 (CHPT 5 & 6)
76 pages
Chapter 6
No ratings yet
Chapter 6
11 pages
Chapter 5 Prob
No ratings yet
Chapter 5 Prob
6 pages
Lecture 1-1_Review of Probability
No ratings yet
Lecture 1-1_Review of Probability
36 pages
Lecture - 02
No ratings yet
Lecture - 02
36 pages
Chapter 6, 7, 8
No ratings yet
Chapter 6, 7, 8
25 pages
Fe Engineering Probability Statistics
No ratings yet
Fe Engineering Probability Statistics
9 pages
Formula Sheet
No ratings yet
Formula Sheet
18 pages
Types of Data
No ratings yet
Types of Data
45 pages
Unit 4.3 Random Variables, Discrete and Continuous Probability Distribution
No ratings yet
Unit 4.3 Random Variables, Discrete and Continuous Probability Distribution
7 pages
03 16 ReviewMathStat2
No ratings yet
03 16 ReviewMathStat2
137 pages
Statistical Inference
No ratings yet
Statistical Inference
106 pages
6.mean and Variance of A Distribution
No ratings yet
6.mean and Variance of A Distribution
38 pages
LECT3 Probability Theory
No ratings yet
LECT3 Probability Theory
42 pages
2.3 Expectation of Random Variables
No ratings yet
2.3 Expectation of Random Variables
3 pages
Statistics and Probability Module 1 (1)
No ratings yet
Statistics and Probability Module 1 (1)
58 pages
Stat - G. Assignment
No ratings yet
Stat - G. Assignment
21 pages
Lesson 5 - Probability Distributions
No ratings yet
Lesson 5 - Probability Distributions
8 pages
Stats and Prob Reviewer
No ratings yet
Stats and Prob Reviewer
7 pages
c3 Dist
No ratings yet
c3 Dist
21 pages
Random Variables and Mathematical Expectations - Lecture 13 Notes
No ratings yet
Random Variables and Mathematical Expectations - Lecture 13 Notes
9 pages
Chapter 2 - Lesson 4 Random Variables
No ratings yet
Chapter 2 - Lesson 4 Random Variables
19 pages
Statistic S at Probabili TY: Teacher: Aldwin N. Petronio
No ratings yet
Statistic S at Probabili TY: Teacher: Aldwin N. Petronio
44 pages
Statistics and Probability Second SEMESTER S.Y. 2020 - 2021: Quest
No ratings yet
Statistics and Probability Second SEMESTER S.Y. 2020 - 2021: Quest
6 pages
Probability
No ratings yet
Probability
28 pages
4&5 Basic Probability Concepts and Discrete Probability Distribution
No ratings yet
4&5 Basic Probability Concepts and Discrete Probability Distribution
10 pages
ECMT1020_lecture_notes_01_rv1
No ratings yet
ECMT1020_lecture_notes_01_rv1
6 pages
M131-Lecture Notes No. 4
No ratings yet
M131-Lecture Notes No. 4
58 pages
Probability and Statistics - 2
No ratings yet
Probability and Statistics - 2
72 pages
Statistics Concepts: An Overview of Upper-Division Statistics With R
No ratings yet
Statistics Concepts: An Overview of Upper-Division Statistics With R
69 pages
ECON1005 U5
No ratings yet
ECON1005 U5
32 pages
Random Variables
No ratings yet
Random Variables
44 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
mit18_05_s22_class04-prep-b
No ratings yet
mit18_05_s22_class04-prep-b
7 pages
5 - Jan10 Discrete Random Variable
No ratings yet
5 - Jan10 Discrete Random Variable
24 pages
Probability Distributions in R
No ratings yet
Probability Distributions in R
42 pages
Statistics and Probability2021 - Quarter 3 2
No ratings yet
Statistics and Probability2021 - Quarter 3 2
38 pages
week two note
No ratings yet
week two note
19 pages
STA124 Complete Note (Edward Cares)
No ratings yet
STA124 Complete Note (Edward Cares)
41 pages
c3 Dist
No ratings yet
c3 Dist
21 pages
Unit 1 mth145 Random Variable
No ratings yet
Unit 1 mth145 Random Variable
16 pages
inbound4421484962866478386
No ratings yet
inbound4421484962866478386
68 pages
RVSP Notes
89% (9)
RVSP Notes
123 pages
Statistics and Probability: Philippine College Foundation
No ratings yet
Statistics and Probability: Philippine College Foundation
62 pages
Random Variables and Probability Distributions
No ratings yet
Random Variables and Probability Distributions
15 pages
Probability Distribution From The Old Book.
No ratings yet
Probability Distribution From The Old Book.
13 pages
Math Presentation
73% (15)
Math Presentation
58 pages
R-6 Theory
No ratings yet
R-6 Theory
4 pages
A Concept of Limits
From Everand
A Concept of Limits
Donald W. Hight
4/5 (4)
Fundamentals of Modern Mathematics: A Practical Review
From Everand
Fundamentals of Modern Mathematics: A Practical Review
David B. MacNeil
No ratings yet
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
TRB Syllabus -148-151
No ratings yet
TRB Syllabus -148-151
4 pages
Download ebooks file Microeconometrics Using Stata Cross Sectional and Panel Regression Models 2nd Edition A Colin Cameron Pravin K Trivedi all chapters
100% (10)
Download ebooks file Microeconometrics Using Stata Cross Sectional and Panel Regression Models 2nd Edition A Colin Cameron Pravin K Trivedi all chapters
40 pages
Unit I: Business Research Methods Two Marks Question & Answer
No ratings yet
Unit I: Business Research Methods Two Marks Question & Answer
12 pages
Mandrill Tweet Analysis
No ratings yet
Mandrill Tweet Analysis
9 pages
Experiment 1: Measurement and Error Analysis
No ratings yet
Experiment 1: Measurement and Error Analysis
11 pages
Errors in Measurements
No ratings yet
Errors in Measurements
47 pages
Orca Share Media1673408981696 7018785985563493253
No ratings yet
Orca Share Media1673408981696 7018785985563493253
28 pages
AL-302-Introduction-to-Probability-and-Statistics
No ratings yet
AL-302-Introduction-to-Probability-and-Statistics
2 pages
Financial Well-Being of Auto Drivers in Bangalore
No ratings yet
Financial Well-Being of Auto Drivers in Bangalore
6 pages
Business Statistics Session 17: Simple Correlation and Regression
No ratings yet
Business Statistics Session 17: Simple Correlation and Regression
24 pages
Requirement Area Review: Flexible Study Area For Traffic Impact Assessments
No ratings yet
Requirement Area Review: Flexible Study Area For Traffic Impact Assessments
11 pages
JSPM'S Bhivarabai Sawant Institute of Technology & Research: Mini Project Report On
No ratings yet
JSPM'S Bhivarabai Sawant Institute of Technology & Research: Mini Project Report On
33 pages
Burnham Et Al., 2008
No ratings yet
Burnham Et Al., 2008
383 pages
Parameter-Efficient Transfer Learning For NLP
No ratings yet
Parameter-Efficient Transfer Learning For NLP
10 pages
Practical Research 2 Learning Competenci
No ratings yet
Practical Research 2 Learning Competenci
2 pages
Essentials of Report Writing - Application in Business
75% (4)
Essentials of Report Writing - Application in Business
28 pages
Chapter 14
No ratings yet
Chapter 14
16 pages
Applied Statistics in Business and Economics 4th Edition Doane Solutions Manual 1
100% (72)
Applied Statistics in Business and Economics 4th Edition Doane Solutions Manual 1
28 pages
Cox Ingersoll Ross - Model
No ratings yet
Cox Ingersoll Ross - Model
6 pages
An Introduction to Mathematical Modeling of Infectious Diseases Premium Download
100% (3)
An Introduction to Mathematical Modeling of Infectious Diseases Premium Download
14 pages
Quantitative Criteria For The Selection and Stabilization of Soils For Rammed Earth Wall Construction PDF
No ratings yet
Quantitative Criteria For The Selection and Stabilization of Soils For Rammed Earth Wall Construction PDF
310 pages
Risk Measurement
No ratings yet
Risk Measurement
52 pages
Contoh Template Jurnal
No ratings yet
Contoh Template Jurnal
2 pages
Etextbook 978-1111826925 Business Research Methods All Chapter Instant Download
100% (6)
Etextbook 978-1111826925 Business Research Methods All Chapter Instant Download
53 pages
Model Building Approach
No ratings yet
Model Building Approach
7 pages
10 Cool LEGO Mindstorms Ultimate Builder Projects Amazing Projects You Can Build in Under an Hour 1st Edition Mario Ferrari Giulio Ferrari all chapter instant download
No ratings yet
10 Cool LEGO Mindstorms Ultimate Builder Projects Amazing Projects You Can Build in Under an Hour 1st Edition Mario Ferrari Giulio Ferrari all chapter instant download
67 pages
Troubleshooting Guide For EQA Results - 1WA
100% (1)
Troubleshooting Guide For EQA Results - 1WA
9 pages

ECS3706 study unit 17_reduced size file

Uploaded by

ECS3706 study unit 17_reduced size file

Uploaded by

STuDY uNIT 17

• We will begin by reviewing probability and sampling theory. To be a competent

The message is clear. You cannot fully understand econometrics without a

The approach of this chapter is different to that of other chapters.

• We first provide the headers of sections as discussed in the textbook.

17.1 PROBABILITY DISTRIBuTIONS

In the case of continuous random variables, you must understand

In the case of the normal distribution, you must

Tasks 17.1.1–17.1.5 deal with the following major statistical concepts:

(d) Derive the variance of X and the standard error of X.

(c) The expected value of X is derived as ∑ X . P(X):

(d) The variance of X is ∑ (X – μ)2.P(X) where μ is the expected value of

X P(X) X- μ (X- μ)2 (X- μ)2.P(X)

1 1/6 -2.5 6.25 1.0417

2 1/6 -1.5 2.25 0.3750

3 1/6 -0.5 0.25 0.0417

4 1/6 0.5 0.25 0.0417

5 1/6 1.5 2.25 0.3750

6 1/6 2.5 6.25 1.0417

The variance of X is 2.9167. The standard error of X is 2.91667 = 1.7078.

This example deals with a nonuniform probability density function

The possible outcomes of Y range from a minimum of Y = 2 (1 + 1) to a maximum

Y Fre- P(Y) Y.P(Y) Y (Y (Y – μ)2.

2 1 0.0278 0.0556 -5 25 0.6944

3 2 0.0556 0.1667 -4 16 0.8889

4 3 0.0833 0.3333 -3 9 0.7500

5 4 0.1111 0.5556 -2 4 0.4444

6 5 0.1389 0.8333 -1 1 0.1389

7 6 0.1667 1.1667 0 0 0.0000

8 5 0.1389 1.1111 1 1 0.1389

9 4 0.1111 1.0000 2 4 0.4444

10 3 0.0833 0.8333 3 9 0.7500

11 2 0.0556 0.6111 4 16 0.8889

12 1 0.0278 0.3333 5 25 0.6944

Sum 36 1.0000 μ = 0 110 5.8333

Frequency: the number of times the specific Y-value occurs.

Is Y normally distributed? Well, not quite! To be normally distributed, Y must

σ 2 = Σ(Y i – μ)2.P(Y i) = 5.8333

σ = √ σ 2 = 2.4152. σ is a measure of the dispersion or variation of Y. In the

An example of a discrete distribution that is approxi-mately normally distributed

We use standardised Z-values to look up probabilities of the normal distribution

Sum 1.000 10.5000

Assume that variable X is the sample estimate of a population parameter μ. Also

In the case of a random continuous variable Z:

E(Z) = ∫Z.P(Z).dZ where ∫P(Z).dZ = 1.

100 0.00 0.5000 5 000

110 0.67 0.2514 2 514

120 1.33 0.0918 918

130 2.00 0.0228 228

140 2.67 0.0038 38

145 3.00 0.0013 13

Studenmund provides a good overview of some sample selection methods and of

• What is sampling in general?

In econometrics, we use sample data for estimation. An example is the house

We expect you to understand

The importance of the sampling distribution

To determine the best estimator, we ask four questions:

• Is the estimator unbiased?

Explain the meaning of

• the sampling distribution of X

(3) To further judge the accuracy of a sample value X we need information

(4) The distribution of deviations X – µ conforms to the normal distribution.

σ is the SE(X) and N is the sample size.

The advantage of this form is that Z is normally distributed with an average

The distribution of random sampling estimates of X is highly predictable. We

Because the sample consists of N values of X, σ may in fact be estimated. An

You might also like