0% found this document useful (0 votes)

6 views

AF ECO 4000 cheat sheet

The document provides an overview of linear regression models, including the distinction between experimental and observational data, and the formulation of simple linear regression. It details the components of the regression equation, the criteria for selecting the best fit line, and the properties of estimators, including unbiasedness and efficiency. Additionally, it covers hypothesis testing, the calculation of p-values, and the significance of test statistics in evaluating the null hypothesis.

Uploaded by

whatyouneed123456

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

AF ECO 4000 cheat sheet

Uploaded by

whatyouneed123456

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

The Linear Regression Model

(Ideal) Randomized controlled experiment: control all other variables that impact Y except X
Data Types:
Experimental data come from experiments designed to evaluate a treatment or policy or to investigate a causal effect.
Observational Data: data obtained by observing actual behavior outside an experimental setting
- Cross-sectional data is data collected for multiple entities (person, firm, country) for a single period.
- Time series data are data for a single entity collected at multiple time periods.

Simple Linear Regression: The functional form for the line: y = a ∗x + b or y = B1 * x + B0 + ui

- Panel data, also called longitudinal data, are data for multiple entities in which each entity is observed at two or more time periods.
slope in the simple linear regression model
 ”a or B1” is called slope
Slope shows how much would y (dependent variable) increase, if x (independent variable) increased by one unit *
 ”b OR BO” is called intercept. Slope:
Intercept shows the value of y (dependent variable) when the value of x (independent variable) is set to zero *
- Y= dependent variable on average
- X= independent variable
β0+β1*Xi : represents the population regression function.
ui is the regression error (disturbance term).
Criteria to choose best line: SSR=i=1∑n(Yi−b0−b1Xi)2 Intercept:

-Total Errors are Smallest (distance from errors to is closest to line)

-Unique Solution: (Y= x^2) Square each error, see which sum is the smallest: Ordinary Least Squares (OLS)
In Ordinary Least Squares (OLS) regression, the sample regression line always passes through the point (Xˉ,Yˉ), where Xˉ is the mean of the independent variable and Yˉ is the mean of
the dependent variable.
Steps
1. For all observations, sum X, divide by N= sample mean (X-bar)
2. Take observed x value for each i, subtract X-bar, square them, then sum = Denominator

1. Sum Ys, divide by N (Y-bar)

2. Take observed Y value, subtract Y-bar, square them, then sum

1. (X- X-bar)*(Y- Y-bar), then SUM = Numerator.

(BEST B1) Beta1_Hat = Numerator / Denominator
(BEST B0) Beta0_Hat = Y-bar – Beta1_hat * x-bar
Plug-Ins: Definitions
 B1(slope)= is a true population parameter, the slope of the population regression line
 B1hat= the OLS estimator

 U(error)= represents the deviation of observations from the population regression line
 Uhat= difference between Y and its predicted value Which of the following is the objective function that OLS minimizes?
 E(Y/X)= expected value of Y given values of x ∑i=1n( Yi−b0−b1Xi)^2
 Yhat= OLS predicted value of Y for given values of x The OLS sample regression line passes through the point (Xˉ,Yˉ): Yes
 OLS Residual = Actual - sample avg
Measures of Fit and Prediction Accuracy.
- Y= B0 + B1*X
T (y->x) = Total Variation of Y that comes from X
T(y) Total Variation of Y
- X-bar(mean) = sum everything /N TSS= ESS + SSR
Variance = E (X1-X-bar)^2
N R^2 is unitless, SER has same units as dependent variable
Regression R2(Goodness of Fit): the fraction of the sample variance of Y explained by X. (*Unit-Free) R^2 always between 0 and 1
- Yi = ˆYi + ˆui . R2 is defined as the ratio of the sample variance of Y-hat (predicted value) to the sample variance of Y
- If the independent variable explains none of the variations in the dependent variable, then the value of R^2 will be 0.
- The explained sum of squares (ESS) is the sum of squared deviations of the predicted value, Y-hat-i , from its average, and the total sum of squares
(TSS) is the sum of squared deviations of Yi from its average

▶ The Standard Error of the Regression: The standard error of the regression (SER) is an estimator of the standard deviation of the regression error ui :
Higher R^2 the better

: the estimated slope in the regression of growth on x

Probability
Random variable: A random variable is a numerical summary of a random outcome. X=aY+b, a=scaling factor
Discrete random variable takes on only a discrete set of values,
Var(X)=a^2⋅Var(Y
Continuous random variable takes on a continuum of possible values. (any number)
Probability distribution of a discrete random variable is the list of all
)
possible values of the variable and the probability that each value will occur.
The cumulative probability is the probability that the random variable is less than or equal to a particular value.
The cumulative probability distribution is the list of all possible values and corresponding cumulative probabilities
The cumulative probability distribution of a continuous random variable is the probability that the random variable is less
than or equal to a particular value. Similar to discrete case

Expected Value (sometimes called mean or average) is one of the central tendency measures (others being mode and median)
- The expected value of a discrete random variable is computed as a weighted average of the possible outcomes of
that random variable, where the weights are the probabilities of that outcome
linear transformation of a random variable
- Mu or u: is the mean x and sigma
- σx: standard deviation
- variance = SD^2 SD = square root of variance
Then the expected value and the standard deviation of Y are given as: 0 and 1
The Variance
- The variance and standard deviation measure the dispersion or the “spread” of a probability distribution.
The variance of a RV Y, denoted var(Y), is the expected value of the square of the deviation of Y from its mean:
- Large variance = wider distribution
Steps | ex: E(M) Expected Value or Mean: μX=E[X ]= ∑Xi⋅P(Xi) = 0*0.8 + 1*0.10 + 2*0.06 + 3*0.03 + 4*0.01 = 0.35
1. Find difference between for each variable and mean
2. Square difference
3. Multiply by respected probability
The Normal Distribution
Denoted N(μ, σ2)
- μ: mean
- σ^2: variance
The area under the curve in any interval corresponds to the probability that the RV gets
values in this interval. The area under the curve sums to 1.
 (μ − 1.96σ, μ + 1.96σ) interval contains 95% of the area under the distribution graph
Standard Normal Distribution has mean 0 and variance 1, and is usually denoted by Z: Z N(0, 1).
 Its cumulative distribution is usually denoted by Φ. Thus, Pr ( Z ≤ c)=Φ(c).
For given z score we must find in table (1)pick the integer and the first decimal from the first column, and the second decimal from the first row. The number in the intersection of the corresponding row and
column is the cumulative probability corresponding to the z score of interest.
Less Than (or equal to) = take Z Z= β^ 1 - β1 / σβ1
Greater Than = subtract by 1

X – random variable x Example: X – N (3,4) Pz (x<5) mean = 3 variance = 4

Mx or μ – mean x 5-3 = 2 = 1 = Pz (Z< 1) = 0.8413
Ox or σ - standard deviation of x square root 4 2 Z = X - mean
SD

EXAMPLE: Z- N(0,1) Pz (z < 1.96) (Probability of z less than 1.96) (Less Than = take Z) = 0.975
Pz (z > 1.52) (Greater Than = subtract by 1) = 1 - .9357 = 0.0643
Pz (.81 < Z < 1.96) = .975 - .7910 = .184
Pz (-1 < Z < 1.96) = .975 – (1-.8413)
P(1 > 1.96) = just regular value

Random Sampling
- When Y1, Y2 . . . Yn are drawn from the same distribution and are independently distributed, they are said to be independently and identically distributed (i.i.d.).
The Sampling Distribution of the OLS Estimators
Because the OLS estimators ˆβ0 and ˆβ1 are computed from a randomly drawn sample, the estimators themselves are random variables with a probability distribution—the sampling distribution—that describes
distribution—the sampling distribution—that describes the values they could take over different possible random samples. OLS passes through(Xbar,Ybar) Objective Function OLS Minimizes
- ˆβ0 and ˆβ1 are unbiased estimators of Bo and B1
- The sampling distribution of ˆβ0 and ˆβ1: is well approximated by the bivariate normal distribution if the sample is sufficiently large.
1. Central Limit Theorem
The law of large numbers states that, under general conditions, ˆβ1 will be ”close” to β1 with very high probability when n is large.
- Specifically: E ( ˆβ0) = β0, and E ( ˆβ1) = β1
The larger is n and/or the variance of Xi the more precise will the OLS estimators be.

X2 = variable that follows standard normal distribution

Estimators and Their Properties

The ˆβ1 is an example of the estimators. It is an estimator of population slope.
Xbar= sum of X1
N
B^ = Sum of (x-xbar) (y1-ybar)
Sum of (x1-xbar)^2
 β1 as an estimator of β1 possesses all three characteristics of a “good” estimator.
 ˆβ1 is unbiased, consistent and has the lowest variance among all the linear estimators of β1
 ˆβ1 is BLUE - best linear unbiased estimator
An estimator is a function of a sample of data to be drawn randomly from a population.
An estimate is the numerical value of the estimator when it is actually computed using data from a specific sample.
- An estimator is a random variable because of randomness in selecting the sample, while an estimate is a nonrandom number.
- There might be more estimators for the same population parameters
- A “good” estimator must possess the following characteristics: unbiasedness, consistency, and efficiency.
- The estimator is unbiased if E( ˆβ1) =β1, where E( ˆβ1Y ) is the mean of the sampling distribution of ˆβ1.; otherwise ˆβ1 is biased.
- UNBIASED IF: its average value, over repeated sampling for the same sample size, is equal to the population value
If the probability that ˆβ1 is within a small interval of the true value β1 approaches to 1 as the sample size increases, then ˆβ1 is consistent. (increasing sample size will increase accuracy)
- Efficient – choose unbiased estimator with SMALLER variance
Hypothesis Testing (look at population parameters)
 The hypothesis that we want to test is called the null hypothesis and is denoted H0. For ex. we want to test whether β1(NOT Bhat) =1.5. Then, H0: βeduc. 1 =1.5$. In general, H0: β1= βh 1 where βh
1 is the hypothesized value.
 (does not equal, NEVER use accept)
 The hypothesis that we compare the null hypothesis to is called the alternative hypothesis and is denoted H1. The alternative hypothesis holds if the null hypothesis is not true.
 If we have enough evidence against the null evidence, then we reject the null hypothesis. If we do not have enough evidence then we fail to reject the null hypothesis. We never accept the null
hypothesis
 The p-value, also called the significance probability, is the probability of drawing a statistic at least as adverse to the null hypothesis as the one you actually computed in your sample, assuming the
null hypothesis is correct.
How to calculate the p-Value
 State the null hypothesis.
 Calculate the test statistics based on sample data.
 Standardize the test statistics.
 Determine the distribution of the standardized test statistics.
 Calculate the p-value.
 Make the decision

T stats=Z stats
T statistic = Estimator – hypothesized value (Coefficient of Slope)
Standard Error of Estimator
small p-value = reject null (less than 0.05)
large p-value = fail to reject null (higher than 0.05)
Larger (in absolute terms) Test statistics = smaller p value
Smaller test statistics= higher p value
Negative = same (b/c absolute value takes both – and positive, multiply by 2)

P value: includes AREA IN BOTH ENDS= ABSOLUTE VALUE Two tail= * 2

- | x | > 1 (less than -1 AND greater than 1) multiply by 2 for answer
EXAMPLE: Y=Bo+B1*(X). B1hat=5. Ho: B1=2
H1: B1=/(doesn’t equal) 2 -> P2(B1sac-hat>5) = P2 (B1hat – 2) = 5 – 2 = P2(z > 1.5) = 1 – 0.9332(z) = .066 =
0.66*(2) = 0.132
(O) SE 2
- Z= X- Mx
Ox
X – random varuavke x
Mx – mean x
Ox- standard deviation of x

Hypothesis formula for example:

- Ho: B1=1.5
- H1: Bo=/(doesn’t equal) 1.5
- SE(B1hat) = 0.008
- Bactual = 1.52
-
B (beta) is actual coefficient B^ (beta hat) is the estimator of B, estimated P-value represents the probability of observing data as extreme, or more extreme than
(greater than), the data actually observed, assuming the null hypothesis is true

Introductory Econometrics Test Bank
100% (1)
Introductory Econometrics Test Bank
106 pages
CH 03 Wooldridge 5e PPT PDF
100% (3)
CH 03 Wooldridge 5e PPT PDF
35 pages
Cheat Sheet
No ratings yet
Cheat Sheet
4 pages
Calculus of Variations
From Everand
Calculus of Variations
Lev D. Elsgolc
No ratings yet
Answer Key Problem Set 2
No ratings yet
Answer Key Problem Set 2
8 pages
Notes 1017 Part1
No ratings yet
Notes 1017 Part1
50 pages
Week 3-4
No ratings yet
Week 3-4
75 pages
CHAPTER TWO
No ratings yet
CHAPTER TWO
44 pages
Important Formulas Table
No ratings yet
Important Formulas Table
4 pages
x=^μ= x n x x) n−1 s σ s x−μ σ se (´x) = σ n: Sample Mean
No ratings yet
x=^μ= x n x x) n−1 s σ s x−μ σ se (´x) = σ n: Sample Mean
5 pages
Lecture 6
No ratings yet
Lecture 6
45 pages
Econometrics for Finace Lecture II-Session Three
No ratings yet
Econometrics for Finace Lecture II-Session Three
32 pages
Econometrics Endterm Summary 2 PDF
No ratings yet
Econometrics Endterm Summary 2 PDF
43 pages
3-TheSimpleLinearRegressionModelPart2
No ratings yet
3-TheSimpleLinearRegressionModelPart2
38 pages
Econometrics Bacheror's Lectures Utrecht University
No ratings yet
Econometrics Bacheror's Lectures Utrecht University
24 pages
Lecture8 4
No ratings yet
Lecture8 4
29 pages
Statistics Week3
No ratings yet
Statistics Week3
19 pages
Econometrics notes
No ratings yet
Econometrics notes
15 pages
Oversikt ECN402
No ratings yet
Oversikt ECN402
40 pages
Lecture 2
No ratings yet
Lecture 2
39 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
42 pages
1.1 Simple Linear Regression Model
100% (1)
1.1 Simple Linear Regression Model
15 pages
Simple Regression Model - Estimation
No ratings yet
Simple Regression Model - Estimation
9 pages
Ordinary Least Squares-2
No ratings yet
Ordinary Least Squares-2
31 pages
Linear Regression For Intermediate
No ratings yet
Linear Regression For Intermediate
6 pages
Chapter 2-Simple Regression Model
No ratings yet
Chapter 2-Simple Regression Model
25 pages
Regression and Correlation
No ratings yet
Regression and Correlation
14 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
Student Notes Madule 2
No ratings yet
Student Notes Madule 2
12 pages
統計摘要
No ratings yet
統計摘要
12 pages
Statistic..past Question
No ratings yet
Statistic..past Question
19 pages
Multivariate Regression Model - Lecture Notes
No ratings yet
Multivariate Regression Model - Lecture Notes
17 pages
Basic Economterics - I
No ratings yet
Basic Economterics - I
17 pages
Econometrics: Two Variable Regression: The Problem of Estimation
No ratings yet
Econometrics: Two Variable Regression: The Problem of Estimation
28 pages
Simple Regression
No ratings yet
Simple Regression
46 pages
Welcome To The Course: Financial Econometrics I
No ratings yet
Welcome To The Course: Financial Econometrics I
14 pages
Suggested_ Gordon Chap 8.2_8.4.6
No ratings yet
Suggested_ Gordon Chap 8.2_8.4.6
14 pages
Ordinary Least Squares Linear Regression Review: Week 4
No ratings yet
Ordinary Least Squares Linear Regression Review: Week 4
10 pages
05 16 Simple Regression 2
No ratings yet
05 16 Simple Regression 2
84 pages
Jimma University: M.SC in Economics (Industrial Economics) Regular Program Individual Assignment: Econometrics
No ratings yet
Jimma University: M.SC in Economics (Industrial Economics) Regular Program Individual Assignment: Econometrics
20 pages
Financial Data Analysis Unit e Lectures
No ratings yet
Financial Data Analysis Unit e Lectures
36 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
Chapter 11 Lecture Notes .
No ratings yet
Chapter 11 Lecture Notes .
22 pages
R18&19
No ratings yet
R18&19
32 pages
The Simple Regression Model
No ratings yet
The Simple Regression Model
24 pages
Ec410 Lecture 4 - Simple Regression II
No ratings yet
Ec410 Lecture 4 - Simple Regression II
8 pages
Chapter 3 Notes
No ratings yet
Chapter 3 Notes
5 pages
BRM - L4,5 - Linear Regression
No ratings yet
BRM - L4,5 - Linear Regression
113 pages
Linear Regression - Module 3
No ratings yet
Linear Regression - Module 3
16 pages
Econometric s
No ratings yet
Econometric s
23 pages
CH 02
No ratings yet
CH 02
41 pages
Linear Regression Models
No ratings yet
Linear Regression Models
41 pages
Sampling Distribution and SE
No ratings yet
Sampling Distribution and SE
9 pages
The Simple Regression Model
No ratings yet
The Simple Regression Model
41 pages
Regression With One Regressor
No ratings yet
Regression With One Regressor
25 pages
Biostat Lecture 10
No ratings yet
Biostat Lecture 10
47 pages
Lecture 4
No ratings yet
Lecture 4
60 pages
Midterm 2 Nem Veg Leges
No ratings yet
Midterm 2 Nem Veg Leges
9 pages
AG909 Quantitative Methods For Finance
No ratings yet
AG909 Quantitative Methods For Finance
7 pages
qrm2 Session1 2
No ratings yet
qrm2 Session1 2
89 pages
Exercises of Advanced Statistics
From Everand
Exercises of Advanced Statistics
Simone Malacrida
No ratings yet
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Calculus Refresher
From Everand
Calculus Refresher
A. A. Klaf
3/5 (8)
The Problem of Modeling Rare Events in ML-based Logistic Regression - Assessing Potential Remedies Via MC Simulations
No ratings yet
The Problem of Modeling Rare Events in ML-based Logistic Regression - Assessing Potential Remedies Via MC Simulations
20 pages
Chapter 5 Measures of Variability
No ratings yet
Chapter 5 Measures of Variability
24 pages
Estimation Theory
No ratings yet
Estimation Theory
40 pages
(eBook PDF) Introductory Econometrics: Asia-Pacific 2nd Edition pdf download
100% (1)
(eBook PDF) Introductory Econometrics: Asia-Pacific 2nd Edition pdf download
47 pages
Chapter 2. Dynamic Panel Data Models
No ratings yet
Chapter 2. Dynamic Panel Data Models
209 pages
HW 1
No ratings yet
HW 1
16 pages
ML-Lab-pgm-6
No ratings yet
ML-Lab-pgm-6
5 pages
PSUnit IV Lesson 1 Computing The Point Estimate of A Population Mean
100% (1)
PSUnit IV Lesson 1 Computing The Point Estimate of A Population Mean
20 pages
Bayesian Inference of Poisson Distribution Using Conjugate and Non-Informative Priors
No ratings yet
Bayesian Inference of Poisson Distribution Using Conjugate and Non-Informative Priors
10 pages
AGRICULTURAL STATISTICS
No ratings yet
AGRICULTURAL STATISTICS
3 pages
Exam Et4386 Estimation and Detection: January 21st, 2016
No ratings yet
Exam Et4386 Estimation and Detection: January 21st, 2016
5 pages
Statistical Regression and Classification From Linear Models to Machine Learning 1st Edition Norman Matloff instant download
100% (1)
Statistical Regression and Classification From Linear Models to Machine Learning 1st Edition Norman Matloff instant download
55 pages
(Ebook) Real Stats: Using Econometrics for Political Science and Public Policy by Bailey, Michael A. ISBN 9780199981946, 0199981949 pdf download
No ratings yet
(Ebook) Real Stats: Using Econometrics for Political Science and Public Policy by Bailey, Michael A. ISBN 9780199981946, 0199981949 pdf download
48 pages
Ahmad Et Al. 2019
No ratings yet
Ahmad Et Al. 2019
11 pages
2 - CHAPTER TWO-Mean and Total Estimation
No ratings yet
2 - CHAPTER TWO-Mean and Total Estimation
14 pages
ISLR
No ratings yet
ISLR
9 pages
Estimation of The Generalized Extreme Value Distribution by The Method of Probability Weighted Moments
No ratings yet
Estimation of The Generalized Extreme Value Distribution by The Method of Probability Weighted Moments
11 pages
BITS Pilani: MATH F113-Probability and Statistics Assignment 1 (Updated)
No ratings yet
BITS Pilani: MATH F113-Probability and Statistics Assignment 1 (Updated)
10 pages
Essentials of Econometrics 4th Edition Gujarati Solutions Manual download
100% (1)
Essentials of Econometrics 4th Edition Gujarati Solutions Manual download
47 pages
Lyxor
No ratings yet
Lyxor
64 pages
Brugger (1969) - A Note On Unbiased Estimation of The Standard Deviation
No ratings yet
Brugger (1969) - A Note On Unbiased Estimation of The Standard Deviation
2 pages
Kernel Density Estimation and Its Application
No ratings yet
Kernel Density Estimation and Its Application
8 pages
Introductory Statistics 8th Edition Mann Solutions Manual - Full Version Is Ready For Free Download
100% (3)
Introductory Statistics 8th Edition Mann Solutions Manual - Full Version Is Ready For Free Download
45 pages
JCG221S11 JCSS Guide To Uncertainty Estimate (Standard Solution) 1/11
No ratings yet
JCG221S11 JCSS Guide To Uncertainty Estimate (Standard Solution) 1/11
11 pages
Chapter 8.3. Maximum Likelihood Estimation: Prof. Tesler
No ratings yet
Chapter 8.3. Maximum Likelihood Estimation: Prof. Tesler
11 pages
Sampling Error
No ratings yet
Sampling Error
7 pages
Instrumental Variables and The Search For Identification
No ratings yet
Instrumental Variables and The Search For Identification
88 pages

AF ECO 4000 cheat sheet

Uploaded by

AF ECO 4000 cheat sheet

Uploaded by

The Linear Regression Model

Simple Linear Regression: The functional form for the line: y = a ∗x + b or y = B1 * x + B0 + ui

-Total Errors are Smallest (distance from errors to is closest to line)

1. Sum Ys, divide by N (Y-bar)

1. (X- X-bar)*(Y- Y-bar), then SUM = Numerator.

: the estimated slope in the regression of growth on x

X – random variable x Example: X – N (3,4) Pz (x<5) mean = 3 variance = 4

X2 = variable that follows standard normal distribution

Estimators and Their Properties

P value: includes AREA IN BOTH ENDS= ABSOLUTE VALUE Two tail= * 2

Hypothesis formula for example:

You might also like