0% found this document useful (0 votes)

42 views

Cross Sectional

best econometric pdf

Uploaded by

mengstuhagos1223

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views

Cross Sectional

best econometric pdf

Uploaded by

mengstuhagos1223

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Cross sectional

 Type of data
A cross-sectional data set consists of a sample of
individuals, households, firms, cities, states,
countries, or a variety of other units, taken at a given
point in time.
• Cross-sectional data are widely used in economics
and other social sciences.
• In economics, the analysis of cross-sectional data is
closely aligned with the applied microeconomics
fields, such as labor economics, state and local
public finance, industrial organization, urban
economics, demography, and health economics.
 Type of data….
• A time series data set consists of observations
on a variable or several variables overtime.
• Examples of time series data include stock prices,
money supply, consumer price index, gross
domestic product, Annual homicide rates, and
automobile sales figures.
• Pooled cross section; have both cross-sectional
and time series features.
• For example, suppose that two cross-sectional
household surveys are taken in the United States,
one in 1985 and one in 1990.
 Type of data………
• A panel data (or longitudinal data) set consists
of a time series for each cross-sectional
member in the data set.
• The key feature of panel data that distinguishes
them from a pooled cross section is that the
same cross-sectional units (individuals, firms, or
counties in the preceding examples) are followed
over a given time period.
 linear regression model
linear regression estimates how much Y changes when
X changes one unit.
1. the simple linear regression model.
• It is also called the two-variable linear regression
model or bivariate linear regression model because it
relates the two variables x and y.
• simple regression = regression with 2 variables
• the variables y and x have several different names used
interchangeably, as follows: y is called the dependent
variable, the explained variable, the response variable,
the predicted variable, or the regressand.
 linear regression model….
• x is called the independent variable, the
explanatory variable, the control variable, the
predictor variable, or the regressor.
beo Fitted values

10
beo

0 10 20 30
bpop
 linear regression model….
• The relationship between variables Y and X is
described using the equation of the line of best
fit.
• 𝑦 = 𝛼 + 𝛽𝑥
• α indicating the value of Y when X is equal to
zero (also known as the intercept) and
• ß indicating the slope of the line (also known as
the regression coefficient).
• The regression coefficient ß describes the change
in Y that is associated with a unit change in X.
beo Fitted values

income
beo

0 10 20 30
bpop
education
Yi   0  1 X i   i
How did we get that line?

εi Yi-Y^i
“residual”
^
Y i
MWTP
of the
respond
ent

education
beo Fitted values

Black % in
state
beo

5
legislatures
(Y)

 0  1.31
0

0 10 20 30
bpop
Black % in state population (X)

1  0.359
Yi   0  1 X i   i
 OLS in stata
• In Stata use the command regress, type:
• regress [dependent variable] [independent
variable(s)]
regress y x for simple regression
• In a multivariate setting we type:
regress y x1 x2 x3 …
For example using the data rayu.dta ,if we
need to asses the determinants of income.
OLS in stata……
• Outcome (Y) variable – income of the respondents
• Predictor (X) variables the gender, age, education,
family size and marital states of the respondents.
• If Income=f(edu)
Income= a+ b1 edu
reg income edu
• If income=f(gen,age,edu,fs,ms)
Income=a+b1gender+b2age+b3education+b4
familysize +b5marital states
reg Income gender age education familysize
marital states

Simple linear regrassion using stata
• Simple linear regrassion
reg INC EDU
multi linear regrassion using stata
• Multi linear regrassion
regress INC GEN age MRS FS EDU EMS
 General interpretation
 Basic assumptions in OLS
1. linearity in parameter
2. Random sampling
3. No perfect collinearity among independent
variables
4. zero conditional Mean
E(u/x1,x2….xn)=0
If the above conditioned fulfiled the outcome
becomes un biased i.e E(b)=β
 Basic assumptions in OLS….
5. homoskedasticity
• The error u has the same variance given any values of
the explanatory variables. In other words,

6. Normality
• The population error u is independent of the
explanatory variables x and is normally distributed with
zero mean and variance ; u ~ Normal(0, δ2 )
 Linearity
• Linear in the variables vs. linear in the
parameters
– Y = a + bX + e (linear in both)
– Y = a + bX + cX2 + e (linear in parms.)
– Y = a + b2 X + e (linear in variables, not parms.)
• Regression must be linear in parameters
 multicollinearity
• An important assumption for the multiple regression
model is that independent variables are not perfectly
multicolinear.
• One regressor should not be a linear function of
another.
 The presence of multicollinearity will not lead to biased
coefficients. .
• If a variable which you think should be statistically
significant is not, consult the correlation coefficients.
• The Stata command to check for multicollinearity is vif
(variance inflation factor).
• If A vif > 10 or a 1/vif < 0.10 indicates trouble.
 Multicollinearity…….

1.78<10 so, There is no multicollinarity problem

 Multicollinearity…
• Another method to see weather muticolinarity
exist or not
• use correlate command in stata
• If the correlation coefficient matrix between all
explanatory variables is below 0.8 it means there
is no perfect collinarity between the explanatory
variables.
• You can also use pwcorr
• pwcorr GEN age MRS FS EDU INC EMC,
obs sig
 Multicollinearity
• In our case because all correlation are below
0.8. so, there is no multicollinearity.

• Method of solving the problem drop the

variable.
 Testing for homoskedasticity
• An important assumption is that the variance in
the residuals has to be homoskedastic or
constant.
• Residuals cannot varied for lower of higher values
of X (i.e. fitted values of Y since Y=Xb). A
definition:The error term [e] is homoskedastic if
the variance of the conditional distribution of [e
is constant for i=1・, and in particular does not
depend on x; otherwise, the error term is
heteroskedastic (Stock and Watson, 2003)
 Testing for homoskedasticity…..
• If heteroscedastcity estimates becomes inefficient and
inconsistent.
 cross sectional by its nature, we are likely to face the
problem of heteroscedastcity
• To detect heteroskedasticiy we use the Breusch-Pagan
test.
• The null hypothesis is that residuals are homoskedastic.
• In the example below we reject the null at 95% and
concluded that residuals are homogeneous.
• The command in stata
• hettest
 Testing for homoskedasticity…..

• Because p value is 0.0019; we reject Ho and accept H1

ie. there is problem heteroskedasticity.
• If the problem occur the way to deal with this problem,
one is using heteroskedasticity-robust standard errors.
• reg INC EDU FS MRS age GEN, robust
 Solving hotroskedasticity…..
 omitted-variable test

• How do we know we have included all

variables we need to explain Y?
• Regression: Testing for omitted variable bias is
important for our model since it is related to
the assumption that the error term and the
independent variables in the model are not
correlated (E(e|X) = 0)
• In Stata we test for omitted-variable bias using
the ovtest command:
 omitted-variable test…..

• The null hypothesis is that the model does not

have omitted-variables bias, the p-value is
lower than the usual threshold of 0.05 (95%
significance), so we fail to accept the null and
conclude that we do need more variables
 specification error
• Another command to test model specification is
linktest. It basically checks whether we need more
variables in our model by running a new regression
with the observed Y against independent variables
linktest
• The thing to look for here is the significance of _hatsq.
• The null hypothesis is that there is no specification
error.
• If the p-value of _hatsq is not significant then we fail to
reject the null and conclude that our model is correctly
specified
 specification error…..

• the p-value of _hatsq is significant at 5% then we

fail to accept the null and conclude that our
model is not correctly specified
• Solution; add additional variable by reviewing
deferent literatures.
 normality
• We need to test the residuals for normality.
• e = Y – Yhat.
• We can save the residuals in STATA, by issuing
a command that creates them, after we have
run the regression command.
• The command to generate the residuals is
predict residual, resid
predict e, resid
 Normality…..
• kdensity e, normal
Kernel density estimate
.0002
.00015
Density
.0001
.00005

-5000 0 5000 10000 15000

Residuals

Kernel density estimate

Normal density
kernel = epanechnikov, bandwidth = 652.5153
 Normality…
• histogram e, normal
• histogram e, kdensity normal
2.0e-04
1.5e-04
Density
1.0e-04
5.0e-05

-5000 0 5000 10000 15000

Residuals

• In adddition the command pnorm e and

qnorm e are also helpful to test normality.
 Normality….
• If residuals do not follow a ‘normal’ pattern
then you should check for omitted variables,
model specification, linearity, functional
forms.
• In large sample, this assumption is not that
important because of Central Limit Theory
• ladder INC
 logistic regression
• Binary logistic regression is a type of regression
analysis where the dependent variable is a
dummy variable:
• Outcome can be coded 1 or 0 (yes or no,
approved or denied, success or failure, did not
vote or did vote.
•To be continued………
thanks for the attention

CFA LVL II Quantitative Methods Study Notes
No ratings yet
CFA LVL II Quantitative Methods Study Notes
10 pages
Estadística Clase 7
No ratings yet
Estadística Clase 7
24 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Linear Regression
No ratings yet
Linear Regression
216 pages
Econometrics Session
No ratings yet
Econometrics Session
43 pages
Midterm 2 Nem Veg Leges
No ratings yet
Midterm 2 Nem Veg Leges
9 pages
Regression Coeffient
No ratings yet
Regression Coeffient
52 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
Ssss PDF
No ratings yet
Ssss PDF
50 pages
Chapter Two: Bivariate Regression Mode
100% (1)
Chapter Two: Bivariate Regression Mode
54 pages
New Section 1
No ratings yet
New Section 1
39 pages
ARM 2nd Mid
No ratings yet
ARM 2nd Mid
13 pages
Introductory Econometrics Viva Flashcards
No ratings yet
Introductory Econometrics Viva Flashcards
2 pages
049 Stat 326 Regression Final Paper
No ratings yet
049 Stat 326 Regression Final Paper
17 pages
An Overview of Regression Analysis: Notes
No ratings yet
An Overview of Regression Analysis: Notes
5 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
Simple Linear and Logistic Regression
No ratings yet
Simple Linear and Logistic Regression
81 pages
Ra Web
No ratings yet
Ra Web
70 pages
ECON3049 Lecture Notes 1
No ratings yet
ECON3049 Lecture Notes 1
32 pages
Econometrics - Review Sheet ' (Main Concepts)
No ratings yet
Econometrics - Review Sheet ' (Main Concepts)
5 pages
BRM - L4,5 - Linear Regression
No ratings yet
BRM - L4,5 - Linear Regression
113 pages
Regression With One Regressor-Hypothesis Tests and Confidence Intervals
100% (1)
Regression With One Regressor-Hypothesis Tests and Confidence Intervals
53 pages
Lec 5 V 11
No ratings yet
Lec 5 V 11
44 pages
STAT22209 - Chapter 03-Multiple Regression - 2022
No ratings yet
STAT22209 - Chapter 03-Multiple Regression - 2022
41 pages
CLRM
No ratings yet
CLRM
17 pages
Outline of Econometrics Topics
No ratings yet
Outline of Econometrics Topics
3 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
econometrics-cheat-sheet
No ratings yet
econometrics-cheat-sheet
4 pages
6 Continuous Data Analysis
No ratings yet
6 Continuous Data Analysis
49 pages
Basic Economterics - I
No ratings yet
Basic Economterics - I
17 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Eco Trix
No ratings yet
Eco Trix
16 pages
Correlation and Regression 2
No ratings yet
Correlation and Regression 2
24 pages
Introduction To Econometrics - Summary
No ratings yet
Introduction To Econometrics - Summary
23 pages
Econometrics 2
No ratings yet
Econometrics 2
128 pages
Econometrics Notes
No ratings yet
Econometrics Notes
95 pages
Intro To Regresion: Codergirl Data Analysis
No ratings yet
Intro To Regresion: Codergirl Data Analysis
32 pages
Yaregal Birhanu
No ratings yet
Yaregal Birhanu
8 pages
Regression Analysis (Simple)
100% (1)
Regression Analysis (Simple)
8 pages
Oversikt ECN402
No ratings yet
Oversikt ECN402
40 pages
Chapter 8 Regression Model - 2023
No ratings yet
Chapter 8 Regression Model - 2023
21 pages
CH - 3 - Simple and Multiple Linear Regressions in Stata
No ratings yet
CH - 3 - Simple and Multiple Linear Regressions in Stata
36 pages
Data Science 03 - Regression PDF
No ratings yet
Data Science 03 - Regression PDF
32 pages
Module 3 PoM-Forecasting
No ratings yet
Module 3 PoM-Forecasting
5 pages
WEEK2 Simple Regression
No ratings yet
WEEK2 Simple Regression
133 pages
Mda-Session-7 Simple Linear Regression
No ratings yet
Mda-Session-7 Simple Linear Regression
75 pages
Linear Regression
100% (2)
Linear Regression
28 pages
LINEAR REGRESSION IN R
No ratings yet
LINEAR REGRESSION IN R
6 pages
Third, Regression Analysis Predicts Trends and Future Values
No ratings yet
Third, Regression Analysis Predicts Trends and Future Values
2 pages
Econometrics Unit 3 Tedy Best
No ratings yet
Econometrics Unit 3 Tedy Best
147 pages
ECON6067 Stata (II) 2022
No ratings yet
ECON6067 Stata (II) 2022
22 pages
Model Development
No ratings yet
Model Development
80 pages
TNDY_TA Session 2
No ratings yet
TNDY_TA Session 2
6 pages
15Multiple Linear Regression
No ratings yet
15Multiple Linear Regression
168 pages
Untitled 472
No ratings yet
Untitled 472
13 pages
Topic - 9 PDF
No ratings yet
Topic - 9 PDF
12 pages
Regression Kann Ur 14
No ratings yet
Regression Kann Ur 14
43 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Monetary Econ 2
No ratings yet
Monetary Econ 2
21 pages
Monetary Class Notes 2
No ratings yet
Monetary Class Notes 2
21 pages
ECE Cafeteria Management System Proposal
100% (1)
ECE Cafeteria Management System Proposal
13 pages
Chapter 4
No ratings yet
Chapter 4
41 pages
Monetary Class Notes
No ratings yet
Monetary Class Notes
26 pages
Chapter 2 Creation of New Venture
No ratings yet
Chapter 2 Creation of New Venture
32 pages
Heat Transfer Module
No ratings yet
Heat Transfer Module
45 pages
Heat Transfer MEng 3121 Exit Exam Module
No ratings yet
Heat Transfer MEng 3121 Exit Exam Module
38 pages
All Graduation Student 2016
No ratings yet
All Graduation Student 2016
8 pages
Tewodros Gebire and Bele
No ratings yet
Tewodros Gebire and Bele
69 pages
Reporet 1
No ratings yet
Reporet 1
15 pages
Amanueal Jib Crane (Keg Palletizer)
No ratings yet
Amanueal Jib Crane (Keg Palletizer)
67 pages
4 TH Year Internship PPT (2) - 1
No ratings yet
4 TH Year Internship PPT (2) - 1
37 pages
Hangal 14 Harvesting Machine
No ratings yet
Hangal 14 Harvesting Machine
108 pages
GGGGG
No ratings yet
GGGGG
50 pages
Solutions of Wooldridge Lab
No ratings yet
Solutions of Wooldridge Lab
19 pages
UNIT-III Lecture Notes
No ratings yet
UNIT-III Lecture Notes
18 pages
Real Estate Valuation Case
No ratings yet
Real Estate Valuation Case
5 pages
Panel Data Methods
No ratings yet
Panel Data Methods
17 pages
Tutorial 2 PSNM (2024-25) Unit-1 Correlation, Regression and Curve Fitting (1)
No ratings yet
Tutorial 2 PSNM (2024-25) Unit-1 Correlation, Regression and Curve Fitting (1)
2 pages
1 Simple Linear Regression
No ratings yet
1 Simple Linear Regression
13 pages
staffff
No ratings yet
staffff
16 pages
Module 05 PDF
No ratings yet
Module 05 PDF
20 pages
Separable Nonlinear Least Squares For Estimating
No ratings yet
Separable Nonlinear Least Squares For Estimating
5 pages
MN2196_Commentary_October_2023
No ratings yet
MN2196_Commentary_October_2023
12 pages
Least-Squares Data Fitting: EE263 Autumn 2015 S. Boyd and S. Lall
No ratings yet
Least-Squares Data Fitting: EE263 Autumn 2015 S. Boyd and S. Lall
17 pages
Chemestry Lab
100% (1)
Chemestry Lab
6 pages
JURNAL
No ratings yet
JURNAL
18 pages
Econometrics-Ii Quiz
No ratings yet
Econometrics-Ii Quiz
1 page
Data Mining unit-1 complete
No ratings yet
Data Mining unit-1 complete
45 pages
An Empirical Failure Criterion For Rocks and Jointed Rock Masses
No ratings yet
An Empirical Failure Criterion For Rocks and Jointed Rock Masses
19 pages
Heteroscadasticity
No ratings yet
Heteroscadasticity
11 pages
Download Complete (Ebook) Interpreting and Comparing Effects in Logistic, Probit and Logit Regression by Jacques A P Hagenaars, Steffen Kuhnel, Hans-Jurgen Andress ISBN 9781544364018, 1544364016 PDF for All Chapters
100% (3)
Download Complete (Ebook) Interpreting and Comparing Effects in Logistic, Probit and Logit Regression by Jacques A P Hagenaars, Steffen Kuhnel, Hans-Jurgen Andress ISBN 9781544364018, 1544364016 PDF for All Chapters
81 pages
Mock 5 S1
No ratings yet
Mock 5 S1
28 pages
Econometrics Cha 4(1)
No ratings yet
Econometrics Cha 4(1)
72 pages
Jurnal Analisis Time Series
No ratings yet
Jurnal Analisis Time Series
8 pages
Karthik Nambiar 60009220193
No ratings yet
Karthik Nambiar 60009220193
9 pages
midterm-2017
No ratings yet
midterm-2017
5 pages
المراجعة التسويقية وأهميتها في تحسين الأداء التسويقي للمؤسسة الخدمية دراسة حالة مؤسسة اتصالات الجزائر الوحدة العملية للاتصالات ورقلة
No ratings yet
المراجعة التسويقية وأهميتها في تحسين الأداء التسويقي للمؤسسة الخدمية دراسة حالة مؤسسة اتصالات الجزائر الوحدة العملية للاتصالات ورقلة
25 pages
Simple Explanation of Statsmodel Linear Regression Model Summary
No ratings yet
Simple Explanation of Statsmodel Linear Regression Model Summary
19 pages
CO6-L2-CORRELATION AND REGRESSION ANALYSIS
No ratings yet
CO6-L2-CORRELATION AND REGRESSION ANALYSIS
62 pages
Handbook of Regression Analysis With Applications in R, Second Edition Samprit Chatterjee 2024 Scribd Download
100% (1)
Handbook of Regression Analysis With Applications in R, Second Edition Samprit Chatterjee 2024 Scribd Download
69 pages
Forecasting in Business (PPT) - 092138
No ratings yet
Forecasting in Business (PPT) - 092138
53 pages
Nonlinear Regression Part 1
No ratings yet
Nonlinear Regression Part 1
47 pages
ppt for 7 th sem AIML4 ABC
No ratings yet
ppt for 7 th sem AIML4 ABC
4 pages

Cross Sectional

Uploaded by

Cross Sectional

Uploaded by

Cross sectional

1.78<10 so, There is no multicollinarity problem

• Method of solving the problem drop the

• Because p value is 0.0019; we reject Ho and accept H1

• How do we know we have included all

• The null hypothesis is that the model does not

• the p-value of _hatsq is significant at 5% then we

-5000 0 5000 10000 15000

Kernel density estimate

-5000 0 5000 10000 15000

• In adddition the command pnorm e and

You might also like