0% found this document useful (0 votes)

1K views

Econometrics II CH 1

This document provides an overview of regression analysis using dummy variables. It discusses how qualitative variables can be included in regression models through the use of dummy variables. Dummy variables allow qualitative variables with two or more categories to be included. The categories are coded as 0 and 1. One category is selected as the base or reference category coded as 0. Comparisons are then made to this base category. The document discusses how to interpret coefficients when dummy variables are included and covers linear probability models and logistic regression as alternatives to linear regression for binary outcome variables.

Uploaded by

nigusu degu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views

Econometrics II CH 1

Uploaded by

nigusu degu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 48

Debre Markose

University
College of Business and Economics
Econometrics II
Abebe Mucheye Kassie (M.Sc.)
Aug 2022

1
Chapter One
Regression on Dummy Variables
The nature of dummy variables
•In regression analysis the dependent variable is frequently
influenced not only by variables that can be readily quantified
on some well-defined scale
• (e.g., income, output, prices, costs, height, and temperature),
but also by variables that are essentially qualitative in nature
(e.g., sex, race, color, religion, nationality, wars, earthquakes,
strikes, political upheavals, and changes in government
economic policy).

2
3
Cont.…
• Variables that assume such 0 and 1 values are
called dummy variables.
• Alternative names are indicator variables, binary
variables, categorical variables, and dichotomous
variables.
• Dummy variables can be used in regression models just as
easily as quantitative variables.
• As a matter of fact, a regression model may contain
explanatory variables that are exclusively dummy, or
qualitative, in nature.
4
Regression on one quantitative variable and one
qualitative variable with two classes, or categories

5
Cont.…
• Model (5.03) contains one quantitative variable
(years of teaching experience) and one qualitative
variable (sex) that has two classes (or levels,
classifications, or categories), namely, male and
female.
• What is the meaning of this equation?
Assuming, as usual, that E(Ui ) = ,0 we see that


6
Cont….

7
Dummy variable trap

8
Dummy
• The group, category, or classification that is
assigned the value of 0 is often referred to as:
 The base,
 Benchmark,
 Control,
 Comparison,
 Reference, or
 Omitted category.
It is the base in the sense that comparisons are
made with that category.
9
Regression on one quantitative variable and
two qualitative variables

10
Cont…

11
Cont…

12
Cont.…

13
14
15
16
17
18
19
20
Binary Choice Model

21
Binary Choice
Model

22
Cont.

23
Cont.

24
Linear probability model (LPM)

25
Linear probability model (LPM)

26
Linear probability model (LPM)

27
Logistic regression
► In Econometrics one dealt with multiple regression
with a continuous dependent variable, extending
the methods of simple linear regression.

► In many studies the outcome variable of interest

is the presence or absence of some condition,
whether or not the subject has a particular
characteristic such as a symptom of a certain
disease.
► We cannot use ordinary multiple linear
regression for such data, but instead we can use a
similar approach known as multiple linear logistic
regression or just logistic regression.
28
Logistic regression:
♣ In general, there are two main uses of logistic regression.

♣ The first is the prediction (estimation) of the probability

that an individual will have (develop) the characteristic.
For example, logistic regression is often used in
epidemiological studies where the result of the analysis is
the probability of developing cancer after controlling for
other associated risks.

♣ Logistic regression also provides knowledge of the

relationships and strengths between an outcome variable
(dependent variable having only two categories) and
explanatory (independent) variables that can be categorical
or continuous.
29
Logistic regression
The Model:
► The basic principle of logistic regression is much the same
as for ordinary multiple regression.

► The main difference is that instead of developing a model that uses

a combination of the values of a group of explanatory variables to
predict the value of a dependent variable, we predict a
transformation of the dependent variable.

►The dependent variable in logistic regression is usually

dichotomous, that is, the dependent variable can take the
value 1 with a probability of success , or the value 0 with a
probability of failure 1-.
This type of variable is called a binomial (or binary)
variable. 30
Logistic regression
 Although not discussed in this pack, applications of
logistic regression have also been extended to cases
where the dependent variable is of more than two
cases, known as multinomial logistic regression.

 When multiple classes of the dependent variable

can be ranked, then ordinal logistic regression is
preferred to multinomial logistic regression.

31
♣ The logit transformation, written as logit (p). Here p is the proportion of
individuals with the characteristic.

♣ For example, if p is the probability of an individual having its own house,

then 1-p is the probability that they do not have one.

♣ The ratio p / (1-p) is called the odds and thus

logit (p) = ln  1  P  is the log odds.

The logit can take any value from minus infinity to plus infinity.

♣ We can fit regression models to the logit which are very similar to the
ordinary multiple regression models found for data from a normal
distribution.

♣ We assume that relationships are linear on the logistic scale:

 P 
 
1 P 
ln = a + b1X1 + b2X2 + … + bnXn
32
where, X1, … Xn are the predictor variables and p is the proportion to be predicted. The calculation is computer intensive.
e ( a b1 X 1b 2 X 2bnXn)
P
1  e ( a b1 X 1b 2 X 2bnXn )
If Z = a + b1X1 + b2X2 + … + bnXn

The above equation turns out to be:

eZ
P
1 e Z

33
Significance tests
► The process by which coefficients are tested for significance for
inclusion or elimination from the model involves several different
techniques.

I) Z-test
The significance of each variable can be assessed by treating
b
Z= se(b)

► This z value is then squared, yielding a Wald statistic with a chi-

square distribution. However, there are problems with the use of the
Wald statistic.
 The likelihood-ratio test is more reliable for small sample sizes than
the Wald test.
34
Significance tests
II) Likelihood-Ratio Test:

► Logistic regression uses maximum-likelihood estimation

to compute the coefficients for the logistic regression
equation.

N.B. Multiple regression uses the least-squares method to

find the coefficients for the independent variables in the regression
equation
(it computes coefficients that minimize the residuals for all cases).
► Before proceeding to the likelihood ratio test, we need to
know about the deviance which is analogous to the residual
sum of squares from a linear model.

35
Deviance
 The deviance of a model is -2 times the log likelihood (-
2LL) associated with each model.

 As a model’s ability to predict outcomes improves, the

deviance falls.
 Poorly-fitting models have higher deviance.

 If a model perfectly predicts outcomes, the deviance will be

zero.
 This is analogous to the situation in linear regression, where
the residual sum of squares falls to 0 if the model predicts
the values of the dependent variable perfectly.
36
 Based on the deviance, it is possible to construct an
analogous to r² for logistic regression, commonly referred to
as the Pseudo r².

 If G1² is the deviance of a model with variables, and G0² is

the deviance of a null model, the pseudo r² of the model is:
r² = 1 - G12
G 02
 Note that The deviance of a model is -2 times the log
likelihood (i.e., -2LL) associated with each model.

37
The likelihood ratio test (LRT), which
makes use of the deviance, is analogous to the
F-test from linear regression.

In its most basic form, it can test the

hypothesis that all the coefficients in a model
are all equal to 0:
 H0: ß1 = ß2 = . . . = ßk = 0

The test statistic has a chi-square distribution,

with k degrees of freedom.

38
Assumptions
► Logistic regression is popular in part because it
enables the researcher to overcome many of the
restrictive assumptions of OLS regression:

1.Logistic regression does not assume a linear

relationship between the dependents and the
independents.
2. The dependent variable need not be normally
distributed.

3. The dependent variable need not be homoscedastic

for each level of the independents; that is, there is
no homogeneity of variance assumption.
39
However, other assumptions still apply:
1. Meaningful coding. Logistic coefficients will be
difficult to interpret if not coded meaningfully. The
convention for binomial logistic regression is to code the
dependent class of greatest interest as 1 and the other
class as 0.

2. Inclusion of all relevant variables in the

regression model

3. Exclusion of all irrelevant variables

4. Error terms are assumed to be independent
(independent sampling).
40
5. No multicollinearity:
 To the extent that one independent is a linear function of
another independent, the problem of multicollinearity will
occur in logistic regression, as it does in OLS regression.
 As the independents increase in correlation with each
other, the standard errors of the logit (effect) coefficients will
become inflated.

41
8. Large samples: Unlike OLS regression, logistic
regression uses maximum likelihood estimation
(MLE) rather than ordinary least squares (OLS) to
derive parameters.

 MLE relies on large-sample

♦ In small samples one may get high standard errors.

42
Hosmer and Lemeshow Test
♣ The Hosmer -Lemeshow goodness - of - fit statistic is used to assess
whether the necessary assumptions for the application of multiple
logistic regression are fulfilled.

♣ The Hosmer and Lemeshow's goodness-of-fit statistic is computed as

the Pearson chi-square from the contingency table of observed
frequencies and expected frequencies.

♣ A good fit as measured by Hosmer and Lemeshow's test will yield

a large p-value (much larger than 0.05).

♣ The result of the Hosmer-Lemeshow goodness-of-fit is easily

obtained by clicking on the appropriate menu commands of logistic
regression. That is,
Analyze → Regression → Binary logistic → Options → Hosmer-Lemeshow goodness-of-fit

43
Probit model

• Basic difference between

logit and probit model.

44
,,

45
Cont…

46
Cont,,,,

47
21/02/24 48

John Thompson Easiest Piano Course 1 PDF
95% (66)
John Thompson Easiest Piano Course 1 PDF
38 pages
Jazz Piano Fundamentals
100% (21)
Jazz Piano Fundamentals
200 pages
Children Pieces PDF
91% (53)
Children Pieces PDF
191 pages
Kids First Piano Lessons Ebook PDF
93% (110)
Kids First Piano Lessons Ebook PDF
36 pages
Scales & Modes For The Jazz Pianist
96% (115)
Scales & Modes For The Jazz Pianist
62 pages
Basic Rudiments of Music
91% (11)
Basic Rudiments of Music
25 pages
The Adult Beginner at The Piano
100% (25)
The Adult Beginner at The Piano
35 pages
Management of Dysarthria
100% (10)
Management of Dysarthria
204 pages
Piano
94% (17)
Piano
148 pages
Craig Lauritsen - Progressive Drum Method
75% (4)
Craig Lauritsen - Progressive Drum Method
78 pages
Econometrics Test Bank
100% (1)
Econometrics Test Bank
134 pages
Sight Reading Exercises For Piano
100% (13)
Sight Reading Exercises For Piano
29 pages
Berklee Online Music Theory Handbook
93% (14)
Berklee Online Music Theory Handbook
76 pages
Jazz Czerny-Jerry Gray
91% (44)
Jazz Czerny-Jerry Gray
33 pages
Piano Scales 2021 Guide Final
90% (31)
Piano Scales 2021 Guide Final
18 pages
Practice Questions Econometrics II
100% (1)
Practice Questions Econometrics II
5 pages
Piano 2023 2024 Grade 8
0% (1)
Piano 2023 2024 Grade 8
13 pages
Beyer. Preparatory Piano School Op. 101 PDF
75% (4)
Beyer. Preparatory Piano School Op. 101 PDF
86 pages
Eco 2009 Het Final Exam Questions
100% (1)
Eco 2009 Het Final Exam Questions
4 pages
Introductory Econometrics Test Bank 5th Edi
No ratings yet
Introductory Econometrics Test Bank 5th Edi
140 pages
Chapter Six
100% (1)
Chapter Six
112 pages
IEEE Standard For Configuration Management in Systems and Software Engineering
100% (1)
IEEE Standard For Configuration Management in Systems and Software Engineering
71 pages
Econometrics MTU
No ratings yet
Econometrics MTU
31 pages
Econometrics ppt-1
100% (1)
Econometrics ppt-1
205 pages
ECO - Chapter 01 The Subject Matter of Econometrics
No ratings yet
ECO - Chapter 01 The Subject Matter of Econometrics
42 pages
Econometrics 2
No ratings yet
Econometrics 2
135 pages
Econometrics Two
No ratings yet
Econometrics Two
116 pages
Answers Are Highlighted in Yellow Color: MCQ's Subject:Introductory Econometrics
100% (1)
Answers Are Highlighted in Yellow Color: MCQ's Subject:Introductory Econometrics
74 pages
Macroec I
100% (1)
Macroec I
109 pages
Introduction To Econometrics Ii (Econ-3062) : Mohammed Adem (PHD)
100% (5)
Introduction To Econometrics Ii (Econ-3062) : Mohammed Adem (PHD)
83 pages
Microeconomics II
No ratings yet
Microeconomics II
134 pages
Econometrics II Handout For Students
No ratings yet
Econometrics II Handout For Students
29 pages
Econometrics Chapter Two
No ratings yet
Econometrics Chapter Two
108 pages
MoE S Model Exit Exam Solution (Economics) July 04, 2023
No ratings yet
MoE S Model Exit Exam Solution (Economics) July 04, 2023
118 pages
Econometrics CH 1-4
100% (1)
Econometrics CH 1-4
315 pages
Econometrics II Chapter One
No ratings yet
Econometrics II Chapter One
71 pages
CH 1 Econometrics
No ratings yet
CH 1 Econometrics
49 pages
Econometrics II CH 2
100% (1)
Econometrics II CH 2
18 pages
Chapter Two: Economics of Agricultural Development
100% (1)
Chapter Two: Economics of Agricultural Development
40 pages
International Economics II - Chapter 3
100% (1)
International Economics II - Chapter 3
75 pages
MACRO ECONOMICS-I MODULE (1) Jimma
No ratings yet
MACRO ECONOMICS-I MODULE (1) Jimma
164 pages
A FINALS Econometrics - II MCQs
100% (2)
A FINALS Econometrics - II MCQs
6 pages
Chap1 Econometrics
No ratings yet
Chap1 Econometrics
36 pages
Topic 2 - Stages of Econometric Research
No ratings yet
Topic 2 - Stages of Econometric Research
16 pages
Chapter 1 and 2 Mcqs Econometrics
No ratings yet
Chapter 1 and 2 Mcqs Econometrics
10 pages
Gondar Planning Module
100% (1)
Gondar Planning Module
114 pages
1 Logit Probit and Tobit Model
100% (2)
1 Logit Probit and Tobit Model
51 pages
Econometrics I-For Lectuure Latest
67% (3)
Econometrics I-For Lectuure Latest
148 pages
Basic Econometrics Questions and Answers
60% (5)
Basic Econometrics Questions and Answers
3 pages
Chapter 4 Quantitative Development Planning Techniques
No ratings yet
Chapter 4 Quantitative Development Planning Techniques
139 pages
International Economics II Answers 1
100% (2)
International Economics II Answers 1
3 pages
International Economics II Chapter 2
100% (1)
International Economics II Chapter 2
29 pages
Chapter 1
No ratings yet
Chapter 1
39 pages
Monetary Economics For Exit Exam 2023 Chap1-4
No ratings yet
Monetary Economics For Exit Exam 2023 Chap1-4
101 pages
Micro Perfect and Monopoly
No ratings yet
Micro Perfect and Monopoly
57 pages
University of Gondar College of Business and Economics: Scool of Economics PPT Compiled For Macroeconomics I
No ratings yet
University of Gondar College of Business and Economics: Scool of Economics PPT Compiled For Macroeconomics I
83 pages
Econometrics For Management
No ratings yet
Econometrics For Management
53 pages
Monetary 2012E.C
0% (1)
Monetary 2012E.C
434 pages
Chapter One
No ratings yet
Chapter One
26 pages
EKO2111: Macroeconomics II Final Exam
100% (4)
EKO2111: Macroeconomics II Final Exam
3 pages
Exit Exam
No ratings yet
Exit Exam
50 pages
Kayranto Development Economics II Review Questions
No ratings yet
Kayranto Development Economics II Review Questions
19 pages
Macroecon II Handout Haramaya
No ratings yet
Macroecon II Handout Haramaya
107 pages
Econometrics Material For Exit Exam
No ratings yet
Econometrics Material For Exit Exam
81 pages
Mathematical Economics
No ratings yet
Mathematical Economics
123 pages
Econometrics Module
No ratings yet
Econometrics Module
148 pages
Microeconomics I Model Exam For Exit Exam
100% (1)
Microeconomics I Model Exam For Exit Exam
18 pages
LT 2 Econometrics
No ratings yet
LT 2 Econometrics
94 pages
Chapter One: 1 - AAU, Department of Economics
No ratings yet
Chapter One: 1 - AAU, Department of Economics
16 pages
Econometrics Mid Exam
100% (1)
Econometrics Mid Exam
2 pages
Econometrics I Ch2
No ratings yet
Econometrics I Ch2
105 pages
Multiple Choice Questions
67% (3)
Multiple Choice Questions
4 pages
Chapter-Two Classification of Planning: Meaning, Characteristics of Different Types of Planning and Their Relative Merits and Demerits
100% (1)
Chapter-Two Classification of Planning: Meaning, Characteristics of Different Types of Planning and Their Relative Merits and Demerits
40 pages
Chapt 2 MIC
No ratings yet
Chapt 2 MIC
13 pages
Econometrics Module 2
100% (1)
Econometrics Module 2
185 pages
Bio2 Module 5 - Logistic Regression
No ratings yet
Bio2 Module 5 - Logistic Regression
19 pages
5.1) Binary logistic regression
No ratings yet
5.1) Binary logistic regression
32 pages
Czerny-50 Little Studies I PDF
100% (3)
Czerny-50 Little Studies I PDF
16 pages
The Development of Sight-Reading Exercises
100% (4)
The Development of Sight-Reading Exercises
125 pages
Steps To Better-Jazz-Playing
100% (8)
Steps To Better-Jazz-Playing
19 pages
Chapter Three and Four Economics Introduction To Economics
No ratings yet
Chapter Three and Four Economics Introduction To Economics
106 pages
Chapter Two Introduction To Economics
No ratings yet
Chapter Two Introduction To Economics
36 pages
Chapter Three Economics
No ratings yet
Chapter Three Economics
36 pages
Microeconomics II
No ratings yet
Microeconomics II
26 pages
Chapter One - Basics of Entrepreneurship
No ratings yet
Chapter One - Basics of Entrepreneurship
46 pages
Chapter 3
No ratings yet
Chapter 3
41 pages
P Entreprenuership
No ratings yet
P Entreprenuership
123 pages
#3 Gen Ed-900 Items
No ratings yet
#3 Gen Ed-900 Items
40 pages
Eshetuu Print
No ratings yet
Eshetuu Print
60 pages
P/N 1029711B Problue Adhesive Melter Pump Replacement Kit - P/N 1028303, 1058305, and 1058306
No ratings yet
P/N 1029711B Problue Adhesive Melter Pump Replacement Kit - P/N 1028303, 1058305, and 1058306
2 pages
CSE250 Lab Exp 8 Simulation Transient
No ratings yet
CSE250 Lab Exp 8 Simulation Transient
10 pages
A Deep Dive Into 3D-NAND Silicon Linkage To Storage System Performance & Reliability
No ratings yet
A Deep Dive Into 3D-NAND Silicon Linkage To Storage System Performance & Reliability
15 pages
Chicago Style Thesis Sample
100% (2)
Chicago Style Thesis Sample
4 pages
Introduction To Vertical Roller Mill
No ratings yet
Introduction To Vertical Roller Mill
35 pages
321 Hyd System Ata29
No ratings yet
321 Hyd System Ata29
120 pages
Chapter-1 REFERENCE
No ratings yet
Chapter-1 REFERENCE
5 pages
Research Assistant Job Opportunity: NED UET Karachi
No ratings yet
Research Assistant Job Opportunity: NED UET Karachi
1 page
MULTINATIONAL CORPORATION-1
No ratings yet
MULTINATIONAL CORPORATION-1
8 pages
1920000356
No ratings yet
1920000356
4 pages
HSE Plan
No ratings yet
HSE Plan
52 pages
MOTOR - FRANDIESEL
No ratings yet
MOTOR - FRANDIESEL
23 pages
Emotion
100% (1)
Emotion
17 pages
Web Site Design Method (WSDM)
No ratings yet
Web Site Design Method (WSDM)
63 pages
Hcip-Lte Lab Guide v1.0
No ratings yet
Hcip-Lte Lab Guide v1.0
80 pages
What Is Diction - Learn 8 Different Types of Diction in Writing With Examples - 2022 - MasterClass
No ratings yet
What Is Diction - Learn 8 Different Types of Diction in Writing With Examples - 2022 - MasterClass
9 pages
Date AND Time Learning Area Learning Competencie S Learning Tasks Mode of Delivery
No ratings yet
Date AND Time Learning Area Learning Competencie S Learning Tasks Mode of Delivery
4 pages
Seri PDF
No ratings yet
Seri PDF
39 pages
Kumpulan Quiz AKM III
No ratings yet
Kumpulan Quiz AKM III
10 pages
An Introduction To Additive Synthesis
No ratings yet
An Introduction To Additive Synthesis
12 pages
Cat.-I Arts Provisional Merit List
No ratings yet
Cat.-I Arts Provisional Merit List
8 pages
Modifiers, Controllers and Control Switches - Fractal Audio Wiki
No ratings yet
Modifiers, Controllers and Control Switches - Fractal Audio Wiki
10 pages
Service Quality and Customer Satisfaction Theories(1)
No ratings yet
Service Quality and Customer Satisfaction Theories(1)
3 pages
Text Classification Using TF-IDF and Machine Learning
No ratings yet
Text Classification Using TF-IDF and Machine Learning
30 pages
Topic 9 Scheme of Work
No ratings yet
Topic 9 Scheme of Work
3 pages
159 - Advanced Topics - Introduction
No ratings yet
159 - Advanced Topics - Introduction
1 page

Econometrics II CH 1

Uploaded by

Econometrics II CH 1

Uploaded by

Debre Markose

► In many studies the outcome variable of interest

♣ The first is the prediction (estimation) of the probability

♣ Logistic regression also provides knowledge of the

► The main difference is that instead of developing a model that uses

►The dependent variable in logistic regression is usually

 When multiple classes of the dependent variable

♣ For example, if p is the probability of an individual having its own house,

♣ The ratio p / (1-p) is called the odds and thus

logit (p) = ln  1  P  is the log odds.

♣ We assume that relationships are linear on the logistic scale:

The above equation turns out to be:

► This z value is then squared, yielding a Wald statistic with a chi-

► Logistic regression uses maximum-likelihood estimation

N.B. Multiple regression uses the least-squares method to

 As a model’s ability to predict outcomes improves, the

 If a model perfectly predicts outcomes, the deviance will be

 If G1² is the deviance of a model with variables, and G0² is

In its most basic form, it can test the

The test statistic has a chi-square distribution,

1.Logistic regression does not assume a linear

3. The dependent variable need not be homoscedastic

2. Inclusion of all relevant variables in the

3. Exclusion of all irrelevant variables

 MLE relies on large-sample

♣ The Hosmer and Lemeshow's goodness-of-fit statistic is computed as

♣ A good fit as measured by Hosmer and Lemeshow's test will yield

♣ The result of the Hosmer-Lemeshow goodness-of-fit is easily

• Basic difference between

You might also like