0% found this document useful (0 votes)

2 views

Chapter Four

Chapter Four discusses Discrete Choice and Limited Dependent Variable Models, focusing on binary dependent variables and their estimation methods such as linear probability, logit, and probit models. It explains the use of dummy variables to represent qualitative attributes and introduces multivariate choice models like multinomial logit and ordered probit models. Additionally, the chapter covers censored and truncated models, particularly the Tobit regression, highlighting the differences between these concepts and their applications in econometric analysis.

Uploaded by

yodahekahsay19

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Chapter Four

Uploaded by

yodahekahsay19

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Wollo University, College of Business and Economics, Department of Economics

Chapter Four
Discrete Choice and Limited Dependent Variable Model
4.1 Introduction
Limited Dependent Variable (LDV) is broadly defined as a dependent variable whose range
of values is substantively restricted. A binary dependent variable is an example of a LDV.
That is, a binary response/choice variable takes on only two values, zero and one. For
example, a regression model that includes yes/no or present/absent type of response are
known as dichotomous or dummy dependent variable regression model in which the
determinants of an event happening or not happening are identified. They are applicable in a
wide variety of fields and are used in survey or census-type of data. Among the methods that
are used to estimate such models are the linear probability model (LPM), the logit model, and
the probit model. These methods are used to approximate the mathematical relationship
between explanatory variable and dependent dummy variable, which is always assigned
qualitative values. In this section therefore the binary choice model, the multivariate choice
model and censored and truncated models are discussed.

4.2 The Concept of Dummy Variables

Dummy variables are variables that are qualitative in nature mainly used as proxies for other
variables either those cannot be measured quantitatively or those represent values over some
continuous range. Variables, for example, sex, profession and religion are dummy variables.
Dummy variables are, therefore, indicate the presence or absence of an attribute and can be
quantified by constructing an artificial variable that take two values 1 (for the presence of the
attribute) and 0 (for the absence of the attribute).

Example
Suppose we want to test the relationship between household consumption (C) and income (Y)
over the time period 1960-1990. Assume that the relationship between consumption and
income is also affected by other dummy variables like whether the household has children or
not; whether the household head age is over 70 or not; and presence of war in between 1977-
1980 periods. Thus the regression model would be specified as
C t     1Y1t   2 D1t   3 D2 t   4 D4 t  u t

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 1

Wollo University, College of Business and Economics, Department of Economics

1 if the household has children

D1t  
0 if no children (otherwise)
1 if age of household is over 70 years
Where D2t  
0 otherwise
1 if war is present (t  1977  1980)
D3t  
0 if war is not present (t  1960  1976 & 1981  1990)

4.3 Binary Choice Models

a) Linear Probability Model (LPM): The LPM is the simplest of the limited
dependent variable models to use but has several limitations. It assumes that the conditional
probability increases linearly with the values of the explanatory variables. As a result, the
possibility of the estimated probability lying outside the 0-1 bounds so that the fundamental
problem With the LPM is that it assumes that the marginal or incremental effects of
explanatory variables remain constant throughout, which seems patently unrealistic. This also
leads to non-normality of the error term.

Thus, due to the limitation of the LPM there is a need to have an appropriate model in which
the relationship between the probability an event will occur and the explanatory variable is
non-linear. The most common probability models that fill the identified gaps in LPM are the
logit and probit models, which have the S-shaped of the cumulative distribution function
(CDF). The Logit model is based on the logistic CDF where as the probit model is the normal
CDF and both models guarantee that the estimated probabilities lie in the 0-1 range and that
they are non-linearly related to the explanatory variables. The logistic and probit formulations
are quite comparable; a chief difference being that logistic has slightly flatter tails which is a
normal curve approaches the axes more likely than logistic curve. Therefore the choice
between the two is one of the mathematical convenience and matter of choosing between the
cumulative distributions functions

b) Binary Logit model

Binary logit model is also a non-linear model with non-metric dependent variable with only
two groups, yes/no for example, to be formed and metric and non-metric independent
variables.
Z i  B0  B1 x1i  B2 x 2 i  ...  Bn x ni  U i

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 2

Wollo University, College of Business and Economics, Department of Economics

Example 1: To analyze how business constraints affect MSEs operators’ perception on growth
potential of enterprises income by asking the operators about the income situation of the
enterprises (that is, whether it increased, remained the same or declined). To measure the
perception of respondents on income of the enterprises the dummy variable (Zi) is constructed as
dummy one if an enterprise experiences growth in income and zero otherwise. Therefore, the
model indicates the probability that enterprise will experience growth in income given the
constraints and control variables. Thus, the logit model on the growth potential of income
(incgrow) given constraints (const), and control variables (contrv) can be specified as:
incgrow   0  1const   2 contrv  .ui

Note that there are different forms of binary logit and hence the interpretations are also
different: probabilities, odds, and logits. Let’s now assume a continuous X. The logit model
has three equivalent forms:

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 3

Wollo University, College of Business and Economics, Department of Economics

Example 2: to examine the effects of social capital on access to credit the logit model is
employed. The underlying equation for the binary logit model which examine the likelihood
of a household having access to credit is
*
Yi   ' X i  U i

*
where Yi is an unobservable latent variable for having access to credit, X i is a vector of

explanatory variables,  ' is a vector of parameters to be estimated, U i is the error term and

the subscripts is the households.

To examine whether or not a household has access to credit the dummy variable access to
credit is constructed as dummy one if a household has access to credit and zero otherwise.
The observed binary for whether or not a household has access to credit is assumed to be
determined as in the usual logit model.

1 if a household with access to credit , Yi  0
Yi  
0 otherwise, Yi   0

c) Binary Probit Model

Binary probit model: we can apply all procedures from above binary logit model
analogously (only the odds interpretation does not work). Since logistic and normal
distributions are very similar, results are in most situations identical for all practical purposes.
Coefficients can be transformed by a scaling factor (multiply probit coefficients by 1.6-1.8).
Only in the tails results may be different.

An example: absolute poverty status of sample households

This reflects the analysis of the determinants of absolute poverty by classifying poverty as
being poor and non-poor.

Probit and Logit have a S-shaped probability function. As X increases, probability of Y

increases, but never steps outside the 0-1 interval. That is, it approaches zero at slower and
slower rates as X gets small, and it approaches one at slower and slower rates as X gets large.

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 4

Wollo University, College of Business and Economics, Department of Economics

Graphically, the logit distribution has flatter tailts that it approaches the axes more slowly.
This is the main indicator of the difference between the two.

What do CDF look like in graphical representation?

Shape of the logit and probit

4.4 Multivariate Choice Models

a) Multinomial logit model: In case we classify the dependent variable in more than two
groups, multinomial logit model. In other words, this model is used by extending the logit
model with binary outcomes to the case where the response has more than two outcomes.
Thus, multinomial logit analysis exhibits a superior ability to estimate the effect of
explanatory variables on multiple categories of the dependent variable.

An example: To analyze how business constraints affect MSEs operators’ perception on growth
potential of enterprises income by using the three responses of operators as it is, that is, whether
income increased, remained the same or declined.

To analyze effects of social capital on off-farm income source: off-farm employment, trade,
gift and remittance, and welfare programs. In order to interpret the result of this model we
need to take one category as a reference and thus interpret the remaining three categories in
relative to the reference category. The off-farm income equation which shows interaction
between the off-farm income sources and social capital, controlling for other explanatory
variables, can be written as:
Yij  X i  j  eij

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 5

Wollo University, College of Business and Economics, Department of Economics

Where Yij is a four category response variable – off-farm income source; Xi is a set of
explanatory variables; j is parameters to be estimated and eij is the disturbance term.

b) Ordered probit model: This is a model with multiple categories, as the case of
multinomial model, but these categories have a natural order. Models for ordinal dependent
variables can be formulated as a threshold model with a latent dependent variable.

An Example: Extreme poverty status of sample households

This reflects the analysis of the determinants of extreme poverty by classifying poverty as
being extreme or hard core poor, poor and non-poor. This can be generalized as follows by

letting the underlying response model be described as:

Yi   ' xi  u i (i  1,2,..., n )

Where Y is the underlying response variable (extreme poverty status), x is a set of

explanatory variables (demographic and socio-economic variables), and u is the residual.

4.5 Censored or Truncated Models

Tobit Regression: Censored and truncated
Censoring occurs, when some observations on the dependent variable report not the true
value but a cut point. Truncation means that complete observations beyond a cut point are
missing. OLS estimates with censored or truncated data are biased.

In (a) data are censored at a. One knows that the true value is a or less. The regression line
would be less steep (dashed line). Truncation means that cases below a are completely
missing. Truncation also biases OLS estimates. (b) is the case of incidential truncation or
sample selection. Due to a non-random selection mechanism information on Y is missing for

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 6

Wollo University, College of Business and Economics, Department of Economics

some cases. This bias OLS estimates also. Therefore, special estimation methods exist for
such data. In this regard, censored data are analyzed with the tobit model:

Where:

Y∗ is the latent uncensored dependent variable

is a discrete effect on the latent, uncensored variable

What we observe is

In censored regression, the dependent variable may contain some zero values. With these
zero values for the dependent variable, using ordinary least squares (OLS) to estimate the
model would lead to biased and inconsistent results. Proper estimation of the model requires
use of a censored tobit regression. The censored tobit analysis which is given as:
Yi  X i    i (i ,..., n)

Where Yi is the dependent variable with some zero values; Xi refers the explanatory
variables;  is vector of parameters; and i is the error term.

Note: Differences of Truncated and Censored models are

Truncated model does not know how many samples are truncated. For instance, when in a
telephone survey, those who have no phones are truncated. There is no enough
information to obtain correct regression.
Censored Model know how many samples are censored. For instance, in a survey, those
who have no cars are censored but we know the number.

An example: if you want to examine the effects of social capital on income diversification in
order to analyze the implication of household’s engagement in social institutions on
diversifying their income source, you can use the censored tobit model. Since all households
do not necessarily earn income from other sources other than the main source, there might be
a possibility that the dependent variables become zero values. For this, we used censored
tobit analysis which is given as:
Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 7
Wollo University, College of Business and Economics, Department of Economics

yi  xi    i (i,..., n)

Where Yi share of income from other sources to total income of the household which is
censored at zero, Xi is vector of determinant of income diversification including variables
related to social capital and household characteristics,  is vector of parameters, and i is the
error term.

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 8

Introduction to Applied Econometrics Analysis Using Stata
From Everand
Introduction to Applied Econometrics Analysis Using Stata
Justin Doran
5/5 (3)
Primary 4 / Grade 4: Full Name: School: Index Number
No ratings yet
Primary 4 / Grade 4: Full Name: School: Index Number
39 pages
Multivariate Analysis – The Simplest Guide in the Universe: Bite-Size Stats, #6
From Everand
Multivariate Analysis – The Simplest Guide in the Universe: Bite-Size Stats, #6
Lee Baker
No ratings yet
Mca Jntu Syllabus
100% (7)
Mca Jntu Syllabus
87 pages
Lecture Notes 5
No ratings yet
Lecture Notes 5
19 pages
TS&PDA
No ratings yet
TS&PDA
13 pages
Chapter - Five - Limited Dependent Variable Models
No ratings yet
Chapter - Five - Limited Dependent Variable Models
75 pages
K Kiran Kumar IIM Indore
100% (1)
K Kiran Kumar IIM Indore
115 pages
Newsletter 23 - Logit, Probit, Tobit (2P)
No ratings yet
Newsletter 23 - Logit, Probit, Tobit (2P)
2 pages
Topic 5
No ratings yet
Topic 5
30 pages
Financial Econometrics: ASSIGNMENT: Functional Forms of Regression Models
No ratings yet
Financial Econometrics: ASSIGNMENT: Functional Forms of Regression Models
14 pages
Assignment
No ratings yet
Assignment
20 pages
Generalised Linear Models
No ratings yet
Generalised Linear Models
74 pages
credit-paper
No ratings yet
credit-paper
30 pages
Class - Lectur 5&6
No ratings yet
Class - Lectur 5&6
12 pages
Econometrics II Specail-2
No ratings yet
Econometrics II Specail-2
107 pages
Logistic Regression
No ratings yet
Logistic Regression
47 pages
Chapter - Two - Simple Linear Regression - Final Edited
No ratings yet
Chapter - Two - Simple Linear Regression - Final Edited
28 pages
Chapter 2
No ratings yet
Chapter 2
58 pages
Roni Presentation
No ratings yet
Roni Presentation
17 pages
Group Work - Econometrics Updated
No ratings yet
Group Work - Econometrics Updated
22 pages
Econometrics II Chapter One(1)
No ratings yet
Econometrics II Chapter One(1)
35 pages
Perraillon Marginal Effects Lecture Lisbon 0
No ratings yet
Perraillon Marginal Effects Lecture Lisbon 0
65 pages
Mood LogisticRegressionCannot 2010
No ratings yet
Mood LogisticRegressionCannot 2010
17 pages
Making Regression Analysis More Useful, II_ Dummies and Trends
No ratings yet
Making Regression Analysis More Useful, II_ Dummies and Trends
27 pages
446 PDF
No ratings yet
446 PDF
19 pages
HSTS423 - Unit 5 Multicolinearity
No ratings yet
HSTS423 - Unit 5 Multicolinearity
12 pages
Multicollinearity
No ratings yet
Multicollinearity
5 pages
Lecture 8 - Limited Dependent Var PDF
No ratings yet
Lecture 8 - Limited Dependent Var PDF
78 pages
SOC6078 SOC6078 Advanced Statistics: 4. Models For Categorical Dependent Variables II Extending The Logit and Probit Models
No ratings yet
SOC6078 SOC6078 Advanced Statistics: 4. Models For Categorical Dependent Variables II Extending The Logit and Probit Models
15 pages
A Unified Framework For Monetary Theory and Policy Analysis: Ricardo Lagos
No ratings yet
A Unified Framework For Monetary Theory and Policy Analysis: Ricardo Lagos
22 pages
Module 5: Multiple Regression Analysis: Tom Ilvento
No ratings yet
Module 5: Multiple Regression Analysis: Tom Ilvento
20 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Chapter 15 ANCOVA For Dichotomous Dependent Variables
No ratings yet
Chapter 15 ANCOVA For Dichotomous Dependent Variables
12 pages
Non Linear Probability Models
No ratings yet
Non Linear Probability Models
18 pages
Heteros Kedas T I City
No ratings yet
Heteros Kedas T I City
9 pages
09-01
No ratings yet
09-01
23 pages
Statistical Models
No ratings yet
Statistical Models
10 pages
MULTICOLLINEARITY(1)
No ratings yet
MULTICOLLINEARITY(1)
21 pages
Sta 3010 Quizes
No ratings yet
Sta 3010 Quizes
10 pages
ch 4 eco
No ratings yet
ch 4 eco
42 pages
Logistic Regression
No ratings yet
Logistic Regression
54 pages
1170_10045_121363
No ratings yet
1170_10045_121363
77 pages
Homoscedastic That Is, They All Have The Same Variance: Heteroscedasticity
100% (1)
Homoscedastic That Is, They All Have The Same Variance: Heteroscedasticity
11 pages
Dummy Dependent Variables Models
No ratings yet
Dummy Dependent Variables Models
15 pages
Linear Regression and Logit
No ratings yet
Linear Regression and Logit
15 pages
Statistical Modelling: Regression: Multicollinearity
No ratings yet
Statistical Modelling: Regression: Multicollinearity
22 pages
Week 9 Lesson 2 PDF
No ratings yet
Week 9 Lesson 2 PDF
4 pages
Econometrics 4
No ratings yet
Econometrics 4
37 pages
Ecntr Assmm
No ratings yet
Ecntr Assmm
23 pages
10 CÂU LÝ THUYẾT MIDTERM
No ratings yet
10 CÂU LÝ THUYẾT MIDTERM
2 pages
Creditworthiness and Thresholds in A Credit Market Model With Multiple Equilibria
No ratings yet
Creditworthiness and Thresholds in A Credit Market Model With Multiple Equilibria
35 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Logistic Regression Notes
No ratings yet
Logistic Regression Notes
16 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
ECONOMETRICS Chapter 1,2
No ratings yet
ECONOMETRICS Chapter 1,2
8 pages
Eur Sociol Rev 2010 Mood 67 82
No ratings yet
Eur Sociol Rev 2010 Mood 67 82
16 pages
807 2
No ratings yet
807 2
13 pages
class 2
No ratings yet
class 2
53 pages
Regression Analysis SPSS Natasha Latif
100% (1)
Regression Analysis SPSS Natasha Latif
7 pages
CHAPTER 2
No ratings yet
CHAPTER 2
11 pages
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
CHAPTER ONE
No ratings yet
CHAPTER ONE
19 pages
Chapter 1. Introduction
No ratings yet
Chapter 1. Introduction
30 pages
P-2 Chapter one
No ratings yet
P-2 Chapter one
25 pages
Group Assign Econometrics.
No ratings yet
Group Assign Econometrics.
3 pages
Rio PDF
No ratings yet
Rio PDF
128 pages
Class 900 RTJ Flange
No ratings yet
Class 900 RTJ Flange
1 page
Forced Vibration CU
No ratings yet
Forced Vibration CU
5 pages
Uttar Prade SH: L T P/S SW/FW No. of Psda Total Credit Units
No ratings yet
Uttar Prade SH: L T P/S SW/FW No. of Psda Total Credit Units
4 pages
Sdi011 Manual Ver1 05
No ratings yet
Sdi011 Manual Ver1 05
68 pages
Syllabus 671
No ratings yet
Syllabus 671
1 page
Schumacher Se 3612
No ratings yet
Schumacher Se 3612
8 pages
1.4 Protein
No ratings yet
1.4 Protein
55 pages
Duel With The Devil Tab
No ratings yet
Duel With The Devil Tab
13 pages
Linear Integrated Circuits - EE3402 - Notes - Unit 3 - Applications of OP-AMP
No ratings yet
Linear Integrated Circuits - EE3402 - Notes - Unit 3 - Applications of OP-AMP
28 pages
Critical Velocity Faraz
No ratings yet
Critical Velocity Faraz
15 pages
1
No ratings yet
1
2 pages
3 - Introduction SQL Plus PDF
No ratings yet
3 - Introduction SQL Plus PDF
16 pages
Highland College Bahir Dar Campus Department of Nurs ING: Applying Basic Health Statistics and Survey
100% (2)
Highland College Bahir Dar Campus Department of Nurs ING: Applying Basic Health Statistics and Survey
51 pages
DIDURIT 168 (DIDURIT M66 5 M)
No ratings yet
DIDURIT 168 (DIDURIT M66 5 M)
1 page
GNU Toolchain For ARC
100% (1)
GNU Toolchain For ARC
154 pages
2022mm1 NHT W
No ratings yet
2022mm1 NHT W
16 pages
TGB Blade 425400 Manual de Reparatie 6
No ratings yet
TGB Blade 425400 Manual de Reparatie 6
229 pages
A Product From MAGCHEM Technical Data Sheet - Fiche Technique
No ratings yet
A Product From MAGCHEM Technical Data Sheet - Fiche Technique
2 pages
Bme319 Lab2 Can Mungan
No ratings yet
Bme319 Lab2 Can Mungan
11 pages
What Is A UV-Vis Spectrophotometer
No ratings yet
What Is A UV-Vis Spectrophotometer
5 pages
A Rapid Method To Verify Single Cell Deposition Setup For Cell Sorters
No ratings yet
A Rapid Method To Verify Single Cell Deposition Setup For Cell Sorters
7 pages
Activity #8 .Earthquake - Epicenters
No ratings yet
Activity #8 .Earthquake - Epicenters
8 pages
Ranchi University, Ranchi: Regular Students Doranda College, Ranchi
No ratings yet
Ranchi University, Ranchi: Regular Students Doranda College, Ranchi
6 pages
Result of Calibration
No ratings yet
Result of Calibration
2 pages
ASE Exercise 9 (Fall 2015) : Task 1: Questions
No ratings yet
ASE Exercise 9 (Fall 2015) : Task 1: Questions
11 pages
Leandro Stormer
No ratings yet
Leandro Stormer
402 pages
The Restless Atmosphere
No ratings yet
The Restless Atmosphere
59 pages

Chapter Four

Uploaded by

Chapter Four

Uploaded by

Wollo University, College of Business and Economics, Department of Economics

4.2 The Concept of Dummy Variables

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 1

1 if the household has children

4.3 Binary Choice Models

b) Binary Logit model

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 2

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 3

the subscripts is the households.

c) Binary Probit Model

An example: absolute poverty status of sample households

Probit and Logit have a S-shaped probability function. As X increases, probability of Y

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 4

What do CDF look like in graphical representation?

Shape of the logit and probit

4.4 Multivariate Choice Models

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 5

An Example: Extreme poverty status of sample households

letting the underlying response model be described as:

Where Y is the underlying response variable (extreme poverty status), x is a set of

4.5 Censored or Truncated Models

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 6

Y∗ is the latent uncensored dependent variable

is a discrete effect on the latent, uncensored variable

Note: Differences of Truncated and Censored models are

Econometrics Lecture Notes; 2016 By Addisu Molla (PhD) 8

You might also like