Chapter 2 (Econometrics)
Chapter 2 (Econometrics)
• In simple terms, it tells how the mean or average response of varies with .
• where and are unknown but fixed parameters known as the regression coefficients; and are also
known as intercept and slope coefficients, respectively.
• Some alternative expressions used in the literature are linear population regression model or simply
linear population regression.
• In the sequel, the terms regression, regression equation, and regression model will be used
synonymously.
The Meaning of the Term Linear
• Linearity in the Variables
The first and perhaps more “natural” meaning of linearity is that the
conditional expectation of Y is a linear function of , such as, for example, Eq.
Geometrically, the regression curve in this case is a straight line.
• The second interpretation of linearity is that the conditional expectation of , is a linear function of the parameters,
the ; it may or may not be linear in the variable .
• To see this, let us suppose X takes the value 3. Therefore, , which is obviously linear in and .
• All the models shown in Figure 2.3 are thus linear regression models, that is, models linear in the parameters.
• The preceding model is an example of a nonlinear (in the parameter) regression model.
Linearity in the Parameters
• The term “linear” regression will always mean a regression that is linear in
the parameters; the (that is, the parameters) are raised to the first power
only.
5. Poor proxy variables: Although the classical regression model assumes that the variables Y
and X are measured accurately, in practice the data may be plagued by errors of measurement.
• But since data on these variables are not directly observable, in practice we use proxy
variables, such as current consumption (Y) and current income (X), which can be observable.
The Significance of the Stochastic
Disturbance Term
6. Principle of parsimony: If we can explain the behavior of Y “substantially” with
two or three explanatory variables and if our theory is not strong enough to
suggest what other variables might be included, why introduce more variables?
Let represent all other variables.
• For all these reasons, the stochastic disturbances assume an extremely critical
role in regression analysis
The Sample Regression Function (SRF)
• We can express the SRF in Equation 2.6.1 in its stochastic form as follows:
• where, in addition to the symbols already defined, denotes the (sample) residual term.
• It is introduced in the SRF for the same reasons as was introduced in the PRF.
The Sample Regression Function (SRF)
• For , we have one (sample)
observation, .