CH - 05 - Further Issues - TQT
CH - 05 - Further Issues - TQT
5.5. Heteroskedasticity
5.1. Changing the scale of variables
(Changing the unit of measurement)
Change in the measurement unit of the dependent variable 𝑌 Intercept 𝜷𝟎 Slope coefficient 𝛽1
Y is divided by the constant 𝐶: 𝑌/𝐶 𝜷𝟎 /𝐂 𝜷𝟏 /𝐂
Y is multiplied by the constant 𝐶: 𝑌∗𝐶 𝜷𝟎 ∗ 𝐂 𝜷𝟎 ∗ 𝐂
𝑤𝑎𝑔𝑒
ෟ = 608 + 69.6edu wage is measured in 1.000 VND
𝑤𝑎𝑔𝑒
ෟ = 0.608 + 0.0696𝑒𝑑𝑢 wage is measured in 1.000.000 VND: Wage/1000
Change in the measurement unit of the independent variable 𝐗 Intercept 𝜷𝟎 Slope coefficient 𝛽1
X is divided by the constant 𝑪: 𝐗/𝐶 𝜷𝟎 𝛽1 ∗ 𝐶
X is multiplied by the constant 𝑪: 𝐗∗𝐶 𝜷𝟎 𝛽0 /𝐶
𝑤𝑎𝑔𝑒
ෟ = 608 + 70edu education is measured in years.
𝑤𝑎𝑔𝑒
ෟ = 608 + 𝟓. 𝟖𝑒𝑑𝑢 education is measured in months. Edu*12
Changing the unit of measurement
(Cont)
It is easier to compare the effect when looking at the standardized coefficients (beta).
Income has the largest standardized effect, while household size has the smallest standardized effect.
One standard deviation increase in income raises consumption by 0.716 standard deviation.
Beta coefficient
The value of beta coefficients (standardized coefficient) in Table 4 indicates that
forest land plays a larger role in household income than other land types.
5.2. More on functional form
Some notes:
The log of variables with units like years should be avoided.
Additionally, variables expressed in percentage points shouldn't be taken log
Do not take log if the variables contain zero or negative values
It is challenging to predict y by exponentiating the predicted value of log(y).
5.2. More on functional form
Quadratic Function: y = b0 + b1x + b2x2 + u
This functional form captures decreasing or increasing marginal effects
We must not interpret b1 alone as measuring the change in Y with respect to X; we
need to take into account b2 and the value of X as well, since:
∆𝑦ො ≈ 𝛽1 + 2𝛽2 𝑥 ∆𝑥, 𝑠𝑜 ∆𝑦/∆𝑥
ො ≈ 𝛽1 + 2𝛽2 𝑥
If X increases by one unit, Y changes by ( 𝛽1 + 2𝛽2 𝑥) units
Wage = b0+b1𝑒𝑥𝑝𝑒𝑟 + b2𝑒𝑥𝑝𝑒𝑟 2 + u
= 5367.546 + 225.8147𝑒𝑥𝑝𝑒𝑟 − 4.910445𝑒𝑥𝑝𝑒𝑟 2
𝑊𝑎𝑔𝑒
∗ 𝛽1
the turning point is 𝑋 = | |= 225.8147 / 2 ∗ 4.910445 = 22.993303 ≈23 (years)
2𝛽2
Note: If 𝑥 = 0, then 𝛽መ1 can be interpreted as the approximate slope in going from 0 to 1. After that 2𝛽መ2 𝑥
must be accounted for.
5.2. More on functional form
With 𝛽መ1 >0 and 𝛽መ2 <0 , the slope is initially positive, but it diminishes when X increases. X* is
the maximum of the function, and after X*, the slope is negative.
Y= b0 + b1x -b2x2 + u
𝛽1 >0 and 𝛽2 <0: diminishing effect of X on Y: Inverted U-shape or a parabolic shape
5.2. More on functional form
With 𝛽መ1 <0 and 𝛽መ2 >0 , the slope is initially negative, but it increases (less negative) when X
increases. X* is the minimum of the function, and after X*, the slope is positive.
Y= b0 -b1x +b2x2 + u
Ho is rejected in favour of H1
There is an inverse U shape: diminishing effect
5.3. More on goodness-of-fit and selection of regressors
Goodness of Fit: The adjusted R2: 𝑅ത 2
❑ Note that the R2 has a limitation that it will always increase when adding
more explanatory variables to the model.
(𝑅𝑆𝑆/(𝑛 − 𝑘 − 1))
𝑅ത 2 = 1 − = 𝑎𝑑𝑗𝑢𝑠𝑡𝑒𝑑 𝑅2
𝑇𝑆𝑆Τ(𝑛 − 1)
❑ 𝑅ത 2 increases if, and only if, the t-statistic of a newly added variable is
greater than one in absolute value.
Exponentiating the predicted values of log(y) will underestimate the expected value
of y
𝑦ො = exp(𝑙𝑜𝑔𝑦)
We need to scale this up by an estimate of the expected value of exp(u):
𝜎2
𝑇ℎ𝑒 𝑒𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 exp 𝑢 = exp 𝒊𝒇𝒖 is normally distributed
2
exp ෝ2
𝜎
The predicted value of Y: 𝑦ො = exp 𝑙𝑜𝑔𝑦 .
2
❑ 𝜎ො 2 : the estimated error variance which is provided by the regression results.
ෝ2
𝜎 1 𝑛
❑ Because 𝜎ො 2 > 0, exp 2
>1 ෝ𝟐 =
𝝈 𝑢ො 𝑖2
𝑛 − 𝑘 − 1 𝑖=1
ෝ2
𝜎
The predicted value of Y: 𝑦 exp
ො = exp 𝑙𝑜𝑔𝑦
Example: 2
ෝ=
𝝈 𝜎ො 2 : 𝐑𝐨𝐨𝐭 𝐦𝐞𝐚𝐧 𝐬𝐪𝐮𝐚𝐫𝐞𝐝 𝐞𝐫𝐫𝐨𝐫
Standard error of the regression
1 𝑛
ෝ𝟐 =
𝝈 𝑢ො 𝑖2
𝑛 − 𝑘 − 1 𝑖=1
Estimated variance of the error term
Predicting y in a log model
(Cont)
2 2
σ𝑛
𝑖=1 𝑟Ƹ 𝑖𝑗 𝑢
ෝ𝑖
White/Eicker standard errors: 𝛽መ𝑗 )=
𝑉𝑎𝑟(
𝑅𝑆𝑆𝑗2
❑ Log-transformation.